Software Architecture: 14th European Conference, ECSA 2020, L'Aquila, Italy, September 14–18, 2020, Proceedings [1st ed.] 9783030589226, 9783030589233

This book constitutes the refereed proceedings of the 14th International Conference on Software Architecture, ECSA 2020,

267 41 22MB

English Pages XXI, 370 [379] Year 2020

Table of contents :
Front Matter ....Pages i-xxi
Front Matter ....Pages 1-1
Assessing Architecture Conformance to Coupling-Related Patterns and Practices in Microservices (Evangelos Ntentos, Uwe Zdun, Konstantinos Plakidas, Sebastian Meixner, Sebastian Geiger)....Pages 3-20
Formal Software Architectural Migration Towards Emerging Architectural Styles (Nacha Chondamrongkul, Jing Sun, Ian Warren)....Pages 21-38
Monolith Migration Complexity Tuning Through the Application of Microservices Patterns (João Franscisco Almeida, António Rito Silva)....Pages 39-54
Front Matter ....Pages 55-55
Decentralized Architecture for Energy-Aware Service Assembly (Mauro Caporuscio, Mirko D’Angelo, Vincenzo Grassi, Raffaela Mirandola)....Pages 57-72
Continuous Experimentation for Automotive Software on the Example of a Heavy Commercial Vehicle in Daily Operation (Federico Giaimo, Christian Berger)....Pages 73-88
Towards Using Probabilistic Models to Design Software Systems with Inherent Uncertainty (Alex Serban, Erik Poll, Joost Visser)....Pages 89-97
Front Matter ....Pages 99-99
Empowering SysML-Based Software Architecture Description with Formal Verification: From SysADL to CSP (Fagner Dias, Marcel Oliveira, Thais Batista, Everton Cavalcante, Jair Leite, Flavio Oquendo et al.)....Pages 101-117
A Flexible Architecture for Key Performance Indicators Assessment in Smart Cities (Martina De Sanctis, Ludovico Iovino, Maria Teresa Rossi, Manuel Wimmer)....Pages 118-135
Front Matter ....Pages 137-137
A Multi-objective Performance Optimization Approach for Self-adaptive Architectures (Davide Arcelli)....Pages 139-147
Data Stream Operations as First-Class Entities in Component-Based Performance Models (Dominik Werle, Stephan Seifermann, Anne Koziolek)....Pages 148-164
Architecture-Centric Support for Integrating Security Tools in a Security Orchestration Platform (Chadni Islam, Muhammad Ali Babar, Surya Nepal)....Pages 165-181
VisArch: Visualisation of Performance-based Architectural Refactorings (Catia Trubiani, Aldeida Aleti, Sarah Goodwin, Pooyan Jamshidi, Andre van Hoorn, Samuel Gratzl)....Pages 182-190
Front Matter ....Pages 191-191
An Initial Study on the Association Between Architectural Smells and Degradation (Sebastian Herold)....Pages 193-201
Architectural Technical Debt: A Grounded Theory (Roberto Verdecchia, Philippe Kruchten, Patricia Lago)....Pages 202-219
Does BERT Understand Code? – An Exploratory Study on the Detection of Architectural Tactics in Code (Jan Keim, Angelika Kaplan, Anne Koziolek, Mehdi Mirakhorli)....Pages 220-228
Front Matter ....Pages 229-229
Teaching Students Software Architecture Decision Making (Rafael Capilla, Olaf Zimmermann, Carlos Carrillo, Hernán Astudillo)....Pages 231-246
The PDEng Program on Software Technology (Ad T. M. Aerts, Yanja Dajsuren)....Pages 247-262
Front Matter ....Pages 263-263
Architectural Concerns for Digital Twin of the Organization (Mauro Caporuscio, Farid Edrisi, Margrethe Hallberg, Anton Johannesson, Claudia Kopf, Diego Perez-Palacin)....Pages 265-280
Quick Evaluation of a Software Architecture Using the Decision-Centric Architecture Review Method: An Experience Report (Pablo Cruz, Luis Salinas, Hernán Astudillo)....Pages 281-295
The Quest for Introducing Technical Debt Management in a Large-Scale Industrial Company (Somayeh Malakuti, Sergey Ostroumov)....Pages 296-311
Front Matter ....Pages 313-313
Determining Microservice Boundaries: A Case Study Using Static and Dynamic Software Analysis (Tiago Matias, Filipe F. Correia, Jonas Fritzsch, Justus Bogner, Hugo S. Ferreira, André Restivo)....Pages 315-332
IAS: An IoT Architectural Self-adaptation Framework (Mahyar T. Moghaddam, Eric Rutten, Philippe Lalanda, Guillaume Giraud)....Pages 333-351
A Comparison of MQTT Brokers for Distributed IoT Edge Computing (Heiko Koziolek, Sten Grüner, Julius Rückert)....Pages 352-368
Back Matter ....Pages 369-370

Recommend Papers

Software Architecture: 14th European Conference, ECSA 2020 Tracks and Workshops, L'Aquila, Italy, September 14–18, 2020, Proceedings [1st ed.] 9783030591540, 9783030591557

This book constitutes the refereed proceedings of the tracks and workshops which complemented the 14th European Conferen

569 109 48MB Read more

Software Architecture. 17th European Conference, ECSA 2023 Istanbul, Turkey, September 18–22, 2023 Proceedings 9783031425912, 9783031425929

104 20 27MB Read more

Scalable Uncertainty Management: 14th International Conference, SUM 2020, Bozen-Bolzano, Italy, September 23–25, 2020, Proceedings [1st ed.] 9783030584481, 9783030584498

This book constitutes the refereed proceedings of the 14th International Conference on Scalable Uncertainty Management,

420 65 6MB Read more

Software Architecture: First European Conference, ECSA 2007 Aranjuez, Spain, September 24-26, 2007 Proceedings [1 ed.] 9783540751311, 3540751319

Software architecture has emerged as an important subdiscipline of software engineering encompassing a broad set of lang

384 70 5MB Read more

Software Architecture: 17th European Conference, ECSA 2023, Istanbul, Turkey, September 18–22, 2023, Proceedings [1 ed.] 303142591X, 9783031425912, 9783031425929

This book constitutes the refereed proceedings of the 17th International Conference on Software Architecture, ECSA 2023,

100 46 27MB Read more

Brain Informatics: 13th International Conference, BI 2020, Padua, Italy, September 19, 2020, Proceedings [1st ed.] 9783030592769, 9783030592776

This book constitutes the refereed proceedings of the 13th International Conference on Brain Informatics, BI 2020, held

472 29 52MB Read more

Software Architecture: 15th European Conference, ECSA 2021, Virtual Event, Sweden, September 13-17, 2021, Proceedings (Programming and Software Engineering) 3030860434, 9783030860431

This book constitutes the refereed proceedings of the 15th International Conference on Software Architecture, ECSA 2021,

121 9 18MB Read more

Systems, Software and Services Process Improvement: 27th European Conference, EuroSPI 2020, Düsseldorf, Germany, September 9–11, 2020, Proceedings [1st ed.] 9783030564407, 9783030564414

This volume constitutes the refereed proceedings of the 27th European Conference on Systems, Software and Services Proce

948 48 70MB Read more

Software Architecture: First European Conference, ECSA 2007, Madrid, Spain, September 24-26, 2007, Proceedings (Lecture Notes in Computer Science, 4758) 3540751319, 9783540751311

Software architecture has emerged as an important subdiscipline of software engineering encompassing a broad set of lang

114 38 11MB Read more

Provable and Practical Security: 14th International Conference, ProvSec 2020, Singapore, November 29 – December 1, 2020, Proceedings [1st ed.] 9783030625757, 9783030625764

This book constitutes the refereed proceedings of the 14th International Conference on Provable Security, ProvSec 2020,

663 55 12MB Read more

Software Architecture: 14th European Conference, ECSA 2020, L'Aquila, Italy, September 14–18, 2020, Proceedings [1st ed.]
9783030589226, 9783030589233

Author / Uploaded
Anton Jansen
Ivano Malavolta
Henry Muccini
Ipek Ozkaya
Olaf Zimmermann

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

File loading please wait...

Citation preview

LNCS 12292

Anton Jansen · Ivano Malavolta · Henry Muccini · Ipek Ozkaya · Olaf Zimmermann (Eds.)

Software Architecture 14th European Conference, ECSA 2020 L’Aquila, Italy, September 14–18, 2020 Proceedings

Lecture Notes in Computer Science Founding Editors Gerhard Goos Karlsruhe Institute of Technology, Karlsruhe, Germany Juris Hartmanis Cornell University, Ithaca, NY, USA

Editorial Board Members Elisa Bertino Purdue University, West Lafayette, IN, USA Wen Gao Peking University, Beijing, China Bernhard Steffen TU Dortmund University, Dortmund, Germany Gerhard Woeginger RWTH Aachen, Aachen, Germany Moti Yung Columbia University, New York, NY, USA

12292

More information about this series at http://www.springer.com/series/7408

Anton Jansen Ivano Malavolta Henry Muccini Ipek Ozkaya Olaf Zimmermann (Eds.) •

•

•

•

Software Architecture 14th European Conference, ECSA 2020 L’Aquila, Italy, September 14–18, 2020 Proceedings

123

Editors Anton Jansen Koninklijke Philips N.V. Eindhoven, The Netherlands

Ivano Malavolta VU Amsterdam Amsterdam, The Netherlands

Henry Muccini University of L’Aquila L’Aquila, Italy

Ipek Ozkaya Carnegie Mellon University Pittsburg, PA, USA

Olaf Zimmermann University of Applied Sciences of Eastern Switzerland Rapperswil, Switzerland

ISSN 0302-9743 ISSN 1611-3349 (electronic) Lecture Notes in Computer Science ISBN 978-3-030-58922-6 ISBN 978-3-030-58923-3 (eBook) https://doi.org/10.1007/978-3-030-58923-3 LNCS Sublibrary: SL2 – Programming and Software Engineering © Springer Nature Switzerland AG 2020 This work is subject to copyright. All rights are reserved by the Publisher, whether the whole or part of the material is concerned, speciﬁcally the rights of translation, reprinting, reuse of illustrations, recitation, broadcasting, reproduction on microﬁlms or in any other physical way, and transmission or information storage and retrieval, electronic adaptation, computer software, or by similar or dissimilar methodology now known or hereafter developed. The use of general descriptive names, registered names, trademarks, service marks, etc. in this publication does not imply, even in the absence of a speciﬁc statement, that such names are exempt from the relevant protective laws and regulations and therefore free for general use. The publisher, the authors and the editors are safe to assume that the advice and information in this book are believed to be true and accurate at the date of publication. Neither the publisher nor the authors or the editors give a warranty, expressed or implied, with respect to the material contained herein or for any errors or omissions that may have been made. The publisher remains neutral with regard to jurisdictional claims in published maps and institutional afﬁliations. This Springer imprint is published by the registered company Springer Nature Switzerland AG The registered company address is: Gewerbestrasse 11, 6330 Cham, Switzerland

Preface

The European Conference on Software Architecture (ECSA) is the premier European conference that provides researchers and practitioners with a platform to present and discuss the most recent, innovative, and signiﬁcant ﬁndings and experiences in the ﬁeld of software architecture research and practice. This 14th edition of ECSA builds upon a series of successful European workshops on software architecture held during 2004–2006, as well as a series of European software architecture conferences during 2007–2019. This edition of ECSA had a unique nature as due to the novel coronavirus, COVID-19, it was the ﬁrst ECSA conference that was originally to be held in L’Aquila, Italy, but convened the participants around the globe virtually during September 14–18, 2020. This year’s technical program included a main research track, three keynote talks, and an industry track (included in this volume), as well as a doctoral symposium track with its own keynote, a gender diversity in software architecture track with its own keynote, and a tool demos track. In addition, ECSA 2020 also offered nine workshops on diverse topics related to the software architecture discipline, such as automotive architectures, quality-aware DevOps, and IoT systems. In addition, ECSA 2020 featured a journal ﬁrst track partnering with the Journal of Software and Systems, Elsevier, and the IEEE Software Magazine. The contributions of all these other meetings are included in the companion proceedings, published in a volume by Springer CCIS. ECSA 2020 received 103 contributions to all tracks. For the main research track, we received 60 submissions in the two main categories: full and short research papers. Based on the recommendations of the Program Committee, we accepted 12 papers as full papers and 5 additional papers as short papers. Hence the acceptance rate for full research papers was 20% for ECSA 2020. For the industrial track, we received 11 submissions and accepted 6 of them. The conference attracted papers (co-)authored by researchers, practitioners, and academia from 24 countries (Austria, Australia, Brazil, Canada, Chile, Columbia, Denmark, Ecuador, Finland, France, Germany, Italy, the Netherlands, New Zealand, Spain, Pakistan, Poland, Portugal, Romania, Sweden, Switzerland, Tunisia, the UK, and the USA). The main ECSA program had three keynotes. Professor Ivica Crnkovic from Chalmers University, Sweden, talked about “AI engineering—new challenges in system and software architecting and managing lifecycle for AI-based systems.” Professor Diomidis Spinellis, from Athens University of Economics and Business, Greece, gave a presentation on “Fifty years of sustained progress: Form, forces, and lessons of Unix architectural evolution.” The industry keynote was delivered by Michael Keeling, an experienced software engineer and the author of the book “Design It! From Programmer to Software Architect.” We are grateful to the members of the Program Committee for helping us to seek submissions and provide valuable and timely reviews. Their efforts enabled us to put together a high-quality technical program for ECSA 2020. We would like to thank the

vi

Preface

members of the Organizing Committee of ECSA 2020 for playing an enormously important role in successfully organizing the event with several tracks and collocated events, as well as the workshop organizers, who made signiﬁcant contributions to this year’s successful event. We also thank our sponsors who provided ﬁnancial support for the event: the University of L’Aquila, Italy, provided the technology infrastructure and the support needed, nExpecto, and Springer. The ECSA 2020 submission and review process was supported by the EasyChair conference management system. We acknowledge the prompt and professional support from Springer who published these proceedings in electronic volumes as part of the Lecture Notes in Computer Science series. Finally, we would like to thank the authors of all the ECSA 2020 submissions and the attendees of the conference for their participation. ECSA 2020 planning and execution took place during an unprecedented time in our history, globally we had to face a pandemic as well as understand and react to consequences of systematic racism and intolerance. As the ECSA community, we pledge to stand against racism and intolerance and strive to elevate the ideas and voices of black, indigenous, and people of color who have been historically excluded because of systemic racism. We thank the support of the software architecture community, they reacted by continuing to advance the ﬁeld of software architecture through their scientiﬁc submissions to ECSA, while staying flexible as the Organizing Committee had to pivot several times from an in-person, to hybrid, to an all-online conference. July 2020

Anton Jansen Ivano Malavolta Henry Muccini Ipek Ozkaya Olaf Zimmermann

Organization

General Chair Henry Muccini

University of L’Aquila, Italy

Steering Committee Muhammad Ali Babar Paris Avgeriou Tomas Bures Rogério de Lemos Laurence Duchien Carlos E. Cuesta David Garlan Paola Inverardi Patricia Lago Antónia Lopes Ivano Malavolta Raffaela Mirandola Henry Muccini Flavio Oquendo (Chair) Ipek Ozkaya Jennifer Pérez Bedir Tekinerdogan Danny Weyns Uwe Zdun

The University of Adelaide, Australia University of Groningen, The Netherlands Charles University, Czech Republic University of Kent, UK CRIStAL, University of Lille, France Rey Juan Carlos University, Spain Carnegie Mellon University, USA University of L’Aquila, Italy Vrije Universiteit Amsterdam, The Netherlands University of Lisbon, Portugal Vrije Universiteit Amsterdam, The Netherlands Politecnico di Milano, Italy University of L’Aquila, Italy IRISA, University of South Brittany, France Carnegie Mellon University, USA Technical University of Madrid, Spain Wageningen University, The Netherlands KU Leuven, Belgium University of Vienna, Austria

Research Track Program Committee Chairs Ivano Malavolta Ipek Ozkaya

Vrije Universiteit Amsterdam, The Netherlands Carnegie Mellon University, USA

Program Committee Jesper Andersson Paris Avgeriou Rami Bahsoon Luciano Baresi Thais Batista Steffen Becker Stefan Biffl

Linnaeus University, Sweden University of Groningen, The Netherlands University of Birmingham, UK Politecnico di Milano, Italy Federal University of Rio Grande do Norte, Brazil University of Stuttgart, Germany TU Wien, Austria

viii

Organization

Barbora Buhnova Tomas Bures Javier Cámara Rafael Capilla Jan Carlson Siobhán Clarke Vittorio Cortellessa Carlos Cuesta Rogerio De Lemos Elisabetta Di Nitto Andres Diaz Pace Khalil Drira Laurence Duchien Neil Ernst George Fairbanks Matthias Galster Ilias Gerostathopoulos Carlo Ghezzi Volker Gruhn Petr Hnetynka Paola Inverardi Pooyan Jamshidi Wouter Joosen Anne Koziolek Heiko Koziolek Patricia Lago Nuno Laranjerio Nicole Levy Grace Lewis Antónia Lopes Kristina Lundquist Sam Malek Tomi Männistö Antonio Martini Tommi Mikkonen Mehdi Mirakhorli Raffaela Mirandola Marina Mongiello Gabriel Moreno Juan Manuel Elisa Yumi Elena Navarro Flavio Oquendo Claus Pahl Liliana Pasquale Cesare Pautasso

Masaryk University, Czech Republic Charles University, Czech Republic University of York, UK Rey Juan Carlos University, Spain Malardalen University, Sweden Trinity College Dublin, Ireland University of L’Aquila, Italy Rey Juan Carlos University, Spain University of Kent, UK Politecnico di Milano, Italy UNICEN University, Argentina LAAS-CNRS, France University of Lille, France University of Victoria, Canada Google, USA University of Canterbury, New Zealand TU Munich, Germany Politecnico di Milano, Italy Universität Duisburg-Essen, Germany Charles University, Czech Republic University of L’Aquila, Italy University of South Carolina, USA KU Leuven, Belgium Karlsruhe Institute of Technology, Germany ABB Corporate Research, Germany Vrije Universiteit Amsterdam, The Netherlands University of Coimbra, Portugal CNAM, France Carnegie Mellon University, USA University of Lisbon, Portugal Malardalen University, Sweden University of California, Irvine, USA University of Helsinki, Finland University of Oslo, Norway University of Helsinki, Finland Rochester Institute of Technology, USA Politecnico di Milano, Italy Politecnico di Bari, Italy Carnegie Mellon University, USA Murillo University of Extremadura, Spain Nakagawa University of São Paulo, Brazil University of Castilla-La Mancha, Spain Université Bretagne Sud, France Free University of Bozen-Bolzano, Italy University College Dublin, LERO, Ireland USI Lugano, Switzerland

Organization

Patrizio Pelliccione Jennifer Perez Claudia Raibulet Maryam Razavian Ralf Reussner Bradley Schmerl Romina Spalazzese Girish Suryanarayana Bedir Tekinerdogan Chouki Tibermacine Rainer Weinreich Danny Weyns Uwe Zdun Liming Zhu Olaf Zimmermann

Chalmers University of Technology, Sweden Universidad Politécnica de Madrid, Spain University of Milano-Bicocca, Italy Eindhoven University of Technology, The Netherlands Karlsruhe Institute of Technology, Germany Carnegie Mellon University, USA Malmö University, Sweden Siemens Corporate Technology, India Wageningen University, The Netherlands University of Montpellier, France Johannes Kepler University Linz, Austria KU Leuven, Belgium University of Vienna, Austria The University of New South Wales, Australia Hochschule für Technik Rapperswill, Switzerland

Additional Reviewers Anastase Adonis Abdulatif Alabdulatif Maria Istela Cagnin Everton Cavalcante Milena Guessi

Axel Legay Samir Ouchani Eduardo Silva Roberto Verdecchia

Industry Track Program Committee Chairs Anton Jansen Olaf Zimmermann

Philips, The Netherlands Hochschule für Technik Rapperswil, Switzerland

Program Committee Mohsen Anvaari Andrei Furda Heiko Koziolek Thomas Kurpick Xabier Larrucea Daniel Lübke Željko Obrenović Eltjo Poort Daniele Spinosi Michael Stal Johannes Wettinger Erik Wittern Eoin Woods

ix

Independent Consultant, Norway Hitachi Rail STS, Australia ABB Corporate Research, Germany Trusted Shops, Germany Tecnalia, Spain iQuest GmbH, Germany Incision, The Netherlands CGI, The Netherlands Micron Technology, Italy Siemens, Germany Bosch, Germany IBM T.J. Watson Research Center, USA Endava, UK

x

Organization

Additional Reviewers Stefan Kapferer Mirko Stocker

Organizing Committee Proceedings Chair Mirco Franzago

University of L’Aquila, Italy

Web Chair Karthik Vaidhyanathan

Gran Sasso Science Institute, Italy

Tool Demos Chairs Paris Avgeriou Barbora Buhnova

University of Groningen, The Netherlands Masaryk University, Czech Republic

Gender Diversity in SA Chairs Javier Camara Catia Trubiani

University of York, UK Gran Sasso Science Institute, Italy

Doctoral Symposium Chairs Patrizia Scandurra Danny Weyns

DIIMM, University of Bergamo, Italy KU Leuven, Belgium

Workshops Chairs Mauro Caporuscio Anne Koziolek

Linnaeus University, Sweden Karlsruhe Institute of Technology, Germany

Journal First Chair Uwe Zdun

University of Vienna, Austria

Publicity Chairs Stéphanie Challita Juergen Musil

Inria, France TU Wien, Austria

Student Volunteer Chairs Roberta Capuano Jamal El Hecham

University of L’Aquila, Italy IRISA, France

Organization

Virtualization Chairs Claudio Di Sipio Luca Traini

University of L’Aquila, Italy University of L’Aquila, Italy

xi

Keynotes

AI Engineering — New Challenges in System and Software Architecting and Managing Lifecycle for AI-based Systems

Ivica Crnkovic Chalmers University, Gothenburg, Sweden [email protected] Abstract. Artiﬁcial Intelligence based on Machine Learning, and in particular Deep Learning, is today the fastest growing trend in software development, and literally used in all other research disciplines, with a very high impact on the modern society. However, a wide use of AI in many systems, in particular dependable systems, is still far away of being widely used. On the one hand there is a shortage of expertise, on the other hand the challenges for managing AI-based complex and dependable systems are enormous, though less known, and in general underestimated. Some aspects of these challenges are based on management of resources, including computational, data storage capacity, performance, and real-time constraints. Introduction of AI-based components, i.e. components that includes AI algorithms, require signiﬁcant changes in system and software architecture, and its successful deployment is based on many architectural decisions and on changes of the development process. This talk discusses some of these challenges, illustrate a case of Cyber-physical systems, and gives some ideas for new research in software engineering inducing software architecture, i.e. for AI engineering.

Short Bio Ivica Crnkovic is a professor of software engineering at Chalmers University, Gothenburg, Sweden. He is the director of ICT Area of Advance at Chalmers University, and the director of Chalmers AI Research Centre (CHAIR). His research interests include, software architecture, software development processes, software engineering for large complex systems, component-based software engineering, and recently Software engineering for AI. Professor Crnkovic is the author of more than 200 refereed publications on software engineering topics, and guest editor of a number of special issues in different journals and magazines, such as IEEE Software, and Elsevier JSS. He was the general chair of 40th International Conference on Software Engineering (ICSE) 2018, held in Gothenburg, 2018. Before Chalmers, Ivica Crnkovic was afﬁliated with Mälardalen University, Sweden, and before that he was employed at ABB company, Sweden, where he was responsible for software development environments and tools. More information is available on http://www.ivica-crnkovic.net

Fifty Years of Sustained Progress: Form, Forces, and Lessons of Unix Architectural Evolution

Diomidis Spinellis Department of Management Science and Technology, Athens University of Economics and Business, Greece [email protected] Abstract. Unix has evolved over ﬁve decades, shaping modern operating systems, key software technologies, and development practices. Studying the evolution of this remarkable system from an architectural perspective can provide insights on how to manage the growth of large, complex, and long-lived software systems. Along main Unix releases leading to the FreeBSD lineage we examine core architectural design decisions, the number of features, and code complexity, based on the analysis of source code, reference documentation, and related publications. We see that the growth in size has been uniform, with some notable outliers, while cyclomatic complexity has been religiously safeguarded. A large number of Unix-deﬁning design decisions were implemented right from the very early beginning, with most of them still playing a major role. Unix continues to evolve from an architectural perspective, but the rate of architectural innovation has slowed down over the system’s lifetime. Architectural technical debt has accrued in the forms of functionality duplication and unused facilities, but in terms of cyclomatic complexity it is systematically being paid back through what appears to be a self-correcting process. Some unsung architectural forces that shaped Unix are the emphasis on conventions over rigid enforcement, the drive for portability, a sophisticated ecosystem of other operating systems and development organizations, and the emergence of a federated architecture, often through the adoption of third-party subsystems. These ﬁndings allow us to form an initial theory on the architecture evolution of large, complex operating system software.

Short Bio Diomidis Spinellis is a Professor in the Department of Management Science and Technology at the Athens University of Economics and Business, Greece. His research interests include software engineering, IT security, and cloud systems engineering. He has written two award-winning, widely- translated books: “Code Reading” and “Code Quality: The Open Source Perspective”. His most recent book is “Effective Debugging: 66 Speciﬁc Ways to Debug Software and Systems”. Dr. Spinellis has also published more than 300 technical papers in journals and refereed conference proceedings, which have received more than 8000 citations. He served for a decade as a member of the

Fifty Years of Sustained Progress: Form, Forces, and Lessons of Unix

xvii

IEEE Software editorial board, authoring the regular “Tools of the Trade” column, and as the magazine’s Editor-in- Chief over the period 2015–2018. He has contributed code that ships with Apple’s macOS and BSD Unix and is the developer of UMLGraph, CScout, git-issue, and other open-source software packages, libraries, and tools. Dr. Spinellis is a senior member of the ACM and the IEEE.

Mighty Methods: Four Essential Tools for Every Software Architect’s Silver Toolbox

Michael Keeling LendingHome, USA [email protected] Abstract. It is an oversimpliﬁcation to say that we are living in extraordinary times. When my team was ﬁrst asked to work from home back in February we were happy to do our part in attempting to stem the tide of an inevitable global pandemic. While we were eager to help, we were also nervous about how suddenly distributing our co-located team would affect our way of working. And yet, after several months we’ve settled into a “new normal” that looks surprisingly similar to our way of working from Before. Much about how we worked changed, in some cases dramatically, but a handful of design methods that were central to our team remained effective even after the shift from a co-located to fully distributed context. In particular, mob programming, example mapping, architecture decision records, and visual thinking are consistently among the most versatile and reliable tools in my silver toolbox. In this talk we’ll briefly explore these four methods and speculate about what makes them effective tools for software architects in such a broad range of contexts and situations. While this is not a talk about remote work per se, we’ll attempt to use the shifting context of work we’ve all experienced to further isolate variables that might help us identify other potential mighty methods waiting for software architects to adopt.

Short Bio Michael Keeling is a software engineer at LendingHome and the author of Design It!: From Programmer to Software Architect. Prior to LendingHome, Keeling worked at IBM on the Watson Discovery Service, Vivisimo, BuzzHoney, and Black Knight Technology. Keeling has also served as an Adjunct Faculty member at Carnegie Mellon University in the Master of Software Engineering Distance Program since 2009. He holds a Master in Software Engineering from Carnegie Mellon University in Pittsburgh, PA and a Bachelor of Science in Computer Science from the College of William and Mary in Williamsburg, VA. Keeling’s current research interests include software architecture design methods, agile software development, and human factors of software engineering. He is a regular speaker in the architecture and agile communities, presenting papers and talks, and facilitating workshops for both national and international audiences. Keeling is a two-time winner of the SEI/IEEE Software “Architecture in Practice” Best Presentation Award for talks given at the 2012 and 2014 SATURN conferences. A full list of his talks and workshops are available on his website: http://www.neverletdown.net/p/speaking-and-writing.html.

Contents

Microservices Assessing Architecture Conformance to Coupling-Related Patterns and Practices in Microservices . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Evangelos Ntentos, Uwe Zdun, Konstantinos Plakidas, Sebastian Meixner, and Sebastian Geiger

3

Formal Software Architectural Migration Towards Emerging Architectural Styles . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Nacha Chondamrongkul, Jing Sun, and Ian Warren

21

Monolith Migration Complexity Tuning Through the Application of Microservices Patterns . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . João Franscisco Almeida and António Rito Silva

39

Uncertainty, Self-adaptive, and Open System Decentralized Architecture for Energy-Aware Service Assembly . . . . . . . . . . Mauro Caporuscio, Mirko D’Angelo, Vincenzo Grassi, and Raffaela Mirandola

57

Continuous Experimentation for Automotive Software on the Example of a Heavy Commercial Vehicle in Daily Operation . . . . . . . . . . . . . . . . . . Federico Giaimo and Christian Berger

73

Towards Using Probabilistic Models to Design Software Systems with Inherent Uncertainty . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Alex Serban, Erik Poll, and Joost Visser

89

Model-Based Approaches Empowering SysML-Based Software Architecture Description with Formal Verification: From SysADL to CSP. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Fagner Dias, Marcel Oliveira, Thais Batista, Everton Cavalcante, Jair Leite, Flavio Oquendo, and Camila Araújo A Flexible Architecture for Key Performance Indicators Assessment in Smart Cities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Martina De Sanctis, Ludovico Iovino, Maria Teresa Rossi, and Manuel Wimmer

101

118

xx

Contents

Performance and Security Engineering A Multi-objective Performance Optimization Approach for Self-adaptive Architectures . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Davide Arcelli

139

Data Stream Operations as First-Class Entities in Component-Based Performance Models . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Dominik Werle, Stephan Seifermann, and Anne Koziolek

148

Architecture-Centric Support for Integrating Security Tools in a Security Orchestration Platform . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Chadni Islam, Muhammad Ali Babar, and Surya Nepal

165

VisArch: Visualisation of Performance-based Architectural Refactorings . . . . Catia Trubiani, Aldeida Aleti, Sarah Goodwin, Pooyan Jamshidi, Andre van Hoorn, and Samuel Gratzl

182

Architectural Smells and Source Code Analysis An Initial Study on the Association Between Architectural Smells and Degradation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Sebastian Herold Architectural Technical Debt: A Grounded Theory . . . . . . . . . . . . . . . . . . . Roberto Verdecchia, Philippe Kruchten, and Patricia Lago Does BERT Understand Code? – An Exploratory Study on the Detection of Architectural Tactics in Code . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Jan Keim, Angelika Kaplan, Anne Koziolek, and Mehdi Mirakhorli

193 202

220

Education and Training Teaching Students Software Architecture Decision Making. . . . . . . . . . . . . . Rafael Capilla, Olaf Zimmermann, Carlos Carrillo, and Hernán Astudillo The PDEng Program on Software Technology: Experience Report on a Doctorate Level Architecture Training Program . . . . . . . . . . . . . . . . . . Ad T. M. Aerts and Yanja Dajsuren

231

247

Experiences and Learnings from Industrial Case Studies Architectural Concerns for Digital Twin of the Organization. . . . . . . . . . . . . Mauro Caporuscio, Farid Edrisi, Margrethe Hallberg, Anton Johannesson, Claudia Kopf, and Diego Perez-Palacin

265

Contents

xxi

Quick Evaluation of a Software Architecture Using the Decision-Centric Architecture Review Method: An Experience Report . . . . . . . . . . . . . . . . . . Pablo Cruz, Luis Salinas, and Hernán Astudillo

281

The Quest for Introducing Technical Debt Management in a Large-Scale Industrial Company . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Somayeh Malakuti and Sergey Ostroumov

296

Architecting Contemporary Distributed Systems Determining Microservice Boundaries: A Case Study Using Static and Dynamic Software Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Tiago Matias, Filipe F. Correia, Jonas Fritzsch, Justus Bogner, Hugo S. Ferreira, and André Restivo

315

IAS: An IoT Architectural Self-adaptation Framework . . . . . . . . . . . . . . . . . Mahyar T. Moghaddam, Eric Rutten, Philippe Lalanda, and Guillaume Giraud

333

A Comparison of MQTT Brokers for Distributed IoT Edge Computing . . . . . Heiko Koziolek, Sten Grüner, and Julius Rückert

352

Author Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .

369

Microservices

Assessing Architecture Conformance to Coupling-Related Patterns and Practices in Microservices Evangelos Ntentos1(B) , Uwe Zdun1 , Konstantinos Plakidas1 , Sebastian Meixner2 , and Sebastian Geiger2 1

Faculty of Computer Science, Research Group Software Architecture, University of Vienna, Vienna, Austria {Evangelos.Ntentos,Uwe.Zdun,Konstantinos.Plakidas}@univie.ac.at 2 Siemens Corporate Technology, Vienna, Austria {Sebastian.Meixner,Sebastian.Geiger}@siemens.com

Abstract. Microservices are the go-to architectural style for building applications that are polyglot, support high scalability, independent development and deployment, and are rapidly adaptable to changes. Among the core tenets for a successful microservice architecture is high independence of the individual microservices, i.e. loose coupling. A number of patterns and best practices are well-established in the literature, but most actual microservice-based systems do not, as a whole or in part, conform to them. Assessing this conformance manually is not realistically possible for large-scale systems. This study aims to provide the foundations for an automated approach for assessing conformance to coupling-related patterns and practices speciﬁc for microservice architectures. We propose a model-based assessment based on generic, technology-independent metrics, connected to typical design decisions encountered in microservice architectures. We demonstrate and assess the validity and appropriateness of these metrics by performing an assessment of the conformance of real-world systems to patterns through statistical methods.

1

Introduction

Microservice architectures [14,15,22] describe an application as a collection of autonomous and loosely coupled services, typically modeled around a domain. Key microservice tenets are development in independent teams, cloud-native technologies and architectures, polyglot technology stacks including polyglot persistence, lightweight containers, loosely coupled service dependencies, high releasability, and continuous delivery [22]. Many architectural patterns that reﬂect recommended “best practices” in a microservices context have already been published in the literature [18,19,23]. The fact that microservice-based systems are complex and polyglot means that an automatic or semi-automatic assessment of their conformance to these patterns is diﬃcult: real-world systems feature combinations of these patterns, and diﬀerent degrees of violations of the c Springer Nature Switzerland AG 2020 A. Jansen et al. (Eds.): ECSA 2020, LNCS 12292, pp. 3–20, 2020. https://doi.org/10.1007/978-3-030-58923-3_1

4

E. Ntentos et al.

same; and diﬀerent technologies in diﬀerent parts of the system implement the patterns in diﬀerent ways, making the automatic parsing of code and identiﬁcation of the patterns a haphazard process. This work focuses on describing a method for assessing architecture conformance to coupling-related patterns and practices in microservice architectures. Coupling between microservices is caused by existence of dependencies, e.g. whenever one service calls another service to fulﬁll a request or share data. Loose coupling is an established topic in service-oriented architectures [22] but the application to the speciﬁc context of microservice architectures has not, to our knowledge, been examined so far. Strong coupling is conﬂicting with some of the key microservice tenets mentioned above. In particular, releasability, which is a highly desirable characteristic in modern systems due to the emergence of DevOps practices, relies on the rapid and independent release of individual microservices, and is compromised by strong dependencies between them. For the same reason, development in independent teams becomes more diﬃcult, and independent deployment of individual microservices in lightweight containers is also impeded. This work covers three broad coupling aspects: Coupling through Databases, resulting from reliance on commonly accessed data via shared databases; Coupling through Synchronous Invocations, resulting from synchronous communication between individual services; and Coupling through Shared Services, which arises through the dependence on common shared services (for details see Sect. 3). In reality, of course, no microservice system can support all microservice tenets well at the same time. Rather the architectural decisions for or against the use of speciﬁc patterns and practices must reﬂect a trade-oﬀ between ensuring the desired tenets and other important quality attributes [12,22]. From these considerations, this paper aims to study the following research questions: – RQ1 How can we automatically assess conformance to loose coupling-related patterns and practices in the context of microservice architecture decision options? – RQ2 How well do measures for assessing coupling-related decision options and their associated tenets perform? – RQ3 What is a set of minimal elements needed in a microservice architecture model to compute such measures? In pursuing of these questions, we surveyed the relevant literature (Sect. 2) and gathered knowledge sources about established architecture practices and patterns, their relations and tenets in form of a qualitative study on microservice architectures. This enabled us to create a meta-model for the description of microservice architectures, which was veriﬁed and reﬁned through iterative application in modelling a number of real-world systems, as outlined in Sect. 4. We manually assessed all models and model variants on whether each decision option is supported, thereby deriving an objective ground truth (Sect. 5). As the basis for an automatic assessment, we deﬁned a number of generic, technology-independent metrics to measure architecture conformance to the decision options, i.e. at least one metric per major decision option (Sect. 6).

Assessing Architecture Conformance

5

These metrics (and combinations thereof) were applied on the models and model variants to derive a numeric assessment, and then compared to the ground truth assessment via an ordinal regression analysis (Sect. 7). Section 8 discusses the results of our approach, as well as its limitations and potential threats to validity. Finally, in Sect. 9 we draw our conclusions and discuss options for future work.

2

Related Work

Many studies focus on best practices for microservice architectures. Richardson [18] has published a collection of microservice patterns related to major design and architectural practices. Patterns related to microservice APIs have been introduced by Zimmermann et al. [23], while Skowronski [19] collected best practices for event-driven microservice architectures. Microservice fundamentals and best practices are also discussed by Fowler and Lewis [14], and are summarized in a mapping study by Pahl and Jamshidi [16]. Taibi and Lenarduzzi [20] study microservice “bad smells”, i.e. practices that should be avoided (which would correspond to violations in our work). Many software metrics-related studies for evaluating the system architecture and individual architectural components exist, but most of them are not speciﬁc to the microservices domain. Allen et al. [1,2] study component metrics for measuring a number of quality attributes, e.g. size, coupling, cohesion, dependencies of components, and the complexity of the architecture. Additional studies for assessing quality attributes related to coupling and cohesion have been proposed and validated in the literature [3,4,6,11]. Furthermore, a small number of studies [5,17,21] propose metrics speciﬁcally for assessing microservice-based software architectures. Although these works study various aspects of architecture, design metrics, and architecture-relevant tenets such as coupling and independent deployment, their approach is usually generic. None of the works covers all the related software aspects for measuring coupling in a microservice context: the use of databases, system asynchronicity, and shared components. This is the overarching perspective of our work, and the chief contribution of this paper.

3

Decisions

In this section, we brieﬂy introduce the three coupling-related decisions along with their decision options (i.e. the relevant patterns and practices) which we study in this paper. We also discuss the impact on relevant microservice tenets, which we later on use as an argumentation for our manual ground truth assessment in Sect. 5. Inter-Service Coupling Through Databases. One important decision in microservice-based systems is data persistence, which needs to take into account qualities such as reliability and scalability, but also adhere to microservicespeciﬁc best practices, which recommend that each microservice should be

6

E. Ntentos et al.

loosely coupled and thus able to be developed, deployed, and scaled independently [14]. At one extreme of the scale, one option is No Persistent Data Storage, which is applicable only for services whose functions are performed on transient data. Otherwise, the most recommended option is the Database per Service pattern [18]: each service has its own database and manages its own data independently. Another option, which negatively aﬀects loose coupling, is to use a Shared Database [18]: a service writes its data in a common database and other services can read these data when required. There are two diﬀerent ways to implement this pattern: in Data Shared via Shared Database multiple services share the same table, resulting in a strongly coupled system, whereas in Databased Shared but no Data Sharing each service writes to and reads from its own tables, which has a lesser impact on coupling. Inter-Service Coupling Through Synchronous Invocations. Service integration is another core decision when building a microservice-based system. A theoretically optimal system of independent microservices would feature no communication between them. Of course, services need to communicate in reality, and so the question of integrating them so as to not result in tight inter-service coupling becomes paramount. The recommended practice is that communication between the microservices should be, as much as possible, asynchronous. This can be achieved through several patterns which are widely implemented in typical technology stacks: the Publish/Subscribe [13] pattern, in which services can subscribe to a channel to which other services can publish; the use of a Messaging [13] middleware, which decouples communication by using a queue to store messages sent by the producer until they are received by the consumer; the Data Polling [18] pattern, in which services periodically poll other services for data changes; and the Event Sourcing [18] pattern, that ensures that all changes to application state are stored as a sequence of events; Asynchronous Direct Invocation technique, in which services communicate asynchronously via direct invocations. Applying these patterns ensures loose coupling (to diﬀerent degrees), and increases the system reliability. Inter-Service Coupling Through Shared Services. Many of the microservice patterns focus on system structure, i.e. avoiding services sharing other services altogether, or at least not in a strongly coupled way. An optimal system in terms of architecture quality should not have any shared service. In reality, this is often not feasible, and in larger systems service sharing leads to chains of transitive dependencies between services. This is problematic when a service is unaware of its transitive dependencies, and of course for the shared service itself, where the needs of its dependents must always be taken into account during its evolution. We deﬁne three cases: a Directly Shared Service is a microservice which is directly linked to and required by more than one other service; a Transitively Shared Service is a microservice which is linked to other services via at least one intermediary service; and a Cyclic Dependency [10] which is formed when there is a direct or transitive path that leads back to its origin, i.e. that allows a service to be ultimately dependent on itself after a number of intermediary services. Cyclic dependencies often emerge inadvertently through increasing complexity

Assessing Architecture Conformance

7

over the system’s lifecycle, and require extensive refactoring to resolve. All three cases are inimical to the principle of loose coupling as well as to system qualities such as performance, modiﬁability, reusability, and scalability.

4

Research and Modeling Methods

In this section, we summarize the research and modeling methods applied in our study. The code and models used in and produced as part of this study have been made available online for reproducibility1 . 4.1

Research Method

Figure 1 shows the research steps from initial data collection to ﬁnal data analysis. For the data collection phase we conducted a multi-vocal literature study using web resources, public repositories, and scientiﬁc papers as sources [9]. We then analyzed the data collected using qualitative methods based on Grounded Theory [7] coding methods, such as open and axial coding, and extracted the three core architectural decisions described in the previous section along with their corresponding decision drivers and impacts. As data for our further research we used generated models taken from the Model Generation process, described below. We deﬁned a rating scheme for systematic assessment based on support or violation of core practices and tenets. From these we derived a ground truth for our study (the ground truth and its calculation rules are described in Sect. 5) as well as a set of metrics for automatically calculating conformance to each individual pattern or practice per decision. We then used the ground truth data to assess how well the hypothesized metrics can possibly predict the ground truth data by performing an ordinal regression analysis. Ordinal regression is a widely used method for modeling an ordinal response’s dependence on a set of independent predictors, which is applicable in a variety of domains. For the ordinal regression analysis we used the lrm function from the rms package in R [8]. 4.2

Model Generation

We began by performing an iterative study of a variety of microservice-related knowledge sources, and we gradually reﬁned a meta-model which contains all the required elements to help us reconstruct existing microservice-based systems. In order to investigate the ontology, and to evaluate the meta-model’s eﬃciency, we gathered a number of microservice-based systems, summarized in Table 1. Each is either a system published by practitioners (on GitHub and/or practitioner blogs) or a system variant adapted from a published example according to discussions in the relevant literature in order to explore the possible decision space. Apart from the speciﬁc variations described in Table 1 all other system aspects remained the same as in the base models. 1

https://bit.ly/2WmFP3N.

8

E. Ntentos et al. Data Collection Phase

Data Analysis

Architectural Design Decisions

Formulation of Core Decisions

Repositories Extract Ontology (Data Types) from best practices and tenets

Definition of Decision Options: Patterns and practices

Research Papers

Data Analysis: Qualitative study based on Grounded Theory method

Web Resources

Determine Decision Impacts for each option

Metrics Definition

Statistical Evaluation

Metrics Evaluation: Regression analysis on ability of the metrics to predict the ground truth assessment

Extraction of Decision Drivers: Quality attributes/Tenets

Objective Assessment

Define Metrics for quantifying extent of support/violation of each pattern/practice and tenet

Establish a Rating Scheme for System Assessment: Ordinal scale based on support or violation of patterns/practices and tenets

Automatic Calculation of generated metrics based on the system component models

Ground Truth Definition: Manual assessment of system model according to rating scheme

Model Generation

Static Code Analysis

Codeable Models Generator

System Component Model

Model Visualisation

Fig. 1. Overview diagram of the research method followed in this study

The systems were taken from 9 independent sources in total. They were developed by practitioners with microservice experience, and they are representative of the best practices summarized in Sect. 3. We performed a fully manual static code analysis for those models where the source code was available (7 of our 9 sources; two were modeled from documentation published by the practitioners). To create our models, we used our existing modeling tool CodeableModels2 , a Python implementation for the precise speciﬁcation of meta-models, models, and model instances in code. Based on CodeableModels, we speciﬁed meta-models for components, connectors and relationships. We then manually created model instances for each of the systems in Table 1. In addition, we realized automated constraint checkers and PlantUML code generators to generate graphical visualizations of all meta-models and models. The result is a set of precisely modeled component models of the examined software systems (modeled using the techniques described below). This resulted in a total of 27 models summarized in Table 1. We assume that our evaluation systems are, or reﬂect, real-world practical examples of microservice architectures. As many of them are open source systems with the purpose of demonstrating practices or technologies, they are at most of medium size and modest complexity, though. 2

https://github.com/uzdun/CodeableModels.

Assessing Architecture Conformance

4.3

9

Methods for Modeling Microservice Component Architectures

From an abstract point of view, a microservice-based system is composed of components and connectors, with a set of component types for each component and a set of connector types for each connector. For modeling microservice architectures we followed the method reported in our previous work [21].

5

Ground Truth Calculations

In this section, we present and describe the calculation of the ground truth assessment for each of the decisions from Sect. 3. The results of those assessments are reported in Table 2. The assessment begins with a manual evaluation by the authors on whether each of the relevant patterns (decision options) is either Supported, Partially Supported, or Not Supported (S, P, N in Table 2). Based on this and informed by the description of the impacts of the various decision options in Sect. 3, we combined the outcome of all decision options to derive an ordinal assessment on how well the decision as a whole is supported in each model, using the ordinal scale: [ ++: very well supported, +: well supported, o: neutral, -: badly supported, --: very badly supported]. This was done according to best practices documented in literature. For instance, following the ordinal scale the assessment for the model BM1 is +: well supported, since a) option Database per Service is not supported, b) some services have a shared database, but c) they do not share data via the shared database. For the Inter-Service Coupling through Databases decision, we derive the following scoring scheme for our ground truth assessment: • ++: All services (which require data persistence) have individual databases Database per Service. • +: Some services have Shared Databases and no Data Shared via the Shared Databases. • o: All services have Shared Databases and no Data Shared via the Shared Databases. • -: Some services have Shared Databases and Data Shared via the Shared Databases. • --: All services have Shared Databases and Data Shared via the Shared Databases. From the Inter-Service Coupling through Synchronous Invocations decision, we derive the following scoring scheme for our ground truth assessment: • ++: All services communicate asynchronously via Message Brokers or Publish/Subscribe or Stream Processing • +: All services communicate asynchronously via API Gateway or HTTP Polling or Direct Asynchronous calls, or (some) via Message Brokers or Publish/Subscribe or Stream Processing. • o: None or some services communicate asynchronously and all other services communicate asynchronously via Data Sharing (e.g. Shared DB).

10

E. Ntentos et al. Table 1. Overview of modelled systems (size, details, and sources) Model ID

Model Size

Description / Source

BM1

10 components 14 connectors

Banking-related application based on CQRS and event sourcing (from https:// github.com/cer/event-sourcing-examples).

BM2

8 components 9 connectors

Variant of BM1 which uses direct RESTful completely synchronous service invocations instead of event-based communication.

BM3

8 components 9 connectors

Variant of BM1 which uses direct RESTful completely asynchronous service invocations instead of event-based communication.

CO1

8 components 9 connectors

The common component model E-shop application implemented as microservices directly accessed by a Web frontend (from https://github.com/cocomecommunity-case-study/cocome-cloud-jee-microservices-rest).

CO2

11 components 17 connectors

Variant of CO1 using a SAGA orchestrator on the order service with a message broker. Added support for Open Tracing. Added an API gateway.

CO3

9 components 13 connectors

Variant of CO1 where the reports service does not use inter-service communication, but a shared database for accessing product and store data. Added support for Open Tracing.

CI1

11 components 12 connectors

Cinema booking application using RESTful HTTP invocations, databases per service, and an API gateway (from https://codeburst.io/build-a-nodejscinema-api-gateway-and-deploying-it-to-docker-\part-4703c2b0dd269).

CI2

11 components 12 connectors 10 components 11 connectors 11 components 12 connectors

Variant of CI1 routing all interservice communication via the API gateway.

EC1

10 components 14 connectors

E-commerce application with a Web UI directly accessing microservices and an API gateway for service-based API (from https://microservices.io/patterns/ microservices.html).

EC2

11 components 14 connectors 8 components 11 connectors 20 components 36 connectors

Variant of EC1 using event-based communication and event sourcing internally.

ES2

14 components 35 connectors

Variant of ES1 using RESTful communication via the API gateway instead of eventbased communication and one shared SQL DB for all 6 of the services using DBs. No service interaction via the shared database occurs.

ES3

16 components 35 connectors

Variant of ES1 using RESTful communication via the API gateway instead of eventbased communication and one shared database for all 4 of the services using SQL DB in ES1. However, no service interaction via the shared database occurs.

FM1

15 components 24 connectors

Simple food ordering application based on entity services directly linked to a Web UI (from https://github.com/jferrater/Tap-And-EatMicroServices).

FM2

14 components 21 connectors

Variant of FM1 which uses the store service as an API composition and asynchronous interservice communication. Added Jaeger-based tracing per service.

FM3

13 components 15 connectors

Variant of FM1 which demonstrates a cyclic dependency case, uses the store service as an API composition and asynchronous inter-service communication

HM1

13 components 25 connectors

Hipster shop application using GRPC interservice connection and OpenCensus monitoring & tracing for all but one services as well as on the gateway. (from https: //github.com/GoogleCloudPlatform/microservices-demo).

HM2

14 components 26 connectors

Variant of HM1 that uses publish/subscribe interaction with event sourcing, except for one service, and realizes the tracing on all services.

RM1

11 components 18 connectors

Restaurant order management application based on SAGA messaging and domain event interactions. Rudimentary tracing support (from https://github.com/ microservices-patterns/ftgo-application).

RM2

14 components 14 connectors

Variant of RM1 which contains transitively shared services, API Gateway for client services communication, database per service and direct communication between service.

RM3

14 components 15 connectors

Variant of RM1 which demonstrates a cyclic dependency case, API Gateway for client services communication, database per service and direct communication between service.

RS

18 components 29 connectors

Robot shop application with various kinds of service interconnections, data stores, and Instana tracing on most services (from https://github.com/instana/robotshop).

TH1

14 components 16 connectors

Taxi hailing application with multiple frontends and databases per services from (https://www.nginx.com/blog/introduction-to-microservices/).

TH2

15 components 18 connectors

Variant of TH1 that uses publish/subscribe interaction with event sourcing for all but one service interactions.

CI3 CI4

EC3 ES1

Variant of CI1 using direct client to service invocations instead of the API gateway. Variant of CI1 with a subsystem exposing services directly to the client and another subsystem routing all traffic via the API gateway.

Variant of EC1 with a shared database used to handle all but one service interactions. E-shop application using pub/sub communication for event-based interaction, a middleware-triggered identity service, databases per service (4 SQL DBs, 1 Mongo DB, and 1 Redis DB), and backends for frontends for two Web app types and one mobile app type (from https://github.com/dotnet-architecture/ eShopOnContainers).

P

N

+

Data Shared via Shared DB

Assessments

N

N

S

N

N

S

N

N

S P

P

P

++ ++ ++ ++ -

N

N

S

N

N

N

++ --

Asynch Inter-communication via API GW

Shared Database Interconnections

Messaging Interconnections

Assessments

-

N

N

S

N

N

--

N

N

N

N

N

N

N

N

N

N

N

N

++ ++ ++ ++ ++ ++ o

P

N

N

Assessments

N

Cyclic Dependencies

N

N

N

N

N

N

N

--

N

N

N

N

N

Direct Service Sharing N

N

N

N

S N

N

S N

N

S N

N

S N

N

S S

S

N

N

N

P

--

N

N

N

N

N

++ o

N

N

N

--

N

N

N

N

N

N

N

N

--

N

N

N

N

N

N

S

N

N

N

N

N

N

N

N

N

++ o

N

N

N

S

N

N

S

N

N

N

P

+

N

N

N

P

N

o

N

N

P

--

N

N

N

N

N

++ o

N

N

S

++ ++ ++ ++ o

N

N

N

--

N

N

N

N

N

++ ++ ++ ++ ++ ++ --

N

N

S

Transitively Shared Services

N

N

P

N

N

N

++ -

S

N

N

N

N

Inter-service coupling through shared services

N

N

N

N

S

PubSub/Event Sourcing Interconnections

N

N

Asynchronous Direct Interconnections

Inter-service coupling through synchronous invocations

N

Database per Service

Shared Database

Database-based inter-service coupling

o

N

N

P

--

N

N

N

N

N

+

N

P

P N

N

S N

N

S N

N

S

S N

N

N N

S

o

N

N

P

--

N

N

N

N

N

o

N

N

P

+

N

N

N

N

S

--

S

P

P

+

N

N

N

N

S

o

N

N

P

-

N

N

N

N

P

P

N

N

P

N

N

N

N

S

S N

N

N N

S

N

S

N

--

N

N

N

N

N

--

S

N

N

--

N

N

N

N

N

o

N

N

P

o

P

N

N

N

P

++ ++ o

N

N

++ ++ o

N

N

N

++ +

N

N

N

S

P

++ ++ ++ ++ ++ o

N

N

S

S N

N

o

N

N

P

--

N

N

N

N

N

++

N

N

N

+

N

N

N

P

N

++ ++

N

N

S

BM1 BM2 BM3 CO1 CO2 CO3 CI1 CI2 CI3 CI4 EC1 EC2 EC3 ES1 ES2 ES3 FM1 FM2 FM3 HM1 HM2 RM1 RM2 RM3 RS TH1 TH2

Table 2. Ground truth assessment results

Assessing Architecture Conformance 11

12

E. Ntentos et al.

• -: None or some services communicate asynchronously, none or some communicate asynchronously via Data Sharing, some services communicate synchronously. • --: All services communicate synchronously. Finally, from the Inter-Service Coupling through Shared Services decision, we derive the following scoring scheme for our ground truth assessment: • ++: None of the services is a Directly Shared Service or Transitively Shared Service and no Cyclic Dependencies exist. • +: Some of the services are Transitively Shared Services, but none are Directly Shared Services and no Cyclic Dependencies exist. • o: Some or none of the services are Transitively Shared Services and some are Directly Shared Services, but no Cyclic Dependencies exist. • -: Some of the services are Transitively Shared Services and all other services are Directly Shared Services, but no Cyclic Dependencies exist. • --: There are Cyclic Dependencies or all the services are Transitively Shared Components and all the services are Directly Shared Components.

6

Metrics

In this section, we describe the metrics we have hypothesized for each of the decisions described in Sect. 3. All metrics, unless otherwise noted, are a continuous value with range from 0 to 1, with 1 representing the optimal case where a set of patterns is fully supported, and 0 the worst-case scenario where it is completely absent. 6.1

Metrics for Inter-Service Coupling Through Databases Decision

Database Type Utilization (DTU) Metric. This metric returns the number of the connectors from Services to Individual Databases in relation to the total number of Service-to-Database connectors. This way, we can measure how many services are using individual databases. DT U =

Database per Service Links T otal Service-to-Database Links

Shared Database Interactions (SDBI) Metric. Although a Shared Database is considered as an anti-pattern in microservices, there are many systems that make use of it either partially or completely. To measure its presence in a system, we count the number of interconnections via a Shared Database compared to the total number of service interconnections. SDBI =

Service Interconnections with Shared Database T otal Service Interconnections

Assessing Architecture Conformance

6.2

13

Metrics for Inter-Service Coupling Through Synchronous Invocations Decision

Service Interaction via Intermediary Component (SIC) Metric. We deﬁned this metric to measure the proportion of service interconnections via asynchronous relay architectures such as Message Brokers, Publish/Subscribe, or Stream Processing. These represent the best current practices, and are not exhaustive; should any new architectures emerge, these should be added to this list. SIC =

Service Interconnections via [M essage Brokers | P ub/Sub | Stream] T otal Service Interconnections

Asynchronous Communication Utilization (ACU) Metric. This metric measures the proportion of the sum of asynchronous service interconnections (via API Gateway/HTTP Polling/Direct calls/Shared Database) to the total number of service interconnections. ACU =

6.3

Asynchronous Service Interconnections via [AP I | P olling | Direct Calls | Shared DB] T otal Service Interconnections

Metrics for Inter-Service Coupling Through Shared Services Decision

Direct Service Sharing (DSS) Metric. For measuring DSS we count all the directly shared services and set this number in relation to the total number of system services. To this add all the shared services connectors in relation to the total number of services interconnections. This gives us the proportion of the directly shared elements in the system. Shared Services Shared Services Connectors + T otal Services T otal Service Interconnections DSS = 2

Transitively Shared Services (TSS) Metric. For measuring TSS we count all the transitively shared services and set this number in relation to the total number of system services. To this we add all the transitively shared service connectors in relation to the total number of service interconnections. This gives us the proportion of the transitively shared elements in the system. T ransitively Shared Services T ransitively Shared Services Connectors + T otal Services T otal Service Interconnections T SS = 2

14

E. Ntentos et al.

Cyclic Dependencies Detection (CDD) Metric. Let SG = (S, C) be the service graph, S the set of service nodes, and C the set of connector edges in a microservice model. Based on the generic deﬁnition of closed paths, we deﬁne a closed service path in SG as a sequence of services s1 , s2 , . . . , sn (each service ∈ S) such that (sn , sn + 1) ∈ C is a directed connector between services for i = 1, 2, . . . , n and s1 = sn . A service cycle is a closed service path in which no service node is repeated except the ﬁrst and last, and which contains at least two distinct service nodes. Let ServiceCycles() return the set of all service cycles in a service graph. CDD returns 1 (True) if there is at least one cyclic dependency in the model: CDD =

6.4

1: 0:

if |ServiceCycles(SG)| = 0 otherwise

Metrics Calculation Results

We note that for the Inter-Service Coupling through Shared Services decision as well as SDBI metric, our metrics scale is reversed in comparison to the other two decisions, because here we detect the presence of an anti-pattern: the optimal result of our metrics is 0, and 1 is the worst-case result. The metrics results for each model per decision metric are presented in Table 3.

7

Ordinal Regression Analysis Results

The dependent outcome variables are the ground truth assessments for each decision, as described in Sect. 5 and summarized in Table 2. The metrics deﬁned in Sect. 6 and summarized in Table 3 are used as the independent predictor variables. The ground truth assessments are ordinal variables, while all the independent variables are measured on a scale from 0.0 to 1.0. The objective of the analysis is to predict the likelihood of the dependent outcome variable for each of the decisions by using the relevant metrics for each decision. Each resulting regression model consists of a baseline intercept and the independent variables multiplied by coeﬃcients. There are diﬀerent intercepts for each of the value transitions of the dependent variable (≥Badly Supported, ≥Neutral, ≥Well Supported, ≥Very Well Supported ), while the coeﬃcients reﬂect the impact of each independent variable on the outcome. For example, a positive coeﬃcient, such as +5, indicates a corresponding ﬁve-fold increase in the dependent variable for each unit of increase in the independent variable; conversely, a coeﬃcient of −30 would indicate a thirty-fold decrease. The statistical signiﬁcance of each regression model is assessed by the p-value; the smaller the p-value, the stronger the model. A p-value smaller than 0.05 is

Assessing Architecture Conformance

15

Table 3. Metrics calculation results Metrics BM1 BM2 BM3 CO1 CO2 CO3 CI1

CI2

CI3

CI4

EC1 EC2 EC3

Database-based inter-service coupling DTU

0.33 1.00 1.00 1.00 1.00 0.60 1.00 1.00 1.00 1.00 1.00 1.00 0.00

SDBI

0.00 0.00 0.00 0.00 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.00 1.00

Inter-service coupling through synchronous invocations SIC

1.00 0.00 0.00 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.00 1.00 0.00

ACU

0.00 0.00 1.00 0.00 0.00 1.00 0.00 0.00 0.00 0.00 0.00 0.00 1.00

Inter-service coupling through shared services DSS

0.00 0.00 0.00 0.00 0.00 0.00 0.20 0.00 0.38 0.00 0.00 0.00 0.00

TSS

0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00

CDD

0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00

Metrics ES1 ES2 ES3 FM1 FM2 FM3 HM1 HM2 RM1 RM2 RM3 RS

TH1 TH2

Database-based inter-service coupling DTU

1.00 0.00 0.33 1.00 1.00 1.00 1.00 1.00 0.00 1.00 1.00 0.66 1.00 1.00

SDBI

0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00

Inter-service coupling through synchronous invocations SIC

0.60 0.00 0.00 0.00 0.00 0.00 0.00 0.80 1.00 0.00 0.00 0.11 0.00 0.60

ACU

0.00 0.00 0.00 0.00 1.00 0.08 0.50 0.20 0.00 0.00 0.00 0.11 0.00 0.00

Inter-service coupling through shared services DSS

0.27 0.34 0.34 0.62 0.47 0.55 0.52 0.00 0.00 0.00 0.00 0.36 0.33 0.00

TSS

0.00 0.00 0.00 0.00 0.00 0.18 0.00 0.00 0.00 0.18 0.16 0.00 0.00 0.00

CDD

0.00 0.00 0.00 0.00 0.00 1.00 0.00 0.00 0.00 0.00 1.00 0.00 0.00 0.00

generally considered statistically signiﬁcant. In Table 4, we report the p-values for the resulting models, which in all cases are very low, indicating that the sets of metrics we have deﬁned are able to predict the ground truth assessment for each decision with a high level of accuracy. Table 4. Regression analysis results Intercepts/coeﬃcients

Value

Model p-value

Database-based inter-service coupling Intercept (≥Badly Supported)

2.6572

Intercept (≥Neutral)

0.8789

Intercept (≥Well Supported)

−1.3820

1.706019e−06

Intercept (≥Very Well Supported) −3.1260 Metric Coeﬃcient (DTU)

6.4406

Metric Coeﬃcient (SDBI)

−3.7048 (continued)

16

E. Ntentos et al. Table 4. (continued) Inter-service coupling through synchronous invocations Intercept (≥Badly Supported)

−2.6973

Intercept (≥Neutral)

−4.4087

Intercept (≥Well Supported)

−5.8513

6.705525e−11

Intercept (≥Very Well Supported) −15.3677 Metric Coeﬃcient (SIC)

17.3520

Metric Coeﬃcient (ACU)

6.5520

Inter-service coupling through shared services Intercept (≥Neutral)

59.4089

1.625730e−10

Intercept (≥Very Well Supported) 9.7177

8

Metric Coeﬃcient (DSS)

−82.4474

Metric Coeﬃcient (TSS)

−122.2583

Metric Coeﬃcient (CDD)

−57.4650

Discussion

In this section, we ﬁrst discuss what we have learned in our study that helps to answer the research questions and then discuss potential threats to validity. 8.1

Discussion of Research Questions

To answer RQ1 and RQ2, we proposed a set of generic, technology-independent metrics for each coupling-related decision, and to each decision option corresponds at least one metric. We objectively assessed for each model how well patterns and/or practices are supported for establishing the ground truth, and extrapolated this to how well the broader decision is supported. We formulated metrics to numerically assess a pattern’s implementation in each model, and performed an ordinal regression analysis using these metrics as independent variables to predict the ground truth assessment. Our results show that every set of decision-related metrics can predict our objectively evaluated assessment with high accuracy. This suggests that automatic metrics-based assessment of a system’s conformance to the tenets embodied in each design decision is possible with a high degree of conﬁdence. Here, we make the assumption that the source code of a system can be mapped to the models used in our work. To enable this, we used rather simplistic modeling means, which can rather easily be mapped from a speciﬁc source code to the system models. However, it should be noted that full automation of this mapping is an additional eﬀort that needs to be considered and is the subject of ongoing work on our part.

Assessing Architecture Conformance

17

Regarding RQ3, we consider that existing modeling practices can be easily mapped to our microservice meta-model and there is no need for major extensions. More speciﬁcally, for completing the modeling of our evaluation system set, we needed to introduce 25 component types and 38 connector types, ranging from general notions such as the Service component type, to very technologyspeciﬁc classes such as the RESTful HTTP connector, which is a subclass of Service Connector. Our study shows that for each pattern and practice embodied in each decision, and the proposed metrics, only a small subset of the meta-model is required. The decisions Inter-Service Coupling through Databases and Inter-Service Coupling through Shared Services require to model at least the Service and the Database component types and the technology-related connector types (e.g. Database Connector, RESTful HTTP and Asynchronous Connector) and the read/write process which explicitly modeled in the Database Connector type. The Inter-Service Coupling through Synchronous Invocations decision requires a number of additional components (e.g. Event Sourcing, Stream Processing, Messaging, PubSub) and the respective connectors (e.g. Publisher, Subscriber, Message Consumer, Messages Producer, RESTful HTTP and Asynchronous Connector) to be modeled. 8.2

Threats to Validity

We deliberately relied on third-party systems as the basis for our study to increase internal validity, thus avoiding bias in system composition and structure. It is possible that our search procedures introduced some kind of unconscious exclusion of certain sources; we mitigated this by assembling an author team with many years of experience in the ﬁeld (including substantial industry experiences), and performing very general and broad searches. Given that our search was not exhaustive, and that most of the systems we found were made for demonstration purposes, i.e. relatively modestly sized, this means that some potential architecture elements were not included in our meta-model. In addition, this raises a possible threat to external validity of generalization to other, and more complex, systems. We nevertheless feel conﬁdent that the systems documented are a representative cross-cut of current practices in the ﬁeld, as the points of variance between them were limited and well attested in the literature. Another potential threat is the fact that the variant systems were derived by the author team. However, this was done according to best practices documented in literature. We carefully made sure only to change speciﬁc aspects in a variant and keep all other aspects stable. That is, while the variants do not represent actual systems, they are reasonable evolutions of the original designs. The modeling process is also considered as source of internal validity threat. The models of the systems were repeatedly and independently cross-checked by the author team that has considerable experience in similar methods, but the possibility of some interpretative bias remains: other researchers might have coded or modeled diﬀerently, leading to diﬀerent models. As a mitigation, we also oﬀer the whole models and the code as open access artifacts for review. Since

18

E. Ntentos et al.

we aimed only to ﬁnd one model that is able to specify all observed phenomena, and this was achieved, we consider this threat not to be a major issue for our study. The ground truth assessment might also be subject to diﬀerent interpretations by diﬀerent practitioners. For this purpose, we deliberately chose only a three-step ordinal scale, and given that the ground truth evaluation for each decision is fairly straightforward and based on best practices, we do not consider our interpretation controversial. Likewise, the individual metrics used to evaluate the presence of each pattern were deliberately kept as simple as possible, so as to avoid false positives and enable a technology-independent assessment. As stated previously, generalization to more complex systems might not be possible without modiﬁcation. But we consider that the basic approach taken when deﬁning the metrics is validated by the success of the regression models.

9

Conclusions and Future Work

Our approach considered that it is achievable to develop a method for automatically assessing coupling related tenets in microservice decisions based on a microservice system’s component model. We have shown that this is possible for microservice decision models that contain patterns and practices as decision options. In this work, we ﬁrst modeled the key aspects of the decision options using a minimal set of component model elements. These could be possibly automatically extracted from the source code. Then we derived at least one metric per decision option and used a small reference model set as a ground truth. We then used ordinal regression analysis for deriving a predictor model for the ordinal variable. The statistical analysis shows that each decision related metrics are quite close to the manual, pattern-based assessment. There are many studies related on metrics for component model and other architectures so far, but speciﬁcally for microservice architectures and their coupling related tenets have not been studied. Based on our discussion in Sect. 2, assessing microservice architectures using general metrics it is not very helpful. Our approach is one of the ﬁrst that studies a metrics-based assessment of coupling-related tenets in the microservices domain. We aim to a continuous assessment, i.e. we envision an impact on continuous delivery practices, in which the metrics are assessed with each delivery pipeline run, indicating improvement, stability, or deterioration in microservice architecture conformance. With small changes, our approach could also be applied, for instance, during early architecture assessment. As future work, we plan to study more decisions, tenets, and related metrics. We also plan to create a larger data set, thus better supporting tasks such as early architecture assessment in a project. Acknowledgments. This work was supported by: FFG (Austrian Research Promotion Agency) project DECO, no. 846707; FWF (Austrian Science Fund) project APIACE: I 4268.

Assessing Architecture Conformance

19

References 1. Allen, E.B., Gottipati, S., Govindarajan, R.: Measuring size, complexity, and coupling of hypergraph abstractions of software: an information-theory approach. Softw. Qual. J. (2), 179–212. https://doi.org/10.1007/s11219-006-9010-3 2. Allen, E.B., Gottipati, S., Govindarajan, R.: Measuring size, complexity, and coupling of hypergraph abstractions of software: an information-theory approach. Softw. Qual. J. 15, 179–212 (2006) 3. Bansiya, J., Davis, C.G.: A hierarchical model for object-oriented design quality assessment. IEEE Trans. Softw. Eng. 28(1), 4–17 (2002) 4. Basili, V.R., Briand, L.C., Melo, W.L.: A validation of object-oriented design metrics as quality indicators. IEEE Trans. Softw. Eng. 22(10), 751–761 (1996) 5. Bogner, J., Wagner, S., Zimmermann, A.: Towards a practical maintainability quality model for service-and microservice-based systems, pp. 195–198, September 2017 6. Chidamber, S.R., Kemerer, C.F.: A metrics suite for object oriented design. IEEE Trans. Softw. Eng. 20(6), 476–493 (1994) 7. Corbin, J., Strauss, A.L.: Grounded theory research: procedures, canons, and evaluative criteria. Qual. Sociol. 13, 3–20 (1990) 8. Frank, E., Harrell, J.: Regression Modeling Strategies: With Applications to Linear Models, Logistic and Ordinal Regression, and Survival Analysis, 2nd edn. Springer, Heidelberg (2013) 9. Garousi, V., Felderer, M., M¨ antyl¨ a, M.V.: Guidelines for including the grey literature and conducting multivocal literature reviews in software engineering. CoRR 10. Goldstein, M., Moshkovich, D.: Improving software through automatic untangling of cyclic dependencies. In: Association for Computing Machinery, New York, NY, USA (2014) 11. Harrison, R., Counsell, S.J., Nithi, R.V.: An evaluation of the mood set of objectoriented software metrics. IEEE Trans. Softw. Eng. 24(6), 491–496 (1998) 12. Haselb¨ ock, S., Weinreich, R., Buchgeher, G.: Decision models for microservices: design areas, stakeholders, use cases, and requirements. In: Lopes, A., de Lemos, R. (eds.) ECSA 2017. LNCS, vol. 10475, pp. 155–170. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-65831-5 11 13. Hohpe, G., Woolf, B.: Enterprise Integration Patterns. Addison-Wesley, Boston (2003) 14. Lewis, J., Fowler, M.: Microservices: a deﬁnition of this new architectural term, March 2004. http://martinfowler.com/articles/microservices.html 15. Newman, S.: Building Microservices: Designing Fine-Grained Systems. O’Reilly, Sebastapol (2015) 16. Pahl, C., Jamshidi, P.: Microservices: a systematic mapping study. In: 6th International Conference on Cloud Computing and Services Science, pp. 137–146 (2016) 17. Pautasso, C., Wilde, E.: Why is the web loosely coupled?: a multi-faceted metric for service design. In: 18th International Conference on World Wide Web, pp. 911–920. ACM (2009) 18. Richardson, C.: A pattern language for microservices (2017). http://microservices. io/patterns/index.html 19. Skowronski, J.: Best practices for event-driven microservice architecture (2019). https://hackernoon.com/best-practices-for-event-driven-microservice-architectur e-e034p21lk

20

E. Ntentos et al.

20. Taibi, D., Lenarduzzi, V.: On the deﬁnition of microservice bad smells. IEEE Softw. 35(3), 56–62 (2018) 21. Zdun, U., Navarro, E., Leymann, F.: Ensuring and assessing architecture conformance to microservice decomposition patterns. In: Maximilien, M., Vallecillo, A., Wang, J., Oriol, M. (eds.) ICSOC 2017. LNCS, vol. 10601, pp. 411–429. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-69035-3 29 22. Zimmermann, O.: Microservices tenets. Comput. Sci. - Res. Dev., 301–310 (2016). https://doi.org/10.1007/s00450-016-0337-0 23. Zimmermann, O., Stocker, M., Zdun, U., Luebke, D., Pautasso, C.: Microservice API patterns (2019). https://microservice-api-patterns.org

Formal Software Architectural Migration Towards Emerging Architectural Styles Nacha Chondamrongkul(B) , Jing Sun, and Ian Warren School of Computer Science, University of Auckland, Auckland, New Zealand [email protected], {jing.sun,i.warren}@auckland.ac.nz

Abstract. Software systems are evolved over time to enhance various qualities of the software system by adopting new technologies and principles. The architecture design is usually required to be migrated from one architectural style to another to support this adoption, while the key functionalities still need to be preserved. This paper presents a formal approach that supports architectural migration. Our approach automates refactoring the architectural design to support the exploitation of emerging technologies such as microservices and blockchain. With our approach, the refactored architectural design can be verified to ensure that the essential functional requirements are still preserved, and the design has complied with the behavioural constraints of new architectural styles. We have evaluated the accuracy and performance of our approach. The results prove that it performs reasonably well. Keywords: Software architecture Microservice · Blockchain

1

· Architectural migration ·

Introduction

Emerging technologies and principles have been applied to the software system to enhance the system qualities such as maintainability, scalability and security. Software engineers have been facing challenges to modernise the software system by migrating the existing legacy system to take advantage of new technologies and principles [1], such as continuous software engineering, microservices, containerisation and blockchain. Architectural patterns [13,15,19] have been proposed to give structures of how software architecture should be designed to support new technologies and principles. At the migration, software architecture usually needs to be refactored to convert from an architectural pattern to another [6], while the key functionalities need to be preserved [10]. This paper focuses on the architectural migration that modernises the software system to support emerging technologies [18]. As discussed in Sect. 5, some works [2,7,20] have been proposed to provide guidelines and patterns that support architectural migration. A number of tools [3,9,11,14,17] have been proposed to support this task. However, existing approaches focus either refactoring at the implementation level or supporting particular architectural styles. c Springer Nature Switzerland AG 2020 A. Jansen et al. (Eds.): ECSA 2020, LNCS 12292, pp. 21–38, 2020. https://doi.org/10.1007/978-3-030-58923-3_2

22

N. Chondamrongkul et al.

Moreover, most of the existing tools have not addressed how to ensure that the refactored design would preserve the system functionalities. There is still a lack of approaches that generally support the architectural migration by both refactoring and verifying the architectural design in an automatic manner. This paper presents a formal approach that supports architectural migration by refactoring and verifying the architecture design before the actual system migration can be performed. Our approach applies ontology reasoning and model checking technique. The contribution of this work can be summarised as follows: 1) The formal modelling of software architecture design is proposed to describe the structural and behavioural aspect of architectural design based on architectural styles. 2) A set of generic rules is presented to automate the architectural design refactoring towards new architectural styles that support emerging technologies and principles. 3) The veriﬁcation approach helps to ensure that key functional requirements are preserved in the refactored design, and the design has complied with the desired architectural styles. 4) We have evaluated the accuracy and performance of our approach with the architectural design of four real-world software systems. The rest of this paper is organised in the following sections. Section 2 shows how to model and refactor software architecture design. Section 3 presents the veriﬁcation approach and migration beneﬁts. Section 4 presents the evaluation result and the discussion. Section 5 discusses the related approaches in comparison to our approach. This paper is concluded in Section 6.

2

Formal Architectural Migration

The overall process of our approach is shown in Fig. 1. Firstly, the model of current architecture is analysed to identify parts of the design that require changes according to the refactoring rules. The refactoring rules help to select design entities such as connector and component that need to be changed according to the desired architectural styles to be applied. Secondly, the refactoring steps in the rules are performed to create the model of target architecture design. Lastly, the veriﬁcation has been performed on the model of target architecture design to ensure that the functional requirements are still preserved, and the target architecture design behaves according to the architectural styles. 2.1

Architecture Modelling

As this work focuses on refactoring logical architecture design for the architectural migration, our architecture model is based on Connector and Connector (C&C) view [8]. Both structure and behaviour of architecture design are signiﬁcant to refactor and verify the architecture design to support desired architectural styles [20]. Therefore, we apply ontology representation in Ontology Web Language (OWL) to capture the structural aspect of architecture design, according to the modelling proposed in [4]. The ontology model is expressive to

Formal Software Architectural Migration

23

1. Identify entities to refactor Current Architecture Design

Architecture Redesigner

Target Architecture Design

Refactoring Rules

2. Perform design refactoring Design Verifier

Verification Result

Functional Requirement

Pattern Constrainst

3. Verify target architecture

Fig. 1. Overall migration process

describe the concepts and their relationship in the knowledge domain. Architectural Description Language (ADL) is used to capture the behavioural aspect of architecture design according to Wright# language proposed in [5]. Wright# is expressive to describe the dynamic behaviour of components and connectors. In the ontology model, the design entities in C&C view such as component, connector, role and port are deﬁned as classes and their relationships are deﬁned as object properties. The components have ports representing its interface. The connector have roles representing parties involved in the interaction. To form the connectivities, ports are attached to one or more roles. The architectural patterns are formally deﬁned to support architectural styles based on this structure. The deﬁnition of architectural patterns describes speciﬁc structural and behavioural characteristics of diﬀerent types of component and connector1 . We have formally deﬁned basic architectural patterns such as Client-Server (CS), Publish-Subscribe (PS), and Data Repository (RP). To support modern architectural styles, architectural patterns such as Event-Sourcing (ES) and Command Query Responsibility Segregation (CQRS - CR for command and QR for query) [13] are also deﬁned for event-driven architecture that is commonly used in microservices. Oracle (OR) and Reverse-Oracle (RO) pattern are deﬁned for the blockchain-based architecture [19]. The structure of these architectural patterns can be found in the refactoring rules presented in Sect. 2.2. In this section, we use LifeNet as an illustrative example to present our approach. Figure 2 shows the current architecture design of LifeNet. The components are represented as rectangles and connectors are represented as rounded edged rectangles. LifeNet is a software system that assists patients with Alzheimer’s disease and related dementia. The system allows patients to send an emergency request from his wristband (Lifeband ) that notiﬁes the mobile application for medical staﬀ (Lifeguard ) and caregiver (Lifecare) to help. These requests are veriﬁed and handled by SOSGateway and RequestDispatcher component. There are data storage components such as EMCenter and PatientRecord, represented by cylinder box, to allow other components to query and update data. This model 1

The formal architectural patterns can be found at https://bit.ly/2vGv6qi.

24

N. Chondamrongkul et al.

applies three architectural patterns. Client-Server patterns are used for connectors such as soswire and dispatchwire. Publish-Subscribe patterns are used for notiwire connector. Data Repository patterns are used for connectors that query and update data such as emwire and emupwire. Port

responder

requester

Role

LifeBand

writestorage

emupdate

emupwire

soswire

alert

storage

emaccess

pgwire

BackOffice

storage

pgupdate

Hospital Mgmt

publisher notiwire

subscriber writer ptupwire

update writestorage

accept

pgaccess

Practitioner

pgupwire writestorage

responder

Request Dispatcher

reader

writer

dispatchwire

reader

writer

update

requester

emwire

EMCenter

SOS Gateway

sos

Patient Record

gnotify Lifeguard

ptupdate ptaccess

cnotify Lifecare

reader ptwire

storage

Fig. 2. LifeNet’s current design

The structural model of a software system are semantically deﬁned as ontology individuals based on the architectural pattern that are predeﬁned as the ontology class. Due to the page limit, we present some part of the LifeNet model2 . Listing 1.1 shows the deﬁnition of Patient component and its ports: ptaccess and ptupdate. ptaccess takes readstorage role of ptwire connector, which allows querying data. ptupdate takes writestorage role to allow updating data. Listing 1.1. LifeNet’s current structural model Individual(ex: Patient value(ex: hasPort ex: ptaccess, ptupdate)) Individual(ex: ptaccess value(ex: hasAttachment ex: readstorage)) Individual(ex: ptupdate value(ex: hasAttachment ex: writestorage)) Individual(ex: ptwire value(ex: hasRole ex: reader,readstorage) 2

The complete model in OWL of LifeNet can be found at https://bit.ly/3bhsCgO.

Formal Software Architectural Migration

25

The behavioural model are deﬁned to describe how the components are executed at runtime to serve the system functionalities. The description of diﬀerent connector type are deﬁned for the interactive behaviour of architectural patterns. Listing 1.2 shows some part of behavioural model of LifeNet3 . The CSConnector is the deﬁnition of a connector type for Client-Server. The components and their port are deﬁned such as LifeBand and SOSGateway. The system scope deﬁnes how component are interacting with each other. The connector such as soswire is declared, and used later in the attach statement to deﬁne what roles are attached to the component’s port. The execute statement deﬁnes what ports are executed during runtime. More details about Wright# can be found in [5]. Listing 1.2. LifeNet’s current behavioural model connector CSConnector { role requester(j) = request -> req!j -> res?j -> process -> Skip; role responder() = req?j -> invoke -> process -> res!j -> responder(); } component LifeBand { port alert() = onalert->alert();} component SOSGateway { port sos() = acknowledge->alert->sos();} system lifenet { declare soswire = CSConnector; attach LifeBand.alert() = soswire.requester(49); attach SOSGateway.sos() = soswire.responder() dispatchwire.requester(19); execute LifeBand.alert() || SOSGateway.sos() ...

2.2

Refactoring Rules

To create the target architecture design, we deﬁnes a set of refactoring rules to automate revising the current design. Each refactoring rule consists of ontology deﬁnition and refactoring routine. The ontology deﬁnition captures the logic to ﬁnd component or connector that needs to be changed. The refactoring routine is a sequence of steps that can programmatically change the design conﬁguration to support architectural patterns of desired architectural styles. These refactoring rules are generic and can be applied to the architectural design of any software system. The rules presented in this paper focuses on the migration to eventbased and blockchain-based architecture style. They represents the possible rules that cover most well-known cases and are frequently used for migrating basic architectural styles to modern styles. These refactoring rules are as follows: Event Centre. The connector that helps to process events should be changed to Event-Sourcing pattern. The ontology deﬁnition aims to select the PublishSubscribe connector (PSConnector) that has an inbound role subscribed to 3

The complete model in ADL of LifeNet can be found at https://bit.ly/2zhLbnN.

26

N. Chondamrongkul et al.

events to process. InboundRole represents any role that waits to be triggered. Below is the ontology deﬁnition followed by the refactoring routine. EventCentre ≡ PSConnector ∃ hasRole (InboundRole ∃ isAttachOf (Port EventProcessor )) a) Convert the type of connector to Event-Sourcing pattern (ESConnector) by roles mapping shown in Fig. 3; publisher is changed to event publisher and subscriber is changed to event subscriber. b) Add an component of type EventStore to logs incoming events. c) Connect the connector to the component in step b).

Publisher

Event Source

Event Store

publish

publisher

Subscriber

PSConnector

subscribe

event publisher

evwrite

evpublish event logger

ESConnector

Event Consumer

listen

event subscriber

subscriber

Fig. 3. Mapping of event centre

According to these steps, the LifeNet model can be refactored as shown in Fig. 4 a), the notiwire connector has been converted to Event-Sourcing connector and the RequestLog component is added as an event store component. accept

Request Dispatcher

eventpublisher

notiwire eventsubscriber

gnotify Lifeguard

eventstore

cnotify

a) Event Centre for LifeNet

logem

EMCenter

emaccess

read storage

log

Request Log

Lifecare

emupdate

EMCenter Command

commandstore

emupwire commander BackOffice

pgwire

reader

readstorage Practitioner

update

pgaccess

b) Command & Query for LifeNet

Fig. 4. LifeNet’s refactoring for event-based patterns

Event-Based Command. The connector that transfers the event to persist on a data storage should be converted to a commander in CQRS pattern. The ontology deﬁnition selects the repository writing connector (WRConnector) that

Formal Software Architectural Migration

27

is attached to a component, where the events are created and written to the data storage. EventCommand ≡ WRConnector ∃ hasRole (Writer ∃ isAttachOf (Port EventProcessor )) a) Convert the type of connector to CQRS’s command (CRConnector) as shown in Fig. 5; writer and storage role is changed to commander and command store, respectively. b) Add an component of type ViewDB to be a read-only data storage. c) Connect the connector to the component in step b).

Repository Pattern (Write)

CQRS Pattern (Command)

CQRS Pattern (Query)

Repository Pattern (Read)

Command Endpoint Read Endpoint DataReader DataWriter

read write read

issue reader commander data reader read writer

store

QRConnector REConnector WRConnector

readstore CRConnector storage update

store query storage log

command

store Repository

View DB access Command DB

Fig. 5. Mapping of Event-based CQRS

The LifeNet model can be refactored as shown in Fig. 4 b). The emupwire has been converted to a command connector for CQRS pattern and EMCenterCommand has been added as an command store. Event-Based Query. The connector that queries the events persisted on the data storage should be converted to a query processor of CQRS pattern. The ontology deﬁnition captures the connector that has an outbound role to query events on a data storage. EventQuery ≡ REConnector ∃ hasRole (Reader ∃ isAttachOf (Port EventProcessor )) a) Convert the type of connector to CQRS’s query (QRConnector) as shown in Fig. 5; data reader and storage role is changed to reader and query role of QRConnector respectively. b) Reroute the connector to the ViewDB component that is a read-only data storage connected to the command connector.

28

N. Chondamrongkul et al.

According to these steps, the LifeNet model can be refactored as shown in Fig. 4 b). The pgwire connector has been converted to query connector for CQRS pattern. This connector links EMCenter as a ViewDB component. Secure Data Writing. The repository component storing data that need to be tamper-proof can be converted to apply blockchain technology. The ontology deﬁnition selects the repository component (Repository) that its inbound port InboundPort) is data tampering-proof and write data to the storage. InboundPort represents any port that is attached to the inbound role. SecureDataWriter ≡ Repository ∃ hasPort(InboundPort DataTamperingProofPort ∃ hasAttachmentWriteStorage) a) b) c) d) e) f)

Add an oracle component (Oracle). Attach WriteStorage role to a port of oracle in step a). Add a connector of type IOConnector, which is an Oracle connector. Attach extsupplier role of connector in step c) to the port added in step b). Add a blockchain component (Blockchain). Connect the connector added in step c) to the blockchain component.

According to these steps, the LifeNet model can be refactored as shown in Fig. 6. MedRecord is added as an oracle that connects to a blockchain called MedChain with medupwire connector.

ptupwire

writestorage

extsupplier

medup wire

block storage

medfetch

MedRecord

seclog

Patient Record

blocksupplier MedChain

medshare

medwire

ptaccess

extqurier

Fig. 6. LifeNet’s refactoring to blockchain

Secure Data Reading. The blockchain should be applied at the component that allow data to be queries securely with tampering-proof control applied. The ontology deﬁnition selects the repository component (Repository) that its inbound port is data tampering-proof and allow data to be queried. SecureDataReader ≡ Repository ∃ hasPort(InboundPort DataTamperingProofPort ∃ hasAttachmentReadStorage) a) Convert the repository to a reverse oracle component (ReverseOracle) as shown in Fig. 7; access port is changed to blockquery port.

Formal Software Architectural Migration

29

b) Add a connector of type ROConnector, which is a Reverse-Oracle connector. c) Attach a port converted in step a) to the connector in step b). d) Connect the connector added in step b) to the blockchain component.

DataReader

reader read

storage

access

ROConnector extquerier

blocksupplier

Blockchain

blockquery

Repository

blocksupply

REConnector

Reverse Oracle

Repository Pattern

Reverse Oracle Pattern

Fig. 7. Mapping of secure data reading

As shown in Fig. 6 After these steps are applied, the PatientRecord is converted to a reverse oracle. The medwire is added to connect the Medchain that stores patient records on the blockchain.

3

Justification of Architectural Migration

After the architecture design has been refactored, it is veriﬁed to ensure that the functional requirements are preserved, and the target design is compiled according to the desired architectural patterns. The model checking technique has been applied to achieve this veriﬁcation. We also address beneﬁts to the system qualities that can be gained when new architectural styles are applied. 3.1

Verifying Functional Requirements

As the behavioural model has been expressed in ADL, we use Linear Temporal Logic (LTL) to deﬁne liveness properties that describe the key functional requirements of the software system. Liveness properties are generally used in the model checking technique to determine whether certain states occur. In this work, we use the liveness property to determine the sequence of events that occurred in the software system. The property (a → ♦b) check whether every event a eventually leads to event b). According to the modelling proposed in [5], the events triggered from the ports are labelled as [Comp.Prt.Evt], where Comp is name of a component, Prt is name of a port and Evt is name of event. The events triggered by the roles are labelled as [Comp.Conn.Rle.Evt], where Conn is name of a connector, Rle is name of a role and Evt is name of event. The following shows the functional properties that are deﬁned for LifeNet.

30

N. Chondamrongkul et al. (1) (SOSGateway.sos.acknowledge → ♦LifeBand.alert.onalert) (2) (RequestDispatcher .accept.dispatched → ♦SOSGateway.sos.acknowledge) (3) (RequestDispatcher .accept.dispatched → ♦EMCenter .emaccess.emaccessed) (4) (Lifeguard.gnotify.acknowledge → ♦Patient.ptaccess.ptaccessed) (5) (Lifecare.cnotify.acknowledge → ♦Patient.ptaccess.ptaccessed)

These properties can be explained as follows: (1) When the emergency is requested, the request is acknowledged and the wristband’s status is switched to alert mode, (2) After the emergency request is dispatched to process, the request is acknowledged by the system, (3) After the request is dispatched to process, the system must ﬁnd the emergency center in the proximity of patient location, (4) After the emergency staﬀ acknowledges the emergency request (through Lifeguard application), the patient record can be fetched and veriﬁed, (5) After the caregiver acknowledges the emergency request (through Lifecare application), the patient record can be fetched and veriﬁed. These functional requirement properties describe how the system should respond to the emergency request. They are speciﬁc to the software system and have been proved valid on the model of current architecture design. The same set of properties are used to verify the model of target architecture design. If the veriﬁcation results are proved valid, the key functional requirements are preserved. We used PAT [16] as a model checker to make a veriﬁcation in this work. Below are some part of the veriﬁcation results. All properties above are proved valid after we veriﬁed them against the behavioural model of the target architecture of LifeNet. In other words, the target architecture design is proved to support all key functionalities that the current design has. ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ Verification Result ∗ ∗ ∗ ∗ ∗ ∗ ∗ ∗ The Assertion (lifenet() |= 2(SOSGateway sos ... is VALID. The Assertion (lifenet() |= 2(RequestDispatcher accept ... is VALID.

3.2

Checking Architectural Pattern Constraints

The liveness properties are deﬁned to describe the behavioural constraints of the architectural pattern. Each refactoring rules has a corresponding constraint deﬁned as the following formulas. (1) ([EventPublisher ].[Prt].[Event] → ♦[EventStore].[EventCentre].eventstore.persist) (2) ([EventCommander ].[Prt].[Event] → ♦[CommandStore].[EventCommand].commandstore.process) (3) ([EventReader ].[Prt].[Event] → ♦[ViewDB].[EventQuery].readstore.process) (4) ([Oracle].[Prt].[Event] → ♦[Blockchain].[Conn].blockstorage.stored) (5) ([Blockchain].[Conn].blocksupplier .process → ♦[ReverseOracle].[Prt].[Event])

Formal Software Architectural Migration

31

Formula (1) for the Event Centre rule checks the behaviour of an EventSourcing connector EventCentre. This constraint ensures that every event is logged in the event store. EventPublisher is a component that its port (Prt) is attached to the event publisher role. EventStore represent the component that connects to the EventCentre connector. The eventstore.persist are role and event predeﬁned for the architectural pattern. Formula (2) is for the Event-based Command rule. It ensures that every incoming command logged in the command store. EventCommander is a component that its port (Prt) is attached to the event commander role. CommandStore represents a component that connects to the EventCommand connector. Formula (3) is for the Event-based Query rule. It checks if the event can be queried from the view storage. EventReader is a component that its port is attached to the reader role. ViewDB is read-only storage. Formula (4) is for the Secure Data Writing rule. It ensures that the data can be stored on the blockchain according to Oracle pattern. Oracle is an oracle component added to the system. Blockchain represents a blockchain component and Conn represents a Oracle connector. Formula (5) is for Secure Data Writing rule. Blockchain represents a blockchain component and Conn represent a Reverse-Oracle connector. ReverseOracle is a Reverse-Oracle component. To verify the target architecture design of LifeNet, below are properties that are deﬁned according to the formulas, respectively. After we have veriﬁed these properties deﬁned for the architectural constraints, the results are given as valid for all properties. These results prove that the target architecture design has complied with the desired architectural patterns. (1) (RequestDispatcher .accept.dispatched → ♦RequestLog.reqwire.eventstore.persist) (2) (BackOﬃce.updateproﬁle.updating → ♦EmCenterCommand .emupwire.commandstore.process) (3) (EmCenter .emaccess.emaccessed → ♦Practitioner .pgwire.readstore.process) (4) (MedRecord .seclog.ptcommit → ♦MedChain.medupwire.blockstorage.stored ) (5) (MedChain.medwire.blocksupplier .process → ♦Patient.ptaccess.ptaccessed )

3.3

Benefits of the Migration

After the architecture designs of subject systems have been migrated to support new architectural styles, the target architecture designs generally gain the following attributes. Maintainability. The event-based architecture style has been applied to mitigate cohesive interdependencies among components that are in the current architecture. This style changes component’s reliance on service of another to the events. In the target architecture designs, components can, therefore, be independently modiﬁed, deployed and restored. In LifeNet, after Event-Sourcing pattern has applied to notiwire connector as shown in Fig. 4 a), RequestDispatcher

32

N. Chondamrongkul et al.

chronologically stores all requests on the RequestLog. If the RequestDispatcher failed or went oﬀ-line, the restoration of system state can always be done by restoring the component’s execution based on the requests logged in the RequestLog. Scalability. As the target design embrace the loosely-coupled principle, each component can be self-contained in the container that allows the component to be deployed and executed individually. A prominence implementation of containers is Docker. Docker container is supported by tools like Kubernetes that can automatically scale the containers. In LifeNet, after the design has been refactored according to CQRS pattern as shown in Fig. 4 b), the query and update operation performs on separate storages; EMCenterCommand for logging update operation and EMCenter for query only. This enables us to optimise the read operation on EMCenter by caching or clustering to support higher loads. This optimisation can be achieved with less computational resources than traditional data repository. Availability. The data repositories become decentralised as the data are replicated dispersedly on blockchain nodes. The decentralisation enhances the availability of data access, as there is no single point of failure. When a node fails, the data can be accessible through other available nodes. The transactions on data are also transparent, as the users with provided authority can always view the transaction. In the refactored design of LifeNet shown in Fig. 6, as the patient records are stored on MedChain, the data are replicated on every nodes joining in the blockchain network. If a node managed by a hospital fails, the data can be fetched or recovered from other nodes. Also, the patient records can be fetched and traced by any hospital that joins this blockchain. Data-Tampering Proof. In term of security, the data structure in the blockchain is append-only, so data cannot be deleted or altered. The consensus of all the ledger participants is required to record data in the blockchain. As the target architecture design applies the architectural patterns that support the interaction to the blockchain, suﬃcient security control is required to secure the components that have direct communication with the blockchain network. In the refactored design of LifeNet, updating data requires a consensus among hospitals in the networks, so data can only be updated through nodes in the blockchain network by MedRecord component. Therefore, MedRecord must apply security controls such as authorisation and ﬁrewall to prevent malicious traﬃc.

4

Evaluation

This evaluation aims to assess the accuracy and performance of our approach. As there is no benchmark set to evaluate our approach, the architectural design of four real-world software systems are selected to create the models used in this evaluation, namely AgriDigital (AD), Life Net (LN), Sock Shop (SS), Supply Chain (SC). The details of their current architecture design are shown in Table 1. The Cur.Ptn. Column denotes the architectural patterns that are applied in the

Formal Software Architectural Migration

33

current architecture designs. Tgt. Ptn. denotes the patterns that of the target architecture designs to apply. The C&C column shows the number of component and connector. The Req column shows the number of key functional requirements that have been used to verify the design. The current architecture design of these systems is based on the monolithic architecture that needs the migration to the event-based microservice architecture to embrace loosely-coupling principle. Also, some data are to be securely persisted on the blockchain. Table 1. Subject systems System

Cur. Ptn.

Agri Digital Agriculture asset tracking system

CS, PS, RP OR, RO, ES

20

5

Life Net Medical emergency system

CS, PS, RP OR, RO, ES, CR, QR 19

5

Sock Shop E-Commerce System

CS, RP

CR, QR

22

5

Supply Chain Business process management

CS, RP

OR, RO

18

6

4.1

Tgt. Ptn.

C&C Req

Experiment Setup

This evaluation aims at assessing the completeness and soundness of the refactoring rules, and has been conducted according to the following process. Firstly, we deﬁned the models of current architecture design and functional requirements using ArchModeller4 . ArchModeller is a graphical user interface that allows users to seamlessly draw a graphical diagram for software architecture design and make a veriﬁcation. The tool can automatically converts graphical diagrams into the models in OWL and ADL for the veriﬁcation. Secondly, all ﬁve refactoring rules have been implemented in the tool to refactor the current architecture model and create the target architecture model5 . Thirdly, the functional requirement properties have been used to verify the target architecture model. Fourthly, we deﬁned the pattern constraint properties to verify the target architecture model. Lastly, we have gathered the veriﬁcation results to determine the completeness and soundness of refactoring rules. Each rule is assessed by the combination of veriﬁcation result of its pattern constraints property and all functional requirement properties. If the functional requirements properties have any invalid result, the refactoring routine alters the system in the way that it incorrectly performs functionalities. If the pattern-constraint property is proved invalid, the refactoring routine may consist of steps that incorrectly refactor the design, according to the architectural pattern. The properties that are proved valid are considered as true-positive (Tpos ), and invalid properties are considered as false-positive 4 5

Arch Modeller can be found at http://bit.ly/2m3LITT. The implementation of refactoring rules can be found at https://bit.ly/2WEHaTw.

34

N. Chondamrongkul et al.

(Fpos ). We also manually veriﬁed the target architecture design to ﬁnd the falsenegative (Fneg ) result that is any part of the design that requires refactoring but not covered by the refactoring rules. Below is how we calculated the measurements, namely precision and recall that were used to evaluate the soundness and completeness of our approach. Precision =

Tpos Tpos + Fpos

Recall =

Tpos Tpos + Fneg

To assess the performance of our approach, we gathered the performance statistic of the properties veriﬁcation against the models of target architecture design. The performance statistics include processing time in milliseconds and the number of visited states. This performance evaluation was carried out using an Intel Core i7-7500U CPU @ 2.7 GHz with 8.00 GB Ram computer. 4.2

Evaluation Result

The result of accuracy evaluation6 has been summarised in Table 2. This table classiﬁes the results according to the rules presented in Sect. 2.2. The ptn denotes the veriﬁcation result of pattern constraint properties. The fnc denotes the veriﬁcation result of functional requirement properties. The numbers that are shown in ptn and fnc column are the precision rate; - means that the rule is not used in that system, as not all patterns are applied to the design. The msn column indicates the false-negative result as the number of the entities that need refactoring but not yet refactored. Overall, all refactoring rules have the precision rate of 100%. The recall rates are also 100% as we have not found any false-negative result. Table 2. Evaluation results Rule

AD

LN

SS

SC

ptn fnc msn ptn fnc msn ptn fnc msn ptn fnc msn #1 Event Centre

1.0 1.0 0

1.0 1.0 0

–

–

–

–

–

1.0 1.0 0

1.0 1.0 0

–

–

–

1.0 1.0 0

1.0 1.0 0

–

–

–

1.0 1.0 0

–

–

–

1.0 1.0 0

–

–

–

1.0 1.0 0

#2 Event Command –

–

–

#2 Event Query

–

–

–

#4 Secure Write

1.0 1.0 0

#5 Secure Read

1.0 1.0 0

1.0 1.0 0

–

The result of the performance evaluation is presented in Fig. 8. This graph illustrates the veriﬁcation performance of LifeNet model, as it applies all ﬁve refactoring rules. The properties prop1 to prop5 are functional requirement properties, while const1 to const5 are the pattern constraint properties. The 6

The models and evaluation results can be found at https://bit.ly/3ft3gQv.

Formal Software Architectural Migration

35

veriﬁcation generally took less than 0.3 s. The processing times for the veriﬁcation of functional requirements have no signiﬁcant diﬀerence. The const2 that was used to verify the event command of CQRS pattern took the most processing time, while the const5 that was used to verify the Reverse-Oracle pattern took the least processing time. We have found the same trend in the other models. 4.3

Discussion

According to the evaluation result, the refactoring rules have been proved to be soundness as they have been used to refactor the subject systems, while their key functional requirements are still preserved. As all design entities that require changes are refactored, our refactoring rules are proved to be complete. However, this evaluation focuses on the migration to only event-based microservice and blockchain-based architecture, so the migration to other architectural styles requires more refactoring rules to be deﬁned by software architect. The veriﬁcation performance is acceptable for the size of model used in this evaluation, however more comprehensive evaluation needs to be conducted to better understand how it performs with diﬀerent sizes of model.

Fig. 8. Performance result

Other advantages of our approach are with the formal modelling of software architecture design. As the formal model facilitates the application of formal techniques, the ontology reasoning and model checking can be extended to support other aspects of veriﬁcation. For example, the ontology deﬁnition can be deﬁned to represent the security characteristics that measure and compare security between current and target architecture designs. Also, the model checking technique can be used to trace diﬀerent execution scenarios to determine security vulnerability or design ﬂaws.

5

Related Work

Fowler [7] presented a principle of how we can refactor the source code to eliminate design ﬂaws. Most of these design ﬂaws pose diﬃculties in maintaining the software system. Luca et al. [14] and Yun et al. [11] proposed automated

36

N. Chondamrongkul et al.

approaches that suggest refactoring plans using search-based algorithm. However, these tools are performed on the source code and rely on the user feedback, unlike our approach that focuses at the architectural design level and rely on the formal logics. Tanhaei et al. [17] presented a framework for refactoring the architecture of software product lines (SPLs). Their approach aims to keep SPLs consistent with the feature model consisted of a set of similarities and diﬀerences. However, this approach requires a custom algorithm to reason and check the models, unlike our approach that is based on standard techniques such as ontology reasoning and model checking. Some guidelines such as Zimmermann [20], [2] have been proposed to support the migration to modern architecture style such as microservice and cloud-based software system. Holmes and Zdun [9] proposed an automated architectural refactoring approach for security and availability requirements. Their approach has applied formalised architectural knowledge in model veriﬁcation and transformation. Their work focuses particularly on the infrastructure of cloud application, while our approach generally focuses on the logical view of architecture design with diﬀerent architectural styles. Nunes et al. [12] proposed a complete workﬂow to support the migration to microservice architecture. Their approach applies the static analysis of the source code that is limited to a development framework. Bucchiarone et al. [3] proposed a model-driven approach for the automatic migration to microservice architecture. A meta-programming framework has been applied to transform the source code and generate the container conﬁguration; however, it does not address how to ensure that the migrated system would correctly operate. Overall, none of the existing works proposes a set of generic rules that support restructuring and verifying the design based on the architectural style. The architectural styles allow technologies to be exploited in the system implementation to take various advantages, as discussed in Sect. 3.3. Also, none has proposed to apply the formal technique to verify the target architecture design, so it is diﬃcult to extend the veriﬁcation to check other properties as the existing approaches rely on custom algorithms hard-coded in the tools. The veriﬁcation in our approach can be extended by deﬁning liveness property in LTL to describe other functional requirements or architectural pattern constraints.

6

Conclusion

In this paper, we introduce the formal approach to support architectural migration towards emerging architectural styles. A set of generic rules has been proposed to automate the refactoring and checking the architectural design according to the functional requirements and architectural styles. The veriﬁcation ensures that the key functional requirements are still preserved after the design has been changed. Also, it ensures that the migrated system behaves according to the desired architectural styles. With the new architectural styles applied, new technologies can be exploited at the system implementation to enhance the system’s qualities. The evaluation has proved that the refactoring rules are valid and practical to perform architectural migration for the subject systems.

Formal Software Architectural Migration

37

As architectural migration is a part of system migration, we will extend this approach to automatically plan incremental migration steps that support system migration towards the target architecture.

References 1. Balalaie, A., Heydarnoori, A., Jamshidi, P.: Migrating to cloud-native architectures using microservices: an experience report. In: Celesti, A., Leitner, P. (eds.) ESOCC Workshops 2015. CCIS, vol. 567, pp. 201–215. Springer, Cham (2016). https://doi. org/10.1007/978-3-319-33313-7 15 2. Balalaie, A., Heydarnoori, A., Jamshidi, P., Tamburri, D., Lynn, T.: Microservices migration patterns. Softw.: Pract. Exp. 48, 2019–2042 (2018) 3. Bucchiarone, A., Soysal, K., Guidi, C.: A model-driven approach towards automatic migration to microservices. In: Bruel, J.-M., Mazzara, M., Meyer, B. (eds.) DEVOPS 2019. LNCS, vol. 12055, pp. 15–36. Springer, Cham (2020). https://doi. org/10.1007/978-3-030-39306-9 2 4. Chondamrongkul, N., Sun, J., Warren, I.: Ontology-based software architectural pattern recognition and reasoning. In: 30th International Conference on Software Engineering and Knowledge Engineering (SEKE), pp. 25–34, June 2018 5. Chondamrongkul, N., Sun, J., Warren, I.: PAT approach to architecture behavioural verification. In: 31th International Conference on Software Engineering and Knowledge Engineering, pp. 187–192, July 2019 6. Ford, N., Parsons, R., Kua, P.: Building Evolutionary Architectures: Support Constant Change. O’Reilly Media Inc., Sebastopol (2017) 7. Fowler, M.: Refactoring: Improving the Design of Existing Code. A Martin Fowler Signature Book. Addison-Wesley, Boston (2019) 8. Garlan, D., et al.: Documenting Software Architectures: Views and Beyond, 2nd edn. Addison-Wesley Professional, Boston (2010) 9. Holmes, T., Zdun, U.: Refactoring architecture models for compliance with custom requirements. In: Proceedings of the 21th ACM/IEEE MODELS 2018, pp. 267– 277. Association for Computing Machinery, New York (2018) 10. Klettke, M., Thalheim, B.: Evolution and migration of information systems. In: Embley, D., Thalheim, B. (eds.) Handbook of Conceptual Modeling, pp. 381–419. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-15865-0 12 11. Lin, Y., Peng, X., Cai, Y., Dig, D., Zheng, D., Zhao, W.: Interactive and guided architectural refactoring with search-based recommendation. In: Proceedings of the 2016 24th ACM SIGSOFT International Symposium on Foundations of Software Engineering FSE 2016, pp. 535–546. Association for Computing Machinery, New York (2016) 12. Nunes, L., Santos, N., Rito Silva, A.: From a monolith to a microservices architecture: an approach based on transactional contexts. In: Bures, T., Duchien, L., Inverardi, P. (eds.) ECSA 2019. LNCS, vol. 11681, pp. 37–52. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-29983-5 3 13. Richards, M.: Software Architecture Patterns. O’Reilly Media Inc., Sebastopol (2015) 14. Rizzi, L., Fontana, F.A., Roveda, R.: Support for architectural smell refactoring. In: Proceedings of the 2nd International Workshop on Refactoring IWoR 2018, pp. 7–10. Association for Computing Machinery, New York (2018) 15. Stopford, B.: Designing Event-Driven Systems. O’Reilly, Sebastopol (2018)

38

N. Chondamrongkul et al.

16. Sun, J., Liu, Y., Dong, J.S., Chen, C.: Integrating specification and programs for system modeling and verification. In: Proceedings of the Third IEEE International Symposium on Theoretical Aspects of Software Engineering (TASE 2009), pp. 127–135. IEEE Computer Society (2009) 17. Tanhaei, M., Habibi, J., Mirian-Hosseinabadi, S.H.: A feature model based framework for refactoring software product line architecture. J. Comput. Sci. Technol. 31, 951–986 (2016) 18. Ulrich, W.: Introduction to architecture-driven modernization. In: Ulrich, W.M., Newcomb, P.H. (eds.) Information Systems Transformation, pp. 3–34. The MK/OMG Press, Morgan Kaufmann, Boston (2010) 19. Xu, X., Pautasso, C., Zhu, L., Lu, Q., Weber, I.: A pattern collection for blockchainbased applications. In: Proceedings of the 23rd European Conference on Pattern Languages of Programs EuroPLoP 2018. Association for Computing Machinery, New York (2018) 20. Zimmermann, O.: Architectural refactoring for the cloud: a decision-centric view on cloud migration. Computing 99, 129–145 (2017)

Monolith Migration Complexity Tuning Through the Application of Microservices Patterns Jo˜ao Franscisco Almeida and Ant´ onio Rito Silva(B) INESC-ID/Department of Computer Science and Engineering, Instituto Superior T´ecnico, Av. Rovisco Pais 1, 1049-001 Lisbon, Portugal {joao.santos.almeida,rito.silva}@tecnico.ulisboa.pt

Abstract. The microservices architecture has become mainstream for the development of business applications because it supports the adaptation of scalability to the type of demand, but, most importantly, because it fosters an agile development process based on small teams focused on the product. Therefore, there is the need to migrate the existing monolith systems to microservices. Current approaches to the identification of candidate microservices in a monolith neglect the cost of redesigning the monolith functionality due to the impact of the CAP theorem. In this paper we propose a redesign process, guided by a set of complexity metrics, that allows the software architect to analyse and redesign the monolith functionality given a candidate decomposition. Both, the redesign process and the metrics are evaluated in the context of candidate decompositions of two monolith systems. Keywords: Microservices architecture · Monolith migration Complexity metrics · Microservices patterns

1

·

Introduction

Microservices architecture emerged due to the need to have highly available and scalable systems that can be developed by multiple teams in an agile environment. This is achieved through the deﬁnition of independently deployable distributed systems, implemented around business capabilities. As the monolith application size increases, it imposes several drawbacks as the lack of agility, modiﬁability and deployablity. As a consequence there is the need to migrate monolith systems to a microservices architecture. However, this transition imposes a cost because the application cannot preserve the behavior that existed in the monolith. This is due to the introduction of distributed transactions, as the monolith functionalities will be implemented through multiple independent microservices (transactions). Therefore, transaction management is more complex in a microservice architecture because transactions cannot be executed according to the ACID (Atomicity, Consistency, Isolation, Durability) properties, which introduces extra complexity for developers c Springer Nature Switzerland AG 2020 A. Jansen et al. (Eds.): ECSA 2020, LNCS 12292, pp. 39–54, 2020. https://doi.org/10.1007/978-3-030-58923-3_3

40

J. F. Almeida and A. R. Silva

to handle. This extra complexity is explained by the CAP theorem [6], where the decision to maintain the same level of consistency as in a monolith can only be achieved through the application of a two-phase commit protocol, which does not scale with many local transactions. To solve this problem, the use of sagas [5] was suggested, in the context of the microservices architecture [11,13], as the main alternative to the two-phase commit protocol to handle distributed transactions. On the other hand, the API Gateway pattern has been proposed [11,13] to implement queries in a distributed system. The SAGA pattern can be applied to functionalities that create or update data, and consist in dividing a transaction in multiple local transactions, where each local transaction is executed inside a single service following the ACID properties. A saga can have two diﬀerent structures: (1) choreography where the decision and sequencing is distributed through the saga participants, or (2) orchestration, where the decision and sequencing is decided in one orchestractor class, inside a cluster. Independently of the structure, the usage of sagas can guarantee the properties atomicity, consistency, and durability but cannot ensure the isolation property [13]. The lack of isolation can generate anomalies such as: (1) lost updates - when a saga overwrites data without reading changes performed by others sagas, (2) dirty reads - when a saga reads data changed by others sagas that have not been committed, (3) nonrepeatable reads - a saga reads the same data twice and gets diﬀerent results. To correct this anomalies there is a set of countermeasures that can be applied. One of them is the semantic lock that corrects these errors by creating intermediate states as application-level locks that indicate if an entity was written by one saga, alerting others concurrent sagas to these events. It can be integrated with the SAGA pattern typically indicating the current saga state. Because now all the functionalities must be aware of the semantic lock, this imposes extra complexity to the functionalities design and implementation. It also adds complexity to queries, that have to handle the possible combination of semantic locks of the data that they are integrating. Due to the lack of isolation, the saga local transactions can be of three types: (1) pivot - a transaction that if succeeds then the saga is going to succeed; (2) retriable - transactions that occur after the pivot transactions, do not rollback; and (3) compensatable - the transactions that may have to rollback. In a saga there is at most one pivot transaction, and all transactions that are not retriable nor the pivot transaction, are compensatable. In previous work [12,14], a tool was developed that collects information from monolith systems and, based on similarity measures, suggests a microservice candidate decomposition. Its level of complexity can be assessed through a complexity metric that determines the decomposition complexity based on the mean of its functionalities complexity. This metric calculates the impact that the relaxing of atomic transactional behaviour has on the redesign and implementation of the functionalities. In this paper we leverage on the previous work by, given a decomposition and a complexity value, support the redesign of the functionalities and queries, applying the SAGA and the API Gateway patterns, while

Monolith to Microservices

41

the complexity value is tuned, because there is more precise information on the decomposition. With our approach we intend to answer the following questions: – RQ1. What set of operations can be provided to the architect such that the functionalities can be redesigned by applying the SAGA pattern? – RQ2. Is it possible to reﬁne the complexity value associated with the monolith migration when there is additional information about the functionalities redesign? This paper makes two contributions. First, we deﬁne a set of operations that the architect can apply to the initial execution ﬂow of a monolith functionality, such that it can be transformed in a microservices execution ﬂow based on the SAGA pattern. Second, we deﬁne new metrics that provide a more precise value on the cost of the migration, due to the inclusion of the information about the application of the SAGA pattern. In the next section we present the set of operations used in the redesign. In Sect. 3 we present the new complexity metrics. In Sect. 4 we evaluate our work in the context of two monolith systems. Section 5 addresses the related work and Sect. 6 the conclusions.

2

Functionality Redesign

Definition: Monolith. A monolith is a pair (F, E), where F represents its set of functionalities, the functionalities are represented with lower case f , and E represents its set of domain entities, which are accessed by the functionalities, the domain entities are represented with lower case e. The entities are accessed by the functionalities in two modes, read and write. Therefore, M = {r, w} represents the access modes in a monolith, and an access is a pair domain entity access mode, represented by (e, m). The accesses of a functionality f are represented as a sequence of accesses s, where S represents all the sequences of accesses done in the monolith by its functionalities to the domain entities, f.sequence denotes the sequence of access of functionality f , s.entities denotes the entities accessed in sequence s. It is also deﬁned the auxiliary function entities(s : S, m : M ) : 2E , as entities(s, m) = {e ∈ E : (e, m) ∈ s}, which returns the entities accessed in s in mode m. When a monolith is decomposed into a set of candidate microservices, each candidate microservice is a cluster of domain entities. Definition: Monolith Decomposition. A monolith decomposition into a set of candidate microservices is deﬁned by a set of clusters C of the monolith domain entities, where each cluster c represents a candidate microservice and c.entities denotethe domain entities in cluster c, such that all domain entities are in a cluster, c∈C c.entities = E, and in a single one, ∀ci =cj ∈C ci .entities ∩ cj .entities = ∅.

42

J. F. Almeida and A. R. Silva

Given e ∈ E and a decomposition C, e.cluster denotes the entity’s cluster, and given a set of entities E ⊆ E, E .cluster = {c ∈ C : ∃e∈E e ∈ c.entities} denotes its set of entities clusters. Given a monolith candidate decomposition, the monolith functionalities are decomposed into a set of local transactions, where each local transaction corresponds to the ACID execution of part of the functionality domain entity accesses in the context of a candidate microservice. Definition: Functionality Decomposition. A monolith functionality f is decomposed, in the context of a candidate decomposition C, by a sequence of sequences of access to domain entities, denoted by f.subsequences, where all domain entities in a subsequence are in the same cluster, ∀s∈f.subsequences ∃c∈C : s.entities ⊆ c.entities, two consecutive subsequences occur in diﬀerent clusters, ∀0≤i