An exploratory study of the evolution of communicated information about the execution of large software systems (bibtex)
by Shang, Weiyi, Jiang, Zhen Ming, Adams, Bram, Hassan, Ahmed E., Godfrey, Michael W., Nasser, Mohamed and Flora, Parminder
Abstract:
Substantial research in software engineering focuses on understanding the dynamic nature of software systems in order to improve software maintenance and program comprehension. This research typically makes use of automated instrumentation and profiling techniques after the fact, that is, without considering domain knowledge. In this paper, we examine another source of dynamic information that is generated from statements that have been inserted into the code base during development to draw the system administrators' attention to important run-time phenomena. We call this source communicated information (CI). Examples of CI include execution logs and system events. The availability of CI has sparked the development of an ecosystem of Log Processing Apps (LPAs) that surround the software system under analysis to monitor and document various run-time constraints. The dependence of LPAs on the timeliness, accuracy and granularity of the CI means that it is important to understand the nature of CI and how it evolves over time, both qualitatively and quantitatively. Yet, to our knowledge, little empirical analysis has been performed on CI and its evolution. In a case study on two large open source and one industrial software systems, we explore the evolution of CI by mining the execution logs of these systems and the logging statements in the source code. Our study illustrates the need for better traceability between CI and the LPAs that analyze the CI. In particular, we find that the CI changes at a high rate across versions, which could lead to fragile LPAs. We found that up to 70% of these changes could have been avoided and the impact of 15% to 80% of the changes can be controlled through the use of robust analysis techniques by LPAs. We also found that LPAs that track implementation-level CI (e.g. performance analysis) and the LPAs that monitor error messages (system health monitoring) are more fragile than LPAs that track domain-level CI (e.g. workload modelling), because the latter CI tends to be long-lived. Copyright \$\backslash$copyright\ 2013 John Wiley & Sons, Ltd.
Reference:
An exploratory study of the evolution of communicated information about the execution of large software systems (Shang, Weiyi, Jiang, Zhen Ming, Adams, Bram, Hassan, Ahmed E., Godfrey, Michael W., Nasser, Mohamed and Flora, Parminder), In Journal of software: Evolution and Process, volume 26, 2014.
Bibtex Entry:
@article{SMR:SMR1579,
abstract = {Substantial research in software engineering focuses on understanding the dynamic nature of software systems in order to improve software maintenance and program comprehension. This research typically makes use of automated instrumentation and profiling techniques after the fact, that is, without considering domain knowledge. In this paper, we examine another source of dynamic information that is generated from statements that have been inserted into the code base during development to draw the system administrators' attention to important run-time phenomena. We call this source communicated information (CI). Examples of CI include execution logs and system events. The availability of CI has sparked the development of an ecosystem of Log Processing Apps (LPAs) that surround the software system under analysis to monitor and document various run-time constraints. The dependence of LPAs on the timeliness, accuracy and granularity of the CI means that it is important to understand the nature of CI and how it evolves over time, both qualitatively and quantitatively. Yet, to our knowledge, little empirical analysis has been performed on CI and its evolution. In a case study on two large open source and one industrial software systems, we explore the evolution of CI by mining the execution logs of these systems and the logging statements in the source code. Our study illustrates the need for better traceability between CI and the LPAs that analyze the CI. In particular, we find that the CI changes at a high rate across versions, which could lead to fragile LPAs. We found that up to 70{\%} of these changes could have been avoided and the impact of 15{\%} to 80{\%} of the changes can be controlled through the use of robust analysis techniques by LPAs. We also found that LPAs that track implementation-level CI (e.g. performance analysis) and the LPAs that monitor error messages (system health monitoring) are more fragile than LPAs that track domain-level CI (e.g. workload modelling), because the latter CI tends to be long-lived. Copyright {\{}$\backslash$copyright{\}} 2013 John Wiley {\&} Sons, Ltd.},
author = {Shang, Weiyi and Jiang, Zhen Ming and Adams, Bram and Hassan, Ahmed E. and Godfrey, Michael W. and Nasser, Mohamed and Flora, Parminder},
doi = {10.1002/smr.1579},
isbn = {9780769545820},
issn = {20477481},
journal = {Journal of software: Evolution and Process},
keywords = {Communicated information,Execution log analysis,Reverse engineering,Software evolution,cocome_lit-review},
mendeley-tags = {cocome_lit-review},
number = {1},
pages = {3--26},
pmid = {67195556},
title = {{An exploratory study of the evolution of communicated information about the execution of large software systems}},
url = {http://dx.doi.org/10.1002/smr.1579},
volume = {26},
year = {2014}
}
Powered by bibtexbrowser