An Empirical Study on Parallelism in Modern Open-Source Projects
-
Tagung:
Konferenzartikel
-
Autoren:
Marc Aurel Kiefer
Daniel Wurzel
Walter F. Tichy -
Summary
Writing parallel programs is hard, especially for inexperienced programmers. Parallel language features are still being added on a regular basis to most modern object-oriented languages and this trend is likely to continue. Being able to support developers with tools for writing and optimizing parallel programs requires a deep understanding of how programmers approach and implement parallelism.
We present an empirical study of 135 parallel open-source projects in Java, C# and C++ ranging from small (< 1000 lines of code) to very large (> 2M lines of code) codebases. We examine the projects to find out how language features, synchronization mechanisms, parallel data structures and libraries are used by developers to express parallelism. We also determine which common parallel patterns are used and how the implemented solutions compare to typical textbook advice.
The results show that similar parallel constructs are used equally often across languages, but usage also heavily depends on how easy to use a certain language feature is. Patterns that do not map well to a language are much rarer compared to other languages. Bad practices are prevalent in hobby projects but also occur in larger projects.
-
Jahr:
2015
- Links:
Titel Vorname Nachname |
---|
Prof. em. Dr. Walter F. Tichy |
M.Sc. Marc Aurel Kiefer |
Bibtex
@inproceedings{,
author={Marc Aurel Kiefer, Daniel Warzel, Walter F. Tichy},
title={An Empirical Study on Parallelism in Modern Open-Source Projects},
year=2015,
month=October,
booktitle={Second Workshop on Software Engineering for Parallel Systems (SEPS)},
url={https://ps.ipd.kit.edu/downloads/},
abstract={Writing parallel programs is hard, especially for inexperienced programmers. Parallel language features are still being added on a regular basis to most modern object-oriented languages and this trend is likely to continue. Being able to support developers with tools for writing and optimizing parallel programs requires a deep understanding of how programmers approach and implement parallelism.
We present an empirical study of 135 parallel open-source projects in Java, C# and C++ ranging from small (< 1000 lines of code) to very large (> 2M lines of code) codebases. We examine the projects to find out how language features, synchronization mechanisms, parallel data structures and libraries are used by developers to express parallelism. We also determine which common parallel patterns are used and how the implemented solutions compare to typical textbook advice.
The results show that similar parallel constructs are used equally often across languages, but usage also heavily depends on how easy to use a certain language feature is. Patterns that do not map well to a language are much rarer compared to other languages. Bad practices are prevalent in hobby projects but also occur in larger projects.},
}