Upload
mircealungu
View
501
Download
1
Embed Size (px)
DESCRIPTION
The slides of the presentation I delivered at SCAM 2014 in Victoria.
Citation preview
Analysing !Multi-Language Corpora
Pangea
Andrea Caracciolo, Andrei Chis, Boris Spasojević,!Mircea Lungu!
Software Composition Group University of Bern
SwitzerlandSeptember, 2014
with
Pangea
streamlines empirical studies on (multi-language) software corpora.
http://scg.unibe.ch/research/pangea
FAMIX meta-model !gives !
language-independence
–Andrea Caracciolo
“Average class hierarchy height (AHH) is the same across programming languages.”
https://github.com/caracciolo/pangea/blob/master/demo/hierarchy-depth.st
–Boris Spasojević
“The usage of Thread API is very skewed towards a few heavily used
methods.”
https://github.com/caracciolo/pangea/blob/master/demo/thread-stats.st
–Mircea Lungu
“Method names are often long multi-word sentences.”
https://github.com/caracciolo/pangea/blob/master/demo/method-length-distibution.st
JavaCorpusSources
Smalltalk CorpusSources
…
Executable Object Model Snapshots
AnalysisDesign
4
1
Java CorpusModels
…
…
5Deployment
Tool
2
Pangea Tools
3
Smalltalk CorpusModels
More about the architecture in the SCAM 2014 paper
“It is a capital mistake to
theorise before one has data”
http://scg.unibe.ch/research/pangea