On the Statistical Distribution of Object-Oriented System Properties

Herraiz Tabernero, Israel and Rodriguez, Daniel and Harrison, Rachel (2012). On the Statistical Distribution of Object-Oriented System Properties. In: "3rd International Workshop on Emerging Trends in Software Metrics (WETSoM 2012)", 03/06/2012 - 03/06/2012, Zurich, Suiza.

Description

Title: On the Statistical Distribution of Object-Oriented System Properties
Author/s:
  • Herraiz Tabernero, Israel
  • Rodriguez, Daniel
  • Harrison, Rachel
Item Type: Presentation at Congress or Conference (Article)
Event Title: 3rd International Workshop on Emerging Trends in Software Metrics (WETSoM 2012)
Event Dates: 03/06/2012 - 03/06/2012
Event Location: Zurich, Suiza
Title of Book: Proceedings of 3rd International Workshop on Emerging Trends in Software Metrics (WETSoM 2012)
Date: May 2012
Subjects:
Faculty: E.T.S.I. Caminos, Canales y Puertos (UPM)
Department: Matemática e Informática Aplicadas a la Ingeniería Civil [hasta 2014]
Creative Commons Licenses: Recognition

Full text

[thumbnail of paper.pdf]
Preview
PDF - Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (215kB) | Preview

Abstract

The statistical distributions of different software properties have been thoroughly studied in the past, including software size, complexity and the number of defects. In the case of object-oriented systems, these distributions have been found to obey a power law, a common statistical distribution also found in many other fields. However, we have found that for some statistical properties, the behavior does not entirely follow a power law, but a mixture between a lognormal and a power law distribution. Our study is based on the Qualitas Corpus, a large compendium of diverse Java-based software projects. We have measured the Chidamber and Kemerer metrics suite for every file of every Java project in the corpus. Our results show that the range of high values for the different metrics follows a power law distribution, whereas the rest of the range follows a lognormal distribution. This is a pattern typical of so-called double Pareto distributions, also found in empirical studies for other software properties.

More information

Item ID: 10701
DC Identifier: https://oa.upm.es/10701/
OAI Identifier: oai:oa.upm.es:10701
Official URL: http://agile.diee.unica.it/wetsom2012/
Deposited by: Israel Herraiz
Deposited on: 17 Apr 2012 09:37
Last Modified: 20 Apr 2016 18:56
  • Logo InvestigaM (UPM)
  • Logo GEOUP4
  • Logo Open Access
  • Open Access
  • Logo Sherpa/Romeo
    Check whether the anglo-saxon journal in which you have published an article allows you to also publish it under open access.
  • Logo Dulcinea
    Check whether the spanish journal in which you have published an article allows you to also publish it under open access.
  • Logo de Recolecta
  • Logo del Observatorio I+D+i UPM
  • Logo de OpenCourseWare UPM