Full text
Preview |
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (531kB) | Preview |
Dieste Tubio, Oscar ORCID: https://orcid.org/0000-0002-3060-7853, Fernández, Enrique, García Martínez, Ramón and Juristo Juzgado, Natalia
ORCID: https://orcid.org/0000-0002-2465-7141
(2011).
Comparative analysis of meta-analysis methods: when to use which?.
In: "15th Annual Conference on Evaluation & Assessment in Software Engineering, EASE 2011", 11/04/2011 - 12/04/2011, Durham, UK. ISBN 978-1-84919-509-6. pp. 36-45.
https://doi.org/10.1049/ic.2011.0005.
Title: | Comparative analysis of meta-analysis methods: when to use which? |
---|---|
Author/s: |
|
Item Type: | Presentation at Congress or Conference (Unspecified) |
Event Title: | 15th Annual Conference on Evaluation & Assessment in Software Engineering, EASE 2011 |
Event Dates: | 11/04/2011 - 12/04/2011 |
Event Location: | Durham, UK |
Title of Book: | Proceedings of 15th Annual Conference on Evaluation & Assessment in Software Engineering, EASE 2011 |
Date: | 2011 |
ISBN: | 978-1-84919-509-6 |
Subjects: | |
Faculty: | Facultad de Informática (UPM) |
Department: | Lenguajes y Sistemas Informáticos e Ingeniería del Software |
Creative Commons Licenses: | Recognition - No derivative works - Non commercial |
Preview |
PDF
- Requires a PDF viewer, such as GSview, Xpdf or Adobe Acrobat Reader
Download (531kB) | Preview |
Background: Several meta-analysis methods can be used to quantitatively combine the results of a group of experiments, including the weighted mean difference, statistical vote counting, the parametric response ratio and the non-parametric response ratio. The software engineering community has focused on the weighted mean difference method. However, other meta-analysis methods have distinct strengths, such as being able to be used when variances are not reported. There are as yet no guidelines to indicate which method is best for use in each case. Aim: Compile a set of rules that SE researchers can use to ascertain which aggregation method is best for use in the synthesis phase of a systematic review. Method: Monte Carlo simulation varying the number of experiments in the meta analyses, the number of subjects that they include, their variance and effect size. We empirically calculated the reliability and statistical power in each case Results: WMD is generally reliable if the variance is low, whereas its power depends on the effect size and number of subjects per meta-analysis; the reliability of RR is generally unaffected by changes in variance, but it does require more subjects than WMD to be powerful; NPRR is the most reliable method, but it is not very powerful; SVC behaves well when the effect size is moderate, but is less reliable with other effect sizes. Detailed tables of results are annexed. Conclusions: Before undertaking statistical aggregation in software engineering, it is worthwhile checking whether there is any appreciable difference in the reliability and power of the methods. If there is, software engineers should select the method that optimizes both parameters.
Item ID: | 11606 |
---|---|
DC Identifier: | https://oa.upm.es/11606/ |
OAI Identifier: | oai:oa.upm.es:11606 |
DOI: | 10.1049/ic.2011.0005 |
Deposited by: | Memoria Investigacion |
Deposited on: | 13 Jul 2012 08:17 |
Last Modified: | 20 Apr 2016 19:37 |