Questionable Research Practices (QRPs, John et al., 2012)
Publication bias
“Publish or perish!” (Callard, 2022)
Researchers degrees of freedom (Simmons et al., 2011)
Open science
Pre-registration
Registered Reports
Multiverse of analytical scenarios
(data collection, coding and analysis)
⬇
Analysis and presentation of results
from every plausible scenario
↙ ↘
Explorative methods
Inferential methods (PIMA)
Innovative permutation-based method
Strong Family Wise Error Rate (FWER) control
Good statistical power
p-value adjustment for multiple comparisons (maxT)
Applies to Generalized Linear Models (GLMs)
Distribution-Free:
\(\rightarrow\) No assumptions about data normality (non-parametric)
Handles Dependencies:
\(\rightarrow\) Accounts for correlations between multiverse specifications
Coin Toss Example:
What is the probability of getting at least one Head?
The probability of getting what we want approaches 100%!
In Psychology, our coin has a probability of Head of \(5\%\)
Then we can keep tossing this coin until we win
We eventually get our significant \(p < .05^*\)
The problem is that Head = False Positives…
Easy fix for keeping the false positives rate \(< 5\%\)
\[ \alpha_{adj} = \frac{\alpha_{standard}}{\text{number of tries (k)}} \]
Example: if we do ten different tests on different data
Now we are safe from false positives…
…but it’s incredibly hard to find true effects!
Bonferroni assumes tests are independent (like coin tosses)
But Multiverse specifications are highly correlated:
\(\rightarrow\) similar tests on the same data
Bonferroni ignores the correlations
\(\rightarrow\) Massive Power Loss (High Type II Error)
max-T adjustment:
1. empirically models the correlation structure via permutations
2. corrects for multiplicity without killing statistical power
Courtesy of Dr. Gambarota.
RCTs on psychotherapy effectiveness for depression (Plessen et al., 2023)
n of primary studies = 124
Population = adults
| Scenarios | Model | Therapy | Format | Bias | Diagnosis |
|---|---|---|---|---|---|
| \(m_1\) | EE | CBT | Individual | High | Clinical |
| \(m_2\) | RE | Non-CBT | Group | Low | Cut-off |
| … | … | … | … | … | … |
| \(m_{1920}\) | RE | All | All | All | All |
Compute Meta-Analysis for each scenario (\(m_i\))
Include Meta-Analyses with at least 10 studies (\(k \geq 10\))
\(y_k\) = effect size estimate from study k
\(v_k\) = variance of study k
\(\tau_0^2\) = between-study variance under the null (\(H_0\))
| \(m_1\) | \(m_2\) | \(m_i\) | \(m_{1144}\) | |
|---|---|---|---|---|
| \(k_1\) | 0.34 | 0.28 | … | 0.00 |
| \(k_2\) | - 0.25 | 0.00 | … | -0.25 |
| … | … | … | … | … |
| \(k_{124}\) | 0.00 | 0.52 | … | 0.48 |
Null values (= 0.00) indicate study \(k_i\) was not included in meta-analysis \(m_i\)
Significant Meta-Analyses
Never = 8 (0.7%)
Before correction = 1136 (99.3%)
After correction = 1030 (90%)
Mdn = 0.59
\(\boldsymbol{\bar{x}}\) = 0.63
Min-Max = [0.28-1.61]
Clinical Significance \(\geq\) 0.24
(Cuijpers et al., 2014)
Inferential Multiverse Meta-Analysis
Addressing selective reporting and p-hacking
Relative stability of findings on the effectiveness of psychotherapies for depression
Simplification of dataset and analyses
No multilevel meta-analyses
No quantitative assessment of publication bias
PIMMA to consolidate knowledge and evidence
in psychology
Extend the method to multilevel and/or multivariate meta-analyses
Theory comes first!
Be parsimonious
Be exhaustive
https://osf.io/zj7mt/
For info: matteo.manente.3@phd.unipd.it