Lifespan Development
The effects of pubertal timing on externalizing behaviors in
adolescence and early adulthood: A meta-analytic review
Laura M. Dimler *, Misaki N. Natsuaki
University of California, Riverside, CA, United States
article info
Article history:
Available online 3 October 2015
Keywords:
Meta-analysis
Pubertal timing
Externalizing
Moderator
Adolescence
abstract
Using a meta-analytic approach, this investigation examines the association between early
pubertal timing and externalizing behaviors in adolescence and early adulthood. The
findings showed that the effect size of early pubertal maturation on externalizing be-
haviors wasr¼0.180. This small, yet significant effect size is consistent with the models of
early pubertal maturation in that early maturation is associated with higher levels of
externalizing behaviors. Using contrast analyses, we examined three potential moderators
of this association: sex, the concurrent versus long-term effect of early puberty, and types
of puberty assessments. Neither sex nor type of pubertal timing assessment moderated the
effect significantly. However, results indicated that the effect was stronger for studies that
measured pubertal timing and externalizing behaviors concurrently rather than longitu-
dinally (i.e., examining prospective effect of pubertal timing on later externalizing be-
haviors). Thefindings are discussed in terms of implications for future research.
©2015 The Foundation for Professionals in Services for Adolescents. Published by Elsevier
Ltd. All rights reserved.
Early puberty is a risk factor for the development of adolescent psychopathologies, including externalizing behaviors such
as classroom disruptions, aggression, delinquency, and social deviancy. Although there have been many literature reviews on
this topic (e.g.,Ge&Natsuaki, 2009; Graber, 2013; Mendle&Ferrero, 2012; Mendle, Turkheimer,&Emery, 2007; Negriff&
Susman, 2011; Rudolph, 2014), no meta-analysis has been conducted to date. Meta-analysis on pubertal timing and exter-
nalizing behaviors would help further advance thefield for the following reasons. First, while there are many studies that do
report a positive association between early puberty and externalizing problems, there are also many studies thatdo notfind
early puberty to predict adolescent externalizing behaviors (e.g.,Carter, Caldwell, Matusko, Antonucci,&Jackson, 2011;
Obeidallah, Brennan, Brooks-Gunn,&Earls, 2004; Stattin, Kerr,&Skoog, 2011). Therefore, quantification of the pubertal
timing effect using a systematic meta-analysis would further the literature by providing a general overview of the statistical
associations between early pubertal maturation and externalizing behaviors. Second, quantification of the effect size can
inform applied science. It can provide clues to the question of whether targeting early maturing youths is an effective and
efficient strategy for interventions and prevention of externalizing behaviors. If the effect of early pubertal maturation is
found to be robust in the meta-analysis, it is especially alarming because in the context of secular trend of puberty, the age of
pubertal onset has been steadily decreasing in both sexes in the last 25 years (Anderson&Must, 2005; Golub et al., 2008;
Sørenson et al., 2012). For these reasons, the overarching aim of this study is tofill this critical gap in the literature by
conducting a meta-analytic review to further our understanding of the strength of the associations between timing of
*Corresponding author.
E-mail address:[email protected](L.M. Dimler).
Contents lists available atScienceDirect
Journal of Adolescence
journal homepage:www.elsevier.com/locate/jado
http://dx.doi.org/10.1016/j.adolescence.2015.07.021
0140-1971/©2015 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Journal of Adolescence 45 (2015) 160e17 0 pubertal maturation and externalizing behavior. We do this by analyzing effect sizes and multiple moderators on more than
36,000 adolescents.
Although previous studies have shown the association between early pubertal maturation and externalizing behaviors,
the strength of this link varies greatly among studies (Ge&Natsuaki, 2009; Ge, Natsuaki, Jin,&Biehl, 2011). In this report, we
sought to examine whether the following potential moderators contribute to the mixed results in the literature: sex of the
adolescent (Graber, Lewinsohn, Seeley,&Brooks-Gunn, 1997; Najman et al., 2009); whether the researchers assessed a
concurrent or long-term effect of pubertal timing (Graber et al., 1997); and types of pubertal timing measurement (Brooks-
Gunn, Warren, Rosso,&Gargiulo, 1987; Dorn&Biro, 2011; Shirtcliff, Dahl,&Pollak, 2009). Thus, an additional aim of this
study is to examine whether these potential moderators alter the effect size of pubertal timing on externalizing behaviors.
Early puberty and externalizing behaviors
It is well documented that adolescents who experience pubertal maturation earlier than their same-age, same-sex peers
are more likely to have negative developmental outcomes, including externalizing behaviors and affiliation with deviant
peers, especially in adolescence (e.g.,Caspi, Lynam, Moffitt,&Silva, 1993; Felson&Haynie, 2002; Ge, Brody, Conger, Simons,&
Murry, 2002; Lynne, Graber, Nichols, Brooks-Gunn,&Botvin, 2007; Mensah et al., 2013; Stattin&Magnusson, 1990). For
instance, in a longitudinal study with more than 500 young adults, early maturers tended to have significantly higher rates of
disruptive behavior disorders (i.e., conduct disorder and oppositional defiant disorder) and antisocial personality traits in
early adulthood compared with on-time maturers (Graber, Seeley, Brooks-Gunn,&Lewinsohn, 2004). More recently,Mrug
et al. (2014)echo similar results, but add that early pubertal timing, compared to on-time or late pubertal timing, is
related to a wide range of externalizing behaviors including delinquency, physical aggression, relational aggression, and
nonphysical aggression.
Several hypotheses have been put forth to explain this association between early pubertal maturation and externalizing
behaviors (seeGe&Natsuaki, 2009for a review). One possible explanation is that the hormonal changes associated with
puberty increases the risk for developing externalizing problems by heightening an adolescent's novelty-seeking behaviors
and other biological systems such as stress reactivity and brain development (Rudolph, 2014). Another hypothesis suggests
that the gap between physical and psychological (i.e., cognitive and emotional) maturities places early physical maturers at
risk for developing externalizing behaviors. This maturation disparity model asserts that early maturers' externalizing be-
haviors are reflections of misalignment of their slow-developing neural and cognitive development and their body's fast-
paced development, lending to a host of social and emotional demands for which the adolescent is not yet cognitively or
emotionally equipped (Ge&Natsuaki, 2009; Moffitt, 1993). Another hypothesis proposes that early puberty places an
adolescent into a demanding transition that is novel, uncertain, and ambiguous, which may exacerbate pre-existing vul-
nerabilities including pre-pubertal behavioral problems (Caspi&Moffitt, 1991; Giletta et al., 2015; Graber, 2013; Hamilton
et al., 2014; Rudolph, 2014). Lastly,Ge and Natsuaki (2009)outline the contextual amplification hypothesis, which states
that an adverse situation (e.g., family conflict, peer challenges) amplifies the effects of the rapid hormonal and biological
changes occurring in an adolescent's body, exacerbating developing psychopathologies.
Despite the theoretical elaboration of the aforementioned mechanisms, the empirical literature has not demonstrated the
consistent effects of early pubertal maturation. Some studies report significant effect sizes that are rather small (e.g.,r¼0.09,
Burt, McGue, DeMarte, Krueger,&Iacono, 2006), while other studies have reported larger effects (e.g.,r¼0.55,Storvoll&
Wichstrøm, 2002). Therefore, because of the theoretical implications and the statistical inconsistencies in the literature, it
is important to compute the aggregated effect size across studies, and the variation in effect sizes, if any, that needs to be
explained.
Moderators of the pubertal timing effect
To explain the potential heterogeneity in effect sizes, we focused on three possible moderators: sex of the adolescent,
short- vs. long-term effects of pubertal timing, and assessments of pubertal maturation.
Sex
Sex differences in the effect of pubertal timing is discussed extensively in the domain of internalizing psychopathology;
early maturing girls tend to have higher rates of depression, anxiety, and other internalizing symptoms during adolescence
and adulthood while the evidence on adolescent boys' internalizing symptoms and pubertal maturation are mixed
(Fernandez-Castelao&Kroner-Herwig, 2014; Kaltiala-Heino, Marttunen, Rantanen,&Rimpel€
a, 2003; Mendle&Ferrero,
2012; Natsuaki, Biehl,&Ge, 2009). Stronger effects of early maturation in female internalizing symptoms are theorized that
features of female pubertal development (e.g., curvy body, breast development, menarche) can elicit social and emotional
challenges, and the stress associated with these challenges deplete yet-developed early maturing girls' coping and resources
to deal with the pressures, which in turn can contribute to depression and anxiety (Natsuaki, Samuels,&Leve, 2014). For
males, on the other hand, earlier work suggests that features associated with male puberty such as voice change, facial hair,
and growth spurt bring social advantages, such that early maturing boys can enjoy potential dominance, power, and lead-
ership in peer relationships (McCabe&Ricciardelli, 2004; Simmons&Blyth, 1987). However, recent empirical research
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0161 advises that this thinking ought to be revised because this early pubertal transition is associated with elevated levels of
internalizing psychopathology for young males (Ge, Brody, Conger,&Simons, 2006; Graber, 2013; Mendle&Ferrero, 2012).
While existing reviews note that the effect of early maturation on externalizing psychopathology also depends on sex of
the child, the literature is not as clear-cut as it is for internalizing symptoms. Although a fairly consistentfinding is that early
maturing boys have higher levels of externalizing behaviors than late maturing counterparts (Felson&Haynie, 2002; Mendle
&Ferrero, 2012; Susman et al., 2007), the results are rather mixed for girls. For example, multiple recent studies on females
suggest early menarche does not have an impact on externalizing behaviors in females (e.g.,Boden, Fergusson,&Horwood,
2011; Carter et al., 2011), while there seem to be just as many recent studies suggesting that entering into puberty earlier than
most peers is a risk factor for future delinquency for girls (e.g.,Mendle et al., 2007; Mrug et al., 2014). In attempt to resolve this
confusion, the present meta-analysis includes sex as a moderator of the association between early pubertal maturation and
externalizing psychopathology.
Concurrent vs. longitudinal effect of pubertal timing
Another potential moderator of the association between early pubertal timing and externalizing behaviors is the short-
versus long-term effect of early puberty. Pubertal timing is associated with externalizing behaviors concurrently, as evidenced
by a number of cross-sectional studies (e.g.,Flannery, Rowe,&Gulley, 1993; Graber et al., 1997; Sontag, Graber,&Clemans,
2011). However, we have limited knowledge regarding the long-term effect of early puberty. Although there are a compar-
atively small number of studies that have investigated the long-term effects of early pubertal timing, available evidence
suggests that the adverse effect of early maturation is rather short-lived and is contained within the bounds of adolescence
(Boden et al., 2011; Natsuaki et al., 2009; Obeidallah et al., 2004). Among studies that have examined this longitudinal effect,
there is evidence that early pubertal timing in boys, who were measured once a year for four years, consistently resulted in
more externalized hostile feeling in late adolescence than on-time or late-maturing boys (Ge, Conger,&Elder, 2001), but did
not result in major psychopathology in adulthood (Graber et al., 2004). Similarly, adult females who were formerly early
maturers in adolescence were more likely than formerly on-time maturers to have antisocial behavior and conduct behavior
disorders in early adulthood, but not after the age of 21 (Boden et al., 2011; Graber et al., 2004). As such, this quantitative
review aims to combine the results of cross-sectional as well as longitudinal studies to examine whether the concurrent vs.
longitudinal effect of puberty moderates the strength of the association between early puberty and externalizing behaviors.
Pubertal timing assessment
The third potential moderator that may explain mixedfindings in this area of research concerns the type of pubertal
maturation assessments used in each study. Although there are many different and nuanced ways to measure the complex
phenomenon of puberty than are outlined in this meta-analysis (e.g., hormone concentrations, bone age, and gonadal ul-
trasound), we discuss thefive most common forms of assessing pubertal timing that are found within the literature of
externalizing behaviors (seeDorn&Biro, 2011; Dorn, Dahl, Woodward,&Biro, 2006for comprehensive reviews on puberty
assessment). Conventional measures of puberty in psychological research include the Pubertal Development Scale (PDS;
Petersen, Crockett, Richards,&Boxer, 1988), the Tanner Stages (i.e., self-report and/or clinician-report;Marshall&Tanner,
1969, 1970), perceived pubertal timing (i.e., subjective assessment of whether one feels mature earlier than their peers;
seeMendle, 2014a), and self-reported age of menarche in females.
The PDS is a subjective, noninvasivefive-item questionnaire that assesses the status of puberty-related physical devel-
opment (breast development and menarche for girls; voice change and facial hair for boys; skin change, body hair, and growth
spurt for both sexes) at the time of questionnaire administration. Adolescents, parents, and clinicians respond to a four-point
scale, indicating how far along the adolescent's physical maturation has advanced. The Tanner Stages assessment uses an
illustrated scale of primary and secondary sex characteristics in which the adolescent, parent, and/or clinician select one of
thefive pictures that describes the adolescent's current physical development. Research demonstrates that both the PDS and
the Tanner Stages are highly correlated with puberty-related hormones (i.e., dehydroepiandrosterone, testosterone, and
estradiol;Shirtcliff et al., 2009), indicating that the subjective measures are valid and practical when hormonal profiling and
other biological or objective measures of pubertal timing are unfeasible to attain. The clinician-reported Tanner Stages are
known to be more accurate when purposefully isolating an off-time puberty sample (Brooks-Gunn et al., 1987; Dorn et al.,
2006; Shirtcliff et al., 2009). To compute pubertal timing scores from the pubertal status scores based on the PDS and/or
Tanner Stages, researchers typically either 1) standardize the scores within age and sex to create a continuous measure of
pubertal timing; and/or 2) classify the sample into three groups of early, on-time, and late-maturers using the score distri-
bution generated from the given sample (peer-normative comparison).
Perceived pubertal timing is another method that assesses an adolescent's perception of pubertal timing synchrony with
peers, usually by asking adolescents if they feel their physical changes are occurring before, at the same time, or after the rest
of their peers (Mendle, 2014a). Recently, studies have indicated that adolescents' subjective assessment of their timing status
among their peers may be an important factor that uniquely contributes to externalizing behaviors (Lynne et al., 2007;
Storvoll&Wichstrøm, 2002).Mendle (2014a)calls for greater awareness of the importance of the subjective synchrony of
pubertal timing within peers as these self-perceptions of feeling‘different’from peers may result from thoughts and actions
that correspond with the adolescents' impression of their physical appearance. For example, in the context of externalizing
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0 16 2 behaviors, feeling physically‘different’from peers may accentuate the desire to be accepted by peers, resulting in adolescents
associating with older peers and engaging in“adult-like”behaviors that are not age-appropriate for them (i.e., driving a car at
age 14).
Lastly, self-reported recall of age at menarche is the most frequently used measure of female pubertal timing because of its
ease of accessibility and the fact that menarche is a clearly timed event in a female's life (Dorn&Biro, 2011). However, this
type of measurement has its limitations: menarche occurs late in the pubertal process and is not the most accurate measure of
pubertal timing in terms of reliability of self-reported recall, as the reported age of menarche has been shown to vary within
longitudinal studies (Dorn et al., 2006).
Given the heterogeneity in the assessment of pubertal timing across studies, it may be the case that the strength of
observed associations between early pubertal maturation and externalizing behaviors in previous work may depend on the
types of pubertal timing measures and the reporter of pubertal timing, but this possibility has never been systematically
examined. Previous studies have shown that the potential effect of pubertal timing measurement on an adolescent's psy-
chopathology differ by continuous and categorical forms of measurement (Negriff, Fung,&Trickett, 2008) and stage-
normative and peer-normative forms of measurement (i.e.,Cance, Ennett, Morgan-Lopez,&Foshee, 2012), thus, this meta-
analysis further expands these studies by examiningfive types of pubertal timing measures in the context of externalizing
psychopathology.
Present study
The goal of the present study was two-fold: (1) to conduct a meta-analysis on pubertal timing and externalizing behaviors
to quantify the magnitude of the pubertal timing effect; and (2) to explore factors that potentially moderate the link between
pubertal timing and externalizing psychopathology. Based on the previous literature outlined above, we hypothesized that:
(1) early pubertal timing would lead to more externalizing behaviors; (2) the link between pubertal timing and externalizing
behaviors would be stronger for boys than for girls; and (3) this association would be stronger when pubertal timing and
externalizing behaviors are measured concurrently, as opposed to longitudinally. We did not form specific hypotheses
regarding the pubertal timing assessments.
This meta-analytic review advances previous literature reviews in two ways. First, meta-analyses compliment theoretical
reviews by focusing on effect sizes, providing a statistical tool for describing average effect sizes, and allowing for more
powerful methods of evaluating variability offindings across studies and its sources (Rosenthal, 1991). Second, this meta-
analysis allows for testing for moderation via contrast analysis.
Methods
Selection of studies
In this study, the operationalization of externalizing behaviors was based on the Diagnostic and Statistical Manual, Fifth
Edition (DSM-5) (American Psychiatric Association, 2013). According to the DSM-5, disruptive, impulse control, and conduct
disorders include symptoms that involve problems in the self-control of emotions and behaviors. While other disorders in
DSM-5 may also involve problems in emotional and/or behavioral regulation (i.e., substance use and risky sexual behaviors),
the disorders in this cluster are unique in that these problems are manifested in behaviors that violate the rights of others (e.g.
aggression, destruction of property) and/or bring the individual into significant conflict with societal norms or authority
figures (American Psychiatric Association, 2013).
Peer-reviewed studies were obtained through (a) searching computerized literature in the PsycINFO and Academic Search
Complete databases using all combinations of the keywords:adolescent,externalizing,antisocial,aggression,delinquency,
conduct disorder,conduct problems,behavior problems,behavior disorder,puberty,pubertal timing,adolescence,menarche,
spermarche, andoigarche; (b) going through the authors' personal collection of pertinent studies on puberty; (c) using the
ancestry method by examining references cited in prior reviews and empirical articles; and (d) communicating with those in
thefield who have conducted research in this specific area.
Studies were included in this quantitative review if they met the following criteria: (a) they used a correlational model to
assess an association between pubertal timing and externalizing behaviors; (b) they used externalizing behaviors as an
outcome variable, predicted from pubertal timing; (c) the studies were available in peer-reviewed English language journals;
and (d) the studies reported effect sizeror enough statistical information to reconstruct this effect size (i.e., means and
standard deviations).
The following studies were not included in the meta-analysis: (a) Studies that focus exclusively on risky sexual behaviors
(n¼50); (b) studies that assessed only substance/alcohol use (n¼23); and (c) studies that reported odds ratios or relative risk
ratios (n¼3), as there is no process of which to reconstitute a reliable effect sizer(Rosenthal&Rosnow, 2008; Rosenthal,
Rosnow,&Rubin, 2000). Studies with exclusive focus on sexual behaviors and substance use were excluded because these
behaviors tend to be measured as their own constructs and have their own validity, as opposed to being nested under
externalizing behaviors (e.g.,American Psychiatric Association, 2013; Timmermans, van Lier,&Koot, 2008; Winters,
Stinchfield, Latimer,&Stone, 2008). Readers who are specifically interested in the effect of pubertal timing and risky sex-
ual behaviors should refer to a recent meta-analysis byBaams, Dubas, Overbeek, and van Aken (2015).
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 016 3 Because we assessed sex as a potential moderator, we extracted two effect sizes from a study if it reported separate
statistical analyses for males and females. Under these criteria, 34 publications consisting of over 36,000 adolescents
(N¼36,641) and 40 effect sizes were included in the current investigation.
Extraction of information from studies
The following information was extracted from each study: sample size, correlation (or regression coefficient), sex of the
sample, the type of pubertal timing measurement, the ages of the participants when the pubertal timing and externalizing
measures were completed, and the duration of the study. It is important to note that when multiple time points were used to
assess for externalizing symptoms (e.g., pubertal timing's effects on externalizing behavior in the seventh grade, tenth grade,
and twelfth grade), the furthest time point from the pubertal timing assessment was used (e.g., in a four-year longitudinal
study, the ninth grade puberty assessment and the twelfth grade externalizing assessment would be used). This was done so
that the longitudinal effect would be as long-term as possible, within each article's confines.
Coding of moderators
This study focused on three moderators: participants' sex, concurrent vs. long-term effect of pubertal timing, and the type
of pubertal timing assessment.Sexwas dummy coded with males as the reference group. Forconcurrent vs. long-term effectof
pubertal timing, we dummy coded with concurrent¼1 and long-term¼0. Long-term was defined by a longitudinal design
with at least one year between the times at which puberty and externalizing psychopathology were assessed. We identified
16 effect sizes that corresponded with longitudinal studies that met the aforementioned criterion, with an average time
interval of 4.6 years (SD¼4.5) between puberty and externalizing behavior assessments. Lastly,pubertal timing measurement
was divided intofive subgroups: (a) PDS; (b) medical professional-reported Tanner Stages; (c) parent- or self-reported Tanner
Stages; (d) perceived pubertal timing; and (e) age of menarche (for female-only samples). Each type of pubertal timing
assessment was dummy coded (1¼the measure of interest, 0¼all others).
Statistical analyses
Effect size calculations
The effect sizerwas the statistical basis for this meta-analysis becauserillustrates both the strength and direction of the
association between variables (Durlak, 2009; Ozer, 1985; Rosenthal&DiMatteo, 2001; Rosenthal&Rubin, 1982). As noted
earlier, studies reporting only odds ratios or relative risk ratios (and did not report descriptive statistics on the dataset) were
omitted from the meta-analysis due to difficulty in the reconstituted effect size (Rosenthal&Rosnow, 2008; Rosenthal et al.,
2000), unless the corresponding author was available and willing to release the data needed to computer.
For studies reporting effect sizes other thanr, the information in the published article was computed into the desired effect
size through other information within the article such as group means and standard deviations, Hedge'sg, Cohen'sd, an exact
p-value, chi-square, or anF-test with one degree of freedom in the numerator (Rosenthal, 1991; Rosenthal&Rosnow, 2008;
Rosenthal et al., 2000). If a study did not report an exactp-value for a significant result, the one-tailedp-value was
conservatively assumed to be 0.025. Effect sizes were transformed via Fisher'sZrtransformation, and back-transformed tor's
for intuitive interpretability. Four effect sizes from the same sample in separate papers (Negriff et al., 2008; Negriff, Ji,&
Trickett, 2011; Negriff, Susman,&Trickett, 2011; Negriff&Trickett, 2010) were ensemble adjusted into one overall effect
size (thus, using afixed effects approach since all effect sizes were drawn from the same population;Borenstein, Hedges,
Higgins,&Rothstein, 2009; Rosenthal&Rubin, 1983). In such cases, we transformed eachrinto Fisher'sZ
r, averaged the
effect sizes, transformed back to anr, and gave the one averaged effect size (Achenbach, McConaughy,&Howell, 1987;
Borenstein et al., 2009; Rosenthal, 1991; Rosenthal&DiMatteo, 2001; Rosenthal&Rubin, 1983; Smith&Glass, 1977; see
Appendix A). By doing so, we avoided violations of independence assumptions made when testing significance (Card, 2010;
Rosenthal, 1991). It is important to note that when a publication separately reported male and female results, we reported two
separate effect sizes from the study.
Combination statistics
For each analysis, the unweighted mean and median effect size were computed, along with confidence intervals. Using the
unweighted means and medians allows us to have a fuller picture of the data and the psychological phenomenon at hand. The
random effects model was used to compute and combine effect size statistics, using the unweighted meanrbased onk(the
total number of effect sizes included). This model allows the results to be generalized beyond the samples included in the
meta-analysis (Borenstein et al., 2009). In order to address thefile-drawer problem and the potential number of new, un-
published, or otherwise unretrieved studies that would need to show no effect at thep<0.05 level in order to negate our
results, we computed the tolerance level (N¼210) and fail-safeN(N¼3053.47) values (Rosenthal&Rosnow, 2008). All
statistical analyses were conducted using SPSS 22.0 and Excel Professional Plus 2013.
Moderator analyses
A chi square test for heterogeneity wasfirst computed to assess for heterogeneous results within the sample (Hunter&
Schmidt, 2000; Rosenthal, 1991). The chi-square statistic was used because the Type I error rate is lowest when ther-
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0 16 4 transformed-to-Fisher's-Zstatistic is used in conjunction with the chi-square test for heterogeneity (S
anchez-Meca&Marín-
Martínez, 1997). The moderators were tested in random effects models in order to assess significant differences between
effect sizes as a function of potential moderator variables at the most generalizable level. The random effects (unweighted)
approach to analyzing moderators allows for generalization to studies that are not identical to the study sample, but are
others of the same ilk (Hedges&Vevea, 1998;Hunter&Schmidt, 2000;Rosenthal&DiMatteo, 2001). Following the guideline
by other meta-analytic studies (e.g.,Card, 2010; Connell&Goodman, 2002) that moderator analyses with less thanfive effect
sizes in each category are considered to be less reliable, we only ran moderator analyses with groups that containedfive or
more effect sizes. For moderator analyses, we computed independent samplet-tests and reportedrfor each group for
respective comparisons.
Results
In total, 40 effect sizes (34 publications) were examined in this investigation. The sample sizes for these effects ranged
from 52 to 5700, with a median sample size of 345, mean sample size of 916, and a total sample size of 36,641 adolescents.
Thirteen effect sizes did not support the association between early pubertal timing and externalizing behaviors. Fourteen
effect sizes were for males, 24 for females, and two examined both sexes without reporting separate results for each sex in the
original publication. Sixteen effect sizes measured the effect of early pubertal timing longitudinally. Eleven of the 40 effect
sizes used the PDS as the pubertal timing measure,five used clinician-report Tanner Stages, six used parent- or self-report
Tanner Stages, seven used a perceived pubertal timing assessment, and 12 effect sizes used age of menarche as the puber-
tal timing measure. For a more detailed and comprehensive summary of the included studies (with effect sizer, the signif-
icance, conclusion, and moderators for each study), please seeAppendix A.
Across the 40 effect sizes, the median correlation wasr¼0.123, with an average correlation ofr¼0.180 (95% CI: 0.109,
0.250;t(39)¼140.81,p<0.0001). These results suggest that early maturing adolescents tend to engage in more externalizing
behaviors than their on-time or late-time maturing peers. Thesefindings would be disputed only if there were over 3053 (i.e.,
fail-safeN) studies showing that there is in fact no association between early pubertal timing and higher levels of exter-
nalizing behaviors; this well exceeds the tolerance level ofN¼210.
The chi square test for heterogeneity was not significant at the conventionalp<0.05 levelðc
2
ð39Þ ¼47:023Þ. Although the
chi-square test is not significant by the conventional alpha level of 0.05, it has been shown that it is not necessary to have
significantly heterogeneous results (Borenstein et al., 2009; Hall&Rosenthal, 1991) and is still important to examine for
potential moderators (Borenstein et al., 2009; Rosenthal&DiMatteo, 2001).
Results from a series oft-tests for moderation analysis are shown inTable 1. Results from these random effects analyses
showed that there was only one significant moderator (concurrent vs. long-term effect of pubertal timing). The concurrent
effect (r¼0.226) of pubertal maturation on externalizing behaviors was found to be stronger than its longitudinal effect
(r¼0.100) (t
(38) ¼2.366,p¼0.023). Sex did not moderate the effect of early maturation on externalizing behaviors
(t
(36) ¼0.991,p¼0.328), indicating that there was no gender difference in the effect of early pubertal maturation on
externalizing behaviors. Early puberty effect on externalizing behaviors did not significantly differ across thefive types of
assessment tools.
Table 1
Moderator analyses.
Moderating variable Categories No. of ES in each grouprtsignificance test (t
(df),p,r)
Sex a Male 14 0.250t (36) ¼0.991,p¼0.328,r¼0.156
Female 24 0.148
Timing effect Concurrent 24 0.226t
(38) ¼2.366,p¼0.023,r¼0.359*
Long-term 16 0.100
PDS PDS 11 0.241t
(38) ¼ 1.394,p¼0.171,r¼0.221
Other 29 0.154
Clinician-report tanner stages Clinician tanner 5 0.322t
(38) ¼0.615,p¼0.542,r¼0.101
Other 35 0.166
Self-/parent-report tanner stages Self/parent tanner 6 0.105t
(38) ¼0.579,p¼0.566,r¼0.094
Other 34 0.192
Perceived timing Perceived timing 7 0.208t
(38) ¼ 0.607,p¼0.548,r¼0.098
Other 33 0.172
Age of menarche
b Age of menarche 12 0.102t (22) ¼0.988,p¼0.334,r¼0.206
Other 12 0.212
Note: PDS¼pubertal development scale.
*p<0.05.
aTwo effect sizes did not examine sex differences.b24 studies examined females.
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 016 5 Discussion
The aims of the present study were (1) to quantify the magnitude of early pubertal timing effect on externalizing be-
haviors, and (2) to explore potential variables that may moderate this association. Results confirm our hypothesis that early
pubertal timing is a significant predictor of adolescent externalizing behaviors at the effect size of 0.180, which explains
3.24% of the variance in externalizing behaviors. Although this effect size is considered small in accordance withCohen's
(1988)guidelines, it would be hasty to conclude that small to moderate effect sizes are meaningless (Rosenthal&
Rosnow, 2008). In addition, any single predictor of highly complex psychological phenomena, such as externalizing
behaviors, is necessarily small, given that multiple predictors work in a synergetic manner (Ahadi&Deiner, 1989; Durlak,
2009).
Our moderator analysis indicated that sex did not moderate the effect of early maturation on externalizing behaviors,
suggesting that the adverse effect of early pubertal maturation on externalizing behaviors does not differ by sex of the child.
This null result may be due to the fact that there is different developmental timing of puberty for boys versus girls and/or
the different methods used to measure puberty for each sex. We may also see this result because externalizing behaviors
rise for both sexes throughout adolescence, which results in a narrower, and statistically insignificant, sex gap in exter-
nalizing behaviors than at any other point in development (Ge et al., 2006; Mendle, 2014b; Najman et al., 2009). However, it
is important to note that, as with any nullfindings, this interpretation must be treated with caution because a statistical null
finding is not conclusive evidence for the absence of an effect (Natsuaki, 2015; Nickerson, 2000). Replication of thisfinding
is needed in future research for further corroboration. Future longitudinal studies ought to also test whether adolescent
externalizing psychopathology differs by sex in relation to the physical and social developmental differences between the
sexes.
Finally, in support of our second hypothesis on concurrent vs. longitudinal effect, the effect of early pubertal timing was
more strongly related to externalizing behaviors while adolescents go through puberty, as opposed to after they complete
pubertal transition. Thisfinding coincides with the current literature, indicating that early pubertal timing has an effect on
adolescent externalizing behaviors only during adolescence (Obeidallah et al., 2004). This lends credence to the idea that
there is relatively little continuity in externalizing behaviors from adolescence into early adulthood and once early adulthood
is reached and puberty is complete for all normally-developing individuals, any differences in behaviors due to timing of
pubertal entrance dissipate over time (Natsuaki et al., 2009; Obeidallah et al., 2004).
The present study is one of thefirst to directly contrastfive types of puberty assessments in a meta-analytic framework
(see alsoCance et al., 2012). We did notfind evidence of moderation by any specific type of pubertal timing measure.Cance
et al. (2012)reported that different types of pubertal timing assessment may tap onto different facets of pubertal develop-
ment. In their empirical study, they found that adolescents who perceived themselves maturing earlier than peers when using
the one-item perceived pubertal timing scale were not necessarily identified as early maturers when assessed by the PDS. The
current meta-analysis expands their results by suggesting that although different measures may describe different processes
of pubertal maturation, these differences do not seem to qualify the link between pubertal timing and externalizing be-
haviors. However, readers are reminded that the observed null effect may be due to the small number of studies in each
category of puberty assessments, generating too low of power to properly detect an effect. Future studies should revisit this
moderation when enough studies have accumulated.
Limitations and future directions
Thefindings reported here need to be interpreted within the context of the study limitations. First, although multiple
literature search strategies were employed in this study, as with any meta-analysis, it is quite possible that some empirical
articles were not identified and excluded unintentionally. For instance, thefile drawer problem is well known, where studies
with statistically significantfindings may have had greater likelihood of publication than those with nullfindings, and
therefore, the effect size can be overestimated. Despite this potential drawback, the large fail-safeN(the number of
nonsignificant studies that would be necessary to reduce the effect size to a nonsignificant value) in this report made it highly
unlikely that any missed or unpublished studies would dramatically change the current results.
Second, it is possible that the definition and operationalization of externalizing behaviors may have been diverse among
the included studies. In this current study, we attempted to conduct a comprehensive search by including studies using such
terms such astheft,conduct problems,delinquency,behavior problems, andantisocial tendencies. However, each construct may
have tapped into unique components of broadband externalizing behaviors. Although this study was not able to test the types
of externalizing behaviors as a moderator due to small sample size for each category, it is imperative that future researchers
conduct a larger meta-analysis and separately analyze by types of externalizing behaviors.
Third, we also wish to remind readers that a small number of studies included in some of the pubertal timing mea-
surement groups (e.g., Clinician-reported Tanner exam,n¼5) may have resulted in statistical challenges infinding effects.
Additionally, as noted earlier, measurement and computation/classification of pubertal timing used in thefield is quite
diverse and there are only few studies that use the exact same measure and classification system. To test the types of pubertal
timing measures as moderators more fully, more studies in each measurement category are needed. Also, future investigators
should carefully describe how they measure and operationalize pubertal timing in order to aid future meta-analytic work on
this topic.
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0 16 6 Fourth, we were unable to include moderation by race because there were so few studies (n¼3) that reported necessary
statistics (e.g.,ror means and standard deviations) by racial/ethnic groups. Given that the pubertal timing effect may operate
differently depending on ethnic/cultural background (Anderson, Dallal,&Must, 2003; White, Deardoff, Liu,&Gonzales,
2013), future investigators should specifically examine this potential interaction effect or report their analyses separately
by race. Fifth, readers are reminded that there are more than thefive types of measurements tested here that are used to
measure and analyze pubertal timing (e.g., hormonal profiling, bone density;Dorn&Biro, 2011; Dorn et al., 2006).
Furthermore, even within each puberty measure, pubertal timing scores can be analyzed as a continuous or categorical
variable. It can also be computed by using a cut-off based on national norms (e.g., national average of menarche), by relying on
the distribution of the puberty variable within the study sample (e.g., identifying early maturers as the top 30% of the study
sample in the PDS score), by standardizing by age and sex within the sample, or by using a residualized puberty status score
after controlling for age. Unfortunately, this study was not equipped to address these issues due to low power, but this
nuanced approach to pubertal timing measures is a topic that deserves future attention.
Afinal limitation is inherent in meta-analysis computations: we were not able to include articles that reported odds-ratios
or risk-ratios as effect size indicators. As suggested byRosenthal et al. (2000), future research should include effect size
indicators such as those inranddeven when also reporting odds-ratios or risk-ratios, so that future meta-analyses can
replicate thesefindings and are able to include studies that may have otherwise been excluded for computational reasons.
Despite these limitations, thefindings from this meta-analysis should be considered in terms of the overall effect that early
pubertal timing has on an adolescent. Although puberty is a developmental event that is universal across healthy adolescents,
the timing of its occurrence appears to hold different consequence and meaning for individuals (Natsuaki, 2013). Identifying
the risk factors and behaviors that may be influenced by adolescents' pubertal timing is of both theoretical and clinical in-
terest. By attempting to understand this complex biological and social process, researchers and clinicians can further
delineate the etiology of externalizing behaviors in adolescents.
Acknowledgments
We would like to extend our gratefulness to Dr. Robert Rosenthal for his continual statistical guidance, advice, and un-
matched expertise in meta-analysis. We would also like to thank Tricia Miller and Danielle Samuels for their feedback and
valuable help on this meta-analysis.
Appendix A. Summary of studies that tested for an association between early pubertal timing and externalizing
behaviors.
StudyNr pConclusion aSex bPuberty
measurementAge at puberty
measurementAge at ext.
measurement cTiming
effect
Boden et al. (2011)75 0 0.5 EjNE 1 Age of menarche 13.5 30 Long-term
Burt et al. (2006)688 0.090 0.009 E>NE 1 Age of menarche 14.8 14.8 Long-term
Carter, Jaccard, Silverman,
and Pina (2009)102 0.174 0.04 EjNE 1 Age of menarche 14.7 14.7 Concurrent
Carter et al. (2011)607 0.040 0.163 EjNE 1 Age of menarche 15.2 15.2 Concurrent
Caspi et al. (1993)297 0.168 0.003 EjNE 1 Age of menarche 13 15 Long-term
Copeland et al. (2010)140 0.080 0.17 EjNE 1 Self/parent tanner 16 21 Long-term
Cota-Robles, Neiss, and Rowe (2002)5550 0.120 0.001 E>NE 0 PDS 14 14 Concurrent
Deardorff et al. (2013)263 0.020 0.3732 EjNE 1 PDS 10.5 16 Long-term
Dorn, Susman, and Ponirakis (2003)56 0.279 0.0217 E>NE 0 Clinician tanner 12 13.7 Long-term
Dorn et al. (2003)52 0.226 0.0426 EjNE 1 Clinician tanner 12.7 13 Long-term
Felson and Haynie (2002)5700 0.861 0.00005 E>NE 0 PDS 14.1 14.1 Concurrent
Flannery et al. (1993)(1) 376 0.117 0.011 E>NE 0 Self/parent tanner 13.5 13.5 Concurrent
Flannery et al. (1993)(2) 397 0.209 0.00005 E>NE 1 Self/parent tanner 13.5 13.5 Concurrent
Ge et al. (2002)(1) 379 0.193 0.0001 E>NE 0 PDS 10.5 10.5 Concurrent
Ge et al. (2002)(2) 441 0.163 0.0003 E>NE 1 PDS 10.5 10.5 Concurrent
Ge et al. (2001)170 0.206 0.0031 E>NE 0 PDS 12.7 15.7 Long-term
Graber et al. (1997)(1) 667 0.006 0.4345 EjNE 0 Perceived timing 16.6 16.6 Concurrent
Graber et al. (1997)(2) 801 0.120 0.0004 E>NE 1 Perceived timing 16.6 16.6 Concurrent
Haynie (2003)5477 0.063 0.00005 E>NE 1 PDS 15.1 15.1 Concurrent
Lynne et al. (2007)1366 0.081 0.0068 E>NE 2 Perceived timing 11.7 11.7 Concurrent
Mensah et al. (2013)(1) 1773 0.270 0.001 E>NE 0 PDS 8.5 10.5 Long-term
Mensah et al. (2013)(2) 1718 0.050 0.02 E>NE 1 PDS 8.5 10.5 Long-term
Mrug et al. (2008)330 0.126 0.01 E>NE 1 Age of menarche 11.3 11.3 Concurrent
Mrug et al. (2014)2607 0.053 0.0035 E>NE 1 Age of menarche 11.1 16.1 Long-term
Najman et al. (2009)
(1) 1359 0.005 0.4313 EjNE 0 Self/parent tanner 14 20.6 Long-term
Najman et al. (2009)(2) 1425 0.060 0.0113 E>NE 1 Age of menarche 14 20.6 Long-term
(continued on next page)
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 016 7 References 1
Achenbach, T. M., McConaughy, S. H., & Howell, C. T. (1987). Child/adolescent behavioral and emotional problems: implications of cross-informant cor-
relations for situational specificity.Psychological Bulletin, 101(2), 213e232.
Ahadi, S., & Deiner, E. (1989). Multiple determinants of effect size.Personality and Social Psychology, 56(3), 398e406.
American Psychiatric Association. (2013).Diagnostic and statistical manual of mental disorders(5th ed.). Washington, DC: American Psychiatric Association.
Anderson, S. E., Dallal, G. E., & Must, A. (2003). Relative weight and race influence average age at menarche: results from two nationally representative
surveys of US girls studied 25 years apart.Journal of Pediatrics, 111(4), 844e850.
Anderson, S. E., & Must, A. (2005). Interpreting the continued decline in the average age at menarche: results from two nationally representative surveys of
US girls studied 10 years apart.Journal of Pediatrics, 147(6), 753e760.http://dx.doi.org/10.1016/j.jpeds.2005.07.016.
Baams, L., Dubas, J. S., Overbeek, G., & van Aken, M. A. G. (2015). Transitions in body and behavior: a meta-analytic study on the relationship between
pubertal development and adolescent sexual behavior.Journal of Adolescent Health.http://dx.doi.org/10.1016/j.jadohealth.2014.11.019.
*Boden, J. M., Fergusson, D. M., & Horwood, L. J. (2011). Age of menarche and psychosocial outcomes in a New Zealand birth cohort.Journal of the American
Academy of Child and Adolescent Psychiatry, 50(2), 132e140.http://dx.doi.org/10.1016/j.jaac.2010.11.007. e135.
Borenstein, M., Hedges, L. V., Higgins, J. P. T., & Rothstein, H. R. (2009).Introduction to meta-analysis. West Sussex, UK: John Wiley&Sons, Ltd.
Brooks-Gunn, J., Warren, M. P., Rosso, J., & Gargiulo, J. (1987). Validity of self-report measures of girls' pubertal status.Child Development, 58(3), 829e841.
*Burt, S. A., McGue, M., DeMarte, J. A., Krueger, R. F., & Iacono, W. G. (2006). Timing of menarche and the origins of conduct disorder.Archives of General
Psychiatry, 63(8), 890e896.
Cance, J. D., Ennett, S. T., Morgan-Lopez, A. A., & Foshee, V. A. (2012). The stability of perceived pubertal timing across adolescence.Journal of Youth and
Adolescence, 41(6), 764e775.http://dx.doi.org/10.1007/s10964-011-9720-0.
Card, N. (2010). Antipathetic relationships in child and adolescent development: a meta-analytic review and recommendations for an emerging area of
study.Developmental Psychology, 46(2), 516e529.
*Carter, R., Caldwell, C. H., Matusko, N., Antonucci, T., & Jackson, J. S. (2011). Ethnicity, perceived pubertal timing, externalizing behaviors, and depressive
symptoms among black adolescent girls.Journal of Youth and Adolescence, 40(10), 1394e14 0 6 .http://dx.doi.org/10.1007/s10964-010-9611-9.
*Carter, R., Jaccard, J., Silverman, W. K., & Pina, A. A. (2009). Pubertal timing and its link to behavioral and emotional problems among‘at-risk’African
American adolescent girls.
Journal of Adolescence, 32(3), 467e481.http://dx.doi.org/10.1016/j.adolescence.2008.07.005.
*Caspi, A., Lynam, D., Moffitt, T. E., & Silva, P. A. (1993). Unraveling girls' delinquency: biological, dispositional, and contextual contributions to adolescent
misbehavior.Developmental Psychology, 29(1), 19e30.
Caspi, A., & Moffitt, T. E. (1991). Individual differences are accentuated during periods of social change: the sample case of girls at puberty.Journal of
Personality and Social Psychology, 61(1), 157e16 8.
Cohen, J. (1988).Statistical power for the social sciences. Hillsdale, NJ: Laurence Erlbaum and Associates.
Connell, A. M., & Goodman, S. H. (2002). The association between psychopathology in fathers versus mothers and children's internalizing and externalizing
behavior problems: a meta-analysis.Psychological Bulletin, 128(5), 746e773.http://dx.doi.org/10.1037//0033-2909.128.5.746.
*Copeland, W., Shanahan, L., Miller, S., Costello, E. J., Angold, A., & Maughan, B. (2010). Outcomes of early pubertal timing in young women: a prospective
population-based study.American Journal of Psychiatry, 167(10), 1218e1225.
*Cota-Robles, S., Neiss, M., & Rowe, D. C. (2002). The role of puberty in violent and nonviolent delinquency among Anglo American, Mexican American, and
African American boys.Journal of Adolescent Research, 17(4), 364e376.
*Deardorff, J., Cham, H., Gonzales, N. A., White, R. M., Tein, J. Y., Wong, J. J., et al. (2013). Pubertal timing and Mexican-origin girls' internalizing and
externalizing symptoms: the influence of harsh parenting.Developmental Psychology, 49(9), 1790e1804.http://dx.doi.org/10.1037/a0031016.
Dorn, L. D., & Biro, F. M. (2011). Puberty and its measurement: a decade in review.Journal of Research on Adolescence, 21(1), 180e195.http://dx.doi.org/10.
1111/j.1532-7795.2010.00722.x.
Dorn, L. D., Dahl, R. E., Woodward, H. R., & Biro, F. M. (2006). Defining the boundaries of early adolescence: a user's guide to assessing pubertal status and
pubertal timing in research with adolescents.Applied Developmental Science, 10(1), 30e56.
*Dorn, L. D., Susman, E. J., & Ponirakis, A. (2003). Pubertal timing and adolescent adjustment and behavior: conclusions vary by rater.Journal of Youth and
Adolescence, 32(3), 157e16 7.
(continued)
StudyNr pConclusion
aSex bPuberty
measurementAge at puberty
measurementAge at ext.
measurement cTiming
effect
Negriff et al., ensemble adjusted
d 361 0.090 0.087 EjNE 2 Self/parent tanner 10.9 12.9 Long-term
Obeidallah et al. (2004)121 0.174 0.02675 EjNE 1 Age of menarche 13.5 15.6 Long-term
Skoog, Stattin, Ruiselova,
and Ozdemir (2013)1597 0.135 0.001 E>NE 1 Age of menarche 14.5 14.5 Concurrent
Sontag-Padilla et al. (2012)76 0.287 0.0048 E>NE 1 Clinician tanner 7.5 7.5 Concurrent
Sontag et al. (2011)(1) 98 0.030 0.385 EjNE 0 Perceived timing 12.9 12.9 Concurrent
Sontag et al. (2011)(2) 166 0.220 0.0018 E>NE 1 Perceived timing 12.9 12.9 Concurrent
Stattin et al. (2011)284 0.070 0.12 EjNE 1 Age of menarche 14.5 14.5 Concurrent
Stattin and Magnusson (1990)466 0.135 0.002 E>NE 1 Age of menarche 14.5 14.5 Concurrent
Storvoll and Wichstrøm (2002)(1) 67 0.426 0.0003 E>NE 0 Perceived timing 16 16 Concurrent
Storvoll and Wichstrøm (2002)(2) 67 0.551 0.00005 E>NE 1 Perceived timing 16 16 Concurrent
Susman et al. (2007)(1) 56 0.340 0.0073 E>NE 0 Clinician tanner 11.4 11.4 Concurrent
Susman et al. (2007)(2) 55 0.340 0.00785 E>NE 1 Clinician tanner 10.5 10.5 Concurrent
White et al. (2013)312 0.120 0.01705 E>NE 0 PDS 12.8 15.9 Long-term
Williams and Dunlop (1999)99 0.185 0.0319 E>NE 0 PDS 14.2 14.2 Concurrent
aE¼early pubertal timing; NE¼not early pubertal timing.b0¼Male; 1¼Female; 2¼Both sexes (original author did not separate analyses for the sexes).cAge of participant when responding to an externalizing behavior measure.dThe ensemble adjusted effect size was created from the following studies:Negriff et al. (2008), Negriff, Ji, and Trickett (2011), Negriff, Susman, and
Trickett (2011), Negriff and Trickett (2010).
1References marked with an asterisk indicate studies included in the meta-analysis.
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0 16 8 Durlak, J. A. (2009). How to select, calculate, and interpret effect sizes.Journal of Pediatric Psychology, 34(9), 917e928.http://dx.doi.org/10.1093/jpepsy/
jsp004.
*Felson, R. B., & Haynie, D. L. (2002). Pubertal development, social factors, and delinquency among adolescent boys.Criminology, 40(4), 967e988.
Fernandez-Castelao, C., & Kroner-Herwig, B. (2014). Developmental trajectories and predictors of externalizing behavior: a comparison of girls and boys.
Journal of Youth and Adolescence, 43(5), 775e789.http://dx.doi.org/10.1007/s10964-013-0011-9.
*Flannery, D. J., Rowe, D. C., & Gulley, B. L. (1993). Impact of pubertal status, timing, and age on adolescent sexual experience and delinquency.Journal of
Adolescent Research, 8(1), 21e40.
Ge, X., Brody, G. H., Conger, R. D., & Simons, R. L. (2006). Pubertal maturation and African American children's internalizing and externalizing symptoms.
Journal of Youth and Adolescence, 35(4), 528e537.http://dx.doi.org/10.1007/s10964-006-9046-5.
*Ge, X., Brody, G. H., Conger, R. D., Simons, R. L., & Murry, V. M. (2002). Contextual amplification of pubertal transition effects on deviant peer affiliation and
externalizing behavior among African American children.Developmental Psychology, 38(1), 42e54.http://dx.doi.org/10.1037//0012-1649.38.1.42.
*Ge, X., Conger, R. D., & Elder, G. H., Jr. (2001). The relation between puberty and psychological distress in adolescent boys.Journal of Research on Adolescence,
11(1), 49e70.
Ge, X., & Natsuaki, M. N. (2009). In search of explanations for early pubertal timing effects on developmental psychopathology.Current Directions in
Psychological Science, 18(6), 327e331.
Ge, X., Natsuaki, M. N., Jin, R., & Biehl, M. C. (2011). A contextual amplification hypothesis: pubertal timing and girls' emotional and behavioral problems. In
M. Kerr, H. Stattin, R. C. Engels, G. Overbeek, & A.-K. Andershed (Eds.),Understanding girls' problem behavior: how girls' delinquency develops in the context
of maturity and health, co-occurring problems, and relationships(Vol. 7). John Wiley&Sons.
Giletta, M., Calhoun, C. D., Hastings, P. D., Rudolph, K. D., Nock, M. K., & Prinstein, M. J. (2015). Multi-level risk factors for suicidal ideation among at-risk
adolescent females: the role of hypothalamic-pituitary-adrenal axis responses to stress.Journal of Abnormal Child Psychology, 43(5), 807e820.http://dx.
doi.org/10.1007/s10802-014-9897-2.
Golub, M. S., Collman, G. W., Foster, P. M. D., Kimmel, C. A., Rajpert-De Meyts, E., Reiter, E. O., et al. (2008). Public health implications of altered puberty
timing.Pediatrics, 121,S218eS230.
Graber, J. A. (2013). Pubertal timing and the development of psychopathology in adolescence and beyond.Hormones and Behavior, 64(2), 262e269.http://
dx.doi.org/10.1016/j.yhbeh.2013.04.003.
*Graber, J. A., Lewinsohn, P. M., Seeley, J. R., & Brooks-Gunn, J. (1997). Is psychopathology associated with the timing of pubertal development?Journal of the
American Academy of Child and Adolescent Psychiatry, 36(12), 1768e17 76.
Graber, J. A., Seeley, J. R., Brooks-Gunn, J., & Lewinsohn, P. M. (2004). Is pubertal timing associated with psychopathology in young adulthood?Journal of the
American Academy of Child and Adolescent Psychiatry, 43(6), 718e726.http://dx.doi.org/10.1097/01.chi.0000120022.14101.11.
Hall, J. A., & Rosenthal, R. (1991). Testing for moderator variables in meta-analysis: Issues and methods.Communication Monographs, 58(4), 437e448.
Hamilton, J. L., Stange, J. P., Kleiman, E. M., Hamlat, E. J., Abramson, L. Y., & Alloy, L. B. (2014). Cognitive vulnerabilities amplify the effect of early pubertal
timing on interpersonal stress generation during adolescence.Journal of Youth and Adolescence, 43(5), 824e833.http://dx.doi.org/10.1007/s10964-013-
0015-5.
*Haynie, D. L. (2003). Contexts of risk? Explaining the link between girls' pubertal development and their delinquency involvement.Social Forces, 82(1),
355e397.
Hedges, L. V., & Vevea, J. L. (1998). Fixed- and random-effects models in meta-analysis.Psychological Methods, 3(4), 486e504.
Hunter, J. E., & Schmidt, F. L. (2000). Fixed effects vs. random effects meta-analysis models: Implications for cumulative research knowledge.International
Journal of Selection and Assessment, 8(4), 275e292.
Kaltiala-Heino, R., Marttunen, M., Rantanen, P., & Rimpel€
a, M. (2003). Early puberty is associated with mental health problems in middle adolescence.Social
Science&Medicine, 57(6), 1055e10 6 4 .http://dx.doi.org/10.1016/s0277-9536(02)00480-x.
*Lynne, S. D., Graber, J. A., Nichols, T. R., Brooks-Gunn, J., & Botvin, G. J. (2007). Links between pubertal timing, peer influences, and externalizing behaviors
among urban students followed through middle school.Journal of Adolescent Health, 40(2), 181.e7e181.e13.http://dx.doi.org/10.1016/j.jadohealth.2006.
09.008.
Marshall, W. A., & Tanner, J. M. (1969). Variations in the pattern of pubertal changes in girls.Archives of Disease in Childhood, 44(235), 291e303.
Marshall, W. A., & Tanner, J. M. (1970). Variations in the pattern of pubertal changes in boys.Archives of Disease in Childhood, 45(13), 13e23.
McCabe, M. P., & Ricciardelli, L. A. (2004). A longitudinal study of pubertal timing and extreme body change behaviors among adolescent boys and girls.
Adolescence, 39(153), 144e16 6.
Mendle, J. (2014a). Beyond pubertal timing: new directions for studying individual differences in development.Current Directions in Psychological Science,
23(3), 215e219.http://dx.doi.org/10.1177/0963721414530144.
Mendle, J. (2014b). Why puberty matters for psychopathology.Child Development Perspectives, 8
(4), 218e222.http://dx.doi.org/10.1111/cdep.12092.
Mendle, J., & Ferrero, J. (2012). Detrimental psychological outcomes associated with pubertal timing in adolescent boys.Developmental Review, 32(1),
49e66.http://dx.doi.org/10.1016/j.dr.2011.11.001.
Mendle, J., Turkheimer, E., & Emery, R. E. (2007). Detrimental psychological outcomes associated with early pubertal timing in adolescent girls.Develop-
mental Review, 27(2), 151e17 1.http://dx.doi.org/10.1016/j.dr.2006.11.001.
*Mensah, F. K., Bayer, J. K., Wake, M., Carlin, J. B., Allen, N. B., & Patton, G. C. (2013). Early puberty and childhood social and behavioral adjustment.Journal of
Adolescent Health, 53,118e124.http://dx.doi.org/10.1016/j.jadohealth2012.12.018.
Moffitt, T. E. (1993). Adolescent-limited and life-course-persistent antisocial behavior: a developmental taxonomy.Psychological Review, 100(4), 674e701.
*Mrug, S., Elliot, M., Gilliland, J., Grunbaum, J. A., Tortolero, S. R., Cuccaro, P., et al. (2008). Positive parenting and early puberty in girls: protective effects
against aggressive behavior.Archives of Pediatric&Adolescent Medicine, 162(8), 781e786.
*Mrug, S., Elliott, M. N., Davies, S., Tortolero, S. R., Cuccaro, P., & Schuster, M. A. (2014). Early puberty, negative peer influence, and problem behaviors in
adolescent girls.Pediatrics, 133(1), 7e14 .http://dx.doi.org/10.1542/peds.2013-0628.
*Najman, J. M., Hayatbakhsh, M. R., McGee, T. R., Bor, W., O'Callaghan, M. J., & Williams, G. M. (2009). The impact of puberty on aggression/delinquency:
adolescence to young adulthood.The Australian and New Zealand Journal of Criminology, 42(3), 369e386.
Natsuaki, M. N. (2013). Puberty in context: toward a more nuanced understanding of early maturation.Journal of Adolescent Health, 53,677e678.http://dx.
doi.org/10.1016/j.jadohealth.2013.09.019.
Natsuaki, M. N. (2015). Intervention effect fades over time: when, how, and for whom?The Lancet Psychiatry.http://dx.doi.org/10.1016/S2215-0366(15)
00158-3.
Natsuaki, M. N., Biehl, M. C., & Ge, X. (2009). Trajectories of depressed mood from early adolescence to young adulthood: the effects of pubertal timing and
adolescent dating.Journal of Research on Adolescence, 19,47e74.
Natsuaki, M. N., Samuels, D., & Leve, L. D. (2014). Puberty, identity, and context: a biopsychosocial perspective on internalizing psychopathologyin early
adolescent girls. In K. C. McLean, & M. Syed (Eds.),The Oxford handbook of identity development(pp. 389e405). New York, NY: Oxford University Press.
*Negriff, S., Fung, M. T., & Trickett, P. K. (2008). Self-rated pubertal development, depressive symptoms and delinquency: measurement issues and
moderation by gender and maltreatment.Journal of Youth and Adolescence, 37(6), 736e
746.http://dx.doi.org/10.1007/s10964-008-9274-y.
*Negriff, S., Ji, J., & Trickett, P. K. (2011). Exposure to peer delinquency as a mediator between self-report pubertal timing and delinquency: a longitudinal
study of mediation.Development and Psychopathology, 23, 293e304.
Negriff, S., & Susman, E. J. (2011). Pubertal timing, depression, and externalizing problems: a framework, review, and examination of gender differences.
Journal of Research on Adolescence, 21(3), 717e746.http://dx.doi.org/10.1111/j.1532-7795.2010.00708.x.
*Negriff, S., Susman, E. J., & Trickett, P. K. (2011). The developmental pathway from pubertal timing to delinquency and sexual activity from early tolate
adolescence.Journal of Youth and Adolescence, 40, 1343e1356.
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 016 9 *Negriff, S., & Trickett, P. K. (2010). The relationship between pubertal timing and delinquent behavior in maltreated male and female adolescents.Journal of
Early Adolescence, 30(4), 518e542.http://dx.doi.org/10.1177/0272431609338180.
Nickerson, R. S. (2000). Null hypothesis significance testing: a review of an old and continuing controversy.Psychological Methods, 5(2), 241e301.
*Obeidallah, D., Brennan, R. T., Brooks-Gunn, J., & Earls, F. (2004). Links between pubertal timing and neighborhood contexts: implications for girls' violent
behavior.Journal of the American Academy of Child and Adolescent Psychiatry, 43(12), 1460e1468.http://dx.doi.org/10.1097/01.chi.0000142667.52062.1e.
Ozer, D. J. (1985). Correlation and the coefficient of determination.Psychological Bulletin, 97(2), 307e315.http://dx.doi.org/10.1037/0033-2909.97.2.307.
Petersen, A. C., Crockett, L. J., Richards, M., & Boxer, A. (1988). A self-report measure of pubertal status: reliability, validity, and initial norms.Journal of Youth
and Adolescence, 17(2), 117e13 3.
Rosenthal, R. (1991).Meta-analytic procedures for social research(Vol. 6). Newbury Park, CA: Sage.
Rosenthal, R., & DiMatteo, M. R. (2001). Meta-analysis: recent developments in quantitative methods for literature reviews.Annual Review of Psychology, 52,
59e82.
Rosenthal, R., & Rosnow, R. L. (2008).Essentials of behavioral research: methods and data analysis(Vol. 3). New York, NY: McGraw-Hill.
Rosenthal, R., Rosnow, R. L., & Rubin, D. B. (2000).Contrasts and effect sizes in behavioral research: a correlational approach. New York, NY: Cambridge
University Press.
Rosenthal, R., & Rubin, D. B. (1982). A simple, general purpose display of magnitude of experimental effect.Journal of Educational Psychology, 74(2), 166e169.
http://dx.doi.org/10.1037/0022-0663.74.2.166.
Rosenthal, R., & Rubin, D. B. (1983). Ensemble-adjustedpvalues.Psychological Bulletin, 94, 540e541.
Rudolph, K. D. (2014). Puberty as a developmental context of risk for psychopathology. In M. Lewis, & K. D. Rudolph (Eds.),Handbook of developmental
psychopathology(pp. 331e354). New York, NY: Springer.
S
anchez-Meca, J., & Marín-Martínez, F. (1997). Homogeneity tests in meta-analysis: a Monte Carlo comparison of statistical power and type I error.Quality&
Quantity, 31, 385e399.
Shirtcliff, E. A., Dahl, R. E., & Pollak, S. D. (2009). Pubertal development: correspondence between hormonal and physical development.Child Development,
80(2), 327e337.
Simmons, R. G., & Blyth, D. A. (1987).Moving into adolescence: the impact of pubertal change and school context. Aldine Transaction.
*Skoog, T., Stattin, H., Ruiselova, Z., & Ozdemir, M. (2013). Female pubertal timing and problem behaviour: the role of culture.International Journal of
Behavioral Development, 37(4), 357e365.http://dx.doi.org/10.1177/0165025413486761.
Smith, L. M., & Glass, G. V. (1977). Meta analysis of psychotherapy outcomes.American Psychologist, 32, 752e760.
*Sontag-Padilla, L. M., Dorn, L. D., Tissot, A., Susman, E. J., Beers, S. R., & Rose, S. R. (2012). Executive functioning, cortisol reactivity, and symptoms of
psychopathology in girls with premature adrenarche.Developmental Psychopathology, 24(1), 211e223.http://dx.doi.org/10.1017/S0954579411000782.
*Sontag, L. M., Graber, J. A., & Clemans, K. H. (2011). The role of peer stress and pubertal timing on symptoms of psychopathology during early adolescence.
Journal of Youth and Adolescence, 40(10), 1371e1382.http://dx.doi.org/10.1007/s10964-010-9620-8.
Sørenson, K., Mouritsen, A., Aksglaede, L., Hagen, C. P., Mogensen, S. S., & Juul, A. (2012). Recent secular trends in pubertal timing: implications for evaluation
and diagnosis of precocious puberty.Hormone Research in Paediatrics, 77,137e145.http://dx.doi.org/10.1159/000336325.
*Stattin, H., Kerr, M., & Skoog, T. (2011). Early pubertal timing and girls' problem behavior: integrating two hypotheses.Journal of Youth and Adolescence, 40,
1271e1287.
*Stattin, H., & Magnusson, D. (1990).Paths through life: pubertal maturation in female development(Vol. 2). Hillsdale, NJ: Lawrence Erlbaum Associates,
Publishers.
*Storvoll, E. E., & Wichstrøm, L. (2002). Do the risk factors associated with conduct problems in adolescents vary according to gender?Journal of Adolescence,
25(2), 183e202.http://dx.doi.org/10.1006/jado.2002.0460.
*Susman, E. J., Dockray, S., Schiefelbein, V. L., Herwehe, S., Heaton, J. A., & Dorn, L. D. (2007). Morningness/eveningness, morning-to-afternoon cortisol ratio,
and antisocial behavior problems during puberty.Developmental Psychology, 43(4), 811e822.http://dx.doi.org/10.1037/0012-1649.43.4.811.
Timmermans, M., van Lier, P. A. C., & Koot, H. M. (2008). Which forms of child/adolescent externalizing behaviors account for late adolescent risky sexual
behavior and substance abuse? In M. Timmermans (Ed.),Antisocial behaviors: courses and consequences from toddlerhood to late adolescence(pp. 65e79).
Duivendrecht, Netherlands: PI Research.
*White, R., Deardoff, J., Liu, Y., & Gonzales, N. A. (2013). Contextual amplification of attenuation of the impact of pubertal timing on Mexican-origin boys'
mental health symptoms.Journal of Adolescent Health, 53, 692e698.
*Williams, J. M., & Dunlop, L. C. (1999). Pubertal timing and self-reported delinquency among male adolescents.Journal of Adolescence, 22,157e17 1.
Winters, K. C., Stinchfield, R. D., Latimer, W. W., & Stone, A. (2008). Internalizing and externalizing behaviors and their association with the treatment of
adolescents with substance use disorder.
Journal of Substance Abuse Treatment, 35(3), 269e278.http://dx.doi.org/10.1016/j.jsat.2007.11.002.
L.M. Dimler, M.N. Natsuaki / Journal of Adolescence 45 (2015) 160e17 0 17 0