Psychometric properties of the Symptom Checklist-90 in adolescent psychiatric inpatients and age- and gender-matched community youth

Rytilä-Manninen, Minna; Fröjd, Sari; Haravuori, Henna; Lindberg, Nina; Marttunen, Mauri; Kettunen, Kirsi; Therman, Sebastian

doi:10.1186/s13034-016-0111-x

Research Article
Open access
Published: 15 July 2016

Psychometric properties of the Symptom Checklist-90 in adolescent psychiatric inpatients and age- and gender-matched community youth

Minna Rytilä-Manninen^1,2,
Sari Fröjd³,
Henna Haravuori^2,4,
Nina Lindberg⁵,
Mauri Marttunen^2,4,
Kirsi Kettunen² &
…
Sebastian Therman⁴

Child and Adolescent Psychiatry and Mental Health volume 10, Article number: 23 (2016) Cite this article

5696 Accesses
53 Citations
1 Altmetric
Metrics details

Abstract

Background

The Symptom Checklist-90 (SCL-90) is a questionnaire that is widely used to measure subjective psychopathology. In this study we investigated the psychometric properties of the SCL-90 among adolescent inpatients and community youth matched on age and gender.

Methods

The final SCL-90 respondents comprised three subsets: 201 inpatients at admission, of whom 152 also completed the instrument at discharge, and 197 controls. The mean age at baseline was 15.0 years (SD 1.2), and 73 % were female. Differential SCL-90 item functioning between the three subsets was assessed with an iterative algorithm, and the presence of multidimensionality was assessed with a number of methods. Confirmatory factor analyses for ordinal items compared three latent factor models: one dimension, nine correlated dimensions, and a one-plus-nine bifactor model. Sensitivity to change was assessed with the bifactor model’s general factor scores at admission and discharge. The accuracy of this factor in detecting the need for treatment used, as a gold standard, psychiatric diagnoses based on clinical records and the Schedule for Affective Disorders and Schizophrenia for School-Age Children—Present and Lifetime (K-SADS-PL) interview.

Results

Item measurement properties were largely invariant across subsets under the unidimensional model, with standardized factor scores at admission being 0.04 higher than at discharge and 0.06 higher than those of controls. Determination of the empirical number of factors was inconclusive, reflecting a strong main factor and some multidimensionality. The unidimensional factor model had very good fit, but the bifactor model offered an overall improvement, though subfactors accounted for little item variance. The SCL-90s ability to identify those with and without a psychiatric disorder was good (AUC = 83 %, Glass’s Δ = 1.4, Cohen’s d = 1.1, diagnostic odds ratio 12.5). Scores were also fairly sensitive to change between admission and discharge (AUC 72 %, Cohen’s d = 0.8).

Conclusions

The SCL-90 proved mostly unidimensional and showed sufficient item measurement invariance, and is thus a useful tool for screening overall psychopathology in adolescents. It is also applicable as an outcome measure for adolescent psychiatric patients. SCL-90 revealed significant gender differences in subjective psychopathology among both inpatients and community youth.

Background

Adolescence is a transitional stage from childhood to adulthood during which the individual undergoes many physiological, psychological, cognitive, and social changes. It is a risk period for the emergence of many psychiatric disorders [1, 2]. The incidence of psychiatric disorders increases from childhood through mid-adolescence, peaking in late adolescence and young adulthood [3], and approximately one adolescent in five suffers from a psychiatric disorder [4]. In Finland, about 3 % of the adolescent population (ages 13–22) is referred to adolescent psychiatric secondary care, and approximately 0.4–0.6 ‰ require psychiatric hospitalization [5].

Symptom inventories provide an economical means of assessing adolescents’ mental disturbance levels and treatment effectiveness. As Symptom Checklists and rating scales provide extensive amounts of clinical information relatively quickly, self-report symptom inventories are commonly used by both clinicians and researchers to gather information on patients’ mental states. Furthermore, self-report questionnaires can be used to monitor the quality of medical and psychological interventions in mental health services, and to screen for symptoms of psychopathology [6]. Because psychiatric comorbidity is typical for adolescents with mental disorders, a growing body of research has supported using multidimensional scales [7]. One such questionnaire is the Symptom Checklist-90 (SCL-90) [8], a widely applied self-assessment tool for individuals with a broad range of mental disorders and symptom intensity. It contains 90 items and takes approximately 12–15 min to administer, yielding nine scores for primary symptom dimensions and three for global distress. The symptom dimensions comprise somatization, obsessive–compulsive behavior, interpersonal sensitivity, depression, anxiety, hostility, phobic anxiety, paranoid ideation, and psychoticism [8]. The main global index of distress is the global severity index (GSI), which is the average of all responses. A time reference of 1–2 weeks is usually used.

The SCL-90 has been tested in different settings, including community [6, 9–13] and psychiatric outpatient [14, 15] and inpatient samples [16–18]. It is commonly used as an indicator of change in symptoms [19, 20] and as a treatment outcome measure [21, 22]. The SCL-90s ability to discriminate patients from non-patients is adequate [13, 14], but correlations with analogous and non-analogous measures have been somewhat controversial [17, 23]. Significant gender differences have also emerged [13, 21, 24]. The main criticism of the instrument, however, has focused on the original 9-factor structure, with substantial difficulties arising in its replication. One general factor accounting for a large proportion of variance has been proposed in some studies with adults [14, 17, 19, 25].

The aim of the present study was to investigate the measurement invariance, factor structure, reliability, and validity of the SCL-90 among adolescents. A new approach is the use of a bifactor model, which according to Reise [26], is effective when modeling construct-relevant multidimensionality. A bifactor model consists of general factor and a number of specific factors, allowing each item to load both on the general factor and specific factor [26, 27]. In this study we compare two groups, inpatients and controls, and also the same patient sample at two time points, namely admission and discharge. As a prerequisite for comparing these two groups and two time points accurately, a measurement invariance analysis was executed. Measurements invariance signifies that the association between the items and the latent factors should not depend on group membership or measurement occasion, but the measurement instrument and the construct being measured are operating in the same way across diverse samples of interest [28].

To the best of our knowledge, this is the first study that examines the dimensionality and viability of the SCL-90 subscale scores in an adolescent sample by applying a bifactor model. In line with recent findings supporting a bifactor model of the SCL-90 with adults [29], we expect that the model with nine specific factors and one general factor of symptoms would be the best fitting solution. Our second aim is to estimate the screening performance of the SCL-90 and to determine optimal cut-off point. To our knowledge, there are no discrimination thresholds for distinguishing between adolescent patients and the general population or between adolescents with a diagnosed mental disorder and those without. An earlier study in a Finnish adult sample [10] has shown that the screening properties of this SCL-90 translation are good.

The findings could provide important information on the best practices for using the SCL-90 questionnaire and interpreting SCL-90 scores among adolescents.

Methods

Participants and procedure

Inpatients

The Kellokoski Hospital Adolescent Inpatient Follow-Up Study (KAIFUS) is a longitudinal naturalistic study on clinical characteristics and impact of treatment in a consecutive sample of adolescent psychiatric inpatients in Southern Finland. The sample comprises 13- to 17-year-old adolescents admitted to Kellokoski Hospital for the first time between September 2006 and August 2010 (N = 395). We excluded adolescents with a treatment period of less than 2 weeks, with intellectual disability, with an age under 13 years, or with a poor knowledge of Finnish language (n = 80, 20 %). Furthermore, 62 adolescents (16 %) declined to participate, 23 (6 %) discontinued their treatment, and 24 (6 %) had incomplete data. The final inpatient admission sample comprised 60 boys (29 %) and 146 girls (71 %) with a mean age of 15.1 years (SD = 1.2). Non-participation was unrelated to age (p = 0.31, two-sided t test), living situation (p = 0.58), socioeconomic status (p = 0.38), or the presence of substance use disorders (p = 0.59), mood disorders (p = 0.92), conduct disorder (p = 0.09), anxiety disorders (p = 0.39), or eating disorders (p = 0.34), but was higher among boys (p = 0.02) and among patients with psychotic disorders (p = 0.02). Patients were diagnostically interviewed with the Schedule for Affective Disorders and Schizophrenia for School-Age Children—Present and Lifetime version [30]. The patients were requested to complete the SCL-90 at the beginning of their stay as well as at discharge. The treatment duration was between 31 and 90 days in 38 % of the cases, 42 % of the patients stayed in hospital for over 90 days, and 20 % of the patients for less than 31 days. For more details, see Rytilä-Manninen et al. [31]. The study was designed to detect clinically meaningful group differences, and the planned sample size of 200 patients and 200 controls is sensitive enough to achieve 80 % power even for small effect sizes (d > 0.28) when α is set to 0.05 on a t test.

Community sample

The control group comprised a random sample of sex- and age-matched students from two secondary, one vocational, and four comprehensive schools, collected from the same geographical area as the inpatients. A total of 473 students were invited; 202 (43 %) refused to participate, and 68 (14 %) failed to complete the self-assessments despite providing consent. The final sample consisted of 55 males (27 %) and 148 females (73 %). All were native Finns, with a mean age of 14.9 years (SD = 1.2). No significant differences were found between adolescents who participated and those who did not with regard to socioeconomic status (p = 0.61) or living situation (p = 0.49). The same interviews and questionnaires were used with the community youth group as with patients. Based on the diagnostic interviews, 21 % of these youths met the criteria for at least one psychiatric disorder. For more details, see Rytilä-Manninen et al. [31].

Ethical aspects

Participation was voluntary, and all participants and their legal guardians were required to provide written informed consent after receiving both verbal and written information about the study. The Ethics Committee of Helsinki University Hospital approved the study protocol. Permission to conduct the study was granted by the authorities of the Helsinki and Uusimaa Hospital District and school administrations. The study was performed in accordance with the Declaration of Helsinki.

Measures

Schedule for affective disorders and schizophrenia for school-age children—present and lifetime version (K-SADS-PL)

Psychiatric diagnoses were assessed based on the K-SADS-PL interview [30]. This is a semi-structured interview with good to excellent test–retest reliability and high concurrent validity and inter-rater agreement between the original and translated versions [30, 32–34]. The Finnish translation has previously been used in studies of both adolescent in- and outpatients [35, 36].

Psychiatrists specialized in treating adolescents assigned the psychiatric diagnoses according to the Axis-I disorders in DSM-IV [37] based on the K-SADS-PL and clinical records. Discrepancies were resolved by consensus between the psychiatrists. The psychiatric diagnoses present at the time of the baseline interview were included in the analyses, here dichotomized as having at least one psychiatric diagnosis present or no psychiatric diagnosis present.

Scl-90

SCL-90 is a self-report measure for persons aged at least 13 years. It consists of 90 items that represent nine factors and seven additional questions that are configure items, primarily concerning disturbances in appetite and sleep patterns, and are not scored collectively as a dimension [8]. Each of the nine symptom dimensions contains 6-13 items. Items are rated on a five-point Likert-scale of distress, ranging from “not at all” (0) to “extremely” (4). The General Severity Index (GSI) is the average score for all responded items and serves as an overall measure of psychiatric distress. In this study, the time of reference for the symptoms was the previous two weeks.

Statistical analyses

Measurement invariance

To establish sufficient measurement invariance across groups and time points, an iterative algorithm was employed to detect differential item functioning (DIF) under Samejima’s graded response model for the full SCL-90, using the lordif package version 0.3–2 [38] for R with default settings (α = 0.01). The algorithm uses items tentatively flagged as invariant as anchors in an iterative process until a stable solution is identified. Patient responses at admission were separately compared with responses at discharge and control group responses. Total item-wise DIF was measured with summed uniform and non-uniform McFadden pseudo-R ².

Optimal number of factors

The multifactoriality of the subsample datasets were investigated with a number of indices for the optimal number of factors to extract: very simple structure (VSS), minimum average partial correlation (MAP), and parallel analysis (PA) [39–41]. These were calculated with the psych package version 1.5.8 in R version 3.2.3, using the polychoric correlation matrix and both weighted least-squares (WLS) and maximum likelihood (ML) estimation. VSS was investigated at complexity one and two, where an item is allowed to load on one or two factors only. In addition, the comparison data approach of Ruscio and Roche [42] was used, as implemented in R code supplied by the authors, using Spearman correlation matrices derived from complete cases.

Factor analyses

After establishing sufficient measurement invariance, the one-dimensional and a priori nine-dimensional model of the SCL-90 was fitted in confirmatory factor analyses (CFA) separately for patients at admission, patients at discharge, and controls.

In addition, in light of the evidence for a strong main factor, a bifactor model was specified with a general factor uncorrelated with the nine subfactors, which correlated with each other. The percentage of common variance attributable to the general factor was expressed with the explained common variance index (ECV) and the usefulness of individual subscales was assessed with McDonald’s omega hierarchical ω_h and omega subscale ω_s [26].

All factor analyses used the weighted least squares mean and variance adjusted (WLSMV) algorithm for categorical indicators in Mplus 7.3 [43], which performs well with skewed ordinal variables [44, 45] and with smaller samples [46]. Three fit indices were employed; for the comparative fit index (CFI) and the root mean square error of approximation (RMSEA) we followed the suggested cut-off values of Hu and Bentler [47] in judging adequacy of fit: >0.95 for CFI and <0.06 for RMSEA; for the weighted root mean square residual (WRMR) Yu [48] has suggested a cut-off of <1.0 under non-normality and small samples. Note that the one-dimensional and bifactor models included the six items not assigned to any of the nine subfactors. Maximum a posteriori factor scores were calculated for the bifactor model general factor.

Criterion validation

The three response sets of patients at admission, patients at discharge, and controls were compared on their SCL-90 general factor scores. As score distributions were approximately normal, Welch’s unequal variances t-test was employed (two-tailed, α = 0.05), and effect sizes were expressed with Glass’s Δ (using control/healthy variance only) and Cohen’s d (pooled variance). Similarly, diagnosed individuals were compared with non-diagnosed individuals in the combined admission and control groups. Gender effects were examined in all three response sets. Receiver operating characteristic (ROC) curves and associated area under the curve (AUC) values with non-parametric confidence intervals were computed with the pROC package [49] version 1.1-2 in R. The optimal cut-off point for discriminating between groups was determined with Youden’s J statistic [50], maximizing the sum of sensitivity and specificity. The overall discriminability at the chosen cut-offs was expressed as diagnostic odds ratios (DOR).

Results

Basic item distribution properties of SCL-90

From admission, discharge, and control sets 0.1, 0.4 and 0.2 % of SCL-90 responses were missing, respectively, with no individual having more than 30 missing responses. All models and scores were therefore estimated using all available data, assuming missingness at random. There was a strong floor effect in response distributions (item-wise skewness averaged 0.7 at admission, 1.6 at discharge, and 2.0 for controls), which in combination with the five-point response scale confirmed the necessity of employing factor analyses suitable for ordered categorical indicators.

Measurement invariance

When investigating the measurement invariance of items between patients and controls in the one-dimensional model, the iterative algorithm converged after 4 rounds, flagging 23 items for DIF, and McFadden R ² values for all items had a mean of 0.8 % and a median of 0.4 %. The highest values were observed for items 15 and 22 at 5.2 and 5.1 %. However, the total effect of the DIF of all items was small, as it was estimated to lead to 0.06 higher normalized latent scores in the patient group. Group-wise test characteristic curves and the impact of DIF are presented in Fig. 1.

When comparing admission and discharge responses of patients, the algorithm also converged after four rounds, flagging 11 items. McFadden R ² values for all items had a mean of 0.5 % and a median of 0.3 %, the highest values being 2.6, 2.5 and 2.3 % for items 32, 15, and 59, respectively. Again, the total effect of DIF was minimal, resulting in 0.04 higher scores at admission.

Optimal number of factors

The empirical number of factors using WLS and ML estimation were almost identical, and only the former results are shown, along with results for the comparison data method, in Table 1. The various indices were highly divergent, with nominated number of factors ranging from one to nine, consistent with a complex factor structure with a strong primary factor.

Table 1 Suggested number of factors by various indices

Full size table

Confirmatory factor analyses

The one-dimensional CFA models had good fit in all three subsamples (Table 2). In contrast, the fit was poor for the a priori nine-dimensional models, and latent factors were very strongly correlated; the median inter-factor correlations were 0.84, 0.88, and 0.86 for the admission, discharge, and control datasets, respectively. The bifactor models had an even better fit than the corresponding one-dimensional models in the same subsamples. However, successfully fitting the bifactor models required leaving out item 15 from the depression subfactor, as the item was almost perfectly correlated with the general factor. Fit statistics of all models are presented in Table 2, and factor loadings, thresholds, and subfactor correlations of the patient admission subsample in Table 3. Total information curves of the general factor in the three subsamples are presented in Fig. 2.

Table 2 Fit statistics for CFA models

Full size table

Table 3 Standardized thresholds and factor loadings of nine-dimensional bifactor model of patient admission responses to SCL-90

Full size table

As sufficient measurement invariance was established, maximum a posteriori factor scores for the general factor were estimated for all groups using the parameters of the patient admission bifactor model, which was the most multi-factorial of the three and had the most stable parameter estimates; the two items (15 and 22) showing a total DIF effect of over 5 % in either analysis were left out. Factor scores were standardized to set the control sample mean to zero and standard deviation to one, and are presented in Table 4. In the combined admission and control sample, the Pearson correlation between the GSI and factor scores was 0.956 and the Spearman correlation was 0.997, indicating very strong agreement with a curvilinear relationship.

Table 4 Score distributions and group comparisons

Full size table

Subscale viability

The ECV of the general factor in the bifactor analyses was 56 % for the admission sample, 76 % at discharge, and 82 % for controls. McDonald’s omega values for the general factor and subscales are shown in Table 5.

Table 5 Viability of subscales in bifactor models

Full size table

Group differences

The GSI scores by group are shown in Table 4. Using the standardized general factor scores from the bifactor model, boys had lower scores than girls in both admission (Welch test p < 0.001, Cohen’s d = 0.8; girls M = 1.7, SD = 1.2; boys M = 0.6, SD = 1.4) and control samples (p < 0.001, d = 0.6; girls M = 0.1, SD = 1.0; boys M = −0.4, SD = 1.0).

In the ROC analyses of the factor scores, adequate discrimination was found between patients at admission and discharge (AUC 72, 95 % CI [66.8, 77.4 %]) as well as between patients at admission and controls (AUC 79 % [75.5, 84.3 %]). Formulated differently, the group difference between patients at admission and controls was statistically highly significant and the effect was large (p < 0.001, Glass’s Δ = 1.4, Cohen’s d = 1.1). Patients’ scores were also significantly lower at discharge than at admission (paired test p < 0.001, d = 0.8). The optimal cut-off point to distinguish between controls and patients at admission was at θ = 1.14, approximately corresponding to a GSI of 0.99, providing 86 % specificity, 63 % sensitivity, and a DOR of 10.5. In the combined admission and control sample, individuals with and without a psychiatric diagnosis were very well separated on the general factor (AUC 83 % [80, 87 %], p < 0.001, Δ = 1.7, d = 1.3), the optimal cut-off being θ = 0.68, approximately corresponding to a GSI of 0.72 (83 % specificity, 72 % sensitivity, DOR 12.5). ROC curves are shown in Fig. 3.

Discussion

In this study we analyzed the psychometric properties of the SCL-90 questionnaire in adolescent inpatients and a community sample. We found the measurement invariance to be satisfactory between patient and control responses and between patients at admission and discharge. We also examined the dimensionality of measurement with methods intended for exploratory factor analysis and via confirmatory factor and bifactor analysis. The explained common variance was estimated for the latter. To better understand the viability of subscales, we also calculated omega-hierarchical and omega-subscale indices. Receiver operating curves were calculated in order to evaluate the SCL-90s ability to distinguish between controls and patients and between individuals with and without a psychiatric diagnosis.

Measurement invariance analyses revealed sufficient measurement invariance across patients and controls and across time points, in line with an earlier clinical and general population study of adults [51]. These findings support using all the items for the GSI or a general factor, though at least one but perhaps a few items show enough DIF in the unidimensional model to be considered for exclusion. The sample sizes were unfortunately too small to formally test structural invariance in multidimensional models.

We calculated estimates of the number of empirically found number of dimensions, which were highly divergent, and therefore limited our factor analyses to confirmatory testing of previously proposed models. The fit of the unidimensional factor model proved adequate, but the nine-factor structure of the SCL-90 proposed by the original author of the scale [8] was not supported, as it showed poor fit and very highly correlated subscales. In contrast, the bifactor model with one general factor of symptoms and the same nine specific factors yielded an excellent fit to the data in all three subsamples (patient admission, patient discharge, and controls). Similar results have been found also by Urbán et al. [29] and Thomas [52].

As in the previous study by Urbán et al. [29] with an adult sample, we observed a strong global distress factor and weaker specific symptom factors in our patient sample, while our control sample data appeared unidimensional. There are some other previous studies that have similar results among adults. For example, Paap et al. [53, 54] have also found that different populations have varying dimensionality results using Mokken scale analysis: while samples of patients with high levels of distress support multidimensionality of the SCL-90 [53], samples characterized by a low level of distress indicate unidimensionality [54]. Lastly, adolescent inpatients usually suffer from comorbid disorders, and symptomatically homogenous groups without symptoms of other mental disorders are rarely found [55], which may explain the strong unidimensionality also in our clinical sample.

The explained common variance (ECV) index reflected the same findings on dimensionality and higher level of distress in our study. In our patient admission subsample, with severe distress, the ECV of the general factor was 56 %, which means that the explained variance is approximately equally spread across general and group factors, while at discharge, the common variance explained by the general factor was 76 %, and the highest ECV was found in the control sample 82 %, which approaches unidimensionality [26]. Interestingly, in the study by Urbán et al. [29] their adult community sample had almost the same ECV index (83 %) as our adolescent controls, which implies continuity across age groups for this measurement property.

Overall, the analysis of general- and domain-specific components yielded strong support for the presence of a general factor of symptoms within the SCL-90 items and, on the other hand, gave limited evidence for the viability of the a priori multidimensional structure even in the inpatient admission sample. The specific symptom factors Phobic Anxiety (ω_s = 0.40) and Hostility (ω_s = 0.32) had the strongest, but still weak, contributions to explaining the variance of the admission responses. These same two subscales had the strongest coefficients also in the patient discharge and control samples. These two factors also stood out in the study by Urbán et al. [29], indicating that these subfactors are more independent or distinct from other subscales of the SCL-90. The weakest reliability coefficients in this study was found for the depression subscale, suggesting that the depression items in the SCL-90 measure general distress addressed by the whole questionnaire, and that the depression scale does not reflect depression specific factor of symptoms. Thus, the nine subscales demonstrated low reliability as estimated by omega subscale coefficients, showing that these subscales comprise too small amount of reliable variance to reliably interpret. The results of the present research suggest that there is limited value in using the very highly correlated SCL-90 subscale scores among adolescents, because they primarily reflect variations in general symptoms.

Summed raw scores correlated extremely well with scores on the general factor, which is expected with a large number of items and a strong general factor, and the association was stable across the score range. Sum scores can thus confidently be used as a proxy for the latent factor. In this study factor score distributions discriminated well between patient at admission, patients at discharge, and controls. The scores of the patient admission sample were clearly higher than the scores of the patient discharge, being lowest in the controls. Our community sample seemed to exhibit somewhat lower SCL-90 GSI scores than those of an Italian community sample of 15- to 19-year-old adolescents [24]. However, the profile of our sample and that of a previous Swedish community sample of adolescents under 20 years of age [13] resembled each other, showing that there may be some cultural differences in the proneness to report symptoms.

The SCL-90s screening properties as investigated with ROC analyses indicated that it adequately discriminates patients from the community sample and individuals with psychiatric diagnosis from those without, a result resembling those of earlier studies among adult patients [6, 10]. Adequate discrimination was found also between patients at admission, who have severe symptoms, and the same patients at discharge who were largely recovered but still symptomatic. This finding supports earlier studies [19] that the SCL-90 is also a sensitive tool to measure changes in symptoms. Interestingly, the overall information yielded by the questionnaire was highest at discharge, perhaps reflecting an improved ability to understand the items.

Strengths and limitations

Strengths of this study include a relatively high number of consecutive inpatients and a sample of community youth matched for age and gender. Almost identical study protocols were used in both groups, and patients were followed prospectively. Furthermore, the psychiatric diagnoses were based on highly reliable and valid K-SADS-PL interviews, supplemented by patient records. The SCL-90 is a widely used and established questionnaire in clinical practice. A limitation of our study is the relatively small participation rate in the comparison group. A partial explanation is that participants had to have written informed consent from their legal guardians, and refusals were thus not necessarily due to the approached individual’s preferences. In addition, participants were asked to take part in a five-year follow-up study, and in this context, give their permissions for researchers to acquire information from official records concerning for example their future criminal records and use of health services.

These expectations may have influenced students’ willingness to participate in the study. However, we ascertained that community sample participants and non-participants did not differ on a number of socioeconomic variables used in matching, showing that our sampling was representative in this respect. The overall sample size was also too small for testing the measurement invariance of multifactorial models.

Conclusions and clinical implications

As the confirmatory bifactor model improved on the unidimensional model in all subsamples on all fit indices, and achieved excellent fit, it can be considered a sufficient description of the data. As most subscales had a very small contribution, however, it would be interesting to perform exploratory bifactor analyses in future studies. Nevertheless, the general factor was dominant, and the SCL-90 can thus be used as a unidimensional index of psychiatric distress, also when using the raw item score average (GSI). As the subscales were poorly distinguishable from the main factor and each other, they should be considered to measure mostly general distress, and their use to assess separate symptom dimensions does not appear warranted. Among adolescents, the SCL-90 appears to be a useful screening tool as well as a valuable instrument for assessing change in average symptom levels within patient populations.

Abbreviations

AUC:: area under the curve
CFA:: confirmatory factor analysis
CFI:: comparative fit index
DIF:: differential item functioning
DOR:: diagnostic odds ratios
ECV:: explained common variance index
GSI:: global severity index
K-SADS-PL:: the Schedule for Affective Disorders and Schizophrenia for School-Age Children—Present and Lifetime
MAP:: minimum average partial correlation
ML:: maximum likelihood
PA:: parallel analysis
RMSEA:: root mean square error of approximation
ROC:: receiver operating characteristic
SCL-90:: the Symptom Checklist-90
VSS:: very simple structure
WLS:: weighted least-squares
WLSMV:: weighted least squares mean and variance adjusted
WRMR:: weighted root mean square residual

References

Kim-Cohen J, Caspi A, Moffit T, Harrington H, Milne B, Pulton R. Prior juvenile diagnoses in adults with mental disorder: developmental follow-back of a prospective-longitudinal cohort. Arch Gen Psychiatry. 2003;60:709–17.
Article PubMed Google Scholar
Kessler R, Berglund P, Demler O, Jin R, Merikangas K, Walters E. Lifetime prevalence and age-of-onset distributions of DSM-IV disorders in the national comorbidity survey replication. Arch Gen Psychiatry. 2005;62:593–602.
Article PubMed Google Scholar
Newman D, Moffit T, Caspi A, Silva P, Stanton W. Psychiatric disorder in a birth cohort of young adults; prevalence, comorbidity, clinical significance, and new case incidence from ages 11 to 21. J Consult Clin Psychol. 1996;64:552–62.
Article CAS PubMed Google Scholar
Costello J, Copeland W, Angold A. Trends in psychopathology across the adolescent years: what changes when children become adolescents, and when adolescents become adults? J Child Psychol Psychiatry. 2011;52:1015–25.
Article PubMed PubMed Central Google Scholar
Pylkkänen K. Quality guidelines for adolescent psychiatry. Psychiatr Fennica. 2013;44:89–94.
Google Scholar
Schmitz N, Hartkamp N, Kiuse J, Franke G, Reister G, Tress W. The Symptom Checklist-90-R (SCL-90-R): a German validation study. Qual Life Res. 2000;9:185–93.
Article CAS PubMed Google Scholar
Vander Stoep A, Adrian M, Rhew I, McCauley E, Herting J, Kraemer H. Identifying comorbid depression and disruptive behavior disorders: comparison of two approaches used in adolescent studies. J Psychiatr Res. 2012;46:873–81.
Article PubMed Google Scholar
Derogatis L, Lipman R, Covi L. SCL-90: an outpatient psychiatric rating scale- preliminary report. Psychopharmacology. 1973;9:13–28.
CAS Google Scholar
Olsen L, Mortensen E, Bech P. The SCL-90 and SCL-90R versions validated by item response models in a Danish community sample. Acta Psychiatr Scand. 2004;110:225–9.
Article CAS PubMed Google Scholar
Holi M. Assessment of psychiatric symptoms using the SCL-90. Academic dissertation. Finland: Department of Psychiatry: Helsinki University; 2003.
Google Scholar
Barker-Collo S. Culture and validity of the Symptom Checklist-90-revised and profile of mood states in a New Zealand student sample. Cultur Divers Ethnic Minor Psychol. 2003;2:185–96.
Article Google Scholar
Essau C. Comorbidity of anxiety disorders in adolescents. Depress Anxiety. 2003;18:1–6.
Article PubMed Google Scholar
Fridell M, Cesarec Z, Johansson M, Thorsen S. SCL-90: Svensk normering, standardisering och validering av symtomskalan. Statens institutions styrelse, Rapport nr 4/2002.
Holi M, Sammallahti P, Aalberg V. A Finnish validation study of the SCL-90. Acta Psychiatr Scand. 1998;97:42–6.
Article CAS PubMed Google Scholar
Steinberg M, Barry D, Sholomskas D, Hall P. SCL-90 symptom patterns: indicators of dissociative disorders. Bull Menninger Clin. 2005;69:237–49.
Article PubMed Google Scholar
Bjørkly S. SCL-90-R profiles in a sample of severely violent psychiatric inpatients. Aggress Behav. 2002;28:446–57.
Article Google Scholar
Bonynge E. Unidimensionality of SCL-90-R scales in adult and adolescent crisis samples. J Clin Psychol. 1993;49:212–5.
Article CAS PubMed Google Scholar
McGough J, Curry J. Utility of the SCL-90-R with depressed and conduct-disordered adolescent inpatients. J Pers Assess. 1992;59:552–653.
Article CAS PubMed Google Scholar
Prinz U, Nutzinger DO, Schulz H, Petermann F, Braukhaus C, Andreas S. Comparative psychometric analyses of the SCL-90-R and its short versions in patients with affective disorders. BMC Psychiatry. 2013;13:104. doi:10.1186/1471-244X-13-104.
Article PubMed PubMed Central Google Scholar
Nickel M, Loew T, Gil F. Aripiprazole in treatment of borderline patients, part II: an 18-month follow-up. Psychofarmacology. 2007. doi:10.1007/s00213-007-0740-0.
Google Scholar
Boon A, Boer B. Drug usage as a treat to the stability of treatment outcome. Eur Child Adolesc Psychiatry. 2007;16:79–86.
Article PubMed Google Scholar
Levy K, Becker D, Grilo C, Mattanah J, Quinlan D, Edell W, et al. Concurrent and predictive validity of the personality disorder diagnosis in adolescent inpatients. Am J Psychiatry. 1999;156:1522–8.
Article CAS PubMed Google Scholar
Dinning D, Evans R. Discriminant and convergent validity of the SCL-90 in psychiatric inpatients. J Pers Assess. 1977;41:304–11.
Article CAS PubMed Google Scholar
Miotto P, De Coppi M, Frezza D, Masala C, Preti A. Suicidal ideation and aggressiveness in school-aged youths. Psychiatr Res. 2003;120:247–55.
Article Google Scholar
Vallejo M, Jordan C, Diaz M, Comeche M, Ortega J. Psychological assessment via the internet: a reliability and validity study of online (vs paper and pencil) versions of the general health questionnaire-28 (GHQ-28) and the symptoms check-list-90—revised (SCL-90-R). J Med Internet Res. 2007;9:e2.
Article PubMed PubMed Central Google Scholar
Reise SP. The rediscovery of bifactor measurement models. Multivar Behav Res. 2012;47:667–96.
Article Google Scholar
Reise SP, Moore TM, Haviland MG. Bifactor models and rotations: exploring the extent to which multidimensional data yield univocal scale scores. J Pers Assess. 2010;92:544–59.
Article PubMed PubMed Central Google Scholar
Van De Schoot R, Schmidt P, De Beuckelaer A, Lek K, Zondervan-Zwijnenburg M. Editorial: measurement invariance. Front Psychol. 2015;6:1064. doi:10.3389/fpsyg.2015.01064.
Google Scholar
Urbán R, Kun B, Farkas J, Paksi B, Kökönyei G, Unoka Z, Felvinczi K, Oláh A, Demetrovics Z. Bifactor structural model of Symptom Checklists: SCL-90-R and brief symptom inventory (BSI) in a non-clinical community sample. Psychiatry Res. 2014;216:146–54.
Article PubMed Google Scholar
Kaufman J, Birmaher B, Brent D, Rao U, Flynn C, Moreci P, et al. Schedule for affective disorders and schizophrenia for school-age children-present and lifetime version (K-SADS-PL): initial reliability and validity data. J Am Acad Child Adolesc Psychiatry. 1997;36:980–8.
Article CAS PubMed Google Scholar
Rytilä-Manninen M, Lindberg N, Haravuori H, Kettunen K, Marttunen M, Joukamaa M, et al. Adverse childhood experiences as risk factors for serious mental disorders and inpatient hospitalization among adolescents. Child Abuse Negl. 2014;38:2021–32.
Article PubMed Google Scholar
Ambrosini P. Historical development and present status of the Schedule for Affective Disorders and Schizophrenia for School-Age Children (K-SADS). J Am Acad Child Adolesc Psychiatry. 2000. doi:10.1097/00004583-200001000-00016.
PubMed Google Scholar
Ghanizadeh A, Mohammadi M, Yazdanshenas A. Psychometric properties of the Farsi translation of the kiddie schedule for affective disorders and schizophrenia-present and lifetime version. BMC Psychiatry. 2006;6:1.
Article Google Scholar
Brazil H, Bordin I. Convergent validity of K-SADS-PL by comparison with CBCL in a Portuguese speaking outpatient population. BMC Psychiatry. 2010. doi:10.1186/1471-244X-10-83.
Google Scholar
Tuisku V, Pelkonen M, Karlsson L, Kiviruusu O, Holi M, Ruuttu T, et al. Suicidal ideation, deliberate self-harm behaviour and suicide attempts among adolescent outpatients with depressive mood disorders and comorbid axis I disorders. Eur Child Adolesc Psychiatry. 2006;15:199–206.
Article PubMed Google Scholar
Mustanoja S, Luukkonen A, Hakko H, Räsänen P, Säävälä H, Riala K. Is exposure to domestic violence and violent crime associated with bullying behaviour among underage adolescent psychiatric inpatients? Child Psychiatry Hum Dev. 2011;42:495–506.
Article PubMed Google Scholar
American Psychiatric Association. Diagnostic and statistical manual of mental disorders. 4th ed. Washington DC: American Psychiatric Association; 1994.
Google Scholar
Choi S, Gibbons L, Crane Lordif P. An R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. J Stat Softw. 2011;39:8. doi:10.18637/jss.v039.i08.
Article Google Scholar
Revelle W, Rocklin T. Very simple structure: an alternative procedure for estimating the optimal number of interpretable factors. Multivar Behav Res. 1979;14:403–14.
Article CAS Google Scholar
Velicer WF. Determining the number of components from the matrix of partial correlations. Psychometrika. 1976;41:321–7.
Article Google Scholar
Horn JL. A rationale and test for the number of factors in factor analysis. Psychometrika. 1965;30:179–85.
Article CAS PubMed Google Scholar
Ruscio J, Roche B. Determining the number of factors to retain in an exploratory factor analysis using comparison data of known factorial structure. Psychol Assess. 2012;24:282–92.
Article PubMed Google Scholar
Muthén LK, Muthén BO. Mplus user’s guide. 7th ed. Los Angeles: Muthén & Muthén; 2012.
Google Scholar
DiStefano C, Morgan GB. A comparison of diagonal weighted least squares robust estimation techniques for ordinal data. Struct Equ Model. 2014;21:425–38.
Article Google Scholar
Li CH. The performance of MLR, USLMV, and WLSMV estimation in structural regression models with ordinal variables (Doctoral dissertation); 2014.
Beauducel A, Herzberg PY. On the performance of maximum likelihood versus means and variance adjusted weighted least squares estimation in CFA. Struct Equ Model. 2006;13:186–203.
Article Google Scholar
Hu L, Bentler PM. Cutoff criteria for fit indexes in covariance structure analysis: conventional criteria versus new alternatives. Struct Equ Model. 1999;6:1–55.
Article Google Scholar
Yu CY. Evaluating cutoff criteria of model fit indices for latent variable models with binary and continuous outcomes (Doctoral dissertation); 2002.
Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J, Müller M. pROC: an open-source package for R and S + to analyze and compare ROC curves. BMC Bioinformatics. 2011;12:1. doi:10.1186/1471-2105-12-77.
Article Google Scholar
Youden WJ. Index for rating diagnostic tests. Cancer. 1950;3:32–5.
Article CAS PubMed Google Scholar
Arrindell WA, Barelds DP, Janssen IC, Buwalda FM, van der Ende J. Invariance of SCL-90-R dimensions of symptom distress in patients with peri partum pelvic pain (PPPP) syndrome. Br J Clin Psychol. 2006;45:377–91.
Article CAS PubMed Google Scholar
Thomas ML. Rewards of bridging the divide between measurement and clinical theory: demonstration of a bifactor model for the brief symptom inventory. Psychol Assess. 2012;24:101–13.
Article PubMed Google Scholar
Paap MC, Meijer RR, Van Bebber J, Pedersen G, Karterud S, Hellem FM, Haraldsen IR. A study of the dimensionality and measurement precision of the SCL-90-R using item response theory. Int J Methods Psychiatr Res. 2011;20:e39–55. doi:10.1002/mpr.347.
PubMed Google Scholar
Paap MC, Meijer RR, Cohen-Kettenis PT, Richter-Appelt H, de Cuypere G, Kreukels BP, Pedersen G, Karterud S, Malt UF, Haraldsen IR. Why the factorial structure of the SCL-90-R is unstable: comparing patient groups with different levels of psychological distress using Mokken scale analysis. Psychiatry Res. 2012;200:819–26.
Article PubMed Google Scholar
Ha C, Balderas JC, Zanarini MC, Oldham J, Sharp C. Psychiatric comorbidity in hospitalized adolescents with borderline personality disorder. J Clin Psychiatry. 2014;75:e457–64. doi:10.4088/JCP.13m08696.
Article PubMed Google Scholar

Download references

Authors’ contributions

MRM, KK, and HH collected the data. MRM wrote the initial manuscript draft. ST and MRM analyzed the data and drafted the added methods and results. All authors participated in the writing process. All authors read and approved the final manuscript.

Acknowledgements

This study was funded by the Helsinki and Uusimaa Hospital District, which, however, had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests

The authors declare that they have no competing interests.

Author information

Authors and Affiliations

Hospital District of Helsinki and Uusimaa, Kellokoski Hospital, 04500, Kellokoski, Finland
Minna Rytilä-Manninen
Adolescent Psychiatry, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Minna Rytilä-Manninen, Henna Haravuori, Mauri Marttunen & Kirsi Kettunen
School of Health Sciences, University of Tampere, Tampere, Finland
Sari Fröjd
Department of Health, Mental Health Unit, National Institute for Health and Welfare, Helsinki, Finland
Henna Haravuori, Mauri Marttunen & Sebastian Therman
Forensic Psychiatry, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
Nina Lindberg

Authors

Minna Rytilä-Manninen
View author publications
You can also search for this author in PubMed Google Scholar
Sari Fröjd
View author publications
You can also search for this author in PubMed Google Scholar
Henna Haravuori
View author publications
You can also search for this author in PubMed Google Scholar
Nina Lindberg
View author publications
You can also search for this author in PubMed Google Scholar
Mauri Marttunen
View author publications
You can also search for this author in PubMed Google Scholar
Kirsi Kettunen
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Therman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Minna Rytilä-Manninen.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Rytilä-Manninen, M., Fröjd, S., Haravuori, H. et al. Psychometric properties of the Symptom Checklist-90 in adolescent psychiatric inpatients and age- and gender-matched community youth. Child Adolesc Psychiatry Ment Health 10, 23 (2016). https://doi.org/10.1186/s13034-016-0111-x

Download citation

Received: 03 January 2016
Accepted: 29 June 2016
Published: 15 July 2016
DOI: https://doi.org/10.1186/s13034-016-0111-x

Psychometric properties of the Symptom Checklist-90 in adolescent psychiatric inpatients and age- and gender-matched community youth

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Participants and procedure

Inpatients

Community sample

Ethical aspects

Measures

Schedule for affective disorders and schizophrenia for school-age children—present and lifetime version (K-SADS-PL)

Scl-90

Statistical analyses

Measurement invariance

Optimal number of factors

Factor analyses

Criterion validation

Results

Basic item distribution properties of SCL-90

Measurement invariance

Optimal number of factors

Confirmatory factor analyses

Subscale viability

Group differences

Discussion

Strengths and limitations

Conclusions and clinical implications

Abbreviations

References

Authors’ contributions

Acknowledgements

Competing interests

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Child and Adolescent Psychiatry and Mental Health

Contact us