Next Article in Journal
Mucosal Melanoma of the Hard Palate: Surgical Treatment and Reconstruction
Next Article in Special Issue
The Male Experience of Suicide Attempts and Recovery: An Interpretative Phenomenological Analysis
Previous Article in Journal
Understanding the Global Challenges to Accessing Appropriate Wheelchairs: Position Paper
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Which PHQ-9 Items Can Effectively Screen for Suicide? Machine Learning Approaches

1
Department of Psychiatry, Hanyang University Medical Center, Seoul 04763, Korea
2
Department of Nursing, College of Nursing and Health, Kongju National University, Kognju 32588, Korea
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2021, 18(7), 3339; https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph18073339
Submission received: 21 February 2021 / Revised: 21 March 2021 / Accepted: 22 March 2021 / Published: 24 March 2021
(This article belongs to the Special Issue Suicide Prevention: A Public Health Imperative)

Abstract

:
(1) Background: The Patient Health Questionnaire-9 (PHQ-9) is a tool that screens patients for depression in primary care settings. In this study, we evaluated the efficacy of PHQ-9 in evaluating suicidal ideation (2) Methods: A total of 8760 completed questionnaires collected from college students were analyzed. The PHQ-9 was scored in combination with and evaluated against four categories (PHQ-2, PHQ-8, PHQ-9, and PHQ-10). Suicidal ideations were evaluated using the Mini-International Neuropsychiatric Interview suicidality module. Analyses used suicide ideation as the dependent variable, and machine learning (ML) algorithms, k-nearest neighbors, linear discriminant analysis (LDA), and random forest. (3) Results: Random forest application using the nine items of the PHQ-9 revealed an excellent area under the curve with a value of 0.841, with 94.3% accuracy. The positive and negative predictive values were 84.95% (95% CI = 76.03–91.52) and 95.54% (95% CI = 94.42–96.48), respectively. (4) Conclusion: This study confirmed that ML algorithms using PHQ-9 in the primary care field are reliably accurate in screening individuals with suicidal ideation.

1. Introduction

Most individuals who die by suicide have visited mental health services providers within one year before death (rates of contact average 32%, range = 16–46%) [1,2]. These medical visits are an opportunity for clinicians to identify and refer patients at risk of suicide; however, patients often talk about their physical problems without discussing suicidal ideations and plans unless asked directly [3,4]. It can be easy to overlook suicide risk assessments even when the risk is high if the patient denies a suicide attempt or refuses a psychiatric evaluation.
Because depression and suicide are often associated in primary care, clinicians use depression tests as a suicide risk detection tool. However, depression screening tests were not designed to evaluate suicide-related risk factors [5]. In addition, the cost-effectiveness of the suicide risk assessment scale as a first-line screening strategy is limited because of the relatively low prevalence of suicide among primary care patients [6,7].
The Patient Health Questionnaire (PHQ-9) is an effective screening tool for major depressive disorders and is a reliable and useful evaluation tool for moderate symptoms of depression in various clinical environments [8,9,10]. This tool has been verified for screening for depression in the general population and various groups outside the primary care environment [11,12].
Item 9, the final item of PHQ-9, is generally used in research to determine the existence of suicidal ideation [13,14,15]. Therefore, item 9 is sometimes referred to as the PHQ-9 suicide question, because it specifically evaluates the frequency of passive accidents due to death or self-harm over the preceding two weeks. It is widely used as a single scale for studies on the prevalence of suicidal ideas [13,14,16,17,18]. In addition, item 9 of PHQ-9 can be used as a predictor of suicidal death or risk even when depression severity is reduced [19]. Higher scores in question 9 are better at predicting suicide risk and mortality [20]. Additionally, PHQ-9 has been used to detect patients at risk of suicide in several studies [19,21,22,23,24]. The American Heart Association Science Advisory recommends that all patients who answer “yes” to item 9 of PHQ-9 be immediately assessed for suicide [25].
However, the predictive value of PHQ-9 item 9 on suicide risk remains unclear. Additionally, some studies have argued that item 9 responses yield a high false-positive ratio by evaluating both the thought of passive death and desire for self-harm in a single-response question [13,16,20,23,24,26]. Therefore, studies are being conducted on the predictive value of item 9 of PHQ-9 for suicide or depression [19,26].
Classifying and predicting individuals with high suicide risk using PHQ-9 will assist with suicide prevention and intervention. Item 9 is widely used as a suicide screening tool; however, no studies have assessed the efficacy of PHQ-9 as a suicide screening tool using machine learning (ML) techniques. Machine learning algorithms have been proposed to improve diagnostic and prognostic accuracy and determine treatment options [27]. Various machine learning studies are focused on suicide prediction [28,29,30] and machine learning analysis has an advantage in its accuracy and scalability compared to conventional statistical approaches [29].
These can investigate a wide range of complex associations between many potential factors to generate algorithms that optimize predictions and reliably predict risk [31,32,33]. This study sought to effectively distinguish individuals with suicidal ideations by applying the latest ML algorithms to the PHQ-9 results and reveal suicide risk prediction accuracy.

2. Materials and Methods

This study used a part of the questionnaire dataset conducted at Kongju National University from 2013 to 2015. Informed written consent was obtained from all participants. In total, 8760 responses (4354 males, 4406 females) were analyzed [34]. A total of 742 (8.5%) participants were classified as a suicide group, and the remainder were classified as a control group (8018, 91.5%). This study was approved by the Kongju National University Institutional Review Board (approval No. KNU2015-38).
The Patient Health Questionnaire (PHQ) is a self-report questionnaire designed to help diagnose and detect mental disorders commonly encountered in the primary clinical setting [35]. It consists of 9 questions to help diagnose a major depressive disorder. PHQ-2 is composed of two items, depressed mood and anhedonia, which are the core symptoms among the diagnostic criteria for major depressive disorders in DSM-IV. The items score how often these feelings have occurred in the last two weeks. In this study, four versions were classified according to the combined criteria of the questionnaire items. These were named PHQ-2 using the sum of the first two questions; PHQ-8, incorporating 8 questions and excluding question 9; and PHQ-9, incorporating up to 9 questions. The sum of up to the 10th question (‘how difficult have these problems made it for you’), which is the question about difficulties due to depression symptoms, is referred to as PHQ-10 (Appendix A).
The Mini-International Neuropsychiatric Interview (MINI) was developed in the US and Europe in 1998 for the diagnosis of axis I psychiatric disorders in DSM-IV and the International Classification of Diseases 10th edition (ICD-10). It is used as a structured interview tool in multicenter clinical studies and epidemiological investigations [36]. Suicide risk is evaluated with 6 questions. In this study, participants who responded positively to questions 1−3, which are related to suicidal ideation, were classified into the suicide ideations group. In this study, the standardized Korean version 5.0 was used [37].
Responses to PHQ-9 were input into the ML algorithms to assess which questions influenced actual suicidal ideations. The algorithms used were k-nearest neighbors classification (KNN), linear discriminant analysis (LDA), and random forest classification. The k-nearest neighbors (kNN) is a non-parametric classification method that is simple and effective in many cases [38]. The kNN algorithm collects existing classes and classifies new classes based on the comparison measure. It has been used as a non-parametric technique for pattern recognition and statistical estimation [39]. Linear discriminant analysis (LDA) determines the correlation between a categorical variable and an interrelated variable. In LDA, an entity of interest is presumed to belong to only one of the groups [40]. Random forest algorithm is the grouping of tree predictors, such that each tree is influenced by the values of a random vector experimented both individually and with the same dissemination for all trees in the forest. Random forests are an operational tool in estimation. Introducing the proper type of chance makes them precise classifiers and regressors. Furthermore, random forests analyze the data by considering the strength of discrete analysts and their associations [41].
The abovementioned machine learning (ML) models are based on critical features. ML algorithms are designed to maximize their clinical significance and generalization potential. In fact, ML models have been shown to be able to predict suicide tendencies [27]. Although ML algorithms have improved the accuracy of suicide risk detection; for a number of reasons, perfect predictions using ML are not possible and result in false positives and false positives [31].
Therefore, clinical implications are important to reduce these errors and increase accuracy. We analyzed three machine learning algorithms accordingly. The target variable (y) was binary. We collected 8713 questionnaires, from which 5576 responses were used as training data, 1395 were used as validation data, and 1742 were used as test data. In LDA, 6971 people were used as train data and 1742 were used as test data. Suicide ideation with/without group ML analyses were performed using JASP v0.14.4 (Amsterdam, Netherland). The dependent variable was suicidal ideation and the independent variables were combinations of 10 items of PHQ-9. A Chi-Square test of independence and a t-test were conducted. In addition, we calculated the sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) with a 95% confidence interval (95% CI) by calculating the sum according to each version of PHQ-2, PHQ-8, PHQ-9, and PHQ-10. The calculation consisted of suicide ideation groups with a PHQ-2 cutoff score > 1, PHQ-8 cutoff score > 4, PHQ-9 cutoff score > 5, PHQ-10 cutoff score > 5.
The area under the curve (AUC) of the receiver operating curve (ROC) was measured for each algorithm. An AUC close to 1 indicates a better model. An AUC of 0.5–0.6 indicates a coincidence and 0.6–0.7 indicates a bad model, while 0.7–0.8 is worthless, 0.8–0.9 is good, and 0.9–1.0 is excellent [42]. All statistical analyses were performed using JASP v0.14.4 (Amsterdam, the Netherland) and MedCalc v19.6.1 (MedCalc Software, Mariakerke, Belgium). All p values were obtained using two-tailed tests, and p < 0.05 was considered statistically significant. To control for multiple comparisons, a Bonferroni corrected α was adopted for each item’s score (0.05/10 = 0.005).
This study aimed to differentiate the suicide ideation group using a patient health questionnaire (PHQ)-9 items by applying the latest ML algorithm models. Validation was performed using MINI as the gold standard. We used the subtotal score of PHQ-9 items, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), and area under the receiver operating characteristic curve (AUC) for comparison with the clinical diagnosis of suicide ideation using 95% confidence intervals for each measure [43].

3. Results

3.1. Experimental Results

3.1.1. General Characteristics

Of all participants, 742 (8.5%) were included in the suicidal ideation group. In the suicidal ideation group, the proportion of women was significantly higher than men (X2 = 48.61, p < 0.001, Table 1). The age difference between groups was not significant; however, the total score for each PHQ-9 item and the sum of each scoring method was significantly higher in the suicidal ideation group (Table 1).

3.1.2. PHQ Cutoff Scores

When 5 points were used as a cut-off point in the evaluation using the total score of PHQ-9, the sensitivity and specificity were 71.56% and 76.54%. The AUC was 0.817 (Table 2); however, the PPV was 22.01%. Interestingly, the PPV was low (17.9–22.01%) in all four types of evaluation using the total score. In contrast, the NPV was high (96.68–97.10%). The cut-off threshold score was 5 for the sensitivity and specificity of PHQ-9 (PPV = 22.01, NPV = 96.68).

3.1.3. PHQ Item Combination by ML Technique

The ML analysis revealed an accuracy of >90%, which was excellent in all item combinations. An accuracy of 94.4% was found in the PHQ-10 (Table 3). The AUC was 0.848 in ML with the linear discriminant technique in PHQ-8. When nine items were used, the positive and negative predictive values were 84.95% (95% CI = 76.03–91.52) and 95.54% (95% CI = 94.42–96.48), respectively, which showed good diagnostic performance.

4. Discussion

The purpose of this study was to predict and report suicidal ideations using the ML analyses of a combination of 10 items from the PHQ-9, which is the most used depression assessment tool in primary healthcare. We found differences between ML techniques; however, we confirmed that KNN, LDA, and random forest all predicted suicidal ideations at an excellent level. In particular, all techniques showed a high level of accuracy. This means that the depression screening tool can be used to predict suicide risk using ML without requiring direct confirmation of suicide ideations and attempts.
The ninth item on the PHQ-9 is used to indicate the existence of suicidal ideations. In fact, it is widely used as a single measure of the prevalence of suicidal ideations [13,14,15,16,17,18]. All ML techniques showed that PHQ-9 and PHQ-10 scored a higher accuracy than PHQ-2 and 8, which exclude item 9. The AUC analysis revealed that LDA and random forest analyses were superior (0.816–0.846). In addition, PPV and NPV were 62.71–84.95% and 93.82–95.54%, which were higher than the PPVs of PHQ-2 and 8 (37.14–69.57%). The ninth item asks patients “How often have you been bothered by thoughts that you would be better off dead or of hurting yourself in some way?” This question refers to self-harm and suicidal thoughts. Most people with suicidal ideations or self-harm respond positively to this question [16,44]. In our study, the predicted values of PHQ-9 and PHQ-10—which included this item—were more related to suicide. Therefore, all PHQ-9 items using ML, including item 9, better predict suicidal ideation than the shortened PHQ-2 and PHQ-8.
There are limitations to the single use of the ninth item of PHQ-9 as an effective suicide screening tool. Further studies are assessing which items are the best predictors of suicide risk [16,19,45,46,47]. In this study, PHQ-9 and PHQ-10 were more accurate than PHQ-2 and PHQ-8 even when using a cut-off point on the total number of PHQ questions, and without using ML. This indicated that the inclusion of PHQ-9 item 9 can predict suicidal ideations even while controlling for different statistical techniques. Our study showed similar results to a previous study that found that having positive scores on the depressive symptoms in items 1–8 was less important than item 9 when assessing suicide [19,20]. The PHQ-9 is designed to measure the severity of depression, but if it was specifically focused on suicidal ideation, its prediction of suicidal ideation might be more accurate. In particular, since this study has confirmed the possibility of predicting suicide risk through PHQ-9—including item 9, which measures suicidal ideation— suicide risk can be considered to be an indirect measurement compared with other suicide screening tools. Rather than directly assessing suicidal ideation, indirect measures such as an emotion regulation questionnaire and an anger rumination scale are better predictors of actual suicide attempts [30]. Therefore, PHQ-9— a tool commonly used in clinical settings or various groups— would be more useful and effective in discriminating thoughts about suicide indirectly than a measurement tool that directly screens for suicide in an early screening measure. A previous study sought to predict suicide risk using the Columbia-Suicide Severity Rating Scale (C-SSRS), based on the PHQ-9 cut-off value. They showed high sensitivity and specificity, but low PPV (22.0–30.4%) [45]. In this study, measurements using the cut-off value of the existing statistical method showed similar PPV values (17.90–22.01%); however, the PHQ-9 had the highest PPV (84.95%) score when using the ML technique. Because the PHQ-9 was originally developed to screen for depression and not for suicide, these results indicate that PHQ-9 and PHQ-10 would be very useful for screening suicide risk in a primary care situation.
The accurate prediction of suicide attempts requires a complex combination of hundreds of risk factors; therefore, traditional statistical techniques are not ideal for this analysis. For example, generally strong predictors—such as previous suicidal behavior, depression, despair, or male gender—also appear to be weak predictors of risk [46,47,48]. A recent meta-analysis has reported that predicting suicide attempts using previously known risk factors is not effective due to methodological limitations (AUC = 0.56–0.58). They argue that it is necessary to use a risk algorithm using ML to improve this [49]. These techniques will accurately predict suicidal behavior and actual suicide through longitudinal records of patients [31,33,34], socio-demographic information, psychopathological factors, and control for risk factors [49]. Other ML studies have shown suicidal ideation as a better potential predictor of suicidal risk than suicidal attempts [50]. Suicide ideation is considered to be an important precursor to a later attempted and completed suicide and can be considered a first step on the pathway to suicide [51]. Therefore, the strength of this study is that it analyzed the usefulness of PHQ-9 as a suicide ideation screening tool for the first steps leading to suicide attempts and planning through ML algorithms. Screening tools should be applied differently to determine whether actionable risks exist and identify risk factors that should be ignored. This will enable easy management by front-line staff. Furthermore, it should be highly sensitive to confidently rule out patients who do not show any visible risk [52]. Hence, the PHQ-9 is suitable as a screening tool in the primary care setting, which has been confirmed through this study. However, these algorithmic predictions cannot predict when these people will manifest suicidal risk. Further research is required to assess the correct therapeutic approach for those at risk [29].
There are limitations to this study. First, it is difficult to say that these results are representative of the entire population, as the survey was conducted at one university. Second, as a self-reported study, there is an inherent respondent bias. This survey was conducted for a large number of students, but it was conducted only in the form of a self-report, and we could only use the MINI model of suicidality for comparison. Third, there was no information on clinical diagnosis or psychiatric treatment because this was a study in a non-clinical group. This is a retrospective analysis using existing data; therefore, it was difficult to obtain additional information. Fourth, the MINI suicidality module has not been sufficiently verified. However, the most effective screening method for evaluating suicide is to ask directly and the first three questions of MINI would not have been efficient in evaluating suicidal ideations. Although in this study we have not considered the effect of depression, it may act as a confounding variable. Therefore, future confirmatory studies will need to be conducted on individuals with major depressive disorder.
Nevertheless, this study confirmed that analysis using ML can be used very accurately when screening for people with suicidal ideations using PHQ-9, a simple depression screening tool. The possibility of predicting and evaluating the risk using national prevalence studies or simple evaluation in the primary care field was presented. Justification for population-based screening and prevention would require both evidence that the screening predicts suicidal risk in the general population and evidence that some feasible intervention can reduce that risk [53,54]. Therefore, in the future, if clinical data are added and analyzed through long-term follow-up for a wider range of groups, the value of PHQ-9 in suicide risk assessment will increase.

5. Conclusions

This study confirmed that ML with PHQ-9 provides reliable accuracy in classifying and predicting suicidal ideations of individuals. All ML algorithms indicated that if the PHQ-9 included item 9, then the prediction accuracy was high. AUC analysis revealed that LDA and random forest analyses were superior among all ML algorithms tested. These findings can help clinicians to detect and treat high-risk suicide groups early in the primary care field.

Author Contributions

K.L. and S.K. contributed to the design of the study, interpretation of the results, and preparation of the manuscript. K.L., H.-K.L. coordinated the data acquisition. K.L. and S.K. wrote the draft manuscript. K.L., H.-K.L. and S.K. contributed to the critical revision of the article for important intellectual content. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Basic Science Research Program of the National Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2018R1D1A1B07050245), and the Technology Innovation Program (20012931) funded by the Ministry of Trade, Industry and Energy (MOTIE, Korea). This funding source had no role in the design of this study and will not have any role during its execution, analyses, interpretation of the data, or decision to submit results.

Institutional Review Board Statement

This study was approved by the Kongju National University Institutional Review Board (approval No. KNU2015-38).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author. The data are not publicly available due to ethical restrictions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. Independent variable inputted according to PHQ combinations.
Figure A1. Independent variable inputted according to PHQ combinations.
Ijerph 18 03339 g0a1
Table A1. PHQ-9 items.
Table A1. PHQ-9 items.
No.Question
item 1Little interest or pleasure in doing things
item 2Feeling down, depressed, or hopeless
item 3Trouble falling/staying asleep or sleeping too much
item 4Feeling tired or having little energy
item 5Poor appetite or overeating
item 6Feeling bad about yourself—or that you are a failure or have let yourself or your family down
item 7Trouble concentrating on things, such as reading the newspaper or watching television
item 8Moving or speaking so slowly that other people could have noticed? Or the opposite—being so fidgety or restless that you have been moving around a lot more than usual
item 9Thoughts that you would be better off dead or of hurting yourself in someway
item 10If you checked off any problems, how difficult have these problems made it for you to do your work, take care of things at home, or get along with other people?

References

  1. Gairin, I.; House, A.; Owens, D. Attendance at the accident and emergency department in the year before suicide: Retrospective study. Br. J. Psychiatry 2003, 183, 28–33. [Google Scholar] [CrossRef] [Green Version]
  2. Luoma, J.B.; Martin, C.E.; Pearson, J.L. Contact With Mental Health and Primary Care Providers Before Suicide: A Review of the Evidence. Am. J. Psychiatry 2002, 159, 909–916. [Google Scholar] [CrossRef] [PubMed]
  3. Pan, Y.-J.; Lee, M.-B.; Chiang, H.-C.; Liao, S.-C. The recognition of diagnosable psychiatric disorders in suicide cases’ last medical contacts. Gen. Hosp. Psychiatry 2009, 31, 181–184. [Google Scholar] [CrossRef]
  4. Wintersteen, M.B. Standardized Screening for Suicidal Adolescents in Primary Care. Pediatrics 2010, 125, 938–944. [Google Scholar] [CrossRef] [PubMed]
  5. Recklitis, C.J.; Lockwood, R.A.; Rothwell, M.A.; Diller, L.R. Suicidal Ideation and Attempts in Adult Survivors of Childhood Cancer. J. Clin. Oncol. 2006, 24, 3852–3857. [Google Scholar] [CrossRef] [PubMed]
  6. Callahan, C.M.; Hendrie, H.C.; Nienaber, N.A.; Tierney, W.M. Suicidal Ideation Among Older Primary Care Patients. J. Am. Geriatr. Soc. 1996, 44, 1205–1209. [Google Scholar] [CrossRef]
  7. Kim, Y.A.; Bogner, H.R.; Brown, G.K.; Gallo, J.J. Chronic medical conditions and wishes to die among older primary care patients. Int. J. Psychiatry Med. 2006, 36, 183–198. [Google Scholar] [CrossRef] [Green Version]
  8. Gilbody, S.; Richards, D.; Brealey, S.; Hewitt, C. Screening for Depression in Medical Settings with the Patient Health Questionnaire (PHQ): A Diagnostic Meta-Analysis. J. Gen. Intern. Med. 2007, 22, 1596–1602. [Google Scholar] [CrossRef] [Green Version]
  9. Valuck, R.J.; Anderson, H.O.; Libby, A.M.; Brandt, E.; Bryan, C.; Allen, R.R.; Staton, E.W.; West, D.R.; Pace, W.D. Enhancing Electronic Health Record Measurement of Depression Severity and Suicide Ideation: A Distributed Ambulatory Research in Therapeutics Network (DARTNet) Study. J. Am. Board Fam. Med. 2012, 25, 582–593. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  10. Wittkampf, K.A.; Naeije, L.; Schene, A.H.; Huyser, J.; van Weert, H.C. Diagnostic accuracy of the mood module of the Patient Health Questionnaire: A systematic review. Gen. Hosp. Psychiatry 2007, 29, 388–395. [Google Scholar] [CrossRef] [PubMed]
  11. Arroll, B.; Goodyear-Smith, F.; Crengle, S.; Gunn, J.; Kerse, N.; Fishman, T.; Falloon, K.; Hatcher, S. Validation of PHQ-2 and PHQ-9 to Screen for Major Depression in the Primary Care Population. Ann. Fam. Med. 2010, 8, 348–353. [Google Scholar] [CrossRef] [Green Version]
  12. Van Steenbergen-Weijenburg, K.M.; De Vroege, L.; Ploeger, R.R.; Brals, J.W.; Vloedbeld, M.G.; Veneman, T.F.; Roijen, L.H.-V.; Rutten, F.F.J.; Beekman, A.T.F.; Van Der Feltz-Cornelis, C.M. Validation of the PHQ-9 as a screening instrument for depression in diabetes patients in specialized outpatient clinics. BMC Health Serv. Res. 2010, 10, 1–6. [Google Scholar] [CrossRef] [Green Version]
  13. Bauer, A.M.; Chan, Y.-F.; Huang, H.; Vannoy, S.; Unützer, J. Characteristics, Management, and Depression Outcomes of Primary Care Patients Who Endorse Thoughts of Death or Suicide on the PHQ-9. J. Gen. Intern. Med. 2012, 28, 363–369. [Google Scholar] [CrossRef] [PubMed]
  14. Denneson, L.M.; Corson, K.; Helmer, D.A.; Bair, M.J.; Dobscha, S.K. Mental health utilization of new-to-care Iraq and Afghanistan Veterans following suicidal ideation assessment. Psychiatry Res. 2014, 217, 147–153. [Google Scholar] [CrossRef] [PubMed]
  15. Mackelprang, J.L.; Bombardier, C.H.; Fann, J.R.; Temkin, N.R.; Barber, J.K.; Dikmen, S.S. Rates and Predictors of Suicidal Ideation During the First Year After Traumatic Brain Injury. Am. J. Public Health 2014, 104, e100–e107. [Google Scholar] [CrossRef]
  16. Walker, J.; Hansen, C.H.; Butcher, I.; Sharma, N.; Wall, L.; Murray, G.; Sharpe, M. Thoughts of Death and Suicide Reported by Cancer Patients Who Endorsed the “Suicidal Thoughts” Item of the PHQ-9 During Routine Screening for Depression. J. Psychosom. Res. 2011, 52, 424–427. [Google Scholar] [CrossRef] [PubMed]
  17. Walker, J.; Waters, R.A.; Murray, G.; Swanson, H.; Hibberd, C.J.; Rush, R.W.; Storey, D.J.; Strong, V.A.; Fallon, M.T.; Wall, L.R.; et al. Better Off Dead: Suicidal Thoughts in Cancer Patients. J. Clin. Oncol. 2008, 26, 4725–4730. [Google Scholar] [CrossRef] [PubMed]
  18. Yawn, B.P.; Pace, W.; Wollan, P.C.; Bertram, S.; Kurland, M.; Graham, D.; Dietrich, A. Concordance of Edinburgh Postnatal Depression Scale (EPDS) and Patient Health Questionnaire (PHQ-9) to Assess Increased Risk of Depression among Postpartum Women. J. Am. Board Fam. Med. 2009, 22, 483–491. [Google Scholar] [CrossRef] [Green Version]
  19. Simon, G.E.; Rutter, C.M.; Peterson, D.; Oliver, M.; Whiteside, U.; Operskalski, B.; Ludman, E.J. Does Response on the PHQ-9 Depression Questionnaire Predict Subsequent Suicide Attempt or Suicide Death? Psychiatr. Serv. 2013, 64, 1195–1202. [Google Scholar] [CrossRef] [Green Version]
  20. Louzon, S.A.; Bossarte, R.; McCarthy, J.F.; Katz, I.R. Does Suicidal Ideation as Measured by the PHQ-9 Predict Suicide Among VA Patients? Psychiatr. Serv. 2016, 67, 517–522. [Google Scholar] [CrossRef]
  21. Britton, P.C.; Ilgen, M.A.; Rudd, M.D.; Conner, K.R. Warning signs for suicide within a week of healthcare contact in Veteran decedents. Psychiatry Res. 2012, 200, 395–399. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  22. Brown, G.K.; Beck, A.T.; Steer, R.A.; Grisham, J.R. Risk factors for suicide in psychiatric outpatients: A 20-year prospective study. J. Consult. Clin. Psychol. 2000, 68, 371–377. [Google Scholar] [CrossRef]
  23. Razykov, I.; Hudson, M.; Baron, M.; Thombs, B.D.; Canadian Scleroderma Research Group. Utility of the Patient Health Questionnaire-9 to Assess Suicide Risk in Patients With Systemic Sclerosis. Arthritis Rheum. 2013, 65, 753–758. [Google Scholar] [CrossRef]
  24. Uebelacker, L.A.; German, N.M.; Gaudiano, B.A.; Miller, I.W. Patient health questionnaire depression scale as a suicide screening instrument in depressed primary care patients: A cross-sectional study. Prim. Care Companion CNS Disord. 2011, 13, 10m01027. [Google Scholar] [CrossRef] [Green Version]
  25. Lichtman, J.H.; Bigger Jr, J.T.; Blumenthal, J.A.; Frasure-Smith, N.; Kaufmann, P.G.; Lespérance, F.o.; Mark, D.B.; Sheps, D.S.; Taylor, C.B.; Froelicher, E.S. Depression and coronary heart disease: Recommendations for screening, referral, and treatment: A science advisory from the American Heart Association Prevention Committee of the Council on Cardiovascular Nursing, Council on Clinical Cardiology, Council on Epidemiology and Prevention, and Interdisciplinary Council on Quality of Care and Outcomes Research: Endorsed by the American Psychiatric Association. Circulation 2008, 118, 1768–1775. [Google Scholar] [PubMed] [Green Version]
  26. Razykov, I.; Ziegelstein, R.C.; Whooley, M.A.; Thombs, B.D. The PHQ-9 versus the PHQ-8—Is item 9 useful for assessing suicide risk in coronary artery disease patients? Data from the Heart and Soul Study. J. Psychosom. Res. 2012, 73, 163–168. [Google Scholar] [CrossRef] [PubMed]
  27. Fazel, S.; O’Reilly, L. Machine Learning for Suicide Research–Can It Improve Risk Factor Identification? JAMA Psychiatry 2020, 77, 13–14. [Google Scholar] [CrossRef]
  28. Gradus, J.L.; King, M.W.; Galatzer-Levy, I.; Street, A.E. Gender Differences in Machine Learning Models of Trauma and Suicidal Ideation in Veterans of the Iraq and Afghanistan Wars. J. Trauma. Stress 2017, 30, 362–371. [Google Scholar] [CrossRef] [Green Version]
  29. Linthicum, K.P.; Schafer, K.M.; Ribeiro, J.D. Machine learning in suicide science: Applications and ethics. Behav. Sci. Law 2019, 37, 214–222. [Google Scholar] [CrossRef] [PubMed]
  30. Oh, J.; Yun, K.; Hwang, J.-H.; Chae, J.-H. Classification of Suicide Attempts through a Machine Learning Algorithm Based on Multiple Systemic Psychiatric Scales. Front. Psychiatry 2017, 8, 192. [Google Scholar] [CrossRef] [Green Version]
  31. Walsh, C.G.; Ribeiro, J.D.; Franklin, J.C. Predicting Risk of Suicide Attempts Over Time Through Machine Learning. Clin. Psychol. Sci. 2017, 5, 457–469. [Google Scholar] [CrossRef]
  32. Barak-Corren, Y.; Castro, V.M.; Javitt, S.; Hoffnagle, A.G.; Dai, Y.; Perlis, R.H.; Nock, M.K.; Smoller, J.W.; Reis, B.Y. Predicting Suicidal Behavior From Longitudinal Electronic Health Records. Am. J. Psychiatry 2017, 174, 154–162. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Kessler, R.C.; Warner, C.H.; Ivany, C.; Petukhova, M.V.; Rose, S.; Bromet, E.J.; Brown, M.; Cai, T.; Colpe, L.J.; Cox, K.L. Predicting suicides after psychiatric hospitalization in US Army soldiers: The Army Study to Assess Risk and Resilience in Servicemembers (Army STARRS). J. JAMA Psychiatry 2015, 72, 49–57. [Google Scholar] [CrossRef] [PubMed]
  34. Lee, K.; Lee, H.-K.; Kim, S.; Kim, S.H. The relationship between circadian typology and lifetime experiences of hypomanic symptoms. Psychiatry Res. 2021, 298, 113788. [Google Scholar] [CrossRef]
  35. Kroenke, K.; Spitzer, R.L.; Williams, J.B. The PHQ-9: Validity of a brief depression severity measure. J. Gen. Intern. Med. 2001, 16, 606–613. [Google Scholar] [CrossRef]
  36. Sheehan, D.V.; Lecrubier, Y.; Sheehan, K.H.; Amorim, P.; Janavs, J.; Weiller, E.; Hergueta, T.; Baker, R.; Dunbar, G.C. The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J. Clin. Psychiatry 1998, 59, 22–33. [Google Scholar] [PubMed]
  37. Yoo, S.; Kim, Y.; Noh, J.; Oh, K.; Kim, C.; Namkoong, K.; Chae, J.; Lee, G.; Jeon, S.; Min, K. Validity of Korean version of the mini-international neuropsychiatric interview. Anxiety Mood 2006, 2, 50–55. [Google Scholar]
  38. Hand, D.J. Principles of data mining. Drug Safety 2007, 30, 621–622. [Google Scholar] [CrossRef]
  39. Stork, D.G.; Duda, R.O.; Hart, P.E.; Stork, D. Pattern Classification; Wiley: Hoboken, NJ, USA, 2001. [Google Scholar]
  40. McLachlan, G.J. Discriminant Analysis and Statistical Pattern Recognition, 1st ed.; John Wiley & Sons Inc.: New York, NY, USA, 1992. [Google Scholar]
  41. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  42. Zhu, W.; Zeng, N.; Wang, N. Sensitivity, specificity, accuracy, associated confidence interval and ROC analysis with practical SAS implementations. In Proceedings of the NESUG proceedings: Health care and life sciences, Baltimore, MD, USA, 14–17 November 2010; Volume 19, p. 67. [Google Scholar]
  43. Glas, A.S.; Lijmer, J.G.; Prins, M.H.; Bonsel, G.J.; Bossuyt, P.M.M. The diagnostic odds ratio: A single indicator of test performance. J. Clin. Epidemiol. 2003, 56, 1129–1135. [Google Scholar] [CrossRef]
  44. Corson, K.; Gerrity, M.S.; Dobscha, S.K. Screening for depression and suicidality in a VA primary care setting: 2 items are better than 1 item. Am. J. Manag. Care 2004, 10, 839–845. [Google Scholar]
  45. Na, P.J.; Yaramala, S.R.; Kim, J.A.; Kim, H.; Goes, F.S.; Zandi, P.P.; Voort, J.L.V.; Sutor, B.; Croarkin, P.; Bobo, W.V. The PHQ-9 Item 9 based screening for suicide risk: A validation study of the Patient Health Questionnaire (PHQ)−9 Item 9 with the Columbia Suicide Severity Rating Scale (C-SSRS). J. Affect. Disord. 2018, 232, 34–40. [Google Scholar] [CrossRef] [PubMed]
  46. Huang, X.; Ribeiro, J.D.; Musacchio, K.M.; Franklin, J.C. Demographics as predictors of suicidal thoughts and behaviors: A meta-analysis. PLoS ONE 2017, 12, e0180793. [Google Scholar] [CrossRef] [Green Version]
  47. Ribeiro, J.D.; Huang, X.; Fox, K.R.; Franklin, J.C. Depression and hopelessness as risk factors for suicide ideation, attempts and death: Meta-analysis of longitudinal studies. Br. J. Psychiatry 2018, 212, 279–286. [Google Scholar] [CrossRef] [PubMed]
  48. Ribeiro, J.; Franklin, J.; Fox, K.R.; Bentley, K.; Kleiman, E.M.; Chang, B.; Nock, M.K. Self-injurious thoughts and behaviors as risk factors for future suicide ideation, attempts, and death: A meta-analysis of longitudinal studies. J. Psychol. Med. 2016, 46, 225–236. [Google Scholar] [CrossRef] [PubMed]
  49. Franklin, J.C.; Ribeiro, J.D.; Fox, K.R.; Bentley, K.H.; Kleiman, E.M.; Huang, X.; Musacchio, K.M.; Jaroszewski, A.C.; Chang, B.P.; Nock, M.K. Risk factors for suicidal thoughts and behaviors: A meta-analysis of 50 years of research. Psychol. Bull. 2017, 143, 187–232. [Google Scholar] [CrossRef]
  50. Ryu, S.; Lee, H.; Lee, D.-K.; Park, K. Use of a Machine Learning Algorithm to Predict Individuals with Suicide Ideation in the General Population. Psychiatry Investig. 2018, 15, 1030–1036. [Google Scholar] [CrossRef]
  51. Kachur, S.P.; Potter, L.B.; Powell, K.E.; Rosenberg, M.L. Suicide: Epidemiology, Prevention, and Treatment. Adolesc. Med. 1995, 6, 171–182. [Google Scholar]
  52. Horowitz, L.M.; Ballard, E.D.; Pao, M. Suicide screening in schools, primary care and emergency departments. Curr. Opin. Pediatr. 2009, 21, 620–627. [Google Scholar] [CrossRef] [Green Version]
  53. Gaynes, B.N.; West, S.L.; Ford, C.A.; Frame, P.; Klein, J.; Lohr, K.N. Screening for Suicide Risk in Adults: A Summary of the Evidence for the U.S. Preventive Services Task Force. Ann. Intern. Med. 2004, 140, 822–844. [Google Scholar] [CrossRef]
  54. Mann, J.J.; Apter, A.; Bertolote, J.; Beautrais, A.; Currier, D.; Haas, A.; Hegerl, U.; Lonnqvist, J.; Malone, K.; Marusic, A. Suicide prevention strategies: A systematic review. JAMA 2005, 294, 2064–2074. [Google Scholar] [CrossRef] [PubMed]
Table 1. Differences in response and general characteristics of each group according to suicidal ideation.
Table 1. Differences in response and general characteristics of each group according to suicidal ideation.
VariableDimensionOverall
(n = 8761)
Groups Based on Suicidal IdeationX2 or tp
No Suicidal Ideation
(n = 8019, 91.5%)
With Suicidal Ideation
(n = 742, 8.5%)
sexmale4354 (49.7%)4073 (50.8%)278 (37.5%)48.61<0.001
age 18.25 ± 8.6418.25 ± 9.0118.25 ± 2.410.010.996
each item’s scoreitem10.54 ± 0.640.47 ± 0.591.25 ± 0.7826.35<0.001
item20.59 ± 0.670.54 ± 0.631.16 ± 0.8024.90<0.001
item30.79 ± 0.850.75 ± 0.821.26 ± 0.9813.55<0.001
item40.63 ± 0.780.59 ± 0.751.09 ± 0.9716.77<0.001
item50.13 ± 0.420.11 ± 0.370.43 ± 0.7112.17<0.001
item60.86 ±0.790.81 ± 0.761.43 ± 0.8918.19<0.001
item70.38 ± 0.660.32 ± 0.581.09 ± 0.9322.26<0.001
item80.15 ± 0.430.12 ± 0.380.48 ± 0.7413.10<0.001
item90.03 ± 0.280.01 ± 0.120.55 ± 0.7220.13<0.001
item100.25 ± 0.510.21 ± 0.450.73 ± 0.7727.61<0.001
subtotal scorePHQ-21.12 ± 1.161.00 ± 1.062.40 ± 1.4126.42<0.001
PHQ-94.12 ± 3.623.70 ± 3.178.69 ± 4.8627.44<0.001
PHQ-84.06 ± 3.503.68 ± 3.148.14 ± 4.4826.51<0.001
PHQ-104.36 ± 3.903.89 ± 3.399.40 ± 5.2727.98<0.001
MINI suicidalitytotal0.69 ± 2.860.17 ± 0.826.39 ± 7.3223.16<0.001
Values were presented as mean ± SD or n (%).
Table 2. Diagnostic characteristics of suicidal ideation assessment using PHQ cutoff scores.
Table 2. Diagnostic characteristics of suicidal ideation assessment using PHQ cutoff scores.
VariableCutoffAccuracyAUCSensitivity (%)Specificity (%)PPV (%)NPV (%)
PHQ-2>10.6800.78677.36
(74.17–80.32)
67.16
(66.12–68.19)
17.90
(17.18–18.64)
96.97
(96.56–97.34)
PHQ-8>40.6820.80378.30
(75.16–81.22)
67.29
(66.25–68.31)
18.13
(17.41–18.88)
97.10
(96.69–97.47)
PHQ-9>50.7610.81771.56
(68.17–74.79)
76.54
(75.60–77.46)
22.01
(21.00–23.07)
96.68
(96.29–97.03)
PHQ-10>50.7440.82475.20
(71.93–78.27)
74.31
(73.34–75.26)
21.31
(20.40–22.26)
97.00
(96.62–97.35)
AUC: area under the receiver operative characteristic curve; PPV: positive predictive value; NPV: negative predictive value.
Table 3. Diagnostic characteristics of suicidal ideation assessment using PHQ Item combination by ML technique.
Table 3. Diagnostic characteristics of suicidal ideation assessment using PHQ Item combination by ML technique.
VariableML
Methods
Validation
Accuracy
Test
Accuracy
OOB
Accuracy
AUCSensitivity
(%)
Specificity
(%)
PPV
(%)
NPV
(%)
PHQ-2KNN0.9180.907n/a0.7004.97
(2.17–9.56)
99.43
(98.92–99.74)
47.06
(22.98–72.19)
91.13
(89.69–92.43)
Linear Discriminantn/a0.916n/a0.80330.46
(23.24–38.47)
97.36
(96.45–98.09)
52.27
(41.35–63.04)
93.65
(92.37–94.78)
random forest0.9240.9200.0700.62510.81
(6.31–16.96)
99.56
(99.1–99.82)
69.57
(47.08–86.79)
92.32
(90.96–93.54)
PHQ-8KNN0.9210.920n/a0.73816.18
(10.42–23.46)
98.43
(97.69–98.98)
46.81
(32.11–61.92)
93.22
(91.91–94.37)
Linear Discriminantn/a0.918n/a0.84834.53
(26.68–43.06)
96.79
(95.8–97.6)
48.48
(38.32–58.75)
94.41
(93.19–95.48)
random forest0.9130.9200.1310.7689.56
(5.19–15.79)
98.62
(97.92–99.13)
37.14
(21.47–55.08)
92.73
(91.39–93.93)
PHQ-9KNN0.9310.932n/a0.73631.29
(23.91–39.45)
98.99
(98.36–99.42)
74.19
(61.5–84.47)
93.94
(92.68–95.04)
Linear Discriminantn/a0.936n/a0.81641.61
(33.91–49.64)
98.92
(98.27–99.37)
79.76
(69.59–87.75)
94.28
(93.05–95.36)
random forest0.9510.9430.4090.84151.97
(43.73–60.14)
99.11
(98.51–99.51)
84.95
(76.03–91.52)
95.54
(94.42–96.48)
PHQ-10KNN0.9260.927n/a0.75027.82
(20.4–36.25)
98.51
(97.76–99.07)
62.71
(49.15–74.96)
93.82
(92.51–94.97)
Linear Discriminantn/a0.941n/a0.84643.18
(34.59–52.08)
98.65
(97.92–99.17)
74.03
(62.77–83.36)
95.12
(93.92–96.14)
random forest0.9360.9440.4240.84344.03
(35.47–52.86)
98.99
(98.33–99.43)
79.73
(68.78–88.19)
95.13
(93.93–96.15)
AUC: area under the receiver operative characteristic curve; PPV: positive predictive value; NPV: negative predictive value; n/a: not applicable.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Kim, S.; Lee, H.-K.; Lee, K. Which PHQ-9 Items Can Effectively Screen for Suicide? Machine Learning Approaches. Int. J. Environ. Res. Public Health 2021, 18, 3339. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph18073339

AMA Style

Kim S, Lee H-K, Lee K. Which PHQ-9 Items Can Effectively Screen for Suicide? Machine Learning Approaches. International Journal of Environmental Research and Public Health. 2021; 18(7):3339. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph18073339

Chicago/Turabian Style

Kim, Sunhae, Hye-Kyung Lee, and Kounseok Lee. 2021. "Which PHQ-9 Items Can Effectively Screen for Suicide? Machine Learning Approaches" International Journal of Environmental Research and Public Health 18, no. 7: 3339. https://0-doi-org.brum.beds.ac.uk/10.3390/ijerph18073339

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop