Abstract
Background: Depression is a debilitating disorder that at present lacks a reliable biomarker to aid in diagnosis and early detection. Recent advances in computational analytic approaches have opened up new avenues in developing such a biomarker by taking advantage of the wealth of information that can be extracted from a person’s speech.
Objective: The current review provides an overview of the latest findings in the rapidly evolving field of computational language analysis for the detection of depression. We cover a wide range of both acoustic and content-related linguistic features, data types (i.e., spoken and written language), and data sources (i.e., lab settings, social media, and smartphone-based). We put special focus on the current methodological advances with regard to feature extraction and computational modeling techniques. Furthermore, we pay attention to potential hurdles in the implementation of automatic speech analysis.
Conclusion: Depressive speech is characterized by several anomalies, such as lower speech rate, less pitch variability and more self-referential speech. With current computational modeling techniques, such features can be used to detect depression with an accuracy of up to 91%. The performance of the models is optimized when machine learning techniques are implemented that suit the type and amount of data. Recent studies now work towards further optimization and generalizability of the computational language models to detect depression. Finally, privacy and ethical issues are of paramount importance to be addressed when automatic speech analysis techniques are further implemented in, for example, smartphones. Altogether, computational speech analysis is well underway towards becoming an effective diagnostic aid for depression.
Keywords: Computational speech analysis, natural language processing, machine learning, depression, biomarker, categorization, diagnosis.
[http://dx.doi.org/10.1016/j.biopsych.2012.03.015] [PMID: 22541039]
[http://dx.doi.org/10.31887/DCNS.2018.20.3/fthibaut] [PMID: 30581283]
[http://dx.doi.org/10.1093/jamiaopen/ooz054] [PMID: 32607482]
[http://dx.doi.org/10.1016/S0962-1849(05)80049-6]
[http://dx.doi.org/10.1176/ajp.154.1.4] [PMID: 8988952]
[http://dx.doi.org/10.1097/00005053-192104000-00057]
[http://dx.doi.org/10.1186/s12888-019-2300-7] [PMID: 31615470]
[http://dx.doi.org/10.1109/TAFFC.2020.3035535]
[http://dx.doi.org/10.1002/lio2.354] [PMID: 32128436]
[http://dx.doi.org/10.1016/j.schres.2014.10.032] [PMID: 25464920]
[http://dx.doi.org/10.1109/BIBM.2017.8217971]
[http://dx.doi.org/10.1109/O-COCOSDA46868.2019.9060848]
[http://dx.doi.org/10.24193/jebp.2017.1.7]
[http://dx.doi.org/10.1037/pspp0000187] [PMID: 29504797]
[http://dx.doi.org/10.1177/0261927X15589186]
[http://dx.doi.org/10.3389/fpsyg.2015.01564] [PMID: 26500601]
[http://dx.doi.org/10.1016/j.jrp.2013.01.008]
[http://dx.doi.org/10.2466/02.09.21.28.PR0.109.5.686-700] [PMID: 22238866]
[http://dx.doi.org/10.1002/cpp.2006] [PMID: 26818665]
[http://dx.doi.org/10.1037/0033-2909.128.4.638] [PMID: 12081086]
[http://dx.doi.org/10.1111/j.1745-6924.2008.00088.x] [PMID: 26158958]
[http://dx.doi.org/10.1109/EMBC.2019.8857071]
[http://dx.doi.org/10.1016/j.specom.2015.03.004]
[http://dx.doi.org/10.18653/v1/W17-3101]
[http://dx.doi.org/10.1007/978-981-10-6577-4_6]
[http://dx.doi.org/10.1007/978-3-319-56904-8_29]
[http://dx.doi.org/10.1016/j.jneuroling.2006.04.001] [PMID: 21253440]
[http://dx.doi.org/10.1016/j.jvoice.2021.06.018]
[http://dx.doi.org/10.1155/2018/6508319] [PMID: 30344616]
[http://dx.doi.org/10.1371/journal.pone.0238726] [PMID: 32915846]
[http://dx.doi.org/10.1016/j.bandc.2004.05.003] [PMID: 15380873]
[http://dx.doi.org/10.1145/2502081.2502224]
[http://dx.doi.org/10.1016/j.csl.2017.08.005]
[http://dx.doi.org/10.1109/SAI.2014.6918213]
[http://dx.doi.org/10.1039/D0CP03694C] [PMID: 32935687]
[http://dx.doi.org/10.1021/acs.estlett.9b00476]
[http://dx.doi.org/10.1007/978-3-030-18305-9_47]
[http://dx.doi.org/10.1109/ICASSP.2013.6639227]
[http://dx.doi.org/10.1109/JBHI.2019.2913590] [PMID: 31034426]
[http://dx.doi.org/10.1016/j.specom.2017.04.001]
[http://dx.doi.org/10.1109/JSTSP.2019.2955012]
[http://dx.doi.org/10.1016/j.ymeth.2018.07.007] [PMID: 30099083]
[http://dx.doi.org/10.1109/ACCESS.2020.2970496]
[http://dx.doi.org/10.1109/ICASSP.2019.8683498]
[http://dx.doi.org/10.22159/ajpcr.2018.v11s3.30042]
[http://dx.doi.org/10.1007/s42600-020-00097-1]
[http://dx.doi.org/10.1016/j.neubiorev.2018.06.008] [PMID: 29890179]
[http://dx.doi.org/10.1109/ICIINFS.2017.8300343]
[http://dx.doi.org/10.1007/s10579-018-9423-1]
[http://dx.doi.org/10.1016/j.artmed.2012.06.001] [PMID: 22771201]
[http://dx.doi.org/10.1145/2988257.2988263]
[http://dx.doi.org/10.1016/j.im.2020.103349]
[http://dx.doi.org/10.21437/Interspeech.2020-2819]
[http://dx.doi.org/10.3115/v1/W14-3207]
[http://dx.doi.org/10.3115/v1/W14-3214]
[http://dx.doi.org/10.2196/14199] [PMID: 31250832]
[http://dx.doi.org/10.1016/j.cobeha.2017.07.005]
[http://dx.doi.org/10.3390/ijerph17134752]
[http://dx.doi.org/10.1038/s41746-020-0233-7] [PMID: 32219184]
[http://dx.doi.org/10.1109/ACIIW.2017.8272609]
[http://dx.doi.org/10.1159/000450959] [PMID: 27842303]
[http://dx.doi.org/10.2196/mhealth.5284] [PMID: 27439444]
[http://dx.doi.org/10.17744/mehc.35.4.f85k258620765tj4]
[http://dx.doi.org/10.1109/ICASSP.2019.8682916]
[http://dx.doi.org/10.1109/ICASSP40776.2020.9054323]
[http://dx.doi.org/10.1109/JSTSP.2019.2949419]
[http://dx.doi.org/10.2196/22723] [PMID: 33512325]
[http://dx.doi.org/10.1136/amiajnl-2013-002605] [PMID: 25147247]
[http://dx.doi.org/10.1109/SP.2017.41]
[http://dx.doi.org/10.1038/s41537-020-0108-6] [PMID: 32686681]
[http://dx.doi.org/10.1038/s41537-020-00114-3] [PMID: 32895389]