The Relationship Between Linguistic Features and Psychological States: A Quantitative Approach
Main Article Content
Abstract
Linguistic features are crucial in identifying psychological states. Prior studies have attempted to investigate the influence of psychological states on individual language usage through quantitative analysis, as well as which linguistic features can effectively indicate an individual’s psychological state. Conducting a systematic review of pertinent literature, this study initially outlines the linguistic features related to psychological states extracted quantitatively, encompassing three dimensions: vocabulary, syntax, and emotion. Subsequently, it summarizes the correlations between linguistic features across different levels and psychological states based on quantitative methods. Last, the limitations of existing research are discussed. This study contributes to a deeper understanding of the rationale, significance, and applicability of applying linguistic features in detecting psychological states and provides guidance for future research in this domain.
Article Details
The Medical Research Archives grants authors the right to publish and reproduce the unrevised contribution in whole or in part at any time and in any form for any scholarly non-commercial purpose with the condition that all publications of the contribution include a full citation to the journal as published by the Medical Research Archives.
References
[2] ALLGOOD S M,SEEDALL R B & WILLIAMS R B. Expressive writing and marital satisfaction:A Writing sample analysis [J]. Family Relations, 2020, 69(2): 380-391. https://doi.org/10.1111/fare.12416.
[3] AL-MOSAIWI M & JOHNSTONE T. In an absolute state:Elevated use of absolutist words is a marker specific to anxiety,depression,and suicidal ideation [J]. Clinical Psychological Science, 2018, 6(4):529-542. https://doi.org/10.1177/2167702617747074.
[4] ANTONIOU E,EBONGERS P & JANSEN A. The mediating role of dichotomous thinking and emotional eating in the relationship between depression and BMI [J]. Eating Behaviors, 2017, 26:55-60. https://doi.org/10.1016/j.eatbeh.2017.01.007.
[5] BARNES D H, LAWAL-SOLARIN F W & LESTER D. Letters from a suicide [J]. Death Studies, 2007, 31(7): 671–678. https://doi.org/10.1080/07481180701405212.
[6] BHATIA M S,VERMA S K & MURTY O P. Suicide notes: Psychological and clinical profile [J]. The International Journal of Psychiatry in Medicine, 2006, 36(2): 163–170. https://doi.org/10.2190/5690-CMGX-6A1C-Q28H.
[7] BIBER D. Variation across speech and writing [M]. Cambridge:Cambridge University Press, 1988.
[8] BLACKBURN K G,WANG W,PEDLER R, THOMPSON R & GONZALES D. Linguistic markers in women’s discussions on miscarriage and abortion illustrate psychological responses to their experiences [J]. Journal of Language and Social Psychology,2021, 40(3): 398–411. https://doi.org/10.1177/0261927X20965643.
[9] BOUKIL S,EL ADNANI F,CHERRAT L, et al. Deep learning algorithm for suicide sentiment prediction [A]. In M Ezziyyani (Ed.), Advanced Intelligent Systems for Sustainable Development (AI2SD’2018) (Vol. 914, pp. 261–272) [C], 2019, Springer International Publishing. https://doi.org/10.1007/978-3-030-11884-6_24.
[10] BOYD R L & SCHWARTZ H A. Natural language analysis and the psychology of verbal behavior: The past, present, and future states of the field [J]. Journal of Language and Social Psychology, 2021, 40(1):21–41.
https://doi.org/10.1177/0261927X20967028.
[11] CALZÀ L, GAGLIARDI G, ROSSINI FAVRETTI R, et al. Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia [J]. Computer Speech & Language, 2021, 65, 101113. https://doi.org/10.1016/j.csl.2020.101113.
[12] CAMBRIA E. Affective computing and sentiment analysis [J]. IEEE Intelligent Systems, 2016, 31(2):102-107.
https://doi.org/10.1109/MIS.2016.31.
[13] CHATTERJEE A,GUPTA U, CHINNAKOTLA M K,et al. Understanding emotions in text using deep learning and big data [J]. Computers in Human Behavior, 2019, 93:309–317. https://doi.org/10.1016/j.chb.2018.12.029.
[14] CHEN X, SYKORA M D, JACKSON T W,et al. What about mood swings:identifying depression on twitter with temporal measures of emotions [A]. Companion of the The Web Conference 2018 on The Web Conference 2018 - WWW ’18 [C], 2018, 1653–1660. https://doi.org/10.1145/3184558.31916244.
[15] CHENG Q, LI T M, KWOK, C-L, et al. Assessing suicide risk and emotional distress in chinese social media:A text mining and machine learning study [J]. Journal of Medical Internet Research,2017, 19(7):e243.
https://doi.org/10.2196/jmir.7276.
[16] COHN M A, MEHL M R & PENNEBAKER J W. Linguistic markers of psychological change surrounding September 11, 2001 [J]. Psychological Science, 2004, 15(10):687-693.
https://doi.org/10.1111/j.0956-7976.2004.00741.x.
[17] COPPERSMITH G, DREDZE M, HARMAN C, et al. From ADHD to SAD:Analyzing the language of mental health on Twitter through self-reported diagnoses [A]. Proceedings of the 2nd Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality [C]. Denver, Colorado. Association for Computational Linguistics, 2015, 1–10. https://doi.org/10.3115/v1/W15-1201.
[18] CROSSLEY S A & MCNAMARA D S. Understanding expert ratings of essay quality:Coh-Metrix analyses of first and second language writing [J]. International Journal of Continuing Engineering Education and Life-Long Learning, 2011, 21(2/3):170. https://doi.org/ 10.1504/IJCEELL.2011.040197.
[19] CUMMINGS L. Pragmatic disorders in the twenty-first century [A]. In L. Cummings (Ed.), Handbook of Pragmatic Language Disorders (pp. 1–22)[C]. Springer International Publishing, 2021. https://doi.org/10.1007/978-3-030-74985-9_1.
[20] D’ANDREA A, FERRI F, GRIFONI P, et al. Approaches, tools and applications for sentiment analysis implementation [J]. International Journal of Computer Applications, 2015, 125(3):26-33. https://doi.org/10.5120/ijca2015905866.
[21] DE BEER C, WARTENBURGER I, HUTTENLAUCH C & HANNE S. A systematic review on production and comprehension of linguistic prosody in people with acquired language and communication disorders resulting from unilateral brain lesions [J]. Journal of Communication Disorders, 2023, 101:106298.
https://doi.org/10.1016/j.jcomdis.2022.106298.
[22] DE CHOUDHURY M, COUNTS S & HORVITZ E. Predicting postpartum changes in emotion and behavior via social media [A]. Proceedings of the SIGCHI Conference on Human Factors in Computing Systems - CHI ’13, 3267 [C]. 2013. https://doi.org/10.1145/2470654.2466447.
[23] DEMIRAY Ç K & GENÇÖZ T. Linguistic reflections on psychotherapy:Change in usage of the first person pronoun in information structure positions [J]. Journal of Psycholinguistic Research, 2018, 47(4):959–973.
https://doi.org/10.1007/s10936-018-9569-4.
[24] DESMET B & HOSTE V. Emotion detection in suicide notes [J]. Expert Systems with Applications, 2013, 40(16):6351–6358. https://doi.org/10.1016/j.eswa.2013.05.050.
[25] DU,X. Lexical Features and psychological states: A quantitative linguistic approach [J]. Journal of Quantitative Linguistics, 2023,1-23. https://doi.org/10.1080/09296174.2023.2256211.
[26] DU X, SUN Y. Linguistic features and psychological states:A machine-learning based approach [J]. Frontiers in Psychology, 2022, 12.
[27] EICHSTAEDT J C, SMITH R J, MERCHANT R M, et al. Facebook language predicts depression in medical records [A]. Proceedings of the National Academy of Sciences [C], 2018, 115(44):11203-11208. https://doi.org/10.1073/pnas.1802331115.
[28] GRAESSER A C, MCNAMARA D S, LOUWERSE M M, et al. Coh-Metrix: Analysis of text on cohesion and language. Behavior Research Methods, Instruments, & Computers, 2004, 36(2): 193–202. https://doi.org/10.3758/BF03195564.
[29] GREGORY A. The decision to die:The psychology of the suicide note [A]. In D Canter & L Alison (Eds.), Interviewing and deception (pp. 127-156) [C]. Aldershot, UK: Ashgate, 1999.
[30] HANDELMAN L D & LESTER D. The content of suicide notes from attempters and completers [J]. Crisis, 2007, 28(2):102-104. https://doi.org/10.1027/0227-5910.28.2.102.
[31] HOLMES D, ALPERS G W, ISMAILJI T, et al. Cognitive and emotional processing in narratives of women abused by intimate partners [J]. Violence Against Women, 2007, 13(11): 1192-1205.
https://doi.org/10.1177/1077801207307801.
[32] HOMAN C, JOHAR R, LIU T, et al. Toward macro-insights for suicide prevention: Analyzing fine-grained distress at scale [A]. In P. Resnik, R. Resnik & M. Mitchell (Eds.), Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality (pp. 107-117) [C]. Association for Computational Linguistics. 2014. https://doi.org/10.3115/v1/W14-3213.
[33] HOU R,YANG J & JIANG M. A study on Chinese quantitative stylistic features and relation among different styles based on text clustering [J]. Journal of Quantitative Linguistics, 2014, 21(3): 246- 280. https://doi.org/10.1080/09296174.2014.911508.
[34] HU M and LIU B. Mining and summarizing customer reviews [A]. In Proceedings of the 10th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining (KDD '04) [C]. Association for Computing Machinery, New York, NY, USA, 2004, 168-177. https://doi.org/10.1145/1014052.1014073.
[35] JAKOVLJEV I & MILIN P. The relationship between thematic, lexical, and syntactic features of written texts and personality traits [J]. Psihologija, 2017, 50(1):67-84. https://doi.org/10.2298/PSI161012006J.
[36] JI S, YU C P, FUNG S, et al. Supervised learning for suicidal ideation detection in online user content [J]. Complexity, 2018, 1-10. https://doi.org/10.1155/2018/6157249.
[37] JONES L S, ANDERSON E, LOADES M, et al. Can linguistic analysis be used to identify whether adolescents with a chronic illness are depressed? [J]. Clinical Psychology & Psychotherapy, 2020, cpp.2417. https://doi.org/10.1002/cpp.2417.
[38] JUOLA P, MIKROS G K & VINSICK S. Correlations and potential cross-linguistic indicators of writing style [J]. Journal of Quantitative Linguistics, 2019, 26(2):146-171. https://doi.org/10.1080/09296174.2018.1458395.
[39] KIM K,CHOI S,LEE J,et al. Differences in linguistic and psychological characteristics between suicide notes and diaries [J]. The Journal of General Psychology, 2019, 146(4):391-416. https://doi.org/10.1080/00221309.2019.1590304.
[40] KIM M & CROSSLEY S A. Modeling second language writing quality: A structural equation investigation of lexical, syntactic, and cohesive features in source-based and independent writing [J]. Assessing Writing, 2018,37:39-56. https://doi.org/10.1016/j.asw.2018.03.002.
[41] KOTU V & DESHPANDE B. Predictive analytics and data mining: Concepts and practice with RapidMiner [M]. Elsevier/Morgan Kaufmann, Morgan Kaufmann is an imprint of Elsevier, 2015.
[42] LE X, LANCASHIRE I, HIRST G, et al. Longitudinal detection of dementia through lexical and syntactic changes in writing:A case study of three British novelists [J]. Literary and Linguistic Computing, 2011, 26(4):435-461. https://doi.org/10.1093/llc/fqr013.
[43] LEENAARS,A. A. Suicide notes: Predictive clues and patterns [M]. New York: Human Sciences Press, 1988.
[44] LESTER, D. Bereavement after suicide: A study of memorials on the internet [J]. OMEGA - Journal of Death and Dying, 2012, 65(3):189-194. https://doi.org/10.2190/OM.65.3.b.
[45] LITVINOVA T, ZAGOROVSKAYA O, LITVINOVA O, et al. Profiling a set of personality traits of a text’s author: A corpus-based approach [A]. In A Ronzhin, R Potapova & G Nemeth (Eds.), Speech and Computer: Proceedings of the 18th International Conference, SPECOM 2016 (pp. 555-562) [C]. Springer International Publishing.
[46] LYONS M, AKSAYLI N D & BREWER G. Mental distress and language use: Linguistic analysis of discussion forum posts [J]. Computers in Human Behavior, 2018, 87:207-211. https://doi.org/10.1016/j.chb.2018.05.035.
[47] LU X. Automatic analysis of syntactic complexity in second language writing [J]. International Journal of Corpus Linguistics, 2010,15(4):474–496. https://doi.org/10.1075/ijcl.15.4.02lu.
[48] MOHAMMAD S M, AND TURNEY P D. Crowdsourcing a word-emotion association lexicon [J]. Comput. Intell, 2013, 29:436–465. doi: 10.1111/j.1467-8640.2012. 00460.x.
[49] MORALES M R & LEVITAN R. Speech vs. text: A comparative analysis of features for depression detection systems [A]. 2016 IEEE Spoken Language Technology Workshop (SLT) [C], 2016, 136-143.
https://doi.org/10.1109/SLT.2016.7846256.
[50] ÖZCAN VURAL A & KURUOĞLU G. Nominal and verbal predicate use in schizophrenia [J]. PSYCHOLINGUISTICS, 2020, 27(2):213-228. https://doi.org/10.31470/2309-1797-2020-27-2-213-228.
[51] PENNEBAKER J W, CHUNG C K, FRAZEE J, et al. When small words foretell academic success:The case of college admissions essays [J]. PLoS ONE, 2014, 9(12):e115844. https://doi.org/10.1371/journal.pone.0115844.
[52] PENNEBAKER J W & STONE L D. Words of wisdom: Language use over the life span [J]. Journal of Personality and Social Psychology, 2003,85(2), 291–301. DOI:10.1037/0022-3514.85.2.291.
[53] PULVERMAN C S, LORENZ T A & MESTON C M. Linguistic changes in expressive writing predict psychological outcomes in women with history of childhood sexual abuse and adult sexual dysfunction [J]. Psychological Trauma: Theory, Research, Practice, and Policy, 2015, 7(1):50-57. https://doi.org/10.1037/a0036462.
[54] RAMÍREZ-ESPARZA N & CHUNG C, KACEWICZ E, et al. The psychology of word use in depression forums in English and in Spanish: testing two text analytic approaches [A]. ICWSM 2008 - Proceedings of the 2nd International Conference on Weblogs and Social Media [C]. ICWSM 2008, Seattle, WA.
[55] RUDE S, GORTNER E-M & PENNEBAKER J. Language use of depressed and depression-vulnerable college students [J]. Cognition & Emotion, 2004, 18(8):1121-1133. https://doi.org/10.1080/02699930441000030.
[56] SAILUNAZ K, DHALIWAL M, ROKNE J, et al. Emotion detection from text and speech: A survey [J]. Social Network Analysis and Mining, 2018, 8(1) :28. https://doi.org/10.1007/s13278-018-0505-2.
[57] SCHOENE A M, TURNER A, DE MEL G R, et al. Hierarchical multiscale recurrent neural networks for detecting suicide Notes [J]. IEEE Transactions on Affective Computing, 2021, 1-1.
https://doi.org/10.1109/TAFFC.2021.3057105.
[58] SHEFFLER J L, JOINER T E & SACHS-ERICSSON N J. The interpersonal and psychological impacts of COVID-19 on risk for late-life suicide [J]. The Gerontologist, 2021, 61(1):23-29. https://doi.org/10.1093/geront/gnaa103.
[59] TAUSCZIK Y R & PENNEBAKER J W. The psychological meaning of words:LIWC and computerized text analysis methods [J]. Journal of Language and Social Psychology, 2010, 29(1):24-54. https://doi.org/10.1177/0261927X09351676.
[60] TEN THIJ M, BATHINA K, RUTTER L A, et al. Depression alters the circadian pattern of online activity [J]. Scientific Reports, 2020, 10(1):17272. https://doi.org/10.1038/s41598-020-74314-3.
[61] TØLBØLL K B. Linguistic features in depression:A meta-analysis [J]. Jounal of Language Works, 2019, 4:22.
[62] TSUGAWA S, KIKUCHI Y, KISHINO F, et al. Recognizing depression from Twitter activity [A]. Proceedings of the 33rd Annual ACM Conference on Human Factors in Computing Systems - CHI ’15 [C], 2015,3187–3196.
https://doi.org/10.1145/2702123.2702280.
[63] XIAO W & SUN S. Dynamic lexical features of PhD theses across disciplines:A text mining approach [J]. Journal of Quantitative Linguistics, 2020, 27(2):114–133. https://doi.org/10.1080/09296174.2018.1531618.
[64] ZINKEN J, ZINKEN K, WILSON J C, et al. Analysis of syntax and word use to predict successful participation in guided self-help for anxiety and depression [J]. Psychiatry Research, 2010, 179(2): 181- 186. https://doi.org/10.1016/j.psychres.2010.04.011.
[65] ZÖRNIG P & ALTMANN G. A sequential activity measure for texts and speeches [J]. Glottotheory, 2016, 7(2).
https://doi.org/10.1515/glot-2016-0015.