Published on in Vol 27 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/72959, first published .
What Does Text Mining of Reddit Forums Reveal About Factors Surrounding Mental Health in Singapore?

What Does Text Mining of Reddit Forums Reveal About Factors Surrounding Mental Health in Singapore?

What Does Text Mining of Reddit Forums Reveal About Factors Surrounding Mental Health in Singapore?

1Division of Computational and Data Sciences, Washington University in St. Louis, 1 Brookings Drive, St Louis, MO, United States

2Institute of Data Science, National University of Singapore, Singapore, Singapore

3Imaging Science Program, Washington University, St Louis, MO, United States

4Department of Social Work, Faculty of Arts and Social Sciences, National University of Singapore, Singapore, Singapore

Corresponding Author:

Charles Alba, MS


Text mining of mental health-related posts from Singapore-based Reddit communities uncovered 10 themes, with loneliness, access to affordable mental health care, and education challenges emerging as the most prevalent, alongside rising discussions on social isolation and emotional struggles, as well as access to mental health support and services.

J Med Internet Res 2025;27:e72959

doi:10.2196/72959

Keywords



A 2023 national survey reported a sharp rise in poor mental health among young Singaporean adults compared to 2017 [1], while another found that 46% of Singaporeans regarded mental health as the top health challenge [2], highlighting the need for effective interventions and policies.

In response, a recent white paper [3] recommended stronger policies, improved stakeholder collaboration, and greater public awareness. However, these stemmed from household surveys and interviews that struggle to capture real-time needs due to factors like stigma.

As online forums provide a complementary, timely, and open setting, this study aims to leverage Reddit forums to identify key factors and trends that shape Singapore’s mental health discourse, with the goal of helping practitioners and policymakers refine strategies and disseminate targeted recommendations.


Ethical Considerations

Our study was not subject to the Internal Review Board (IRB) federal definitions, as confirmed by Washington University’s IRB board (#202509029). Our study is not considered to meet federal definitions under the jurisdiction of an IRB and falls outside the purview of the Human Research Protect Office as the data was obtained from social media posts consistent with the Reddit terms of use applicable at the time. All the secondary data used were anonymous.

Data

Reddit posts spanning January 2015 to December 2022 were retrieved from four Singapore-based subreddits – r/askSingapore, r/singapore, r/singaporeR, and r/singaporeRaw.

Methods Used

Chain-of-thought prompting [4] first identified posts relevant to mental health. Dynamic BERTopic [5] uncovered topics and tracked their evolution over time, allowing us to observe topical shifts in discussions of mental health across different years. BERTopic embeds each post using language models, clusters semantically similar posts, and represents each cluster as keywords to form coherent topics. Finally, Linguistic Inquiry and Word Count analysis [6] generated psychological insights from posts among each identified topic. Prompts and parameters are detailed in Multimedia Appendix 1.


In total, 2,783/379,787 (0.73%) posts were identified as mental health-related. BERTopic analysis uncovered 10 distinct key topics (Table 1), which can be grouped into several categories: (1) social isolation and emotional struggles, such as loneliness (Topic 0) and relationship challenges (Topic 8); (2) access and support to services, including affordability (Topic 1) and assistance in dealing with depressive symptoms (Topic 5); (3) career and education pressures (Topics 2, 3, and 6); and (4) tragedies from personal and global events, such as assaults (Topic 7) or COVID-19 (Topic 4).

Trends-wise, Figure 1 suggests that loneliness (Topic 0) has exponentially risen since 2019, while assistance with affordable care (Topic 1) has steadily increased in the same period. The counts are normalized as(# documents in topics)t(# documents across all sub-reddits)t where t represents the year, to account for changes in Reddit’s popularity among Singaporeans over time. Conversely, discussions surrounding COVID-19 (Topic 4) declined after 2020, stemming from a return to normalcy. School-and career-related struggles (Topics 2 and 3) have fluctuated since 2015 but appear to rise since 2019.

Table 1. Topics identified through BERTopic analysis of posts classified as relevant to mental health using chain-of-thought prompting across four Singapore-based subreddits (r/askSingapore, r/singapore, r/singaporeR, and r/singaporeRaw) spanning January 2015 to December 2022.
TopicCountTopic keywordsInterpretationRepresentative textCategories
0241loneliness, like, sad, destinedLoneliness“Is it normal to feel unlovable… was previously ok w making friends w my friends… but … feeling lonely and depressed”Social Isolation & Emotional Struggles
1229imha, help, affordable, bipolarAccess to affordable mental health care“... anyone know where I can find affordable therapy in Singapore? I would prefer private therapists or NGOs, not referrals from polyclinics or IMH… I’m looking for help with anxiety …”Access & Support to Services
2136poly, gpa, secondary, freshbSchool-related strugglesHow to transfer to another secondary school … I don’t feel accepted here… My grades are mediocre… My score is miserable… I really hate my life…Career & Education Pressures
3126internship, sg, just, resignationCareer-related struggles“Anxiety and stressed about work… I get so anxious and stressed about working, whether I am able to do what I am told or also able to understand and deliver on my own… ”Career & Education Pressures
4126coronavirus, migrantc, measures, PfizerCOVID-19“Who here is feel burn out and slightly depressed because of covid...”Tragedies from Personal & Global Events
586attacks, eating, ptsd, antidepressantsSymptoms of depression“Anxiety attacks support… looking for someone to talk to about anxiety attack”Access & Support to Services
682pes, nsf, bunk, depressiondStruggles with military (national) service“Downing PES status… been in the army for about six months now… been undergoing bouts of depression… been having suicidal ideations … should i tell my commanders/MO…”Career & Education Pressures
736heard, shes, assaulted, familiesFamily assault[Traumatic personal experience of assault]Tragedies from Personal & Global Events
824breakup, attachment, unavailable, sensitiveRelationship challenges“How to deal with breakup… Just broke up with my gf recently, it hurts so bad … my mechanism with dealing with such depressing problem is actually unhealthy…”Social Isolation & Emotion Struggles
910scam, valuing, transactions, ocbceScam incidents“Just got scammed.... but too scared to go to the police to get help... I have been in a weirdly long manic episode…”Tragedies from Personal & Global Events
-11687man, help, singaporeansOutlierOutlierOutlier

aInstitute of Mental Health (IMH) is the only tertiary hospital in Singapore specializing in psychiatry.

bPoly indicates Polytechnic; Secondary indicates Secondary School.

cReferring to COVID-19 outbreaks that took place in migrant dormitories.

dPhysical employment standard (PES) is used to determine an individual’s military service vocation based their medical fitness and condition; full-time national serviceman (NSF) refers to individuals serving mandatory 2-year military service.

eOverseas-Chinese banking corporation (OCBC) is one of the largest local banks in Singapore.

Figure 1. Temporal trends of the most frequent topics (top) and topic categories (bottom) identified through BERTopic analysis of Singapore-based Reddit discussions on mental health from 2015 to 2022.

Among categories, topics related to “Social Isolation and Emotional Struggles” and “Access and Support to Services” are on the rise, with the former increasing exponentially since 2017.

Linguistic Inquiry and Word Count analysis revealed homogeneous distributions across topics in terms of psychological states and motives, with most attributes falling at or below the average levels observed in the broader Reddit corpus (Multimedia Appendix 2). “Allure” remains an exception with a divergence observed; topics like COVID-19 (Topic 4), symptoms of depression (Topic 5), and national service (Topic 6) fall substantially below the Reddit average, while other topics display slightly elevated levels.

Among the most prevalent topics, loneliness (Topic 0) and access to affordable mental health care (Topic 1) show sustained increases. Categorization of all topics (Table 1) indicates that discussions related to social isolation and emotional struggles, as well as access and support to services, have risen consistently.


Our findings reveal that loneliness, access to affordable care, and school-and career-related struggles are most prominent, with the former two on the rise. Our results can complement existing efforts, like the recently launched white paper [3], providing insights for practitioners and policymakers. Rising loneliness (Topic 0) and access to affordable services (Topic 1) highlight the need for awareness campaigns on social isolation, cost-effective treatment, technology-enabled care, and stronger family and community support. At the micro-level, addressing affordability could reduce previous findings of treatment gaps [7], while targeting loneliness could help counselors tailor interventions. Similarly, Linguistic Inquiry and Word Count analysis suggests that above-average allure across most topics may reflect societal pressures related to attractiveness.

Reddit data are limited by demographic bias and difficulty capturing cultural nuance. Yet its anonymous discussions reveal needs overlooked in surveys, remaining prone to social desirability bias.

Future research encompasses integrating this Reddit analyses with surveys and other sources for comprehensive recommendations on Singapore’s mental health needs. Our study demonstrates how digital data captures lived experiences that complement demographic trends, enabling more responsive and inclusive strategies.

Acknowledgments

This manuscript was proofread with the assistance of ChatGPT. These includes correcting any grammatical, spelling, punctuation or typological errors. The author(s) reviewed and edited the content as needed and take(s) full responsibility for the content of the publication.

This work is funded, in part by, the National University of Singapore (NUS) Development Grant and the Danforth Scholarship of the Washington University in St Louis. The funders had no involvement in the study design, data collection, analysis, interpretation, or the writing of the manuscript. Any opinions, findings, conclusions, or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National University of Singapore or Washington University in St Louis.

Data Availability

The datasets generated or analyzed during this study are publicly available in the Open Science Framework repository [8]. The codes used for analysis are available on GitHub [9].

Authors' Contributions

C.A. performed the conceptualization, methodology, software, validation, formal analysis, investigation, data curation, writing - original draft, visualization and funding acquisition. V.A. performed the visualization and writing - review & editing. A.A. performed the conceptualization and writing - review & editing. G.C. provided the conceptualization, resources and writing - review & editing.

Conflicts of Interest

None declared.

Multimedia Appendix 1

Chain-of-thought evaluation, promts and parameters, and BERTopic parameters.

DOCX File, 36 KB

Multimedia Appendix 2

Linguistic Inquiry and Word Count analysis results.

DOCX File, 3018 KB

  1. National population health survey 2023 report. Ministry of Health Singapore. 2023. URL: https://isomer-user-content.by.gov.sg/3/d93ac4ca-205c-4afc-85de-cb8eccf02923/nphs-2023-report.pdf [Accessed 2025-10-21]
  2. Ho T. Singaporeans deem mental health as the biggest health problem. Ipsos. 2023. URL: https://www.ipsos.com/en-sg/singaporeans-deem-mental-health-biggest-health-problem [Accessed 2025-01-19]
  3. Ong A, Ng J, Tan R. PROJECT HAYAT: national suicide prevention strategy white paper. SG Mental Health Matters. Sep 10, 2024. URL: https://sgmentalhealthmatters.com/suicide-prevention [Accessed 2025-02-02]
  4. Wei J, Wang X, Schuurmans D, et al. Chain-of-thought prompting elicits reasoning in large language models. 2022. Presented at: Proceedings of the 36th International Conference on Neural Information Processing Systems (NeurIPS 2022. URL: https://dl.acm.org/doi/10.5555/3600270.3602070 [Accessed 2025-10-22]
  5. Grootendorst M. BERTopic: neural topic modeling with a class-based TF-IDF procedure. arXiv. Preprint posted online on Mar 11, 2022. [CrossRef]
  6. Boyd RL, Ashokkumar A, Seraj S, Pennebaker JW. The development and psychometric properties of LIWC-22. University of Texas at Austin; Mar 31, 2022. URL: https://www.liwc.app/ [Accessed 2025-02-02]
  7. Subramaniam M, Abdin E, Vaingankar JA, et al. Minding the treatment gap: results of the Singapore Mental Health Study. Soc Psychiatry Psychiatr Epidemiol. Nov 2020;55(11):1415-1424. [CrossRef] [Medline]
  8. Alba C, Ang ABH, Abbasian V, Chung G. Data for "Trends in factors surrounding mental health in singapore: an observational text mining study from reddit forums ". Open Science Framework. Jan 15, 2025. URL: https://doi.org/10.17605/OSF.IO/XBMN5
  9. Codes for “trends in factors surrounding mental health in singapore: an observational text mining study from reddit forums”. GitHub. URL: https://github.com/cja5553/Mental_Health_SG_forums [Accessed 2025-10-22]


IRB: Internal Review Board


Edited by Amaryllis Mavragani; submitted 21.Feb.2025; peer-reviewed by Paul Matthews, Zisu Wang; final revised version received 25.Sep.2025; accepted 13.Oct.2025; published 31.Oct.2025.

Copyright

© Charles Alba, Abel Beng Heng Ang, Vahid Abbasian, Gerard Chung. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 31.Oct.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research (ISSN 1438-8871), is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.