Background

JMIR

J Med Internet Res

Journal of Medical Internet Research

1438-8871

JMIR Publications

Toronto, Canada

v24i10e39676

36191167

10.2196/39676

Original Paper

Tracking the Impact of COVID-19 and Lockdown Policies on Public Mental Health Using Social Media: Infoveillance Study

Leung

Tiffany

Pal

Anjan

Ruiyu

Jia

Minghui

BA 1 2

https://orcid.org/0000-0003-3673-5925

Hua

Yining

BA 3 4

https://orcid.org/0000-0001-7779-1208

Liao

Yanhui

MD 5

https://orcid.org/0000-0003-4735-3252

Zhou

MD, PhD 3 4

https://orcid.org/0000-0003-3874-4833

Xue

PhD 1 2

https://orcid.org/0000-0001-6880-2577

Wang

Ling

PhD 6

https://orcid.org/0000-0002-0059-2232

Yang

Jie

PhD 1

Department of Big Data in Health Science School of Public Health, Center of Clinical Big Data and Analytics of The Second Affiliated Hospital Zhejiang University School of Medicine

866 Yuhangtang Road

Hangzhou, 310058

China 86 19157731185 86 571 87077982 jieynlp@gmail.com

https://orcid.org/0000-0001-5696-363X

1 Department of Big Data in Health Science School of Public Health, Center of Clinical Big Data and Analytics of The Second Affiliated Hospital Zhejiang University School of Medicine

Hangzhou

China 2 The Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province

Hangzhou

China 3 Department of Biomedical Informatics Harvard Medical School

Boston, MA

United States 4 Division of General Internal Medicine and Primary Care Department of Medicine Brigham and Women’s Hospital

Boston, MA

United States 5 Department of Psychiatry, Sir Run Run Shaw Hospital School of Medicine Zhejiang University

Hangzhou

China 6 Florence Nightingale Faculty of Nursing, Midwifery & Palliative Care King’s College London

London

United Kingdom

Corresponding Author: Jie Yang jieynlp@gmail.com

10 2022

13 10 2022

24 10

e39676

18 5 2022 15 6 2022 21 7 2022 30 9 2022

©Minghui Li, Yining Hua, Yanhui Liao, Li Zhou, Xue Li, Ling Wang, Jie Yang. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 13.10.2022.

2022

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.

Background

The COVID-19 pandemic and its corresponding preventive and control measures have increased the mental burden on the public. Understanding and tracking changes in public mental status can facilitate optimizing public mental health intervention and control strategies.

Objective

This study aimed to build a social media–based pipeline that tracks public mental changes and use it to understand public mental health status regarding the pandemic.

Methods

This study used COVID-19–related tweets posted from February 2020 to April 2022. The tweets were downloaded using unique identifiers through the Twitter application programming interface. We created a lexicon of 4 mental health problems (depression, anxiety, insomnia, and addiction) to identify mental health–related tweets and developed a dictionary for identifying health care workers. We analyzed temporal and geographic distributions of public mental health status during the pandemic and further compared distributions among health care workers versus the general public, supplemented by topic modeling on their underlying foci. Finally, we used interrupted time series analysis to examine the statewide impact of a lockdown policy on public mental health in 12 states.

Results

We extracted 4,213,005 tweets related to mental health and COVID-19 from 2,316,817 users. Of these tweets, 2,161,357 (51.3%) were related to “depression,” whereas 1,923,635 (45.66%), 225,205 (5.35%), and 150,006 (3.56%) were related to “anxiety,” “insomnia,” and “addiction,” respectively. Compared to the general public, health care workers had higher risks of all 4 types of problems (all P<.001), and they were more concerned about clinical topics than everyday issues (eg, “students’ pressure,” “panic buying,” and “fuel problems”) than the general public. Finally, the lockdown policy had significant associations with public mental health in 4 out of the 12 states we studied, among which Pennsylvania showed a positive association, whereas Michigan, North Carolina, and Ohio showed the opposite (all P<.05).

Conclusions

The impact of COVID-19 and the corresponding control measures on the public’s mental status is dynamic and shows variability among different cohorts regarding disease types, occupations, and regional groups. Health agencies and policy makers should primarily focus on depression (reported by 51.3% of the tweets) and insomnia (which has had an ever-increasing trend since the beginning of the pandemic), especially among health care workers. Our pipeline timely tracks and analyzes public mental health changes, especially when primary studies and large-scale surveys are difficult to conduct.

COVID-19 mental health social media Twitter topic model health care workers

Introduction

The global COVID-19 pandemic has drastically changed people’s daily lives since the first confirmed case in December 2019 [1]. It has led to high hospitalization and fatality and negatively impacted public mental health [2,3]. Mental health problems cover a wide range of populations during the pandemic. The causes include but are not limited to the infection and death of relatives and friends, fear of illness, isolation brought by quarantine [4,5], and stress from unemployment [6]. At the same time, specific subpopulations such as children and adolescents [7,8], students [9,10], patients with COVID-19 [11], and health care workers [12,13] are particularly vulnerable to psychological disorders during the pandemic.

Studies have pointed out that health care workers in the United States experience psychological distress, facing high levels of anxiety, depression, and burnout during the pandemic [14]. The underlying reasons could be higher exposure risks to the virus and overwhelming workload [15,16]. Although there is literature on studying the mental health status of health care workers during the pandemic period, existing research primarily focuses on retrospective cross-sectional studies [13,14,16-19]. Therefore, it is necessary to study the dynamic characteristics of their mental status, identify general concerns, and provide timely support [20,21].

Due to their large scale, immediacy, and comprehensive coverage, social media platforms (such as Twitter, Facebook, and Weibo) have been vital data sources of research to analyze public perceptions timely when primary studies and large-scale surveys are difficult to be conducted. For example, Chew et al [22] used Twitter to study misinformation during the 2009 H1N1 pandemic, and Masri et al [23] found that new case trends can be predicted 1 week ahead based on related tweets for the 2015 Zika epidemic. Similarly, numerous studies have used social media to monitor public perceptions on topics such as enforced remote work [24], vaccines [25,26], drug use [27], mask wearing [28], and so on. Meanwhile, Berry et al [29] pointed out through a study with both quantitative and qualitative approaches that people are willing to discuss mental health problems on Twitter for varied reasons, including the sense of community and Twitter being a safe space for expression, coping, empowerment, etc. However, existing literature on public mental health during the pandemic using Twitter data [30-33] either has short study periods and small sample sizes or does not focus on subtypes of mental health problems and subgroup prevalence. More granular study designs and more comprehensive data are needed for such studies.

Finally, there is inconsistency in studying the effect of lockdown policies—one of the most highly debated topics related to mental health during the pandemic. Das et al [34] found that “state lockdown policies precede greater mental health symptoms.” In contrast, Adams-Prassl et al [35] found that “the lockdown measures lowered mental health by 0.083 standard deviations.”

To fill in these research gaps and potentially resolve the inconsistency, this study aimed to use related data from February 1, 2020—the beginning of the pandemic—to April 30, 2022, to analyze public mental status, problem types, their temporal and geographic distributions during COVID-19, as well as the effects of lockdown policies on public mental health across states (Figure S1 in Multimedia Appendix 1). In detail, we used this study to answer the 4 following research questions:

What types of mental health problems were the most frequent?

What mental health–related topics were the public the most concerned about, and how did relevant discussions change over time?

Are there differences in mental health concerns between the general population and health care workers?

How did lockdown policies impact public mental health?

To answer question 1, two mental health experts from our teams curated a mental health lexicon for Twitter that categorizes related tweets into 4 common mental health problems: anxiety, depression, insomnia, and addiction. Based on this lexicon, we extracted related tweets and visualized their distributions by week and state. To answer questions 2 and 3, we built a pipeline to identify potential health care workers, used a topic model to summarize related tweets into 16 topics, and compared the topic distributions among health care workers and the general population. To answer question 4, we identified tweets related to mental issues and compared their proportions before and after lockdown policies across different US states.

Methods Data Collection

We collected and downloaded COVID-19–related tweets from February 1, 2020, to April 30, 2022, from Twitter’s application programming interface using the unique tweet ID provided by an open-source COVID-19 tweet database [36]. The downloaded data contained full tweet texts and the corresponding metadata, including created time, user information, tweet status, etc. We further filtered out non–English-language and retweeted tweets and kept 471,371,477 tweets. Our data collection process strictly followed Twitter’s privacy and data use management. This study followed the Strengthening the Reporting of Observational Studies in Epidemiology reporting guidelines.

Ethics Approval

This study was conducted with approval by the Institutional Review Board of Zhejiang University (ZGL202201-2).

Data Preprocessing and Filtering

We removed tweets that contain URLs because such tweets often only included summaries or quotations of the original contents (169,660,346 tweets remained). A psychiatrist and a psychologist curated a mental health lexicon with 231 keywords. The keywords were categorized into 4 subgroups: anxiety, depression, insomnia, and addiction (Table S1 in Multimedia Appendix 1). We used this lexicon to extract mental health–related tweets through keyword matching against the preprocessed tweets and identified 4,460,203 tweets. To reduce the impact of spam and misinformation tweets, we removed data from users who posted more than 1000 mental health–related tweets during the study period. The final data set contained 4,213,005 tweets. Figure 1 shows an overview of the data preprocessing process.

Figure 1

Data collection and preprocessing.

Geographic Information Extraction

The geographic information of users was collected from 2 fields of the tweets: (1) the “place” field in tweet metadata and (2) the “location” variable nested in the “user” field of tweet metadata. The “place” information was chosen as the primary evidence of the users’ geographic information, since it is generated from GPS data and is, therefore, more accurate than the information from the self-reported “location” field. We used a list of US state names to extract users’ geographic information (“Methods” in Multimedia Appendix 1 [37-39]). Tweets from users associated with more than 1 state were removed in this step.

Topic Model Analysis

The Latent Dirichlet Allocation model [39] was used to conclude the main topics of mental health–related tweets. To create the corpora for topic modeling, we removed all stop words [40] as well as numbers and symbols. The topic model was implemented using the LdaModel function of the Genism package [40]. We selected the number of topics—a model hyperparameter—based on perplexity and topic coherence (“Methods” in Multimedia Appendix 1 [37-39]).

Health Care Worker Identification

To identify health care workers, we built a health care worker identification lexicon, whose keywords can be roughly divided into 3 groups: occupation, degree, and the title of the association (“Methods” in Multimedia Appendix 1 [37-39]). The dictionary contained 47 keywords, such as “doctor,” “MD,” “Doctor of Medicine,” “FACP,” etc (Table S2 in Multimedia Appendix 1). We used this lexicon to filter the user’s description and extracted 49,307 tweets from health care workers.

Statistical Analysis

We applied standard descriptive statistics to summarize the 4 types of mental health–related tweets proportion, including median and IQRs. Wilcoxon matched-pairs signed-ranks test was used to compare differences between health care workers and the general population. Interrupted time series analysis [41] was applied to analyze the lockdown policy’s effects on public mental health (see detailed information in “Methods” in Multimedia Appendix 1 [37-39]). We used Python software (version 3.8) to conduct the statistical analyses and chose a P value of .05 as the statistically significant threshold.

Results Collected Data Set

Data preprocessing selected 4,213,005 mental health–related tweets from 2,316,817 users (Figure 1). Among these tweets, 51.3% (2,161,357) were in the “depression” group, 45.66% (n=1,923,635) tweets were in the “anxiety” group, 5.35% (n=225,205) tweets were in the “insomnia” group, and 3.56% (n=150,006) tweets were in the “addiction” group. The sum of the 4 proportions was larger than 100% because some tweets included multiple keywords that belong to different mental health subgroups. Additionally, 789,967 (18.75%) tweets were extracted with their geographic information, and health care workers posted 49,307 (1.17%) tweets (from 21,963 users).

Temporal Distribution of Mental Health–Related Tweets

The trends of the weekly numbers of COVID-19 new cases and mental health–related tweets in 4 subgroups are shown in Figure S2 in Multimedia Appendix 1. The number of tweets of mental health problems reached their first peak from February 29 to April 4, 2020. We calculated and visualized the proportions of mental health–related tweets among all COVID-19–related tweets in Figure 2. The proportion curve of anxiety-related tweets had 3 dominant peaks in March 2020, October 2020, and September 2021. The curve of insomnia-related tweets continually increased during the study period, whereas no specific trends were observed in the curves of depression and addiction.

Figure 2

Trends of 4 types of mental health symptom–related tweets by the proportion of tweets.

Geographic Distribution of Mental Health–Related Tweets in the United States

Figure 3 shows the proportion of mental health–related tweets among all COVID-19–related tweets in each US state from February 1, 2020, to April 30, 2022, and visualizes the monthly tweet proportion for all the 50 US states (concrete proportions and 95% CIs are listed in Multimedia Appendix 2). Vermont, Oregon, and Utah were the 3 states with the highest proportions of mental health–related tweets, whereas Mississippi, Hawaii, and Louisiana had the lowest proportions. The first 2 months had a more substantial proportion of mental health–related tweets than the following months across most states.

Figure 3

Proportion distribution of mental health–related tweets in the United States.

Topics of Mental Health–Related Tweets

The most frequent terms for mental health–related tweets were “people,” “worried,” “shame,” “panic,” “lockdown,” “anxiety,” “mask,” etc (Figure S3 in Multimedia Appendix 1). We chose 16 to be the number of topics based on the perplexity and coherence (“Methods” and Figure S4 in Multimedia Appendix 1 [37-39]). Topics and the corresponding top 20 most probable unigrams and bigrams are displayed in Table S3 in Multimedia Appendix 1. We assigned each topic with a topic name based on the keywords. For example, a topic having the keywords “college,” “student,” “stress,” and “exam” indicates that tweets on this topic was likely to have been focused on “students’ pressure.” Except for the issues related to COVID-19 itself, such as “COVID-19 news,” “test results,” and “mask wearing,” the public also showed particular interest in topics such as “economic collapse,” “panic buying,” and “fuel problems.” The 16 topics were then categorized into 6 topic groups: “COVID-19 pandemic,” “preventive measures,” “economic,” “people,” “education,” and “mental health.” Figure 4 shows the dynamic distributions of the investigated topics in relative tweet proportions. The topic “lockdown days” occupied a dominant position during the pandemic most of the time. “COVID-19 news” was frequently mentioned at the beginning of the pandemic but returned to an average level after June 2020. The topic of “panic buying” notably fluctuated in the research period and was relatively large from February to March 2020 and from August to October 2021.

Figure 4

Dynamic characteristics of topic proportions.

Mental Health of Health Care Workers

We assessed the differences in the proportions of 4 mental health symptom–related tweets between health care workers and the general population and showed the results in Table 1. Statistical results showed that the proportions of anxiety-, depression-, insomnia-, and addiction-related tweets were significantly higher in health care workers than in the general public (all P<.001). Figure 5A shows the average number of tweets per user on different topics. “Lockdown days” is the top topic discussed by both health care workers and the general population. To visualize the difference in topic distribution between health care workers and the general population, we visualized the ratios of the average number of tweets by topic for the 2 groups in Figure 5B. It demonstrates that health care workers discussed more on 13 topics, especially clinical-related topics such as “hospital situations,” “COVID-19 symptoms,” and “mask wearing.” Conversely, the general population focused on topics such as “fuel problems,” “students’ pressure,” and “panic buying.”

Table 1

Comparison of proportions of mental health–related tweets between health care workers and the general population.

Mental health symptom	Health care workers (% tweets), median (IQR^a)	General population (% tweets), median (IQR^a)	W	P value
Anxiety	1.103 (1.02-1.187)	1.025 (0.956-1.094)	2120	<.001
Depression	1.519 (1.396-1.642)	1.255 (1.171-1.339)	26	<.001
Insomnia	0.251 (0.175-0.328)	0.131 (0.093-0.17)	7	<.001
Addiction	0.139 (0.114-0.164)	0.086 (0.079-0.094)	185	<.001

^aIQR and Wilcoxon matched-pairs signed-ranks test were applied to compare the differences between the 2 groups.

Figure 5

The distribution of tweets in topics for health care workers and the general population. (A) Average number of tweets per user in each topic. (B) Logarithmic ratio of the average number of tweets between health care workers and the general population on each topic. The ratio equals the average number of tweets per user among health care workers divided by the average number of tweets among the general population.

Impacts of Lockdown Policies

We selected 12 states with more than 20,000 related tweets during the study period to explore the effect of lockdown policies on public mental status. We report the significant results found in Michigan, Pennsylvania, North Carolina, and Ohio (analysis results of the other 8 states are displayed in Figure S5 in Multimedia Appendix 1). Sensitivity analysis was applied to verify the stability of the results (Table S4 in Multimedia Appendix 1). Figure 6 shows the proportions of the 4 mental health–related tweets changed after the lockdown policy in Pennsylvania but not in the other 3 states. Table 2 lists the results of the interrupted time series analyses [41] of the lockdown policy on public mental health. The coefficient of “policy,” meaning the change of intercept, was significant in the model of Pennsylvania (P=.007), and the coefficient of interaction term indicated that the change of slope was both significant in the models of Michigan (P=.03) and Pennsylvania (P=.04).

Figure 6

Daily proportion of mental health–related tweets before and after lockdown policies.

Table 2

The impact of lockdown policies on public mental health.

State	Date	Intercept	Pvalue	Time^a	Pvalue	Policy^b	Pvalue	Time*policy^c	Pvalue	Fstatistic	Pvalue
Michigan	March 24, 2020	0.0528	<.001	–0.0021	.003	–0.0214	.17	0.002	.03	4.669	.009
North Carolina	March 30, 2020	0.0461	<.001	–0.0015	.04	–0.0228	.16	0.0017	.08	2.509	.08
Ohio	March 23, 2020	0.0429	<.001	–0.0013	.03	–0.0117	.39	0.0012	.14	2.078	.13
Pennsylvania	April 1, 2020	0.0254	<.001	0.0002	.63	0.0288	.007	–0.0012	.04	3.033	.046

^aTime: a continuous variable encoding the number of days in the research period (15 days before and after lockdown).

^bPolicy: a binary variable, encoded as 0 before the lockdown policy and 1 after the policy.

^cTime*policy: the interaction term of time and policy.

Discussion Principal Findings

We investigated public mental status for 2 and a half years since the beginning of the pandemic by analyzing topics of Twitter discussions, examining potential differences between health care workers and the general population, and studying the impacts of statewide lockdown policies. We found that anxiety and depression problems were frequently mentioned on Twitter during the study period, and the proportion of insomnia discussions increased continuously. The content analysis of mental health–related tweets revealed potential reasons: control measures, economic collapse, pressure from unemployment, and so on. Based on Twitter mentions, we found that all 4 mental health problems studied in this paper (addiction, anxiety, depression, and insomnia) were significantly more prevalent among health care workers than the general population. Finally, lockdown policies had different influences on public mental health status in different states. Among the 12 states studied, the negative effect of lockdown policies on public mental health was significant in Pennsylvania but not the other states.

Comparison to Prior Works

Consistent with research on similar topics, we found that COVID-19 has severely impacted public mental health and has dynamic influences on public mental health [30,42]. In addition, we found that the proportion of anxiety-related tweets increased to a substantial peak in March 2020 and remained low but stable for several months. A possible explanation is that the outbreak of COVID-19 caused various social problems, such as the shortage of necessities and unemployment, in the initial stage. These problems raised an intense but temporal public fear. As the pandemic continued, public concerns fell to normal as the early-stage issues were mitigated. Another possible explanation is that public emotional response diminishes as the pandemic intensifies, which is consistent with findings from Dyer and Kolic [43]. The remaining 2 peaks of anxiety-related tweets occurred during the presidential election (November 2020) and the fuel price surge (September 2021). The proportion of insomnia also increased during the study period. This observation is consistent with Shi et al [44], who reported an incremental prevalence of insomnia in the follow-up period (from July 8 to August 8, 2020) than the baseline period (from February 28 to March 11, 2020).

The topic analysis shows that the public was concerned about the pandemic, its prevention, and the economic and educational problems caused by COVID-19. Topics such as “social distancing,” “test results,” “world pandemic,” “COVID-19 news,” and “economic collapse” were both observed in our work and previous studies [32,45-49], which only analyzed tweets during the early stage of the pandemic (mainly from January to August 2020). Our study found 2 additional topics through a longer study period: “fuel problems” and “students’ pressure.” These topics correspond to the literature and observations: students (especially children and adolescents) are more vulnerable to psychological disorders [50], and fuel prices frequently fluctuated during COVID-19 [51].

Unlike previous studies that only compare the prevalence of mental health symptoms between health care workers and the general population [52], we also analyzed the topics they focused on. We confirmed that health care workers were more concerned by all the studied mental problems: anxiety, depression, insomnia, and addiction. Particularly, higher proportions of insomnia among health care workers have been extensively reported in the literature [53-57]. These increased problems may be attributed to higher risks of infection [15] and more intense environmental pressure (eg, increased workload, lack of medical supplies, etc) that they face. Health care professionals were more focused on discussing the virus and more interested in sharing news or experiences related to the pandemic, demonstrating a high level of concern about the pandemic, which may be associated with an increased rate of mental disorders.

Lockdown policies had various effects on mental health discussions across US states. In Pennsylvania, it showed a positive association with mental health discussions. However, an opposite association was observed in Michigan, North Carolina, and Ohio. The literature also suggests geographically different associations between local lockdown policies and public mental health. For example, Mittal et al [58] found that most Twitter users shared positive opinions toward lockdown policies in related tweets from March 22 to April 6, 2020, whereas another study focusing on Twitter users in Massachusetts found increased anxiety expression after the enforcement of the Massachusetts State of Emergency and US State of Emergency [59]. Notably, Wang et al [60] found that public sentiment toward lockdown policies was positive in most states (such as Michigan, North Carolina, and Pennsylvania) and negative in only a few states, including Ohio, which also demonstrates geographic variations of public reactions to lockdown policies.

Strengths and Limitations

Previous work on the same topic has either not focused on the subtypes of mental health problems or studied them over short periods. Our work fills these research gaps by focusing on more granular types of mental health problems over a more extended study period. We built a comprehensive pipeline, including temporal, geographic, and discussion topic analyses; comparisons of trends and topics of concern between groups; and the impact of lockdown policies. On top of the analyses, we released the code and contributed 2 lexicons that can be used to identify mental health issues and health care professionals from tweets.

We also acknowledge the following limitations. First, the evaluation of public mental health on social media is inevitably biased due to the underlying population distribution of social media users. For example, older adults and people with low socioeconomic status may have less access to social media. As a result, this study may not reflect accurate attributes of such subpopulations. However, given the sheer number of people on Twitter, the results of this study are helpful and valuable in tracking public mental health during the pandemic. Additionally, future work could consider sampling according to users’ age to avoid this problem. Second, professional psychologists must make precise diagnoses of mental health problems following official heuristics. Therefore, identifying patients using lexicons based on their tweets can introduce false cases. To validate the reliability of the lexicon, we had professional psychiatrists curate the lexicon based on sampled tweets. Third, tweets that contain keywords do not always reflect the user’s mental health status as they can instead be comments on the news or from other people. To reduce this noise, we removed tweets containing URLs in our preprocessing step, as these tweets were usually summarizations or quotes of different information sources.

Future Work

The proposed pipeline can be applied to study other public mental health problems, such as suicidal thoughts, posttraumatic stress disorder, paranoia, and so on. It can also be applied to studying characteristics of other cohorts, such as sex minority groups, college students, etc. Regarding the analyses, more data sources (eg, surveys and interviews) could be introduced to validate the conclusions of this research.

Conclusions

This study developed a comprehensive pipeline to use social media for tracking and analyzing public mental status during a pandemic. It also contributed 2 lexicons that could be used in future studies. We found that the impact of COVID-19 and the corresponding control measures on the public’s mental status is dynamic and shows variability among different cohorts regarding disease types, occupations, and regional groups. Health agencies and policy makers should primarily focus on depression (reported by 51.3% of the tweets) and insomnia (which has had an ever-increasing trend since the beginning of the pandemic), especially among health care workers. Our approach works efficiently, especially when primary studies and large-scale surveys are difficult to conduct. It can be extended to track the mental status of other cohorts (eg, sex minority groups and adolescents) or during different pandemic periods.

Multimedia Appendix 1

Supplementary methods, pictures, and tables.

Multimedia Appendix 2

The proportion and 95% CIs of mental health–related tweets in each state by month.

JY was partially supported by the Key Laboratory of Intelligent Preventive Medicine of Zhejiang Province (2020E10004). The funders had no role in the design and conduct of the study.

Data Availability

The data and code supporting the study’s findings are available at https://github.com/zjumh/mental-health-during-COVID.

ML and JY designed the study and drafted the manuscript. YH prepared the data, provided feedback on the study design, and helped draft and revise the manuscript. ML performed data and statistical analysis. YL and LW built the lexicon of mental health keywords. YL, LZ, and XL provided critical reviews. All authors reviewed the manuscript. ML takes responsibility for the integrity of the work.

None declared.

Zhu

Zhang

Wang

Yang

Song

Zhao

Huang

Shi

Niu

Zhan

Wang

Gao

Tan

China Novel Coronavirus Investigating and Research Team

A novel coronavirus from patients with pneumonia in China, 2019

N Engl J Med 2020 02 20 382 8 727 733

10.1056/NEJMoa2001017

31978945

PMC7092803

Moreno

Wykes

Galderisi

Nordentoft

Crossley

Jones

Cannon

Correll

Byrne

Carr

Chen

EYH

Gorwood

Johnson

Kärkkäinen

Hilkka

Krystal

Lee

Lieberman

López-Jaramillo

Carlos

Männikkö

Miia

Phillips

Uchida

Vieta

Vita

Arango

How mental health care should change as a consequence of the COVID-19 pandemic

Lancet Psychiatry 2020 09 7 9 813 824

10.1016/S2215-0366(20)30307-2

32682460

S2215-0366(20)30307-2

PMC7365642

Vigo

Patten

Scott

Pajer

Kathleen

Krausz

Michael

Taylor

Steven

Rush

Brian

Raviola

Giuseppe

Saxena

Shekhar

Thornicroft

Graham

Yatham

Lakshmi N

Mental health of communities during the COVID-19 pandemic

Can J Psychiatry 2020 10 65 10 681 687

10.1177/0706743720926676

32391720

PMC7502878

Wang

Shi

Que

Liu

Sun

Meng

Yuan

Ran

Bao

Shi

The impact of quarantine on mental health status among general population in China during the COVID-19 pandemic

Mol Psychiatry 2021 09 26 9 4813 4822

10.1038/s41380-021-01019-y

33483692

10.1038/s41380-021-01019-y

PMC7821451

Xin

Luo

She

Wang

Tao

Zhang

Zhao

Zhang

Lin

Wang

Cai

Wang

You

Lau

Negative cognitive and psychological correlates of mandatory quarantine during the initial COVID-19 outbreak in China

Am Psychol 2020 75 5 607 617

10.1037/amp0000692

32673008

2020-51214-002

Witteveen

Velthorst

Economic hardship and mental health complaints during COVID-19

Proc Natl Acad Sci U S A 2020 11 03 117 44 27277 27284

10.1073/pnas.2009609117

33046648

2009609117

PMC7959574

Golberstein

Wen

Miller

Coronavirus disease 2019 (COVID-19) and mental health for children and adolescents

JAMA Pediatr 2020 09 01 174 9 819 820

10.1001/jamapediatrics.2020.1456

32286618

2764730

Leeb

Bitsko

Radhakrishnan

Martinez

Njai

Holland

MMWR Morb Mortal Wkly Rep 2020 11 13 69 45 1675 1680

10.15585/mmwr.mm6945a3

33180751

PMC7660659

Copeland

McGinnis

Bai

Adams

Nardone

Devadanam

Rettew

Hudziak

Impact of COVID-19 pandemic on college student mental health and wellness

J Am Acad Child Adolesc Psychiatry 2021 01 60 1 134 141.e2

10.1016/j.jaac.2020.08.466

33091568

S0890-8567(20)31988-2

PMC8173277

Wang

Hegde

Son

Keller

Smith

Sasangohar

Investigating mental health of US college students during the COVID-19 pandemic: cross-sectional survey study

J Med Internet Res 2020 09 17 22 9 e22817

10.2196/22817

32897868

v22i9e22817

PMC7505693

Naidu

Shah

Saigal

Smith

Brill

Goldring

Hurst

Jarvis

Lipman

Mandal

The high mental health burden of "Long COVID" and its association with on-going physical and respiratory symptoms in all adults discharged from hospital

Eur Respir J 2021 06 57 6 2004364

10.1183/13993003.04364-2020

33795319

13993003.04364-2020

PMC8015645

Lin

Yang

Luo

Liu

Huang

Majeed

Lee

Lui

LMW

Mansur

Nasri

Subramaniapillai

Rosenblat

Liu

McIntyre

The mental health effects of COVID-19 on health care providers in China

Am J Psychiatry 2020 07 01 177 7 635 636

10.1176/appi.ajp.2020.20040374

32605443

Bryant-Genevier

Rao

Lopes-Cardozo

Kone

Rose

Thomas

Orquiola

Lynfield

Shah

Freeman

Becker

Williams

Gould

Tiesman

Lloyd

Hill

Byrkit

Symptoms of depression, anxiety, post-traumatic stress disorder, and suicidal ideation among state, tribal, local, and territorial public health workers during the COVID-19 pandemic - United States, March-April 2021

MMWR Morb Mortal Wkly Rep 2021 12 03 70 48 1680 1685

10.15585/mmwr.mm7048a6

34855723

PMC8641565

Firew

Sano

Lee

Flores

Lang

Salman

Greene

Chang

Protecting the front line: a cross-sectional survey analysis of the occupational factors contributing to healthcare workers' infection and psychological distress during the COVID-19 pandemic in the USA

BMJ Open 2020 10 21 10 10 e042752

10.1136/bmjopen-2020-042752

33087382

bmjopen-2020-042752

PMC7580061

Rudberg

Havervall

Månberg

Anna

Jernbom Falk

Aguilera

Gabrielsson

Salomonsson

Hanke

Murrell

McInerney

Olofsson

Andersson

Hellström

Cecilia

Bayati

Bergström

Sofia

Sjöberg

Ronald

Tegel

Hedhammar

Phillipson

Nilsson

Hober

Thålin

Charlotte

SARS-CoV-2 exposure, symptoms and seroprevalence in healthcare workers in Sweden

Nat Commun 2020 10 08 11 1 5064

10.1038/s41467-020-18848-0

33033249

10.1038/s41467-020-18848-0

PMC7544689

Manzano García

Guadalupe

Ayala Calvo

The threat of COVID-19 and its influence on nursing staff burnout

J Adv Nurs 2021 02 77 2 832 844

10.1111/jan.14642

33155716

Amin

Sharif

Saeed

Durrani

Jilani

COVID-19 pandemic- knowledge, perception, anxiety and depression among frontline doctors of Pakistan

BMC Psychiatry 2020 09 23 20 1 459

10.1186/s12888-020-02864-x

32967647

10.1186/s12888-020-02864-x

PMC7509498

Chew

NWS

Lee

GKH

Tan

BYQ

Jing

Goh

Ngiam

NJH

Yeo

LLL

Ahmad

Ahmed Khan

Napolean Shanmugam

Sharma

Komalkumar

Meenakshi

Shah

Patel

Chan

BPL

Sunny

Chandra

Ong

JJY

Paliwal

Wong

LYH

Sagayanathan

Chen

Ying Ng

Teoh

Tsivgoulis

Sharma

A multinational, multicentre study on the psychological outcomes and associated physical symptoms amongst healthcare workers during COVID-19 outbreak

Brain Behav Immun 2020 08 88 559 565

10.1016/j.bbi.2020.04.049

32330593

S0889-1591(20)30523-7

PMC7172854

Pappa

Ntella

Giannakas

Giannakoulis

Papoutsi

Katsaounou

Prevalence of depression, anxiety, and insomnia among healthcare workers during the COVID-19 pandemic: a systematic review and meta-analysis

Brain Behav Immun 2020 08 88 901 907

10.1016/j.bbi.2020.05.026

32437915

S0889-1591(20)30845-X

PMC7206431

Rimmer

COVID-19: drop the hero narrative and support doctors' mental health, says charity

BMJ 2021 02 04 372 n337

10.1136/bmj.n337

33541858

Rimmer

COVID-19: Two fifths of doctors say pandemic has worsened their mental health

BMJ 2020 10 27 371 m4148

10.1136/bmj.m4148

33109530

Chew

Cynthia

Eysenbach

Gunther

Pandemics in the age of Twitter: content analysis of Tweets during the 2009 H1N1 outbreak

PLoS One 2010 11 29 5 11 e14118

10.1371/journal.pone.0014118

21124761

PMC2993925

Masri

Jia

Zhou

Lee

Yan

Use of Twitter data to improve Zika virus surveillance in the United States during the 2016 epidemic

BMC Public Health 2019 06 14 19 1 761

10.1186/s12889-019-7103-8

31200692

10.1186/s12889-019-7103-8

PMC6570872

Zhang

Marin

Exploring public sentiment on enforced remote work during COVID-19

J Appl Psychol 2021 06 106 6 797 810

10.1037/apl0000933

34138587

2021-56704-001

Xie

Wang

Jiang

Chen

Huang

Anand

Ajay

Dongmei

Public perception of COVID-19 vaccines on Twitter in the United States

medRxiv Preprint posted online on October 18, 2021

10.1101/2021.10.16.21265097

34704100

PMC8547532

Melton

Olusanya

Ammar

Shaban-Nejad

Public sentiment analysis and topic modeling regarding COVID-19 vaccines on the Reddit social media platform: a call to action for strengthening vaccine confidence

J Infect Public Health 2021 10 14 10 1505 1512

10.1016/j.jiph.2021.08.010

34426095

S1876-0341(21)00228-8

PMC8364208

Hua

Jiang

Lin

Yang

Plasek

Bates

Zhou

Using Twitter data to understand public perceptions of approved versus off-label use for COVID-19-related medications

J Am Med Inform Assoc 2022 09 12 29 10 1668 1678

10.1093/jamia/ocac114

35775946

6625661

PMC9278189

Sanders

White

Severson

McQueen

Alcântara Paulo

Haniel C

Zhang

Erickson

Bennett

Unmasking the conversation on masks: natural language processing for topical sentiment analysis of COVID-19 Twitter discourse

AMIA Jt Summits Transl Sci Proc 2021 2021 555 564

34457171

3478340

PMC8378598

Berry

Lobban

Belousov

Emsley

Nenadic

Bucci

#WhyWeTweetMH: understanding why people use Twitter to discuss mental health problems

J Med Internet Res 2017 04 05 19 4 e107

10.2196/jmir.6173

28381392

v19i4e107

PMC5399219

Zhang

Sun

Zhang

Liu

Anand

Xie

The COVID-19 pandemic and mental health concerns on Twitter in the United States

Health Data Sci 2022 02 17 2022 1 9

10.34133/2022/9758408

Zhang

Lyu

Liu

Zhang

Wang

Luo

Monitoring depression trends on Twitter during the COVID-19 pandemic: observational study

JMIR Infodemiology 2021 07 16 1 1 e26769

10.2196/26769

34458682

v1i1e26769

PMC8330892

Xue

Chen

Zheng

Zhu

Twitter discussions and emotions about the COVID-19 pandemic: machine learning approach

J Med Internet Res 2020 11 25 22 11 e20550

10.2196/20550

33119535

v22i11e20550

PMC7690968

Koh

Liew

How loneliness is talked about in social media during COVID-19 pandemic: text mining of 4,492 Twitter feeds

J Psychiatr Res 2022 01 145 317 324

10.1016/j.jpsychires.2020.11.015

33190839

S0022-3956(20)31074-8

PMC8754394

Das

Singh

Bruckner

State lockdown policies, mental health symptoms, and using substances

Addict Behav 2022 01 124 107084

10.1016/j.addbeh.2021.107084

34507184

S0306-4603(21)00269-0

PMC8358101

Adams-Prassl

Boneva

Golin

Rauh

The impact of the coronavirus lockdown on mental healthvidence from the United States

Econ Policy 2022 01 09 37 109 139 155

10.1093/epolic/eiac002

Chen

Lerman

Ferrara

Tracking social media discourse about the COVID-19 pandemic: development of a public coronavirus Twitter data set

JMIR Public Health Surveill 2020 05 29 6 2 e19273

10.2196/19273

32427106

v6i2e19273

PMC7265654

Newman

Lau

Grieser

Baldwin

Automatic evaluation of topic coherence

2010 06

Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

June 2-4, 2010

Los Angeles, CA

100 108

Bernal

James Lopez

Cummins

Steven

Gasparrini

Antonio

Interrupted time series regression for the evaluation of public health interventions: a tutorial

Int J Epidemiol 2017 02 01 46 1 348 355

10.1093/ije/dyw098

27283160

dyw098

PMC5407170

Blei

Jordan

Latent Dirichlet allocation

J Mach Learn Res 2003 01 03 3 993 1022

Rehurek

Sojka

Software framework for topic modelling with large corpora

2010

LREC 2010: New Challenges for NLP Frameworks

May 22, 2010

Valletta, Malta

46 50

Biglan

Ary

Wagenaar

The value of interrupted time-series experiments for community intervention research

Prev Sci 2000 03 1 1 31 49

10.1023/a:1010024016308

11507793

PMC4553062

Valdez

ten Thij

Marijn

Bathina

Rutter

Bollen

Social media insights into US mental health during the COVID-19 pandemic: longitudinal analysis of Twitter data

J Med Internet Res 2020 12 14 22 12 e21418

10.2196/21418

33284783

v22i12e21418

PMC7744146

Dyer

Kolic

Public risk perception and emotion on Twitter during the COVID-19 pandemic

Appl Netw Sci 2020 12 16 5 1 99

10.1007/s41109-020-00334-7

33344760

334

PMC7739810

Shi

Que

Huang

Liu

Zheng

Liu

Ran

Yuan

Yan

Sun

Shi

Kosten

Bao

Long-term impact of COVID-19 on mental health among the general public: a nationwide longitudinal study in China

Int J Environ Res Public Health 2021 08 20 18 16 8790

10.3390/ijerph18168790

34444539

ijerph18168790

PMC8393580

Wang

Xue

Zhao

Zhu

The impact of COVID-19 epidemic declaration on psychological consequences: a study on active Weibo users

Int J Environ Res Public Health 2020 03 19 17 6 2032

10.3390/ijerph17062032

32204411

ijerph17062032

PMC7143846

Xue

Chen

Zheng

Zhu

Public discourse and sentiment during the COVID 19 pandemic: using Latent Dirichlet Allocation for topic modeling on Twitter

PLoS One 2020 09 25 15 9 e0239441

10.1371/journal.pone.0239441

32976519

PONE-D-20-11036

PMC7518625

Abd-Alrazaq

Alhuwail

Househ

Hamdi

Shah

Top concerns of tweeters during the COVID-19 pandemic: infoveillance study

J Med Internet Res 2020 04 21 22 4 e19016

10.2196/19016

32287039

v22i4e19016

PMC7175788

Chandrasekaran

Mehta

Valkunde

Moustakas

Topics, trends, and sentiments of tweets about the COVID-19 pandemic: temporal infoveillance study

J Med Internet Res 2020 10 23 22 10 e22624

10.2196/22624

33006937

v22i10e22624

PMC7588259

Kwon

Park

Understanding user responses to the COVID-19 pandemic on Twitter from a terror management theory perspective: cultural differences among the US, UK and India

Comput Human Behav 2022 03 128 107087

10.1016/j.chb.2021.107087

34744298

S0747-5632(21)00410-6

PMC8558263

Singh

Roy

Sinha

Parveen

Sharma

Joshi

Impact of COVID-19 and lockdown on mental health of children and adolescents: a narrative review with recommendations

Psychiatry Res 2020 11 293 113429

10.1016/j.psychres.2020.113429

32882598

S0165-1781(20)31725-X

PMC7444649

Weekly retail gasoline and diesel prices

U.S. Energy Information Administration 2022-10-05

https://www.eia.gov/dnav/pet/pet_pri_gnd_dcus_nus_w.htm

Sakib

Akter

Zohra

Bhuiyan

AKMI

Mamun

Griffiths

Fear of COVID-19 and depression: a comparative study among the general population and healthcare professionals during COVID-19 pandemic crisis in Bangladesh

Int J Ment Health Addict 2021 02 19 1 17

10.1007/s11469-020-00477-9

33642957

477

PMC7894229

Huang

Zhao

Generalized anxiety disorder, depressive symptoms and sleep quality during COVID-19 outbreak in China: a web-based cross-sectional survey

Psychiatry Res 2020 06 288 112954

10.1016/j.psychres.2020.112954

32325383

S0165-1781(20)30607-7

PMC7152913

Chen

Hong

Sun

Dai

Basta

Tang

Qin

Insomnia symptoms during the early and late stages of the COVID-19 pandemic in China: a systematic review and meta-analysis

Sleep Med 2022 03 91 262 272

10.1016/j.sleep.2021.09.014

34732293

S1389-9457(21)00494-9

PMC8479411

Jia

Shi

Niu

Yin

Xie

Wang

Prevalence of mental health problems during the COVID-19 pandemic: a systematic review and meta-analysis

J Affect Disord 2021 02 15 281 91 98

10.1016/j.jad.2020.11.117

33310451

S0165-0327(20)33051-2

PMC7710473

Cénat

Jude Mary

Blais-Rochette

Kokou-Kpolou

Noorishad

Mukunzi

McIntee

Dalexis

Goulet

Labelle

Prevalence of symptoms of depression, anxiety, insomnia, posttraumatic stress disorder, and psychological distress among populations affected by the COVID-19 pandemic: a systematic review and meta-analysis

Psychiatry Res 2021 01 295 113599

10.1016/j.psychres.2020.113599

33285346

S0165-1781(20)33260-1

PMC7689353

Jahrami

BaHammam

Bragazzi

Saif

Faris

Vitiello

Sleep problems during the COVID-19 pandemic by population: a systematic review and meta-analysis

J Clin Sleep Med 2021 02 01 17 2 299 313

10.5664/jcsm.8930

33108269

PMC7853219

Mittal

Ahmed

Mittal

Aggarwal

Twitter users’ coping behaviors during the COVID-19 lockdown: an analysis of tweets using mixed methods

Inf Discov Deliv 2021 05 15 49 3 193 202

10.1108/idd-08-2020-0102

Thorpe Huerta

Hawkins

Brownstein

Hswen

Exploring discussions of health and risk and public sentiment in Massachusetts during COVID-19 pandemic mandate implementation: a Twitter analysis

SSM Popul Health 2021 09 15 100851

10.1016/j.ssmph.2021.100851

34355055

S2352-8273(21)00126-9

PMC8325089

Wang

Fan

Palacios

Chai

Guetta-Jeanrenaud

Obradovich

Zhou

Zheng

Global evidence of expressed sentiment alterations during the COVID-19 pandemic

Nat Hum Behav 2022 03 6 3 349 358

10.1038/s41562-022-01312-y

35301467

10.1038/s41562-022-01312-y