Published on in Vol 24, No 7 (2022): July

Preprints (earlier versions) of this paper are available at, first published .
Tweets Related to Motivation and Physical Activity for Obesity-Related Behavior Change: Descriptive Analysis

Tweets Related to Motivation and Physical Activity for Obesity-Related Behavior Change: Descriptive Analysis

Tweets Related to Motivation and Physical Activity for Obesity-Related Behavior Change: Descriptive Analysis

Authors of this article:

Albert Park1 Author Orcid Image

Original Paper

Department of Software and Information Systems, College of Computing and Informatics, University of North Carolina-Charlotte, Charlotte, NC, United States

Corresponding Author:

Albert Park, PhD

Department of Software and Information Systems

College of Computing and Informatics

University of North Carolina-Charlotte

Woodward 310H

9201 University City Blvd

Charlotte, NC, 28223

United States

Phone: 1 704 687 8668

Fax:1 704 687 1652


Background: Obesity is one of the greatest modern public health problems, due to the associated health and economic consequences. Decreased physical activity is one of the main societal changes driving the current obesity pandemic.

Objective: Our goals are to fill a gap in the literature and study whether users organically utilize a social media platform, Twitter, for providing motivation. We examine the topics of messages and social network structures on Twitter. We discuss social media’s potential for providing peer support and then draw insights to inform the development of interventions for long-term health-related behavior change.

Methods: We examined motivational messages related to physical activity on Twitter. First, we collected tweets related to physical activity. Second, we analyzed them using (1) a lexicon-based approach to extract and characterize motivation-related tweets, (2) a thematic analysis to examine common themes in retweets, and (3) topic models to understand prevalent factors concerning motivation and physical activity on Twitter. Third, we created 2 social networks to investigate organically arising peer-support network structures for sustaining physical activity and to form a deeper understanding of the feasibility of these networks in a real-world context.

Results: We collected over 1.5 million physical activity–related tweets posted from August 30 to November 6, 2018. A relatively small percentage of the tweets mentioned the term motivation; many of these were made on Mondays or during morning or late morning hours. The analysis of retweets showed that the following three themes were commonly conveyed on the platform: (1) using a number of different types of motivation (self, process, consolation, mental, or quotes), (2) promoting individuals or groups, and (3) sharing or requesting information. Topic models revealed that many of these users were weightlifters or people trying to lose weight. Twitter users also naturally forged relations, even though 98.12% (2824/2878) of these users were in different physical locations.

Conclusions: This study fills a knowledge gap on how individuals organically use social media to encourage and sustain physical activity. Elements related to peer support are found in the organic use of social media. Our findings suggest that geographical location is less important for providing peer support as long as the support provides motivation, despite users having few factors in common (eg, the weather) affecting their physical activity. This presents a unique opportunity to identify successful motivation-providing peer support groups in a large user base. However, further research on the effects in a real-world context, as well as additional design and usability features for improving user engagement, are warranted to develop a successful intervention counteracting the current obesity pandemic. This is especially important for young adults, the main user group for social media, as they develop lasting health-related behaviors.

J Med Internet Res 2022;24(7):e15055



Obesity is continually increasing around the world [1]. The annual economic cost of obesity is a substantial burden [2], because it is associated with type 2 diabetes, heart disease, stroke, arthritis, high blood pressure, and various cancers [1,3]. For example, total obesity-related medical expenditures are estimated at $116 billion in the United States [4]. Despite obesity’s serious health and economic consequences, two-thirds of US adults and nearly a third of children aged 6 to 19 years are either at risk for obesity or are obese [3,5]. Society-level behavior changes, including reduced physical activity, have been determined to be among the main contributors driving the current obesity pandemic [6]. To counteract this modern public health problem, we not only need to initiate society-level behavior changes (eg, increase physical activity), but also need to ensure adherence to the changed behaviors [7].

Studies have demonstrated long-term (ie, more than 18 months) exercise adherence challenges for obese individuals [8], even for individuals who had near-death experiences [9,10], due to issues such as dropout [11]. Peer support has been successful in sustaining changed behaviors, especially in the context of the Alcoholics Anonymous group [12,13]. Peer support has also been suggested to be more effective than financial incentives [14] or nurse care management [15] for people with diabetes. The concept has been successfully applied in mental health [16], acute substance abuse [17], diabetes control [14,15], and efforts to maintain physical activity [18]. However, not all peer support systems are successful [8]. One major contributing factor for developing an effective peer support system is to identify proper peer support pairs [19] or workers [16].

Social media has become an integral part of daily life in the United States [20]. Individuals use social media to freely express their thoughts and to interact with geographically dispersed like-minded individuals. Given the increasing use of social media, health-related research utilizing these platforms is also increasing. For instance, social media data have been applied in improving public health outcomes [21], conducting disease surveillance [22-24], enhancing personal health care experiences [19,25,26], examining the daily struggles of living with mental health conditions [27], predicting adverse drug reactions [28], and understanding user responses during a global pandemic [29]. Despite evidence suggesting that social media has the potential to influence individuals [30-37] and to identify exercise partners [38], we lack knowledge on how users naturally utilize social media for peer support, especially how social media is used to provide motivation for exercise adherence in the context of obesity and weight loss.

Many studies have examined social media or incorporated it into their design to investigate attempts to change health-related behaviors [30], such as by increasing physical activity [32-34], losing weight [35], and adhering to exercise programs [36,37]. However, substantial differences have been demonstrated in the use of social media, and these studies have not delivered long-term impacts in reducing the prevalence of obesity. Moreover, effective design elements are unknown; thus, many studies primarily focus on designing weight loss or weight management interventions that are delivered via social media [39-47], or they investigate relevant elements for higher user engagement [48]. To develop an effective intervention, it is imperative to first understand how users are organically utilizing the platform to provide or gain motivation and peer support, defined here as the exchange of motivational tweets for exercise in general, and then draw insights to inform the development of an intervention that provides peer support for long-term exercise adherence. In this study, we characterize the naturally occurring social network structures of peer support pairs or groups for motivating physical activity via a mixed methods approach. More specifically, we examine messages related to physical activity and motivation on a social media platform (Twitter). This study seeks to answer 2 research questions (RQs) to fill the gap in the literature. RQ1: Do Twitter users utilize the platform to provide or gain motivation for physical activity? RQ2: Do individuals use Twitter to form peer support pairs or groups for motivating physical activity? We restricted our analysis to publicly available discussion content.

Data Collection and Social Media Site

Twitter is a popular, free-to-use microblogging social media platform on which users can instantly broadcast short messages. These short messages are called tweets. Tweets are limited to 280 characters (this changed from 140 characters in November 2017) and are meant to be broadcasted to the world. Twitter users, however, can also send out directed, conversational messages by using the @ symbol followed by a Twitter user’s ID. Twitter does not have topically focused groups, but Twitter users can use the # symbol to label and categorize their tweets with searchable keywords. We employed the Python library Tweepy [49] and used hashtags from a previous study [50], including #weightloss, #diet, #fitness, and #health, to collect tweets related to physical activity and weight loss. We then extracted motivation-related tweets to answer RQ1 and reciprocal conversational interactions to answer RQ2. In this study, we only examined tweets in English.

RQ1: Do Twitter Users Utilize the Platform to Provide or Gain Motivation for Physical Activity?

We first used a lexicon-based approach, which is a method to identify related text passages based on a set of words or phrases, to extract and examine motivation-related tweets from the overall physical activity tweets. A number of different motivation-related terms were considered, but for this study, we only used a single term, motivation, for the following three reasons. First, Twitter users frequently use acronyms and abbreviations due to the 280-character length limitation. However, in an online environment, these terms have been shown to change over time [51]. The same acronyms and abbreviations can also be used differently in different communities, which have specific cultures [52]. Second, we placed a higher priority on precision than recall in order to accurately observe user behavior and reduce noise. Third, many popular motivation-related hashtags contain the term motivation (eg, #fitnessmotivation and #mondaymotivation). Thus, we filtered for tweets that contained the keyword motivation. Specifically, we first preprocessed the entire data set to convert the text to lowercase. Then, to extract tweets containing our key terms from the entire data set, we employed a lexicon-based approach and extracted tweets and data from their “user” and “entities” fields, including user names, screen names, locations, timestamps, and user mentions. We extracted and included any partial matches in this process to cover a wide variation in terms, such as #mondaymotivation.

To understand temporal trends in motivation-related messages, we first analyzed the data by week and by time of day. In this analysis, we included data from between September 1, 2018, and November 2, 2018, because the tweets in our original data had an uneven number of days of the week and were collected at different starting and ending time points. The hours of the day were grouped into eight periods: (1) early morning (4:00 AM to 6:59 AM), (2) morning (7:00 AM to 9:59 AM), (3) late morning (10:00 AM to 12:59 PM), (4) afternoon (1:00 PM to 3:59 PM), (5) evening (4:00 PM to 6:59 PM), (6) late evening (7:00 PM to 9:59 PM), (7) night (10:00 PM to 12:59 AM), and (8) late night (1:00 AM to 3:59 AM).

To characterize and understand these messages, we first qualitatively examined retweets—reposted or forwarded tweets—related to motivation. Retweets are important in Twitter, because they have a higher chance of reaching a large number of users and can represent messages that embody the collective ideation of the platform [53]. Thus, we qualitatively analyzed the general themes of retweeted tweets following the thematic analysis process [54,55]. A prespecified threshold of at least 50 retweets was applied for a tweet to be analyzed, due to the size of the data set. All of the quotes in this paper have been deidentified and slightly modified to protect the privacy of users following the guidelines suggested by a previous paper [56].

To quantitatively expand the retweet analysis, we created a topic model using latent Dirichlet allocation (LDA) [57] to identify main topics from the larger data set. Topic modeling is an effective method to discover topics in a large data set and can identify a set of topics from a given set of documents based on document-level word co-occurrences. In our study, each tweet was considered a single document. First, we preprocessed the data set by removing URLs and tweets with fewer than 5 words. Similar exclusion criteria were applied to reduce noise in a previous study that used social media data [58]. Then, nouns were extracted using the Python natural language toolkit library [59] and used in the topic modeling process. LDA requires a predetermined number of topics, which can be a limitation. After experimenting with varying numbers of topics, we chose 10 topics related to motivation and physical activity on Twitter. An LDA topic model was generated by the Python library genism [60], followed by manual examination and labeling of the identified topics. To visually review these topics and occurrences of associated terms, we then visualized the topic model (ie, the main topics and their top 50 associated words) as a word cloud using the Python library wordcloud (Multimedia Appendix 1) [61]. Word cloud visualization is simple and user-friendly, yet scalable and preferred by users [62,63].

RQ2: What Types of Reciprocal Conversational Social Network Structures Are Formed on Twitter for Motivating Physical Activity?

We generated 2 social network visualizations to verify whether social media enables organically formed peer support pairs or groups for motivating physical activity and to examine their social network structure. The first Twitter social network visualization represents a social structure of reciprocal, conversational interactions that are captured when users direct their tweets using the @ symbol to other Twitter IDs. These interactions are dyadic ties among Twitter users (determined by Twitter ID), who are the social actors. We first extracted all tweets directed toward other users by checking if the @ symbol was placed in conjunction with a Twitter ID in a tweet. We then use Gephi [64], a popular network visualization tool, to visualize the social network and apply the OpenOrd [65] layout to gain an overview of the network structure. The edge weight is determined by the number of interactions among Twitter users.

To examine Twitter’s real-world potential for connecting physical activity partners [38], we generated a second social network visualization of users who disclosed their location information. The purpose of this social network visualization was to visually examine the prevalence of social interactions that are generated from the same physical location and to assess the relationship between users’ current practices and the platform’s potential utility for connecting physical activity partners. Twitter users can opt to share their location information in their user profile. The Twitter API also provides a mechanism to track the geocode of a tweet; however, a previous study suggests that only 1.5% of tweets are geotagged [66]. Thus, we extracted users’ location information from their profile for the second social network visualization and also recorded the number of geotagged tweets and users who opted to disclose this information. We removed social actors without location information, and then estimated peer-supporting physical activity partners (ie, socially and physically connected users). We denoted this information using a coloring schema in the second social network visualization. We used the Fruchterman Reingold graph layout algorithm [67] for the second visualization, which had a network structure with considerably fewer social actors and dyadic ties.

Ethics Statement

The study was determined to not have human subjects by the University of North Carolina-Charlotte's Institutional Review Board (Ethics Committee).

Data Set

We collected over 1.5 million physical activity–related tweets made between August 30, 2018, and November 6, 2018. Table 1 summarizes the overall data set and the data sets for each research question.

Table 1. Summary of data set.
Tweet typePhysical activity tweets (N=1,528,439), n (%)Unique Twitter IDs (N=474,402), n (%)
Motivation (research question 1)39,703 (2.6)13,561 (2.86)
Directed motivation (research question 2)15,720 (1.03)6558 (1.38)

RQ1: Do Twitter Users Utilize the Platform to Provide or Gain Motivation for Physical Activity?


Of the 1.5 million physical activity–related tweets, we found that 2.6% (39,703/1,528,439) contained the term motivation. This is a relatively small proportion of the overall physical activity–related tweets, but it still amounted to 39,703 tweets made by 13,561 unique Twitter IDs. Of these 39,703 motivational messages, the most were sent out on Mondays (7070 tweets) and during morning (5668 tweets) or late morning (5748 tweets) hours (7:00 AM to 9:59 AM and 10:00 AM to 12:59 PM, respectively). Other days had substantially fewer tweets (from Tuesday to Sunday, in order: 5278, 5487, 5374, 5492, 5819, and 5220 tweets). Similarly, after late morning, motivational messages gradually decreased until the morning of the next day (from afternoon to early morning of the next day, in chronological order: 5247, 5284, 4955, 4930, 3866, and 4042 tweets). Within the morning and late morning hours, the most tweets were sent around lunchtime, from 11:00 AM to 11:59 AM (1987 tweets), followed by the 7:00 AM to 7:59 AM period (1949 tweets). Specific topics related to these tweets are further examined and described via qualitative analysis of retweets and topic modeling of the entire motivation data set in the following section.

In our data set, we found that 13,268 of the 39,703 (33.42%) overall motivation tweets were retweets. These 13,268 retweets retweeted 5915 (44.58%) original tweets. Of these 5915 original tweets, 18 were retweeted more than a prespecified threshold of 50 times. These 18 tweets were retweeted 1562 times, comprising 11.77% of the total number of retweets. Qualitative analysis of these 18 tweets identified the following 3 themes: motivation, promoting, and information.

Providing Motivation (Self, Process, Consolation, Mental, Quotes)

Providing motivation was the most common theme. However, a number of different subthemes were identified within this theme, including self-motivation, motivation for continuing the physical activity journey/process, self-consolation, mental aspects of motivation, and use of quotes. We categorized 12 of the 18 retweets as motivation. The following quote is a slightly modified example of a motivation retweet: “Finding motivation to workout doesn’t need to be a difficult process....”

Promoting (Individuals or Groups)

We categorized 4 of the 18 retweets into the promoting theme. Two types of promoting were identified, promoting individuals and promoting groups, for physical activity–related matters. One promoting method was to tweet out the names of individuals; thus, we do not show those tweets. Another promoting method was done using the following format in our data set: “Follow us for...”

Information (Sharing or Requesting Information)

We categorized 2 of the 18 retweets into the information theme. In these retweets, Twitter users voluntarily shared health tips or health-related information. It is also important to note that some retweets could have been made to promote products or brands, a practice known as “astroturfing.” Astroturfing is when sponsors or organizations mask themselves as Twitter users to promote their products [68,69]. We categorized these tweets as information because of the subtle nature of astroturfing on Twitter and a lack of knowledge regarding the motives for sharing such information. The following quote is a modified example of information sharing: “The Truth About The...”

Our topic modeling analysis expanded the findings of the qualitative analysis of retweets. Topic modeling revealed that the providing motivation topic had different types. Users often revealed their personal motivation for physical activities, which included tweets with the topics of appearance, weight loss, lifestyle, health, wealth, mental, and personal motivation. We also identified workout inspiration and daily motivation as subtopics that were part of the providing motivation topic.

Topic modeling also identified topics related to sharing or requesting information, specifically information related to the nutrition-diet topic. Compared to the retweets in this category, the tweets contained a greater number of products, tips, and information. Additional topics were identified in this process, mainly different types of workouts, including the muscular training and bodybuilding topics, presumably used by weightlifters. Topic modeling, however, did not identify a topic related to promoting. A total of 10 topics were identified via topic modeling; Table 2 shows sample terms from each of the identified topics. Multimedia Appendix 1 summarizes and provides an overview of all the identified topics and their related terms.

Table 2. Ten topics and samples of associated terms found in motivational tweets identified via topic modeling.
Topic labelSelected sample terms
Workout inspirationmotivation, inspiration, fitness, love, goals, fitnessmotivation, workout, quotes, music, mood, determination, fitnessquotes
Appearancefitbeauty, bikinibody, beautycoolgym, beautycoolmotivation, ass, erossensual, look, amazing, weightloss
Muscular trainingfocus, workout, exercise, muscle, body, power, coach, hardwork, mindset, grind, strength, nutrition, passion, trainer, strong, bodygoals
Bodybuildinggym, motivation, bodybuildingmotivation, gymlife, tip, abs, men, bodybuilder, goal, women, protein, squats, physique
Weight lossweightloss, weightlossmotivation, weightlossjourney, tips, facts, resources, weightlosstransformation, hotbodiedsnaps
Lifestylebody, health, mind, lifestyle, healthyfood, enjoy, fitnessgirl, gymshark, pain, hope, momlife, bodytransformation, obesity
Nutrition-dietnutrition, diet, healthy, health, food, fashion, truth, coffee, ketogenic, weightwatchers, travel, science, supplements, foods, deal
Health; wealth; mentalhealth, fitness, muscle, excuses, gold, pushpullgrind, dreams, goals wealth, mind, healthylife, money, grindout, business
Personal motivationweight, cardio, bodybuilding, weightloss, fitmotivation, fitnessmodel, fitnessaddict, keto, hormones, strength, girls, beach, girl
Daily motivationworkout, mondaymotivation, fitnessmotivation, day, week, time, darebees, days, Tuesday, work, session, mondaymood, saturdaymotivation

RQ2: What Types of Reciprocal Conversational Social Network Structures Are Formed on Twitter for Motivating Physical Activity?

The overall social network of directed tweets consisted of 14,028 dyadic ties among 10,889 social actors (determined by Twitter ID). It is important to note that some tweets contained multiple indications of where they directed their tweets, included self-mentioning, or were directed to Twitter IDs that were not found in our data set. This resulted in a greater number of social actors than the total number of Twitter IDs but a lesser number of dyadic ties than the total number of tweets shown in Table 1 and Table 3. Table 3 summarizes the directed tweets and the creators of those tweets (determined by Twitter ID) in our data set. About 68% (4459/6558) of the Twitter users disclosed their location in their profile; however, users rarely disclosed their exact location by providing geocode information. Similar to a previous study [66], our data set consisted of about 1% (94/6558) geotagged tweets and Twitter users, resulting in less than 100 users who opted to disclose their exact location.

Figure 1 shows an overview of the Twitter social network in our data set. Twitter users formed hundreds of small groups that sent motivation-related tweets to each other, while a small number of larger groups with a higher number of dyadic ties and social actors were formed. Figure 2 shows a social network of location-based Twitter user interactions. After removing social actors without location information, self-mentioning tweets, and interactions with only one-sided location information, the social network structure consisted of 2878 dyadic ties among 1112 social actors, a smaller number of social actors and dyadic ties than shown in Table 3. Of these 2878 social interactions, 1.88% (54/2878) were made from the same physical location.

Table 3. Summary of directed motivation tweet data set.
Tweet typeTotal tweets (N=15,720), n (%)Unique Twitter IDs (N=6558), n (%)
Directed at Twitter ID with location in the profile ID10,533 (67)4459 (67.99)
Directed tweet with geocode159 (1.01)94 (1.43)
Figure 1. A social network of Twitter users sending directed tweets.
View this figure
Figure 2. A social network of location-based Twitter user interactions. Twitter users from the same location are denoted in blue, whereas Twitter users from different locations are denoted in green.
View this figure

This paper characterizes how Twitter users organically use a social media platform for motivating physical activity. To better understand Twitter’s potential as a long-term intervention platform, we specifically investigated and report elements that were related to peer support and motivation for physical activity. We also discuss the practical implications of our work and future directions.

Practical Implications and Future Directions

Tweeting for Motivation

Twitter users use the platform to discuss and broadcast messages about motivation and physical activity. We extracted and analyzed a large number of motivation-related tweets (39,703 tweets from 13,561 unique Twitter IDs); however, these tweets were made by a small percentage of individuals (13,561/474,402, 2.86%) who tweeted about physical activity. Similarly, motivation-related tweets made up a small percentage (39,703/1,528,439, 2.6%) of overall physical activity–related tweets. In our extraction process, we purposely chose precision over recall, thus potentially reducing the overall number of motivation-related tweets. However, it is apparent that motivation was not the main focus of a large portion of the physical-activity tweets. A future study should identify other topics related to physical activity and examine how to further encourage and increase peer support for positive behavior change in social media. Despite this, messages by users to motivate themselves, as well as others, were typically sent on Mondays and in the morning and late afternoon hours. This could indicate that Twitter users need the most motivational support and exercise during these time periods. A hybrid peer-support intervention augmented with an automatic motivational feature may be able to automatically provide encouraging messages when motivation is most needed (eg, Mondays and during morning and late afternoon hours), if peer supporters cannot. A future study should investigate the effectiveness of a similar hybrid intervention to verify whether automatically generated messages can help sustain and encourage physical activity among users.


Among retweets, we identified the following 3 themes: motivation, promoting, and information. As individuals promote themselves on social media, they gain more influence over other users. The characteristics of health information provided by credentialed experts and individuals have been suggested to be different [70,71]. Capitalizing on the influence of these individuals could provide another avenue for changing health-related behavior. However, a better understanding of the role of these individuals and their messages is needed. For example, although individuals’ experience-based information provides many benefits, it could also be misleading without proper contextual information. Twitter, with its 280-character limit, could mislead many individuals, and intermediary monitoring of the quality of health information may thus be required in health-behavior interventions that use the Twitter platform. Another interesting finding was that Twitter users use the platform for physical activity self-motivation. The rationale behind this phenomenon is largely unknown, and this could be another mechanism for sustaining physical activity.

Main User Demographics

The topic model revealed new information about the users. For example, our model suggests that a large portion of these individuals were weightlifters (suggested by the muscular training and bodybuilding topics) or trying to lose weight (suggested by the weight loss and nutrition-diet topics). Our analysis also suggests that individuals need reminders and inspiration to stay physically active (suggested by the workout inspiration, personalmotivation, and daily motivation topics). Other motivation-related factors for physical activity included lifestyle choices, appearance, health, and wealth.

Other Findings

Previous literature suggests that individuals do not want to share their problems using individually identifying social media accounts, such as on Facebook [72]. Our analysis revealed a few terms that described users’ physical ability with negative terminologies (eg, grind and pain), although they were not prevalent. Future work should further identify struggles and concerns that may be harmful for individuals or that can be improved through better design.

Another theme was sharing information voluntarily. Qualitative analysis of retweets led us to suspect that this theme may have arisen from astroturfing, but further analysis is warranted to confirm this observation. Despite the potential misuse of social media, we also found that Twitter users positively motivate themselves and other users by sharing quotes, comforting themselves and others, and discussing the process of their journey and mental aspects of staying physically active. These findings are design elements that need to be considered for developing a successful intervention using social media to increase physical activity.

Peer Support

Online peer support systems, in which peer-to-peer interactions can scale up to arbitrary values of n to n, hold promise for identifying successful peer support groups. Features for identifying physical activity partners exist on a number of online social health activity networks [38], which suggests the need and preference for peer support concerning physical activity. Forging long-lasting connections among a large user base remains a research challenge, although past work has suggested that written communication style and personality matching are important in the success of online peer support systems [19,73]. Despite these challenges, we found empirical evidence that Twitter users use the platform to motivate individuals from different physical locations (2824/2878, 98.12%) to increase their physical activity. However, we do not know how Twitter users organically created these peer-to-peer connections or whether these individuals already knew each other prior to Twitter communication.

Twitter users forge connections using the @ symbol and directly provide or gain motivation for achieving their physical activity goals. In our social network analysis, we verified the existence of a few large, highly interconnected groups as well as hundreds of small groups with fewer interactions. Investigating linguistic differences in topics and communication style and differences in personality matching between the large and small groups could show individuals’ preferences and needs for varying sizes of motivation groups. Because our study provides a snapshot of existing peer-to-peer interactions, we are uncertain how Twitter users form these interactions, find peer support partners, and join groups, and we do not know the long-term effects. Future work could ask Twitter users about their experiences related to forming and continuing peer support systems on Twitter by using surveys and interviews to gain a deeper understanding of long-term behavior changes via social media. Similarly, understanding the effects of strong and weak ties in large groups is important in utilizing social factors for this purpose.

Our findings suggest that Twitter has not been connecting local users and that Twitter users are more engaged with geographically dispersed individuals. Less than 2% (54/2878) of directed tweets were made from the same physical location. Our findings indicate that geographic location is of low importance in providing peer support; as long as the support provides motivation, it can succeed despite the peers having few common factors (eg, the weather) affecting their physical activity. This presents a unique opportunity to identify successful peer support groups that provide motivation in large user bases.

Online peer support systems were observed in our study, but we do not know their effects in a long-term, real-world context, which should be considered in related future work. For example, the effects of online peer support related to (1) fostering positive behavior change and (2) sustaining long-term engagement need to be further examined. A previous study suggests that higher engagement with a group leads to more weight loss [74], but sustaining continuous interaction is a prominent challenge for many online social groups [75-80]. Our analysis of retweets also highlights potential commercial influence, or astroturfing, in Twitter networks. The extent of astroturfing and its influence over social structures and user behavior are unconfirmed and warrant further investigation to control commercial and astroturfing effects.

User Privacy

Although Twitter data are public, and tweets are meant to be broadcasted to the world, we discovered consistent user behaviors aimed at protecting user privacy. For example, we found that almost all users opted out of disclosing geocode information (ie, their exact location), as shown in Table 3. Also, even though the majority of users opted to disclose their location in their profile, many did not provide granular information (the limitation section has further details). Research that uses public social media data, including this study, is typically granted exemption from review by Institutional Review Boards in the United States. User behavior in our study indicates that ethical considerations are still needed, especially with regard to privacy [81,82]. In this paper, we did not present user-identifiable information (eg, Twitter IDs) and modified the example quotations to protect user anonymity.


A relatively short timeframe and a lack of seasonal coverage are limitations of our study. Though we did not examine seasonal trends, it is possible that posting frequency could change with the season. Similarly, previous work has shown that social media usage can vary temporally and spatiotemporally [83]. The time period in our study may also have affected trends in posting. Our study time period included a change of seasons, from late summer to fall, which may have been associated with changes in activity and exercise. Similarly, we manually examined nearly 11.77% (1562/13,268) of the total retweets, but the number of retweeted original tweets in our qualitative analysis was relatively small (18/5915). Although we used a prespecified threshold to select original tweets to be qualitatively examined, investigating a larger number of original tweets could have led to more conclusive findings. Moreover, future studies should examine long-term engagement with content among group members to understand the formation, evolution, and collapse of peer networks for the purpose of encouraging physical activities.

Twitter has a diverse user group compared to many other social media platforms, although it is still used more by young adults [84]. This could have created a bias toward younger Twitter users, allowing us to identify particular topics of interest only to this group. We do not know how our findings would apply to other demographics or social media sites. Similarly, our analysis focused on English tweets, and our findings may not be generalizable to non–English speaking countries. Our findings, however, have significant implications for young adults, the main user group of social media, and the health of future society, as they develop lasting health-related behaviors.

Another selection bias was present in the location-based Twitter user interaction analysis (Figure 2). Although the majority of users opted to disclose their location information in their profile (4459/6558, 67.99%), this analysis was conducted with users who self-selected to disclose this information. In the same analysis, an additional limitation was discovered in our manual validation check. We believe that the different location figure (2824/2878, 98.12%) was inflated due to the granularity of information provided by users. For example, in our data set, we found a few users who had identified their location as “Earth” or “America.” Moreover, some users disclosed their information at the state level (eg, North Carolina) and some users disclosed this information at the city level (eg, Charlotte). Although it is possible that they such users were in fact at the same physical location, we were not able to determine this without additional information. In our study, they were thus treated as being in different locations.

In this study, we used hashtags that were identified in a previous study that collected tweets related to physical activity for the purpose of weight loss. However, we did not incorporate postprocessing to include a wider variation in physical activities. The collected tweets were then processed to extract motivation-related tweets for RQ1. Initial key terms are important in collecting a relevant data set, and different sets of terms could have resulted in different findings, especially related to the prevalence of these tweets. Similarly, for RQ1, we used a single keyword, motivation, to identify motivation-related tweets. While this approach can identify related tweets with high precision, we should in future extend these study results to retrieve a greater number of relevant messages by developing a classifier. Moreover, major differences, such as communication style, user demographic, and the platform’s intended purpose, exist among social media platforms. To better characterize the landscape of peer support and motivating messages for physical activity in social media, replicating a similar study on a different popular social media platform, such as Facebook or Reddit, is warranted.


Peer support through social media could be a potential means of encouraging positive behavior changes and sustaining that changed behavior. This study examines whether individuals organically use social media to encourage and sustain physical activity. The organic exchange of motivational messages and peer support are found among Twitter users, although this support is more commonly used by individuals from different physical locations. Our findings suggest that geographical location is less important in providing peer support, as long as the support provides motivation, despite the users having few factors in common affecting their physical activity (eg, the weather). Based on existing interaction patterns, we consider that successful motivation-providing peer support groups or interventions could be delivered through social media. However, further research on the effects in a real-world context, as well as additional design and usability features improving user engagement, are warranted to develop a successful intervention to counteract the current obesity pandemic. This is especially important for young adults, the main user group of social media, as they develop lasting health-related behaviors.


This work was supported, in part, by funds provided by the University of North Carolina at Charlotte.

Conflicts of Interest

None declared.

Multimedia Appendix 1

A word cloud overview of topic modeling: visual representation of main topics and frequently occurring associated terms.

PNG File , 268 KB

  1. Hruby A, Hu FB. The Epidemiology of Obesity: A Big Picture. Pharmacoeconomics 2015 Jul;33(7):673-689 [FREE Full text] [CrossRef] [Medline]
  2. Tremmel M, Gerdtham U, Nilsson PM, Saha S. Economic Burden of Obesity: A Systematic Literature Review. Int J Environ Res Public Health 2017 Apr 19;14(4):435 [FREE Full text] [CrossRef] [Medline]
  3. Ogden CL, Carroll MD, Kit BK, Flegal KM. Prevalence of obesity in the United States, 2009-2010. NCHS Data Brief 2012 Jan(82):1-8 [FREE Full text] [Medline]
  4. Wang YC, Pamplin J, Long MW, Ward ZJ, Gortmaker SL, Andreyeva T. Severe Obesity In Adults Cost State Medicaid Programs Nearly $8 Billion In 2013. Health Aff (Millwood) 2015 Nov;34(11):1923-1931. [CrossRef] [Medline]
  5. Hedley AA, Ogden CL, Johnson CL, Carroll MD, Curtin LR, Flegal KM. Prevalence of overweight and obesity among US children, adolescents, and adults, 1999-2002. JAMA 2004 Jun 16;291(23):2847-2850. [CrossRef] [Medline]
  6. Hill JO, Peters JC. Environmental contributions to the obesity epidemic. Science 1998 May 29;280(5368):1371-1374. [CrossRef] [Medline]
  7. Ory MG, Lee Smith M, Mier N, Wernicke MM. The science of sustaining health behavior change: the health maintenance consortium. Am J Health Behav 2010;34(6):647-659 [FREE Full text] [CrossRef] [Medline]
  8. Perri MG, McAdoo WG, McAllister DA, Lauer JB, Jordan RC, Yancey DZ, et al. Effects of peer support and therapist contact on long-term weight loss. J Consult Clin Psychol 1987 Aug;55(4):615-617. [CrossRef] [Medline]
  9. van Elderen-van Kemenade T, Maes S, van den Broek Y. Effects of a health education programme with telephone follow-up during cardiac rehabilitation. Br J Clin Psychol 1994 Sep;33(3):367-378. [CrossRef] [Medline]
  10. Bock BC, Albrecht AE, Traficante RM, Clark MM, Pinto BM, Tilkemeier P, et al. Predictors of exercise adherence following participation in a cardiac rehabilitation program. Int J Behav Med 1997;4(1):60-75. [CrossRef] [Medline]
  11. Kelders SM, Van Gemert-Pijnen JEWC, Werkman A, Nijland N, Seydel ER. Effectiveness of a Web-based intervention aimed at healthy dietary and physical activity behavior: a randomized controlled trial about users and usage. J Med Internet Res 2011 Apr 14;13(2):e32 [FREE Full text] [CrossRef] [Medline]
  12. Tonigan JS, Toscova R, Miller WR. Meta-analysis of the literature on Alcoholics Anonymous: sample and study characteristics moderate findings. J Stud Alcohol 1996 Jan;57(1):65-72. [CrossRef] [Medline]
  13. Tonigan JS, Miller WR, Connors GJ. Project MATCH Client Impressions About Alcoholics Anonymous. Alcohol Treat Q 2008 Sep 23;18(1):25-41. [CrossRef]
  14. Long JA, Jahnle EC, Richardson DM, Loewenstein G, Volpp KG. Peer mentoring and financial incentives to improve glucose control in African American veterans: a randomized trial. Ann Intern Med 2012 Mar 20;156(6):416-424 [FREE Full text] [CrossRef] [Medline]
  15. Heisler M, Vijan S, Makki F, Piette JD. Diabetes control with reciprocal peer support versus nurse care management: a randomized trial. Ann Intern Med 2010 Oct 19;153(8):507-515 [FREE Full text] [CrossRef] [Medline]
  16. Repper J, Carter T. A review of the literature on peer support in mental health services. J Ment Health 2011 Aug;20(4):392-411. [CrossRef] [Medline]
  17. Galanter M, Dermatis H, Egelko S, De Leon G. Homelessness and mental illness in a professional- and peer-led cocaine treatment clinic. Psychiatr Serv 1998 Apr;49(4):533-535. [CrossRef] [Medline]
  18. Davison KK, Jago R. Change in parent and peer support across ages 9 to 15 yr and adolescent girls' physical activity. Med Sci Sports Exerc 2009 Sep;41(9):1816-1825 [FREE Full text] [CrossRef] [Medline]
  19. Hartzler AL, Taylor MN, Park A, Griffiths T, Backonja U, McDonald DW, et al. Leveraging cues from person-generated health data for peer matching in online communities. J Am Med Inform Assoc 2016 May 05;23(3):496-507 [FREE Full text] [CrossRef] [Medline]
  20. Perrin A, Atske S. About three-in-ten U.S. adults say they are ‘almost constantly’ online. Pew Research Center. 2021 Mar 26.   URL: https:/​/www.​​fact-tank/​2021/​03/​26/​about-three-in-ten-u-s-adults-say-they-are-almost-constantly-online; Archived at: https:/​/web.​​web/​20220703224341/​https:/​/www.​​fact-tank/​2021/​03/​26/​about-three-in-ten-u-s-adults-say-they-are-almost-constantly-online [accessed 2022-07-13]
  21. Dredze M. How Social Media Will Change Public Health. IEEE Intell Syst 2012 Jul;27(4):81-84 [FREE Full text] [CrossRef]
  22. Brownstein JS, Freifeld CC, Madoff LC. Digital disease detection--harnessing the Web for public health surveillance. N Engl J Med 2009 May 21;360(21):2153-5, 2157 [FREE Full text] [CrossRef] [Medline]
  23. Park A, Conway M. Tracking Health Related Discussions on Reddit for Public Health Applications. AMIA Annu Symp Proc 2017;2017:1362-1371 [FREE Full text] [Medline]
  24. Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature 2009 Feb 19;457(7232):1012-1014. [CrossRef] [Medline]
  25. Friedman C, Elhadad N. Natural Language Processing in Health Care and Biomedicine. In: Shortliffe E, Cimino J, editors. Biomedical Informatics. London, UK: Springer; 2014:255-284.
  26. Hartzler AL, McDonald DW, Park A, Huh J, Weaver C, Pratt W. Evaluating health interest profiles extracted from patient-generated data. AMIA Annu Symp Proc 2014;2014:626-635 [FREE Full text] [Medline]
  27. Park A, Conway M, Chen AT. Examining Thematic Similarity, Difference, and Membership in Three Online Mental Health Communities from Reddit: A Text Mining and Visualization Approach. Comput Human Behav 2018 Jan;78:98-112 [FREE Full text] [CrossRef] [Medline]
  28. Chee BW, Berlin R, Schatz B. Predicting adverse drug events from personal health messages. AMIA Annu Symp Proc 2011;2011:217-226 [FREE Full text] [Medline]
  29. Kwon S, Park A. Understanding user responses to the COVID-19 pandemic on Twitter from a terror management theory perspective: Cultural differences among the US, UK and India. Comput Human Behav 2022 Mar;128:107087 [FREE Full text] [CrossRef] [Medline]
  30. Laranjo L, Arguel A, Neves AL, Gallagher AM, Kaplan R, Mortimer N, et al. The influence of social networking sites on health behavior change: a systematic review and meta-analysis. J Am Med Inform Assoc 2015 Jan;22(1):243-256 [FREE Full text] [CrossRef] [Medline]
  31. Park A, Conway M. Longitudinal Changes in Psychological States in Online Health Community Members: Understanding the Long-Term Effects of Participating in an Online Depression Community. J Med Internet Res 2017 Mar 20;19(3):e71 [FREE Full text] [CrossRef] [Medline]
  32. Davies CA, Spence JC, Vandelanotte C, Caperchione CM, Mummery WK. Meta-analysis of internet-delivered interventions to increase physical activity levels. Int J Behav Nutr Phys Act 2012 Apr 30;9:52 [FREE Full text] [CrossRef] [Medline]
  33. Napolitano MA, Fotheringham M, Tate D, Sciamanna C, Leslie E, Owen N, et al. Evaluation of an internet-based physical activity intervention: a preliminary investigation. Ann Behav Med 2003;25(2):92-99. [CrossRef] [Medline]
  34. Vandelanotte C, Spathonis KM, Eakin EG, Owen N. Website-delivered physical activity interventions a review of the literature. Am J Prev Med 2007 Jul;33(1):54-64. [CrossRef] [Medline]
  35. Beleigoli AM, Andrade AQ, Cançado AG, Paulo MN, Diniz MDFH, Ribeiro AL. Web-Based Digital Health Interventions for Weight Loss and Lifestyle Habit Changes in Overweight and Obese Adults: Systematic Review and Meta-Analysis. J Med Internet Res 2019 Jan 08;21(1):e298 [FREE Full text] [CrossRef] [Medline]
  36. Watson A, Bickmore T, Cange A, Kulshreshtha A, Kvedar J. An internet-based virtual coach to promote physical activity adherence in overweight adults: randomized controlled trial. J Med Internet Res 2012 Jan 26;14(1):e1 [FREE Full text] [CrossRef] [Medline]
  37. Richardson CR, Buis LR, Janney AW, Goodrich DE, Sen A, Hess ML, et al. An online community improves adherence in an internet-mediated walking program. Part 1: results of a randomized controlled trial. J Med Internet Res 2010 Dec 17;12(4):e71 [FREE Full text] [CrossRef] [Medline]
  38. Nakhasi A, Shen AX, Passarella RJ, Appel LJ, Anderson CA. Online social networks that connect users to physical activity partners: a review and descriptive analysis. J Med Internet Res 2014 Jun 16;16(6):e153 [FREE Full text] [CrossRef] [Medline]
  39. Baghaei N, Freyne J, Kimani S, Smith G, Berkovsky S, Bhandari D, et al. SOFA: An Online Social Network for Engaging and Motivating Families to Adopt a Healthy Lifestyle. In: OZCHI '09: Proceedings of the 21st Annual Conference of the Australian Computer-Human Interaction Special Interest Group: Design: Open 24/7. 2009 Presented at: OZCHI '09: Proceedings of the 21st conference of the computer-human interaction special interest group of Australia on Computer-human interaction: design; Nov 23-27, 2009; Melbourne, Australia p. 269-272. [CrossRef]
  40. Jansen A, Van Mechelen M, Slegers K. Personas and Behavioral Theories: A Case Study Using Self-Determination Theory to Construct Overweight Personas. In: CHI '17: Proceedings of the 2017 CHI Conference on Human Factors in Computing Systems. 2017 Presented at: CHI '17: CHI Conference on Human Factors in Computing Systems; May 6-11, 2017; Denver, CO p. 2127-2136. [CrossRef]
  41. Colineau N, Paris C, Marendy P, Bhandari D, Shu Y. Supporting family engagement in weight management. In: CHI EA '09: CHI '09 Extended Abstracts on Human Factors in Computing Systems. 2009 Presented at: CHI '09: CHI Conference on Human Factors in Computing Systems; Apr 4-9, 2009; Boston, MA p. 3991-3996. [CrossRef]
  42. Chuah M, Jakes G, Qin Z. WiFiTreasureHunt: a mobile social application for staying active physically. In: UbiComp '12: Proceedings of the 2012 ACM Conference on Ubiquitous Computing. 2012 Presented at: Ubicomp '12: The 2012 ACM Conference on Ubiquitous Computing; Sep 5-8, 2012; Pittsburgh, PA p. 631-632. [CrossRef]
  43. Kim KK, Logan HC, Young E, Sabee CM. Youth-centered design and usage results of the iN Touch mobile self-management program for overweight/obesity. Pers Ubiquit Comput 2014 Aug 14;19(1):59-68. [CrossRef]
  44. Patrick K, Marshall SJ, Davila EP, Kolodziejczyk JK, Fowler JH, Calfas KJ, et al. Design and implementation of a randomized controlled social and mobile weight loss trial for young adults (project SMART). Contemp Clin Trials 2014 Jan;37(1):10-18 [FREE Full text] [CrossRef] [Medline]
  45. Chuah M, Sample S. Fitness Tour: a mobile application for combating obesity. In: MobileHealth '11: Proceedings of the First ACM MobiHoc Workshop on Pervasive Wireless Healthcare. 2011 Presented at: MobiHoc '11: The Twelfth ACM International Symposium on Mobile Ad Hoc Networking and Computing; May 16, 2011; Paris, France p. 1-5. [CrossRef]
  46. Hammersley ML, Jones RA, Okely AD. Time2bHealthy - An online childhood obesity prevention program for preschool-aged children: A randomised controlled trial protocol. Contemp Clin Trials 2017 Oct;61:73-80. [CrossRef] [Medline]
  47. Napolitano MA, Whiteley JA, Mavredes MN, Faro J, DiPietro L, Hayman LL, et al. Using social media to deliver weight loss programming to young adults: Design and rationale for the Healthy Body Healthy U (HBHU) trial. Contemp Clin Trials 2017 Sep;60:1-13 [FREE Full text] [CrossRef] [Medline]
  48. Edney S, Bogomolova S, Ryan J, Olds T, Sanders I, Maher C. Creating Engaging Health Promotion Campaigns on Social Media: Observations and Lessons From Fitbit and Garmin. J Med Internet Res 2018 Dec 10;20(12):e10911 [FREE Full text] [CrossRef] [Medline]
  49. Tweepy Documentation. Tweepy.   URL: [accessed 2022-07-13]
  50. Turner-McGrievy GM, Beets MW. Tweet for health: using an online social network to examine temporal trends in weight loss-related posts. Transl Behav Med 2015 Jun;5(2):160-166 [FREE Full text] [CrossRef] [Medline]
  51. Danescu-Niculescu-Mizil C, West R, Jurafsky D, Leskovec J, Potts C. No country for old members: user lifecycle and linguistic change in online communities. In: WWW '13: Proceedings of the 22nd international conference on World Wide Web. 2013 Presented at: WWW '13: 22nd International World Wide Web Conference; May 13-17, 2013; Rio de Janeiro, Brazil p. 307-318. [CrossRef]
  52. Park A, Hartzler AL, Huh J, McDonald DW, Pratt W. Automatically Detecting Failures in Natural Language Processing Tools for Online Community Text. J Med Internet Res 2015 Aug 31;17(8):e212 [FREE Full text] [CrossRef] [Medline]
  53. So J, Prestin A, Lee L, Wang Y, Yen J, Chou WS. What Do People Like to "Share" About Obesity? A Content Analysis of Frequent Retweets About Obesity on Twitter. Health Commun 2016;31(2):193-206. [CrossRef] [Medline]
  54. Guest G, MacQueen KM, Namey EE. Applied Thematic Analysis. New York, NY: SAGE Publications; 2012.
  55. Strauss AL, Corbin JM. Basics of qualitative research. Newbury Park, CA: Sage Publications; 1990.
  56. Bruckman A. Teaching Students to Study Online Communities Ethically. J Infor Ethics 2006 Nov 1;15(2):82-98 [FREE Full text]
  57. Blei D, Ng A, Jordan M. Latent Dirichlet Allocation. J Mach Learn Res 2003:993-1022 [FREE Full text]
  58. Park A, Conway M. Harnessing Reddit to Understand the Written-Communication Challenges Experienced by Individuals With Mental Health Disorders: Analysis of Texts From Mental Health Communities. J Med Internet Res 2018 Apr 10;20(4):e121 [FREE Full text] [CrossRef] [Medline]
  59. Bird S. NLTK: the natural language toolkit. In: COLING-ACL '06: Proceedings of the COLING/ACL on Interactive presentation sessions. 2006 Presented at: COLING/ACL 2006 Interactive Presentations; July 17-18, 2006; Sydney, Australia p. 69-72. [CrossRef]
  60. Rehurek R, Sojka P. Software Framework for Topic Modelling with Large Corpora. In: Proceedings of the LREC 2010 Workshop on New Challenges for NLP Frameworks. 2010 Presented at: LREC 2010 Workshop on New Challenges for NLP Frameworks; May 22, 2010; Valletta, Malta p. 45-50.
  61. Wordcloud: Project Description. Python Software Foundation.   URL: [accessed 2022-07-13]
  62. Hartzler AL, Weis B, Cahill C, Pratt W, Park A, Backonja U, et al. Design and Usability of Interactive User Profiles for Online Health Communities. ACM Trans Comput Hum Interact 2016 Jul 21;23(3):1-33. [CrossRef]
  63. Chen H, Kelliher A. Conversational Lives: Visualizing Interpersonal Online Social Interactions. In: Online Communities and Social Computing. 2011 Presented at: OCSC 2011: Online Communities and Social Computing: 4th International Conference; Jul 9-14, 2011; Orlando, FL p. 241-250. [CrossRef]
  64. Bastian M, Heymann S, Jacomy M. Gephi: An Open Source Software for Exploring and Manipulating Networks. In: Proceedings of the International AAAI Conference on Web and Social Media. 2009 Presented at: Third International AAAI Conference on Weblogs and Social Media; May 17-20, 2009; San Jose, CA p. 361-362. [CrossRef]
  65. Martin S, Brown WM, Klavans R, Boyack KW. OpenOrd: an open-source toolbox for large graph layout. Visualization and Data Analysis.   URL: [accessed 2022-07-14]
  66. Gore RJ, Diallo S, Padilla J. You Are What You Tweet: Connecting the Geographic Variation in America's Obesity Rate to Twitter Content. PLoS One 2015;10(9):e0133505 [FREE Full text] [CrossRef] [Medline]
  67. Fruchterman TMJ, Reingold EM. Graph drawing by force-directed placement. Softw Pract Exper 1991 Nov;21(11):1129-1164. [CrossRef]
  68. Harris JK, Moreland-Russell S, Choucair B, Mansour R, Staub M, Simmons K. Tweeting for and against public health policy: response to the Chicago Department of Public Health's electronic cigarette Twitter campaign. J Med Internet Res 2014 Oct 16;16(10):e238 [FREE Full text] [CrossRef] [Medline]
  69. Ratkiewicz J, Conover M, Meiss M, Gonçalves B, Patil S, Flammini A, et al. Truthy: Mapping the Spread of Astroturf in Microblog Streams. In: Proceedings of the 20th international conference companion on World wide web. 2011 Presented at: WWW '11: 20th International World Wide Web Conference; Mar 28-Apr 1, 2011; Hyderabad, India p. 249-252. [CrossRef]
  70. Hartzler A, Pratt W. Managing the personal side of health: how patient expertise differs from the expertise of clinicians. J Med Internet Res 2011 Aug 16;13(3):e62 [FREE Full text] [CrossRef] [Medline]
  71. Oh S, Worrall A. Health answer quality evaluation by librarians, nurses, and users in social Q&A. Libr Inf Sci Res 2013 Oct;35(4):288-298. [CrossRef]
  72. Newman MW, Lauterbach D, Munson SA, Resnick P, Morris ME. It’s not that I don’t have problems, I’m just not putting them on Facebook: Challenges and opportunities in using online social networks for health. In: CSCW 11 Proceedings of ACM 2011 Conference on Computer Supported Cooperative Work. 2011 Presented at: ACM 2011 Conference on Computer Supported Cooperative Work; Mar 19-23, 2011; Hangzhou, China p. 341-350. [CrossRef]
  73. Shpigelman C, Weiss PLT, Reiter S. E-Mentoring for All. Comput Human Behav 2009 Jul;25(4):919-928. [CrossRef]
  74. Hwang KO, Ning J, Trickey AW, Sciamanna CN. Website usage and weight loss in a free commercial online weight loss program: retrospective cohort study. J Med Internet Res 2013 Jan 15;15(1):e11 [FREE Full text] [CrossRef] [Medline]
  75. Millen D, Patterson J. Stimulating social engagement in a community network. In: Proceedings of the 2002 ACM conference on Computer supported cooperative work. 2002 Presented at: CSCW02: Computer Supported Cooperative Work; Nov 16-20, 2002; New Orleans, LA p. 306-313. [CrossRef]
  76. Joyce E, Kraut RE. Predicting Continued Participation in Newsgroups. J Comp Mediated Comm 2006 Apr;11(3):723-747. [CrossRef]
  77. Wang Y, Kraut R, Levine J. To stay or leave?: the relationship of emotional and informational support to commitment in online health support groups. In: Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work. 2012 Presented at: CSCW '12: Computer Supported Cooperative Work; Feb 11-15, 2012; Seattle, WA p. 833-842. [CrossRef]
  78. Arguello J, Butler B, Joyce L, Kraut R, Ling K, Wang X. Talk to me: foundations for successful individual-group interactions in online communities. In: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 2006 Presented at: CHI06: CHI 2006 Conference on Human Factors in Computing Systems; Apr 22-27, 2006; Montréal, QC p. 959-968. [CrossRef]
  79. Nonnecke B, Preece J. Lurker demographics: counting the silent. In: Proceedings of the SIGCHI conference on Human Factors in Computing Systems. 2000 Presented at: CHI00: Human Factors in Computing Systems; Apr 1-6, 2000; The Hague, Netherlands p. 73-80. [CrossRef]
  80. Park A, Hartzler AL, Huh J, McDonald DW, Pratt W. Homophily of Vocabulary Usage: Beneficial Effects of Vocabulary Similarity on Online Health Communities Participation. AMIA Annu Symp Proc 2015;2015:1024-1033 [FREE Full text] [Medline]
  81. Vayena E, Salathé M, Madoff LC, Brownstein JS. Ethical challenges of big data in public health. PLoS Comput Biol 2015 Feb;11(2):e1003904 [FREE Full text] [CrossRef] [Medline]
  82. Mikal J, Hurst S, Conway M. Ethical issues in using Twitter for population-level depression monitoring: a qualitative study. BMC Med Ethics 2016 Apr 14;17:22 [FREE Full text] [CrossRef] [Medline]
  83. Padilla JJ, Kavak H, Lynch CJ, Gore RJ, Diallo SY. Temporal and spatiotemporal investigation of tourist attraction visit sentiment on Twitter. PLoS One 2018;13(6):e0198857 [FREE Full text] [CrossRef] [Medline]
  84. Smith A, Anderson M. Social Media Use in 2018. Pew Research Center.   URL:; [accessed 2022-07-10]

LDA: latent Dirichlet allocation.

Edited by A Mavragani; submitted 15.06.19; peer-reviewed by MM Amini, ORCID: 0000-0002-5279-213X, J Mikal, R Gore, E Fernandes de Mello Araujo, J Wright; comments to author 08.08.20; revised version received 04.01.21; accepted 23.06.22; published 20.07.22


©Albert Park. Originally published in the Journal of Medical Internet Research (, 20.07.2022.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on, as well as this copyright and license information must be included.