This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
Online weight loss programs are increasingly popular. However, little is known about outcomes and associations with website usage among members of free online weight loss programs.
This retrospective cohort study examined the association between website usage and weight loss among members of a free commercial online weight loss program (SparkPeople).
We conducted a retrospective analysis of a systematic random sample of members who joined the program during February 1 to April 30, 2008, and included follow-up data through May 10, 2010. The main outcome was net weight change based on self-reported weight. Measures of website usage included log-ins, self-monitoring entries (weight, food, exercise), and use of social support tools (discussion forums, friendships).
The main sample included 1258 members with at least 2 weight entries. They were 90.7% female, with mean (SD) age 33.6 (11.0) and mean (SD) BMI 31.6 (7.7). Members with at least one forum post lost an additional 1.55 kg (95% CI 0.55 kg to 2.55 kg) relative to those with no forum posts. Having at least 4 log-in days, weight entry days, or food entry days per 30 days was significantly associated with weight loss. In the multiple regression analysis, members with at least 4 weight entry days per 30 days reported 5.09 kg (95% CI 3.29 kg to 6.88 kg) more weight loss per 30 days than those with fewer weight entry days. After controlling for weight entry days, the other website usage variables were not associated with weight change.
Weekly or more frequent self-monitoring of weight is associated with greater weight loss among members of this free online weight loss program.
More than one third of US adults are obese [
A positive association between website usage (program engagement) and weight loss has been demonstrated among participants enrolled in randomized trials and other prospective investigations [
Therefore, we evaluated a naturalistic cohort of members of SparkPeople, which is a free online weight loss program based in the United States. Prior studies described the accuracy of advice [
Membership in the SparkPeople online weight loss program is free and supported chiefly through advertising revenue. Most members are from the United States. The main features are educational content, self-monitoring tools (for weight, diet, and exercise), and social support venues (discussion forums, blogs, and “Friend” relationships similar to general online social networks). Members use the program largely in a self-directed and self-paced manner. They are free to use website features and make weight entries at any time. As of 2009, members could also access components of the website via mobile applications, although data on mobile access were not included in the present dataset.
Approximately 521,000 members joined the program during February 1 – April 30, 2008. Systematic random sampling produced the initial cohort of 26,582 individuals with a baseline weight. This de-identified dataset included all available follow-up data for these members through May 10, 2010.
Members were then excluded from analysis if they reported extreme outlying values for baseline characteristics: age greater than 100 years, weight less than 100 lb (45.4 kg) or greater than 800 lb (362.9 kg) with no follow-up weight entries, or height greater than 10 ft (3.05 m). These extreme outlier values were considered unrealistic, data entry errors, and/or not relevant to the analysis. Exclusion of 469 members with outlier values left an interim cohort of 26,113 members. Although we did not implement a filter for very short heights, the filtering process eliminated members with extreme height values, as the height range was 49-78 inches in the final cohort (n=1258) described below.
Self-reported weights may be inaccurate due to data entry errors, such as omitting, adding, or transposing digits. All 58,574 weight entries were analyzed for potential data entry errors with a series of automated and manual procedures. There were three automated filters (F1, F2, F3) for members with three or more weight entries, and a fourth automated filter (F4) for members with two or more weight entries. The first filter (F1) entailed fitting a second order polynomial regression line on the weight entries and dates; the distance of each point from the expected line (residual) was calculated. Points were flagged for visual evaluation if the residual was more than three times the standard deviation of the residuals for the individual record, and the difference between the actual and expected weight was greater than 10 lb (4.5 kg). The second filter (F2) flagged members that had a change of more than 2% body weight per day (whether over a short or long period of time). The third filter (F3) flagged members who had an absolute change of 50 lb (22.7 kg) or more between any two weight entries. The fourth filter (F4) flagged members with a weight change of 100 lb (45.4 kg) or more over the complete recorded period.
The automated filters identified 301 potentially erroneous weight entries among 248 unique SparkPeople members. Two independent observers (KOH and AWT) manually reviewed these 301 weight entries in the context of the other weight entries for a given individual (the trend) to determine whether the entry was erroneous. The observers demonstrated high interrater reliability, with an overall 95% agreement and Cohen’s Kappa .90. When the observers disagreed about a weight entry, they discussed and reached a consensus determination. After the automated filters and manual review, 73 of 301 weight entries (24%) were deemed erroneous and excluded from analysis. This represents 0.12% of all weights available (73/58,574).
Because the main objective was to evaluate the relationship between website usage and weight loss, 20,518 members with only one weight entry were excluded. The subsequent interim cohort of 5595 members had at least two weight entries. Members with at least two weight entries were younger, with a mean (SD) age 35.3 (11.5) years vs. 37.2 (13.4) years,
Each weight entry and log-in event was date-stamped. Because the data did not include the dates for use of other website features (eg, exercise diary), it was possible that a member used a website feature after his or her last weight entry, thereby complicating the interpretation of associations between website usage and weight change. However, it was possible to determine when a member stopped all website usage activity because each website activity generated a date-stamped log-in. Therefore, the final cohort of n=1258 was defined as those who had at least two weight entries, with the last weight entry on the same day as or after utilization of other website features. In other words, this final cohort consists of members who used the website features between their first and last weight entries. While the study design does not allow definitive analysis of causation, the final cohort at least meets the temporality criteria for causation. This final cohort was analyzed to examine the relationships between website usage and weight change. The flow diagram (
Flow diagram.
Usage of website features included the following variables:
Log-in days: the number of days the member logged into the website at least once, regardless of whether s/he used any other website features while logged in.
Weight entry days: the number of days the member entered weight at least once.
Food entry days: the number of days the member made at least one entry in the food diary, regardless whether those entries were complete or incomplete.
Exercise entry days: the number of days the member made at least one entry in the exercise diary, regardless whether those entries were complete or incomplete.
Exercise minutes: the total number of minutes of exercise the member recorded in the exercise diary during the study period.
SparkPoints: the number of points earned by the member for miscellaneous website activities, such as reading articles, taking quizzes and polls, using food and exercise diaries, posting messages on forums, and making blog entries.
Forum posts: the number of messages the member posted on the discussion forums.
Friends: the number of other members designated as “SparkFriends” by the member (a social network feature).
The weight entry span was defined as the number of days between first and last weight entry. Weight change, defined as the last recorded weight minus baseline weight, was stratified by weight entry span.
Potential confounders of the relationship between website usage and weight change included baseline BMI, age, gender, and weight entry span. Registration source (referred by friend, search engine ad, other) was examined as a potential confounder because it is possible that members who are introduced to the online program by a personal friend may differ in website usage and weight change than those who find the site only after an Internet search. Likewise, geographic location (zip code in midwest, northeast, south, or west region of the United States or other country) was examined as a potential confounder because of possible geographical variation in Internet use and dietary and physical activity factors related to body weight. These potential confounders were examined for association with weight change by using univariable regression analyses. Variables with a significant association (
Regression analysis was conducted with each website usage variable as predictor and net weight change as the outcome with adjustment for the identified confounders (univariable analyses). Multiple regression analysis with backward stepwise selection was conducted to identify the best subset of predictive covariates for net weight change. All covariates were included in the full model before model selection. Variables were included in the final model if
The distribution of website usage was highly skewed, such that many members did not use a given feature before their last weight entry. Therefore, website usage variables were coded as binary variables for the initial analysis (
To further analyze associations between website usage and weight change, one option would be to assume a linear regression model, but the linearity assumption may be not be valid. For example, the change in weight associated with an increase from 3 to 4 forum posts may be different from the weight change associated with an increase from 150 to 151 forum posts. We considered categorizing the website usage variables into tertiles, but the data distributions were too skewed, so that observations in the top tertile may be equivalent to some observations in the middle tertile.
Weekly use of online weight loss program features has been associated with greater perceived social support [
Weight change was defined as (last recorded weight minus baseline weight) divided by 30 day unit of weight entry span. For example, if a member’s last weight was 100 kg and baseline weight was 108 kg, and weight entry span was 60 days, then the weight change was -8 / 2 = -4 kg per 30 days.
Matlab version R2008b (MathWorks, Natick, MA) was used to identify potential outlying values for baseline characteristics and self-reported weights. The R statistical package (R Development Core Team, Version 2.14.1) was used for descriptive and regression analyses.
The study was approved by the Committee for the Protection of Human Subjects at the University of Texas Health Science Center at Houston.
Demographic characteristics of the final cohort are summarized in
Most members did not use the website features frequently. Since values were skewed, they are summarized with medians and interquartile ranges in
Demographic characteristics of final cohort (n=1258).
Characteristics |
|
mean (SD) or n (%) |
|
|
33.6 (11.0) |
|
|
1141 (90.7) |
|
|
31.6 (7.7) |
|
|
|
|
US - Midwest | 291 (23.1) |
|
US - Northeast | 187 (14.9) |
|
US - South | 400 (31.8) |
|
US - West | 210 (16.7) |
|
Other country | 170 (13.5) |
Website usage characteristics of final cohort (n=1258).
|
Minimum | 25th percentile | Median | 75th percentile | Maximum |
Log-in days | 2 | 3 | 7 | 23 | 654 |
Log-in span (number of days between first and last log-in days) | 1 | 54 | 306.5 | 624.8 | 826 |
Weight entry days | 2 | 2 | 2 | 4 | 156 |
Weight entry span (number of days between first and last weight entry days) | 1 | 54 | 306 | 623.8 | 826 |
Food entry days | 0 | 0 | 2 | 9 | 270 |
Exercise entry days | 0 | 0 | 1 | 4 | 258 |
Exercise minutes | 0 | 0 | 40 | 256.5 | 39130 |
Friends | 0 | 0 | 0 | 1 | 74 |
Forum posts | 0 | 0 | 0 | 0 | 2758 |
SparkPoints | 0 | 22.3 | 85 | 330 | 16480 |
Total weight change stratified by weight entry spana among final cohort (n=1258).
Weight entry span | n (%) | Mean (SD) total weight change |
Up to 30 days | 261 (20.7%) | -1.08kg (3.09) |
31-60 days | 69 (5.5%) | -2.00kg (2.68) |
61-90 days | 39 (3.1%) | -1.98kg (3.46) |
91-183 days | 131 (10.4%) | -2.32kg (6.50) |
184-365 days | 210 (16.7%) | -1.97kg (9.27) |
366-548 days | 178 (14.1%) | -0.99kg (8.04) |
549-829 days | 370 (29.4%) | -1.38kg (10.5) |
a Weight entry span is the number of days between first and last weight entry.
Baseline BMI was the only potential confounder significantly associated with weight change. Each additional unit of BMI at baseline was associated with an additional reported weight loss of 0.28 kg (95% CI 0.22 to 0.33 kg,
In the analysis with binary website usage variables, having at least one forum post was the only website variable significantly associated with greater weight loss. Members with at least one forum post reported an additional weight loss of 1.55 kg (95% CI 0.55 kg to 2.55 kg) relative to those with no forum posts (
Associations between website usage variables (binary) and weight changea in the final cohort (n=1258).
N (%) | Coefficient | 95% CI |
|
|
At least one food entry day | 893 (71.0) | -0.35 kg | -1.28 kg to 0.58 kg | .47 |
At least one exercise entry day | 677 (53.8) | -0.67 kg | -1.52 kg to 0.18 kg | .12 |
At least one exercise minute | 741 (58.9) | -0.61 kg | -1.47 kg to 0.25 kg | .16 |
At least one forum post | 291 (23.1) | -1.55 kg | -2.55 kg to -0.55 kg | .002 |
At least one Friend | 547 (43.5) | -0.80 kg | -1.66 kg to 0.06 kg | .07 |
a Website usage variables were coded as binary (at least once versus never). Analyses were adjusted for baseline BMI. Because all members in the sample had two or more log-ins and weigh-ins, and most members had at least one SparkPoint, those variables were not included.
In the analyses with categorical website usage variables, weekly log-ins, weight entries, and food entries (at least 4 in 30 days) were associated with greater reported weight loss than less frequent use (
Univariable analyses of associations between website usage variables (categorized) per 30 days and reported weight change per 30 daysa in the final cohort (n=1258).
|
|
Sample size | Coefficient | 95% CI |
|
|
|
|
|
|
|
|
0 to <4 | 842 | reference | Overall: <.001 | |
|
≥4 | 416 | -3.18 kg | -4.69 kg to -1.67 kg | <.001 |
|
|
|
|
|
|
|
0 to <4 | 1018 | reference | Overall: <.001 | |
|
≥4 | 240 | -5.09 kg | -6.89 kg to -3.29 kg | <.001 |
|
|
|
|
|
|
|
0 | 365 | reference | Overall:.002 | |
|
1 to <4 | 684 | 1.04 kg | -0.59 kg to 2.68 kg | .21 |
|
≥4 | 209 | -2.49 kg | -4.68 kg to -0.30 kg | .03 |
|
|
|
|
|
|
|
0 | 581 | reference | Overall: .021 | |
|
1 to <4 | 576 | 1.18 kg | -0.30 kg to 2.67 kg | .12 |
|
≥4 | 101 | -2.46 kg | -5.20 kg to 0.26 kg | .08 |
|
|
|
|
|
|
|
0 | 517 | reference | Overall: .054 | |
|
1 to <120 | 556 | 1.43kg | -0.12 kg to 2.98 kg | .07 |
|
≥ 120 | 185 | -0.89kg | -3.06 kg to 1.28 kg | .42 |
|
|
|
|
|
|
|
0 | 74 | reference | Overall: .003 | |
|
1 to <20 | 546 | 5.35 kg | 2.22 kg to 8.48 kg | <.001 |
|
≥20 | 638 | 4.47 kg | 1.37 kg to 7.58 kg | .005 |
|
|
|
|
|
|
|
0 | 967 | reference | Overall: .24 | |
|
1 to <4 | 233 | 0.99 kg | -0.86 kg to 2.84 kg | .30 |
|
≥4 | 58 | -2.09 kg | -5.52 kg to 1.33 kg | .23 |
|
|
|
|
|
|
|
0 | 711 | reference | Overall: .55 | |
|
1 | 327 | 0.88kg | -0.81 kg to 2.57 kg | .31 |
|
≥2 | 220 | 0.90 kg | -1.08 kg to 2.87 kg | .37 |
a Analyses were adjusted for baseline BMI.
Average weight loss, based on self-report, was modest in this free online weight loss program, but active users had better outcomes. Making weekly weight entries (at least 4 weight entry days per 30 days) was associated with an additional 5 kg weight loss. After controlling for weight entry days, the other website usage variables were not associated with weight loss.
To our knowledge, this is the first analysis of a naturalistic cohort of members of a free online weight loss program available to the general public. Prior studies evaluated naturalistic cohorts of members who paid for monthly subscriptions to commercial online weight loss programs in Australia [
Our results are consistent with prior studies documenting a positive relationship between engagement in online programs and weight control. Most of these involved research volunteers and strict eligibility criteria [
Although the current study discovered that several website usage variables were associated with weight loss, the multiple regression analysis indicates that weight entry days is the most important. It is possible that weight self-monitoring leads to frequent modification of diet and exercise behavior in response to weights. An alternative explanation is that members made weight entries only when they were losing (rather than gaining) weight. However, a post-hoc analysis found that 71% percent of members posted at least one weight reflecting a higher weight than a previous entry. In other words, they did not avoid documenting weight regain. The evidence in favor of weight self-monitoring improving weight control is strengthened by the consistency of the association in studies of online [
When website usage variables were coded as binary variables, making at least one forum post was associated with greater weight loss, although this analysis did not adjust for weight entry days. Because the content of the messages was not available for this study, we could not determine the nature of interactions on the forums. Furthermore, it was not possible to assess how often members read messages on a forum without posting (“lurking”). However, prior studies found that SparkPeople members receive high-quality advice [
The study had several limitations. First, attrition was high, which is a common problem plaguing commercial online weight loss programs [
Another limitation of the study is that the available records did not include ethnicity, race, or clinical characteristics. Furthermore, members were predominantly female, as is typical of weight loss programs. Taken together, these limitations indicate that results cannot be extrapolated to the general population, regular Internet users, or those who initially who register for this online program. The study is also limited by reliance on self-reported weights. However, weight reported by members of another online weight loss program was highly correlated with and similar to objective weights [
The public health impact of an intervention is determined by efficacy and dissemination [
We thank David Heilmann and Josh Knepfle at SparkPeople for providing the dataset. We also thank Weiwei Wang, PhD, for assisting with preliminary data analyses.
This work was supported in part by a Clinical Investigator Award from The University of Texas Health Science Center at Houston, Center for Clinical Research and Evidence-Based Medicine and The University of Texas at Houston - Memorial Hermann Center for Healthcare Quality and Safety. The funding sources were not involved in reviewing or approving the manuscript for publication.
None declared.