Advertisement: Preregister now for the Medicine 2.0 Congress
Incidence of Online Health Information Search: A Useful Proxy for Public Health Risk Perception
Bo Liang, MS, MBA; Debra L Scammon, PhD
Department of Marketing, David Eccles School of Business, University of Utah, Salt Lake City, UT, United States
Department of Marketing
David Eccles School of Business
University of Utah
1655 East Campus Center Drive
Salt Lake City, UT
Phone: 1 480 309 3478
Fax: 1 801 581 3152
Background: Internet users use search engines to look for information online, including health information. Researchers in medical informatics have found a high correlation of the occurrence of certain search queries and the incidence of certain diseases. Consumers’ search for information about diseases is related to current health status with regard to a disease and to the social environments that shape the public’s attitudes and behaviors.
Objective: This study aimed to investigate the extent to which public health risk perception as demonstrated by online information searches related to a health risk can be explained by the incidence of the health risk and social components of a specific population’s environment. Using an ecological perspective, we suggest that a population’s general concern for a health risk is formed by the incidence of the risk and social (eg, media attention) factors related with the risk.
Methods: We constructed a dataset that included state-level data from 32 states on the incidence of the flu; a number of social factors, such as media attention to the flu; private resources, such as education and health insurance coverage; public resources, such as hospital beds and primary physicians; and utilization of these resources, including inpatient days and outpatient visits. We then explored whether online information searches about the flu (seasonal and pandemic flu) can be predicted using these variables. We used factor analysis to construct indexes for sets of social factors (private resources, public resources). We then applied panel data multiple regression analysis to exploit both time-series and cross-sectional variation in the data over a 7-year period.
Results: Overall, the results provide evidence that the main effects of independent variables—the incidence of the flu (P<.001); social factors, including media attention (P<.001); private resources, including life quality (P<.001) and health lifestyles (P=.009); and public resources, such as hospital care utilization (P=.008) and public health funds (P=.02)—have significant effects on Web searches for queries related to the flu. After controlling for the number of reported disease cases and Internet access rate by state, we estimate the contribution of social factors to the public health risk perception levels by state (R2=23.37%). The interaction effects between flu incidence and social factors for our search terms did not add to the explanatory power of our regression models (R2<1%).
Conclusions: Our study suggests a practical way to measure the public’s health risk perception for certain diseases using online information search volume by state. The social environment influences public risk perception regardless of disease incidence. Thus, monitoring the social variables can be very helpful in being ready to respond to the public’s behavior in dealing with public health threats.
(J Med Internet Res 2013;15(6):e114)
health risk perception; social influence; ecological system
The Internet has rapidly become an important source of health information: 61% of American Internet users have searched for health information online . Most American Internet users primarily use search engines to look for information including health information [2-4]. Researchers in medical informatics have found a high correlation of the occurrence of certain search queries and the incidence of certain diseases, especially infectious diseases (eg, the flu), and thus have suggested the use of search query data for syndromic surveillance, or early detection of outbreaks [5-10]; this body of research has been well framed [11-16] and termed infodemiology by Eysenbach [6,17].
The existence of a correlation between search query volume and disease outbreaks raises a number of questions: Is the occurrence of certain search queries fully accounted for by the incidence of certain diseases? Do consumers search for online information related to a certain disease only when they have symptoms related to the disease? Are there situations in which consumers without any symptoms related to the disease search for online information related to the disease?
The perception of risk can play a role in many consumer decisions. Risk perception is the judgment that people make about the characteristics and severity of risks . Over the past few decades, considerable research has been conducted on risk perception. The traditional theories of risk perception (eg, expected utility theory, prospect theory) were established by work in behavioral economics that focused on individuals’ statistical or heuristic estimation of the value of alternative choices [19-22]. Understanding the risks perceived by individuals, and collectively by populations, is very helpful as the basis for designing effective strategies for communicating about risks. As a result, risk perception and risk communication have been used extensively in the field of public health [23,24]
Ecological systems theory holds that people interact with multiple social systems (eg, cultures, communities) in an environment [25,26]. Since its first introduction, ecological systems theory has been applied in various areas, such as health promotion . From the ecological systems perspective, members of a specific population are influenced by the same sociocultural factors. Thus, their collective behavior is shaped by common factors.
We suggest that online information searches related to a health risk reflect the public’s collective perception for the risk, which is associated not only with current health status (eg, the incidence of a disease), but also with the social environments related to the risk (eg, availability of public health resources) . Our study extends previous work by exploring the association between online health information search and multiple sociocultural factors related to public health risk perception.
We selected online information searches related to the flu as the object of this study. The seasonal flu occurs on a regular basis; in the United States, an average of 5% to 20% of the population gets seasonal flu and more than 200,000 people are hospitalized annually from seasonal flu-related complications. Thus, a significant proportion of the population has direct experience with the flu. Further, the flu can cause mild to severe illness and consumers are generally aware of the risks related to flu. In this study, we demonstrate the extent to which online information searches for the flu is explained by the incidence of the flu (including seasonal flu and pandemic flu) and sociocultural components of a population’s environment. We suggest that the occurrence of online information search related to a health risk can be a practical way to assess the public’s general concern for the risk, or public health risk perception.
The Ecological View of Risk Perception
From an ecological systems point of view, individuals grow and develop in different layered environmental systems, such as family, school, neighborhood, and community . Because risk perception is a sociocultural construct [29,30], individuals form their perception of risks under the influence of the sociocultural components within these systems. Individuals form their risk perception through 2 types of experiences: direct (personal) and indirect (social) experience with the risk [30,31]. For a specific risk, some members of the population may directly experience the risk (eg, patients during a pandemic and victims of a natural disaster) whereas others may experience the risk only vicariously (eg, through exposure to media accounts of the event).
Both groups have social experience with the risk through sociocultural activities, such as receiving information about the risk from multiple social sources (eg, news media, personal social networks), and interpreting the information based on certain values or cultural biases [29-31]. Thus, individuals in a particular ecological system form their perceptions about a risk in response to a set of sociocultural components in the environment (eg, news coverage by mass media, demographics). Because these individuals share the same sociocultural environment, their risk perceptions have features in common. Individuals’ risk perceptions are formed on the basis of a constellation of direct and indirect experience with a risk (sociocultural factors), and the interaction of the 2 types of experiences. The dynamic socioculturalization process through which individuals’ risk perceptions evolve over time leads to the formation of population risk perception.
As an example, individuals who live in a specific community may form their perception about the risk of smoking influenced by their own experience with smoking and a set of shared sociocultural factors, such as the news coverage by the local and national media about the risks of smoking and the behavioral norms of other individuals in their community. As this dynamic process continues over time for individuals within a community, public risk perception for smoking will develop.
Building from this, our main proposition is that public risk perception is predicted by individuals’ direct (personal) and indirect (social) experiences with a risk, and the interaction of the 2 experiences over time within an ecological system.
Previous research has identified a strong correlation between online information search and the incidence of diseases [5-10]. In the following, we establish our hypotheses related to the social experiences that shape public risk perception. Building from previous literature on risk perception, especially health risk perception, we identify 2 major categories of social factors that are associated with public health risk perception: (1) news coverage by mass media, and (2) the availability of resources (including private and public resources).
Agenda-setting theory proposes that mass media have an important influence on what issues the public considers to be important . A number of studies have found powerful effects of mass media on individual risk perception [33-36]. For example, one study found that the number of news articles about the H1N1 pandemic was positively associated with individual preventive pharmaceutical intervention and engagement in information seeking . It is important to note that media coverage may not always be factually correct. When the coverage of health risks by mass media is misleading (eg, exaggeration or stigmatization), the public may form misperceptions of the characteristics of the risks [37,38]. Regardless of the content of media coverage, the extent of coverage likely affects public risk perception.
We argue that when mass media pay more attention to a health risk by increasing coverage of the risk, the public will have higher awareness of the risk. Further, we suggest that when there is a high incidence of a health risk, the public will become more sensitive to the attention paid by media reporting. Their concern for the risk will be higher as the media attention increases. Thus, our first hypotheses are:
H1a: Online information search related to a health risk will be higher when mass media attention to the risk is higher;
H1b: The effect of the incidence of a health risk on online information search related to the risk will be greater when mass media attention to the risk is greater.
Availability of Resources
Research has shown that the availability of resources can reduce an individual’s perceived risk [39-42]. We classify the resources related to health risk perception into 2 categories: private resources and public resources. Private resources are the resources that individuals can acquire through their own efforts, such as financial, informational, physiological, and physical resources. Studies have found that the availability of private resources is negatively associated with health risk perceptions. Three types of private resources are particularly important in the context of health risk perceptions: life quality (eg, education , family income ), health status [45,46], and health lifestyles (eg, tobacco and alcohol consumption ).
Public resources are the resources that the public obtains from organizations such as charities and government. Little research has been conducted on the association between public resource support and risk perception. We classify public resources into 4 groups: natural resources (eg, population density, especially risks that are related to natural disasters), financial resources (eg, funding for public health), capacity of public resources (eg, hospital beds), and utilization of public resources (eg, hospital admissions). In our study, we use capacity and utilization of public health resources as measures of availability of health care resources. Because natural and financial resources and capacity and utilization of public resources are important resources with which the public can deal with health risks, we assume that the availability of these public resources is negatively associated with public health risk perception. Further, for those experiencing a health risk the availability of public resources may be particularly critical. Thus, we expect that their risk perception will be more likely to be influenced by the availability of resources.
We propose our second hypotheses:
H2a: Online information search for a health risk will be lower when the availability of private resources represented by life quality, health status, and health lifestyles, and public resources represented by natural and financial resources and capacity and utilization of public services is greater;
H2b: The effect of the incidence of a health risk on online information search related to the risk will be greater when the availability of private and public resources is lower.
Our study aims to explore the relationship between online information searches related to the flu and factors related to public health risk perception for the flu, including the incidence of the flu and the social factors related to the flu (news coverage and availability of resources). We used data from 2004 to 2011 from multiple published sources as detailed in the following section. The unit of analysis of our study is state population. In the following sections, we first detail the measures and data collection process for each variable necessary to test our hypotheses, and then present our analysis.
Online Information Search
Following methods used in previous studies [47-49], in this study we use Google Insights for Search (GIFS) to identify the changing patterns of Web searches used by consumers for queries related to the flu. Details of GIFS methodology are presented in Multimedia Appendix 1.
Research has shown that Internet users usually include 1 or 2 terms in a search query . Thus, for each of our search queries we include 1 or 2 related terms. People may have specific concerns related to prevention, diagnosis, and treatment of the flu. Thus, we preselected 96 search queries based on 3 categories: prevention (eg, flu shots, flu prevention), diagnosis (eg, flu symptoms, flu fever), and treatment of the flu (eg, Tamiflu). Our criterion for query selection was the availability of weekly search volume data for queries for 25 states in the United States (if there is not enough search volume for each query by state, GIFS shows only monthly search volume or no results). After checking the search volumes for these preselected queries, we identified 2 queries that fulfilled our criterion: flu shot(s) and flu symptom(s). The prevalence of the 2 search queries shows that the most common response by the public to the flu is to take preventive actions and to determine whether they have contracted the flu. In our study, we use the search volumes for flu shot(s) and flu symptom(s) to represent the state population’s risk perception for prevention and diagnosis of the flu, respectively.
Public health agencies in the United States often track the percentage of outpatient visits related to influenza-like illness (ILI), collected through the US Influenza Sentinel Provider Surveillance Network . A high ILI percentage indicates that a large fraction of patients are experiencing flu-like symptoms. Based on previous studies of the correlation of Web search and flu surveillance [9,52] and the availability of data, we used weekly ILI outpatient visit rates to measure the weekly incidence of the flu by state (like search data, this measure is automatically normed for the state’s population). We gathered these data from the official website of the Department of Health for each state. The data are not available for all states and all observed years. In all, the dataset includes the weekly ILI rate data for between 15 and 31 states over the time period of 2004 to 2011.
Mass Media Attention
Previous research on the influence of mass media on risk perception has used the number of news articles to measure mass media attention at the national level . Because states vary in their population, we use the number of news articles per 1 million population to measure the relative media attention for state populations. We collected weekly data on the number of news articles by using the news search function in LexisNexis Academic , a comprehensive database of national and regional news media. To find news articles that focused on the topic flu, we set the search term in GIFS as flu and the restriction as “headline & lead.” We set the time intervals as those used for search volume data from GIFS, and the sources of news as US newspapers and wires. We also set the article location (articles about a geographic location) as each state, indicating that the articles cover the population of a specific state. We collected annual state population data from the website of the . The data for media attention covers all the states and all the observed years.
Private and Public Resources
For private resources, we have variables indicating health status, life quality, and health lifestyles. According to the Centers for Disease Control and Prevention (CDC), the population groups most vulnerable to the flu are young children under age 5 years, the population aged over 65 years, pregnant women, and the population with chronic diseases, such as HIV . Based on the availability of data, we included 2 variables indicating age-related health status: the percentages of the population under 5 years and over 65 years, and 2 variables indicating chronic disease-related health status: the percentages of the population that have asthma and diabetes. We also included variables indicating life quality: the percentage of the population that has completed a bachelor’s degree, median household income, the percentage of the population that reported good health status, and the health insurance coverage rate. For health lifestyles, we included variables indicating the percentage of the population that used tobacco, exercised regularly, and was overweight or obese. Preventive health behavior is an important part of health lifestyles. Thus, we included a variable indicating the percentage of people over 65 years old who have had flu shots.
For public resources, we included variables indicating natural and financial resources, and capacity and utilization of public resources. Because the flu is a contagious respiratory disease, we used population density as a measure of natural resources. The flu is a health-related risk; therefore, we used public health funding as a measure of financial resources. We use the number of primary physicians per 1 million population as a measure of capacity of ambulatory care, outpatient visits per 1000 population as a measure of utilization of ambulatory care, the number of hospital beds per 1000 population as a measure of hospital care capacity, and hospital admissions, emergency room visits per 1000 population, and inpatient days as a measures of utilization of hospital care. We collected the data for private and public resources from the websites of the US Census Bureau , the CDC , Kaiser State Health Facts , and Trust for America’s Health . A majority of the data about health status is captured through state residents’ self-report surveys conducted by the CDC. In all, our dataset for private and public sources includes 20 annual variables. The data for each variable are available for all states (Hawaii is an exception because the data were lacking for the years 2004 and 2005) and for at least 4 years for the observed time period. We present the details about our measures in Tables 1 and 2.
To include as many observations and variables as possible, we use unbalanced panel data in our analyses. According to the CDC, the official annual flu season starts in October and ends in May covering 33 weeks . As most of the states in our dataset have missing values for the incidence of the flu in some weeks outside of each flu season (especially in the years before the H1N1 flu pandemic occurred), we dropped the observations for these weeks. Because the Web search volume data has been normalized by the total Internet traffic from each respective state, we included control variables representing household Internet usage, the percentage of households with an Internet connection, and the percentage of households with an Internet connection through broadband for each state. These data were obtained from the website of the US Department of Commerce and were available for all of the states for 3 years: 2007, 2009, and 2010 .
To account for weekly variations in search volumes, we included 33 dummy variables to indicate the specific weeks in each flu season. We also observed that in the 2 flu seasons following the 2009 H1N1 flu pandemic, search volumes for flu-related queries were higher than in the flu seasons before the pandemic occurred. To account for this variation, we used a dummy variable to indicate the weeks before and after the H1N1 flu pandemic. We present the trends of the means of the search volumes for flu and the incidence of the flu for all the states across the 33 weeks in each flu season in Figure 1.
[view this table]
|Table 1. Study variables: measures and types of data.|
[view this table]
|Table 2. Study variables: data availability and sources.|
[view this figure]
|Figure 1. Trends of the means of search volumes for flu and the incidence of the flu.|
We normalized the data for all continuous variables included in the models by natural log transformations. To make the coefficients for interaction effects more interpretable, we centered all the continuous independent variables by subtracting the mean from each value.
Table 3 presents summary statistics for all dependent and independent variables included in the study. We used Stata (StataCorp LP, College Station, TX, USA) to perform factor analysis with varimax rotation to reduce the number of independent variables. Because factor analysis by Stata is conducted on the correlations (as opposed to the covariances), it is not a concern that the variables have different means and/or standard deviations (eg, variables are measured in different scales). Based on the components identified by factor analysis, 5 composite indexes were constructed: (1) life quality index, (2) age-related health status index, (3) chronic disease-related health status index, (4) health lifestyle index, and (5) hospital care utilization index. We used the values of the composite indexes generated by factor analysis in our regression models. The life quality index includes positive factors indicating the percentage of the population with a bachelor’s degree, good health status, health insurance, and median household income. The age-related health status index includes a positive factor indicating the percentage of the population younger than 5 years and a negative factor indicating the percentage of the population older than 65 years. The chronic disease–related health status index includes positive factors indicating the percentage of the population that has been diagnosed with asthma and diabetes. The health lifestyle index includes positive factors indicating the percentage of the population that consumes tobacco and are overweight or obese, and negative factors indicating the percentage of the population that exercises regularly and the percentage of the population older than 65 years who have had a flu shot. The hospital care utilization index includes positive factors indicating the number of hospital admissions, emergency department visits, and inpatient days. All other variables are represented by single data items. We present factor loadings and the uniqueness for each variable in Table 4.
We applied panel data multiple regression analysis to exploit both time-series and cross-sectional variation in the data using Stata. We built regression models to examine the effects of the incidence of flu, social factors including media attention, and private and public resources and their interaction on the Web search volumes for the 2 queries: flu symptom(s) and flu shot(s). Because we assumed that variation across states was random and uncorrelated with the independent variables, we used random state-specific effects in our models. We also used robust Huber–White standard errors to address any potential heteroscedasticity and autocorrelation in our estimation. To avoid collinearity, we examined the correlation matrix of independent variables (Table 5). We found that each pair of variables has a correlation coefficient of less than 0.8 with most less than 0.6.
To investigate the separate effects of control variables and independent variables on dependent variables, we ran 6 models for each dependent variable (each search query) sequentially as shown in Table 6.
Overall, the results provide substantial evidence that the main effects of the independent variables we analyzed—the incidence of the flu, media attention, and private and public resources—have significant affects on Web search for queries related to the flu. Specifically, the results provide full support for hypotheses H1a and H1b, and partial support for hypotheses H2a and H2b.
The models including control variables (Models 1) were significant (P<.001) with coefficient of determination (R2) value of 37.88% and 59.11% for flu symptoms and flu shots, respectively. Specifically, the dummy variables for the occurrence of H1N1 and seasonality were significant in the models for both search queries. However, with the independent variables sequentially added in Models 2 to Models 6, H1N1 occurrence showed significant effects only in the model for flu symptoms, but not flu shots. Seasonality contributed significantly to the variance of Web search for flu shots. We present the results for the control variables in Tables 7 and 8.
Flu Incidence and Media Attention
With flu incidence as an independent variable added in model 2, the r2 values increased slightly by approximately 2% from model 1 for flu symptoms (P<.001) and flu shots (P<.001). Further, with media attention as an independent variable added in model 3, r2 values increased approximately 7% and 1% from model 2 for flu symptoms (P<.001) and (P<.001) flu shots, respectively. With the interaction of flu incidence and media attention added in model 4, the R2 values showed an increase of less than 1% from model 3 for flu symptoms (P<.001) and flu shots (P=.003).
All these models were significant (P<.001) with flu incidence (P<.001), media attention (P<.001), and their interaction (P<.001 for flu symptoms; P=.001 for flu shots) showing positive effects on Web search volume. The changes in R2 values from model 3 to model 4 show that media attention has a stronger positive influence on a population’s search for flu symptoms than for flu shots.
Private and Public Resources
With variables indicating private and public resources added in models 5, the r2 values showed a substantial increase from model 4 of 23.37% for flu symptoms (P<.001) and of 6.28% for flu shots (P<.001). For private resources, the life quality index (P=.001), health lifestyle index (P=.009), and chronic disease index (P=.004) had negative effects on search volume for flu symptoms.
For public resources, the number of outpatient visits (P<.001) and hospital care utilization index (P=.008) had positive effects, and the number of hospital beds had negative effects (P<.001) on search volume for flu symptoms. Public health funds (P=.02) had a negative effect, whereas population density (P=.001) and number of primary physicians (P=.006) had positive effects on search volume for flu shots.
With the interaction of these social factors and flu incidence added in models 6, the R2 values increased slightly from model 5, 1.2% for flu symptoms (P<.001), and 0.55% for flu shots (P=.004). The interaction of flu incidence and number of primary physicians had a negative effect on search volume for flu symptoms. The interaction of flu incidence and life quality index had a negative effect on search volume for flu shots. We present the results for the changes in the R2 values and the coefficient results for the independent variables in Tables 9 and 10.
[view this table]
|Table 3. Summary statistics.|
[view this table]
|Table 4. Composite indexes from factor analysis.|
[view this table]
|Table 5. Correlation matrix of independent variables.|
[view this table]
|Table 6. Model construction.|
[view this table]
|Table 7. Coefficients for control variables for flu symptom(s).|
[view this table]
|Table 8. Coefficients for control variables for flu shot(s).|
[view this table]
|Table 9. Coefficient of determination (R2) change and coefficient results for dependent variable flu symptom(s).|
[view this table]
|Table 10. Coefficient of determination (R2) change and coefficient results for dependent variable flu shot(s).|
Research on the correlation between the incidence of certain diseases and online information searches related to those diseases has increased in recent years. However, there has been little research on the effects of social factors on online information searches related to disease. In this paper, we demonstrate the usefulness of online search for queries related to a health risk as a measure of public health risk perception. We use publicly available data to demonstrate how such data can be used to provide insights into the factors that influence the public’s perception of health risks.
The results of our regression analyses provide strong support for our hypotheses: Web search volumes for flu-related queries as a measure of public health risk perception is predicted by the incidence of the flu and social factors, including media attention, and private and public resources. In addition to the independent impact of these social variables, we anticipated that the effect of incidence of the flu on public risk perception would be heightened by factors in the social environment. However, our models that incorporated the interaction effects between flu incidence and social factors did not add much to the explanatory power of our regression models. The social environment affects public health risk perception regardless of the incidence of diseases.
We modeled information searches for both risk prevention (flu shots) and risk diagnosis (flu symptoms). In our analyses, independent variables, especially media attention and private and public resources, had significant influence on search volumes for flu symptoms; however, seasonality variables had significant influence on search volumes for flu shots. As we anticipated, different factors appear to influence public perception of risk diagnosis and risk prevention.
Both the model for flu symptoms and that for flu shots demonstrate positive main effects for the incidence of the flu on search volumes. When a population’s flu incidence is higher, the population’s concerns for both prevention and diagnosis of the risk are higher.
Our data also support the expected positive effects of media attention and its interaction with the incidence of the flu on search volumes for risk prevention and risk diagnosis. Because mass media pays more attention to the risks related to a specific population, the overall population and the population with the flu both have more concern for prevention and diagnosis of the risk. Thus, our results show that the media play a significant role in setting the public agenda for health risk (agenda-setting theory ).
Private resources represented by life quality and public resources represented by hospital beds were negatively related to search volumes for flu symptoms. For the risk of the flu, a population with higher life quality and more access to hospital services demonstrated less searches for symptoms, whereas a population with lower life quality and less access to hospital services demonstrated more searches for symptoms. These results suggest consumers may use information from the Internet as a substitute for health care resources. Specifically, consumers vulnerable because of lower life quality and less available hospital services engage in more Internet searches, perhaps because information available on the Internet represents a relatively low-cost and easy-access source for information related to health risks.
With regard to private and public resources related specifically to health, our analyses suggest that there may be some synergistic effects of the 2 types of resources. Private resources represented by healthy lifestyles and public resources represented by outpatient visits and hospital care utilization are positively related to search volume for flu symptoms. Further, public resources represented by primary physicians and public funds were positively related to search volume for flu shots.
Based on these findings, we suggest that when a population has healthier lifestyles and more contact with health care professionals (through outpatient visits, emergency department visits, inpatient stays, availability of primary care physicians, and dedicated public health funds), it may be more conscious about current health risks. These results raise a question about the direction of the relationship between access to health care professionals and consumers’ searches for health information on the Internet. It could be that access to health care professionals stimulates consumers to be more vigilant about risk protective behaviors. If this is the case, primary care physicians and public health agencies play an important role in educating the public to take protective actions.
Private resources represented by chronic diseases had a negative effect on search volume for flu symptoms. For the risk of the flu, a population with higher incidences of chronic diseases demonstrated less searches for flu symptoms. This finding may reflect an environmental constraint on Internet access rather than a lack of interest in such information. As noted in a report from the Pew Research Center , adults living with chronic disease are significantly less likely than healthy adults to have access to the Internet. Individuals with chronic diseases are more likely to have regular contact with health professionals, also highlighting the important role health care providers play in patient education about health risks.
Finally, in our analyses, public resources represented by population density had a positive effect on search volume for flu shots. This finding is important in that, as a contagious illness, the search patterns for flu may also emerge for other communicable diseases.
Our study has important implications for public policy makers and health care professionals theoretically and practically. First, based on ecological systems theory, we proposed that there is a correlation between online health information search and public health risk perception. Recognition of this relationship by policy makers and health care professionals is important. In designing health risk communications strategies and policies, it is critical to take the social environments in which the public engages in online health information search into consideration.
Second, we suggest that the analysis of Internet search query data related to a particular health risk can provide a bell-weather of public health risk perception. Our analysis suggests that online health information search is a reflection of public health risk perception that can be predicted by social context variables. We demonstrate a practical way for policy makers and health professionals to monitor these contextual factors. Previous work has shown that aggregate search data reflect public concerns or interests (eg, ). Following this stream of research, we demonstrate that when a population is concerned about a specific health risk, they engage in online searches about that risk. This online search is predicted by contextual factors. Monitoring these contextual variables on a regular basis can assist policy makers in identifying areas and/or populations that could benefit from enhanced education. It may also help identify areas on which to focus the development/expansion of resources.
The search volume data for queries representing different stages of risk response, such as prevention and diagnosis, can inform policy makers and health professionals about the likely response of a population to current and emerging threats. Social marketing resources should be allocated based on an understanding of public risk perception of prevention and diagnosis of a health risk. For example, seasonality had more influence on search volume related to prevention that did other variables. Social marketing efforts should be timed to coincide with the seasonal variation in public risk perception for flu prevention. Those states with high levels of public perception for risk prevention can use this finding to help prepare for a new flu season by arranging for extra supplies of flu vaccine and planning effective systems for distributing the vaccine. States with low levels of public risk perception for prevention might benefit from more health education and health promotion prior to the onset of a new flu season to help increase awareness of the impending risk.
We found that media attention and private and public resources had strong effects on public risk perception for symptoms. Populations with higher risk perception for the diagnosis of the flu are likely to have higher demand for products or services related to treatment of the flu (eg, vitamin C supplements, primary care visits). Retailers in states with high levels of public risk perception for response to the flu may want to ensure that they have adequate supply of over-the-counter medications for dealing with flu symptoms. Ambulatory care clinics and primary care providers can assist a population in dealing with the flu by providing educational materials focused on identification of symptoms and by ensuring same-day access to provider visits for patients experiencing flu symptoms. To respond to public risk perception for diagnosis of the flu, social marketing efforts should use sociocultural segmentation (eg, vulnerable and health conscious consumers) to target resources most needed by each segment.
With the popularity of mobile devices (eg, smartphones, iPads), mobile searches are growing among consumers. Surveys have shown that a search engine is the most used application by 77% of smartphone users, and 90% of mobile search activities result in actions (eg, purchasing, recommending) . This suggests that search data from mobile devices may reflect the public’s perception of the urgency of the risks and their ability to manage the risks. We suggest that policy makers and health care professionals use mobile search data related to health risks to establish more actionable and timely strategies.
Online information searching is a bidirectional communication process, including sending search requests and receiving search results. Sending search requests reflects the public’s perception of the severity and urgency of risks, whereas receiving search results reflects the public’s perception of their ability to manage or respond to the risks. This study focused on public risk perception as demonstrated by the patterns of search requests. Policy makers and health professionals may further explore public risk perception by examining the patterns of responses to the returns to search requests. We suggest that the public’s perception of the management of health risks may be revealed through behaviors that reflect 4 types of social relations (ie, hierarchical, egalitarian, individualist, and fatalist social relations) . People with a hierarchical approach to social relations (ie, supporting patriotism, law, and order) may be more likely to click on search results links from official government websites, whereas people with an individualist way of life (ie, supporting individual efforts) may be more likely to click on search results from citizen media (eg, independent journalists). We suggest that investigating the association between the public’s response to search results and social-cultural factors may be a practical way to assess the public’s perception of the management of health risks. Policy makers and health care professionals may combine the patterns of search requests and response to search results to generate a composite index for public health risk perception.
Our study has several limitations. First, data gaps exist for the variables we used to indicate flu incidence and online information searches related to the flu. By using a different unit of analysis, additional relevant data may be available for study. For example, for each flu season from 1997 to the current year, the CDC posts the data of ILI outpatient visit rates for 9 flu surveillance regions on its official websites . Regional data are available for Pacific, Mountain, West South Central, East South Central, West North Central, East North Central, New England, mid-Atlantic, and South Atlantic. Using these data regional variations regarding public health risk perception could be explored. Similarly, with data that are available at the state level, state data could be combined, facilitating regional analysis. Such regional analyses may be particularly relevant to public health risks that occur most commonly in particular geographic areas, such as those related to hurricanes.
Second, we base our findings on aggregate data. One limitation of aggregate data is that they represent the characteristics of a group as a whole but do not allow for analysis of individual variation. We cannot establish how individuals perceive their social environments related to health risks. Future research is needed to investigate individual responses to social factors related to health risks by collecting data from self-report surveys.
Next, our study has only shown the usefulness of sets of variables for the prediction of public risk perception related to the flu. Different types of health risks vary in their characteristics such as immediacy, frequency, and severity. These factors may lead to variations not only in the effects of disease incidence, but also the relationship of sociocultural factors to public risk perception as demonstrated by online information search. More research is needed to identify common and unique variables for the measurement of public risk perception related to different types of health risks. For example, food-borne illnesses and the flu are both common health risks. Vaccination is available for the flu but not for food-borne illnesses. Future research should consider the availability of preventive and treatment options for different health risks as they may affect public perception for the health risks.
Finally, our study has shown the strong effects of traditional mainstream mass media (ie, newspaper and news wires) on public risk perception. Research is needed to investigate the influence of multiple forms of mass media, especially social media (eg, blogs, online social networks), on public risk perception. May’s  report about information channels and networks during Hurricane Katrina has identified the prominence of digital communication for risk management.
Conflicts of Interest
Multimedia Appendix 1
Google Insights for Search (GIFS) Methodology.[PDF File (Adobe PDF File), 21KB]
- Fox S, Jones S. The social life of health information. Washington, DC: Pew Internet & American Life Project; 2009 Jun. URL: http://pewinternet.org/~/media//Files/Reports/2009/PIP_Health_2009.pdf [accessed 2012-10-11] [WebCite Cache]
- Eysenbach G, Köhler C. How do consumers search for and appraise health information on the world wide web? Qualitative study using focus groups, usability tests, and in-depth interviews. BMJ 2002 Mar 9;324(7337):573-577 [FREE Full text] [Medline]
- Schwartz KL, Roe T, Northrup J, Meza J, Seifeldin R, Neale AV. Family medicine patients' use of the Internet for health information: a MetroNet study. J Am Board Fam Med 2006 Feb;19(1):39-45 [FREE Full text] [Medline]
- Fallows D. Search engine use. Washington, DC: Pew Internet & American Life Project; 2008. URL: http://www.pewinternet.org/~/media//Files/Reports/2008/PIP_Search_Aug08.pdf [accessed 2012-10-12] [WebCite Cache]
- Cooper CP, Mallon KP, Leadbetter S, Pollack LA, Peipins LA. Cancer Internet search activity on a major search engine, United States 2001-2003. J Med Internet Res 2005 Jul 1;7(3):e36 [FREE Full text] [CrossRef] [Medline]
- Eysenbach G. Infodemiology: tracking flu-related searches on the web for syndromic surveillance. AMIA Annu Symp Proc 2006:244-248 [FREE Full text] [Medline]
- Brownstein JS, Freifeld CC, Madoff LC. Influenza A (H1N1) virus, 2009--online monitoring. N Engl J Med 2009 May 21;360(21):2156 [FREE Full text] [CrossRef] [Medline]
- Pelat C, Turbelin C, Bar-Hen A, Flahault A, Valleron A. More diseases tracked by using Google Trends. Emerg Infect Dis 2009 Aug;15(8):1327-1328 [FREE Full text] [CrossRef] [Medline]
- Ginsberg J, Mohebbi MH, Patel RS, Brammer L, Smolinski MS, Brilliant L. Detecting influenza epidemics using search engine query data. Nature 2009 Feb 19;457(7232):1012-1014. [CrossRef] [Medline]
- Hulth A, Rydevik G, Linde A. Web queries as a source for syndromic surveillance. PLoS One 2009;4(2):e4378 [FREE Full text] [CrossRef] [Medline]
- Wong PW, Fu KW, Yau RS, Ma HH, Law YW, Chang SS, et al. Accessing suicide-related information on the internet: a retrospective observational study of search behavior. J Med Internet Res 2013;15(1):e3 [FREE Full text] [CrossRef] [Medline]
- Zheluk A, Gillespie JA, Quinn C. Searching for truth: internet search patterns as a method of investigating online responses to a Russian illicit drug policy debate. J Med Internet Res 2012;14(6):e165 [FREE Full text] [CrossRef] [Medline]
- Burton SH, Tanner KW, Giraud-Carrier CG, West JH, Barnes MD. "Right time, right place" health communication on Twitter: value and accuracy of location information. J Med Internet Res 2012;14(6):e156 [FREE Full text] [CrossRef] [Medline]
- Pervaiz F, Pervaiz M, Abdur Rehman N, Saif U. FluBreaks: early epidemic detection from Google flu trends. J Med Internet Res 2012;14(5):e125 [FREE Full text] [CrossRef] [Medline]
- Ayers JW, Althouse BM, Allem JP, Ford DE, Ribisl KM, Cohen JE. A novel evaluation of World No Tobacco day in Latin America. J Med Internet Res 2012;14(3):e77 [FREE Full text] [CrossRef] [Medline]
- Hill S, Mao J, Ungar L, Hennessy S, Leonard CE, Holmes J. Natural supplements for H1N1 influenza: retrospective observational infodemiology study of information and search activity on the Internet. J Med Internet Res 2011;13(2):e36 [FREE Full text] [CrossRef] [Medline]
- Eysenbach G. Infodemiology and infoveillance: framework for an emerging set of public health informatics methods to analyze search, communication and publication behavior on the Internet. J Med Internet Res 2009;11(1):e11 [FREE Full text] [CrossRef] [Medline]
- Slovic P. Perception of risk. Science 1987 Apr 17;236(4799):280-285. [Medline]
- Arrow KJ. Aspects of the Theory of Risk Bearing. Chicago, IL: Markham Publication; 1965.
- Duxbury D, Summers B. Financial risk perception: are individuals variance averse or loss averse? Economics Letters 2004;84(1):21-28.
- Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econometrica 1979;47(2):263-292.
- Viscusi WK. The risks and rewards of criminal activity: a comprehensive test of criminal deterrence. Journal of Labor Economics 1986;4(3):317-340.
- Brewer NT, Chapman GB, Gibbons FX, Gerrard M, McCaul KD, Weinstein ND. Meta-analysis of the relationship between risk perception and health behavior: the example of vaccination. Health Psychol 2007 Mar;26(2):136-145. [CrossRef] [Medline]
- Finucane M, Slovic P, Mertz CK, Flynn J, Satterfield T. Gender, race, and perceived risk: the "white male" effect. Health Risk Society 2000;2(2):159-172.
- Bronfenbrenner U. Toward an experimental ecology of human development. American Psychologist 1977;32:513.
- Klein KJ, Tosi H, Cannella AA. Multilevel theory building: benefits, barriers, and new developments. Academy of Management Review 1999;24:243.
- McLeroy KR, Bibeau D, Steckler A, Glanz K. An ecological perspective on health promotion programs. Health Educ Q 1988;15(4):351-377. [Medline]
- Edberg M. Essential Readings in Health Behavior: Theory and Practice. Burlington, MA: Jones & Bartlett Publishers; 2010.
- Douglas M, Wildavsky A. Risk and Culture: An Essay on the Selection of Technical and Environmental Dangers. Berkeley, CA: University of California Press; 1982:1982.
- Short JF. The social fabric at risk: Toward the social transformation of risk analysis. American Sociological Review 1984;49(6):711-725.
- Kasperson RE, Renn O, Slovic P, Brown HS, Emel J, Goble R, et al. The social amplification of risk: a conceptual framework. Risk Analysis 1988;8(2):177-187 [FREE Full text] [WebCite Cache]
- McCombs ME, Shaw DL. The agenda-setting function of mass media. Public Opinion Quarterly 1972;36:176-187. [CrossRef]
- Coleman C. The Influence of Mass Media and Interpersonal Communication on Societal and Personal Risk Judgments. Communication Research 1993 Aug 1993;20(4):611-628. [CrossRef]
- Sjoberg L. Factors in risk perception. Risk Anal 2000 Feb;20(1):1-11. [Medline]
- Wahlberg A, Sjoberg L. Risk perception and the media. Journal of Risk Research 2000;3(1):31-50.
- Ibuka Y, Chapman GB, Meyers LA, Li M, Galvani AP. The dynamics of risk perceptions and precautionary behavior in response to 2009 (H1N1) pandemic influenza. BMC Infect Dis 2010;10:296 [FREE Full text] [CrossRef] [Medline]
- Attavanich W, McCarl BA, Bessler D. The Effect of H1N1 (Swine Flu) Media Coverage on Agricultural Commodity Markets. Applied Economic Perspectives and Policy 2011 May 2011;33(2):241-259 [FREE Full text] [CrossRef]
- Holland K, Blood RW, Pirkis J, Dare A. Postpsychiatry in the Australian media: the "vulnerable" talk back. Asia Pacific Media Educator 2009;19:142-157 [FREE Full text] [WebCite Cache]
- Pancioli AM, Broderick J, Kothari R, Brott T, Tuchfarber A, Miller R, et al. Public perception of stroke warning signs and knowledge of potential risk factors. JAMA 1998;279(16):1288-1292. [Medline]
- Maswanya ES, Moji K, Horiguchi I, Nagata K, Aoyagi K, Honda S, et al. Knowledge, risk perception of AIDS and reported sexual behaviour among students in secondary schools and colleges in Tanzania. Health Educ Res 1999 Apr;14(2):185-196 [FREE Full text] [Medline]
- Sjöberg L. Political decisions and public risk perception. Reliability Engineering & System Safety 2001 May 2001;72(2):115-123. [CrossRef]
- Wildavsky A, Dake K. Theories of risk perception: Who fears what and why? Daedalus 1990;119(4):41-60.
- Grasmück D, Scholz RW. Risk perception of heavy metal soil contamination by high-exposed and low-exposed inhabitants: the role of knowledge and emotional concerns. Risk Anal 2005 Jun;25(3):611-622. [CrossRef] [Medline]
- Rountree PW, Land KC. Perceived risk versus fear of crime: Empirical evidence of conceptually distinct reactions in survey data. Social Forces 1996 Jun;74(4):1353-1376.
- Redelmeier DA, Rozin P, Kahneman D. Understanding patients' decisions. Cognitive and emotional perspectives. JAMA 1993 Jul 7;270(1):72-76. [Medline]
- Haomiao J, Santana A, Lubetkin EI. Measuring risk perception among low-income minority primary care patients. J Ambul Care Manage 2004 Dec;27(4):314-327. [Medline]
- Schmidt T, Vosen S. Social Science Research Netwok. 2009. Forecasting Private Consumption: Survey-Based Indicators vs. Google Trends URL: http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1514369 [accessed 2012-10-12] [WebCite Cache]
- Breyer BN, Eisenberg ML. Use of Google in study of noninfectious medical conditions. Epidemiology 2010 Jul;21(4):584-585 [FREE Full text] [CrossRef] [Medline]
- Baram-Tsabari A, Segev E. Exploring new web-based tools to identify public interest in science. Public Understanding of Science 2009 Oct 2009;20(1):130-143. [CrossRef]
- Jansen BJ, Spink A, Bateman J, Saracevic T. Real life information retrieval: a study of user queries on the Web. SIGIR Forum 1998 Apr 1998;32(1):5-17. [CrossRef]
- US Outpatient Influenza-like Illness Surveillance Network (ILINet). URL: http://www2a.cdc.gov/ilinet/ [accessed 2013-04-10] [WebCite Cache]
- Doornik JA. Improving the timeliness of data on influenza-like illnesses using Google search data. 2009 Oct 8. URL: http://www.doornik.com/flu/Doornik(2009)_Flu.pdf [accessed 2012-10-12] [WebCite Cache]
- LexisNexis Academic. URL: http://www.lexisnexis.com/hottopics/lnacademic/? [accessed 2013-04-10] [WebCite Cache]
- US Census Bureau. Data access tools URL: http://www.census.gov/main/www/access.html [accessed 2013-04-10] [WebCite Cache]
- Centers for Disease Control and Prevention. Seasonal influenza (flu): information for specific groups URL: http://www.cdc.gov/flu/groups.htm [accessed 2013-04-10] [WebCite Cache]
- US Census Bureau. Fast facts for Congress URL: http://www.census.gov/fastfacts/ [accessed 2013-04-10] [WebCite Cache]
- Centers for Disease Control and Prevention. Behavioral Risk Factor Surveillance System URL: http://apps.nccd.cdc.gov/gisbrfss/default.aspx [accessed 2013-04-10] [WebCite Cache]
- The Henry J. Kaiser Foundation. State health facts URL: http://www.statehealthfacts.org/ [accessed 2013-04-10] [WebCite Cache]
- Trust for America's Health. State data URL: http://healthyamericans.org/states/ [accessed 2013-04-10] [WebCite Cache]
- Centers for Disease Control and Prevention. Seasonal influenza (flu): the flu season URL: http://www.cdc.gov/flu/about/season/flu-season.htm [accessed 2013-04-10] [WebCite Cache]
- US Census Bureau. Computer and Internet use URL: http://www.census.gov/hhes/computer/publications/2009.html [accessed 2013-04-10] [WebCite Cache]
- Fox S, Purcell K. Chronic disease and the Internet. Washington, DC: Pew Internet & American Life Project; 2010 Mar 24. URL: http://pewinternet.org/~/media//Files/Reports/2010/PIP_Chronic_Disease_with_topline.pdf [accessed 2012-10-12] [WebCite Cache]
- Askitas N, Zimmermann KF. Google econometrics and unemployment forecasting. Applied Economics Quarterly 2009;55:107 [FREE Full text] [WebCite Cache]
- Google Mobile Ads Blog. 2011 Apr 26. Smartphone user study shows mobile movement under way URL: http://googlemobileads.blogspot.com/2011/04/smartphone-user-study-shows-mobile.html [accessed 2013-02-18] [WebCite Cache]
- Centers for Disease Control and Prevention. Seasonal influenza (flu): past weekly surveillance reports URL: http://www.cdc.gov/flu/weekly/pastreports.htm [accessed 2013-02-27] [WebCite Cache]
- May AL. First informers in the disaster zone: the lessons of Katrina. Washington, DC: The Aspen Institute; 2007. URL: http://www.policyarchive.org/handle/10207/bitstreams/4525.pdf [accessed 2013-02-19] [WebCite Cache]
|GIFS: Google Insights for Search|
|ILI: influenza-like illness|
|Edited by G Eysenbach; submitted 16.10.12; peer-reviewed by N Bragazzi, E Augustson; comments to author 09.02.13; accepted 25.02.13; published 17.06.13|
Please cite as:
Liang Bo, Scammon DL
Incidence of Online Health Information Search: A Useful Proxy for Public Health Risk Perception
J Med Internet Res 2013;15(6):e114
BibTeX, compatible with BibDesk, LaTeX
RIS, compatible with RefMan, Procite, Endnote, RefWorks
Refer, compatible with Endnote
Add this article to your Mendeley library
Add this article to your CiteULike library
Add this article to your Connotea library
Copyright©Bo Liang, Debra L Scammon. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 17.06.2013.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.