Published on in Vol 22, No 2 (2020): February

Preprints (earlier versions) of this paper are available at, first published .
Translating the Burden of Pollen Allergy Into Numbers Using Electronically Generated Symptom Data From the Patient’s Hayfever Diary in Austria and Germany: 10-Year Observational Study

Translating the Burden of Pollen Allergy Into Numbers Using Electronically Generated Symptom Data From the Patient’s Hayfever Diary in Austria and Germany: 10-Year Observational Study

Translating the Burden of Pollen Allergy Into Numbers Using Electronically Generated Symptom Data From the Patient’s Hayfever Diary in Austria and Germany: 10-Year Observational Study

Original Paper

1Aerobiology and Pollen Information Research Unit, Department of Oto-Rhino-Laryngology, Medical University of Vienna, Vienna, Austria

2Foundation German Pollen Information, Berlin, Germany

3Department of Dermatology, Venerology and Allerogology, Charitè Universitätsmedizin, Berlin, Germany

4Paracelsus Medizinische Privatuniversität, Salzburg, Austria

Corresponding Author:

Maximilian Bastl, MSc, PhD

Aerobiology and Pollen Information Research Unit

Department of Oto-Rhino-Laryngology

Medical University of Vienna

Währinger Gürtel 18-20

Vienna, 1090


Phone: 43 4040033380

Fax:43 4040039040


Background: Pollen allergies affect a significant proportion of the population globally. At present, Web-based tools such as pollen diaries and mobile apps allow for easy and fast documentation of allergic symptoms via the internet.

Objective: This study aimed to characterize the users of the Patient’s Hayfever Diary (PHD), a Web-based platform and mobile app, to apply different symptom score calculations for comparison, and to evaluate the contribution of organs and medications to the total score for the first time.

Methods: The PHD users were filtered with regard to their location in Austria and Germany, significant positive correlation to the respective pollen type (birch/grass), and at least 15 entries in the respective season. Furthermore, 4 different symptom score calculation methods were applied to the datasets from 2009 until 2018, of which 2 were raw symptom scores and 2 were symptom load index (normalized) calculations. Pearson correlation coefficients were calculated pairwise for these 4 symptom score calculations.

Results: Users were mostly male and belonged to the age groups of 21 to 40 years or >40 years. User numbers have increased in the last 5 years, especially when mobile apps were made available. The Pearson correlation coefficients showed a significant linear relationship above 0.9 among the 4 symptom score datasets and thus indicated no significant difference between the different methods of symptom score calculation. The nose contributed the most to the symptom score and determined about 40% of the score.

Conclusions: The exact method of calculation of the symptom score is not critical. All computation methods show the same behavior (increase/decrease during the season). Therefore, the symptom load index is a useful computation method in all fields exploring pollen allergy, and Web-based diaries are a globally applicable tool to monitor the effect of pollen on human health via electronically generated symptom data.

J Med Internet Res 2020;22(2):e16767




Pollen allergy is an overreaction of the immune system to a foreign substance such as pollen grains or (free) allergens. This overreaction inflames the skin, sinuses, airways, or the digestive system [1]. The severity of allergies varies individually and may range from minor irritation to anaphylaxis. The most common symptoms of respiratory allergies are allergic rhinitis, allergic conjunctivitis, and asthma. Pollen allergy is a major problem globally [2] and affects a considerable percentage of the population ranging from 5% to 30% in industrialized countries [3]. The prevalence of pollen allergies is assumed to increase [4] along with its socioeconomic impact [2,5]. Furthermore, 1 million people of 8 million inhabitants in Austria are considered to be affected by pollen allergy [6], and almost 20% of the adults in Germany are affected by an allergy [7].

Only a minority of plants cause pollen allergies. Less than 100 species of 250,000 pollen-producing plants are of major interest in this respect [8-10]. For people with pollen allergy globally as well as in Austria and Germany, Betula (birch) and Poaceae (sweet grass family) are considered plants of high importance. Therefore, the birch and grass pollen seasons were selected in this study.

The allergenicity of pollen is influenced by climate, humidity, temperature, and air pollution [11]. The World Allergy Organization (WAO) recommends avoiding the main risk factors including outdoor air pollution [2,12]. Pollen itself may be seen as a green pollutant, and its occurrence in the air above a certain level or concentration may be regarded as an additional factor for air quality, comparable with the levels defined for sulfur dioxide, particulate matter, ozone, or nitrogen dioxide [12]. There is evidence that allergenicity, and thus the burden of allergy, increases with increased levels of air pollution [13-15]. However, allergen content and pollen concentrations are 2 different datasets and cannot always be compared with each other, especially because free allergens are not carried by pollen [16-19]. State-of-the-art pollen monitoring accounts for this fact and has to fulfill certain requirements to allow appropriate pollen information, for example, including symptom data to compensate for the lack of knowledge about the occurrence of major and minor allergens or personal exposure [20,21].

Value of Electronically Generated Symptom Data

The idea of using symptom data in pollen information originates from clinical trials for immunotherapies for the treatment of allergic diseases including the feedback of those affected by pollen allergy for dose finding or confirmatory studies in the so-called symptom scores [22]. Most questionnaires of the freely available crowdsourced symptom diaries have a strong relation to the questionnaires of the European Medicines Agency (EMA) for such immunotherapy trials and should, therefore, be comparable but have not been evaluated for comparability so far.

However, scaling the burden is as important as allergen avoidance itself to improve and monitor the quality of life of the persons concerned: pollen forecasts and pollen information are valuable tools for support [23,24] and are strongly requested for during the pollen season [25]. Recently, pollen forecasts and pollen information have been distributed increasingly via mobile health (mHealth) technology such as mobile phones, tablets, and other wireless devices. The use of electronic health (eHealth) technology as a communication and information channel has gained significant importance to inform the public. This phenomenon is observed in countries with higher income [26]. The outreach via mHealth or eHealth technology allowed for symptom data to be used as a crowdsourced indication for the burden caused by pollen allergies and to monitor the impact of pollen on human health. Therefore, such data are integrated more often into pollen information besides pollen measurements and into studies dealing with pollen allergies. Working directly with patients is time consuming and not cost effective. Up to now, a number of internet tools and mobile apps are available based on country and technology [27-30].

Crowdsourced User Data

The Patient’s Hayfever Diary (PHD), also called pollen diary, was first made available in 2009, developed by UB at the Medical University of Vienna. The pollen diary grew in terms of the included countries, available languages, and usability (available also as the mobile app, Pollen) as well as in user numbers since then. At present, the website is available in 13 countries (Austria, Germany, Switzerland, Great Britain, France, Spain, Slovenia, Sweden, Finland, Turkey, Hungary, Serbia, and Lithuania), whereas the mobile app is available in 8 countries/regions (Austria, Germany, Switzerland, Great Britain, France, Spain, Sweden, and South Tyrol in Italy). More than 240,000 users have entered data across Europe so far, with more than 32,000 users in Austria and more than 160,000 users in Germany over the whole period, making these 2 countries ideal for an in-depth study of electronically generated symptom data (data request on February 12, 2019). Symptom data retrieved from the pollen diary were already analyzed in a couple of studies [27,31]: Those show that an average based on a sufficiently high user number is robust and that symptom data give more insight into the onset of pollen allergy than pollen data alone.


The aims of this study were to (1) analyze the user profiles of the PHD, (2) perform an in-depth study for a 10-year dataset for 2 countries with the highest user numbers, and (3) apply and compare different symptom score calculations to judge their usability to monitor the effect of pollen on human health.

Patient’s Hayfever Diary

The PHD was used as a source for electronically generated symptom data. Data may derive from the webpage or mobile apps (Pollen or Husteblume, the latter only for Germany). The symptom data generated are crowdsourced and gained from users, not patients, because of privacy and data protection issues. Nonetheless, a couple of measures allow for high quality of generated data (see Symptom Data and Symptom Score Calculation Methods). Users were analyzed for the first time with regard to the frequency of certain age groups and gender (see Multimedia Appendices 1 and 2).

The following explanation of the technical background underlines the applicability of such a tool globally: The pollen diary runs a Java-based app on a server in a data center of the Medical University of Vienna. Data are stored in a Structured Query Language database, including a daily encrypted backup stored off-site. Users interact with the pollen diary via a multilingual Web user interface that can be used with any modern Web browser and currently supports 11 languages. In addition, the pollen diary provides a representational state transfer (REST)–based application programming interface (API), which is used by the Pollen app to provide nearly the same functionality as the Web user interface. The pollen diary gathers information via APIs from the European Aeroallergen Network (EAN) database (for displaying pollen loads compared with the user’s symptoms) and an internal data exchange platform, which provides forecasts for pollen and air quality parameters (used for creating personalized forecasts inside the pollen diary). Data gathered by the pollen diary are used (anonymized) in scientific studies and papers. Every communication is secured via HTTP secure/transport layer security (Web user interface and REST API), and access to the REST API is restricted by an internet protocol address, where possible.

Users are granted anonymity. The PHD fulfills the latest European Union (EU) regulation on data privacy (regulation EU 2016/679), adheres to the General Data Protection Regulation, Directive 95/46/EC, and Council of the EU of the EU for data protection, and collects only a minimum of personal data such as email address. Personal data such as birthday, medical conditions, address, or true name are not obligatory. Moreover, personal and symptom datasets are saved on separate servers to avoid any unauthorized connection between them.

Symptom Data and Symptom Score Calculation Methods

The requirement for all users to be included in the study was based on their location (Austria and Germany). The PHD includes an automated background correlation service that correlates users to the pollen concentration of the respective region. For this study, only users with a significant positive correlation to the respective pollen type (birch or grass; P<.01 or P<.05) and 15 or more data entries within the respective pollen season (birch or grass) were included. This procedure limited the available symptom data but provided high-quality data of the symptom scores of users whose scores approach the scores of those diagnosed with pollen allergy the most.

A total of 4 different calculation methods of the symptom data have been applied to the dataset: (1) a raw symptom score (used automatically in the PHD), (2) the symptom load index (SLI) of that raw PHD score, (3) the EMA score, and (4) the SLI of the EMA raw score (Tables 1 and 2).

Table 1. Results of the calculation of the raw Patient’s Hayfever Diary symptom score and the raw European Medicines Agency symptom score per year, season, and country.
Country, allergen, and yearPatient’s Hayfever Diary symptom scoreEuropean Medicines Agency symptom score













































The calculations of the first two methods are described in detail in the study by Bastl et al [27] but have been summarized in this study for a direct comparison. The PHD user process asks for 3 organs of interest: eyes, nose, and lungs. A severity score from 0 to 3 is possible for each organ, resulting in a maximum of 9 points for all organs with no discomfort (no problems)=0, low discomfort (mild problems)=1, moderate discomfort (moderate problems)=2, and strong discomfort (severe problems)=3. Furthermore, 4 specific symptoms per organ can be selected in addition to this general severity: itching, foreign body sensation, redness, and watering (for the eyes); itching, sneezing, running, and blocked (for the nose); and wheezing, shortness of breath, cough, and asthma (for the lungs). Asthma was included in the PHD; although we are aware that asthma is a disease or condition rather than a symptom, it commonly manifests together with allergic rhinitis [2] and therefore should be documented as well. All selected symptoms and the highest severity for each organ amounted so far to 21 points. Medication was included as well by a weighted medication score assigning more points for medications that affect more than one organ, for example, eye drops do have an effect on the eyes but not on the lungs, whereas tablets do influence all the organs. Eye medication gives a total of 1.8 points, with 1 point for eye drops or tablets, 0.5 for others, and 0.3 for homeopathic medicine. Nose medication gives a total of 2.05 points, with 1 point for nose drops or tablets, 0.25 for eye drops, 0.5 for others, and 0.3 for homeopathic medicine. Lung medication gives a total of 0.8 points, with 0.25 for tablets or others and 0.3 for homeopathic medicine. All medications together amount to 4.65, thus resulting in a total symptom score ranging from 0 to a maximum of 25.65. This score is the raw PHD symptom score that was automatically generated by the pollen diary. The PHD raw symptom score has been developed based on the (1) clinical standards of the General Hospital of Vienna (Austria) and (2) published knowledge at that time but has never been validated. However, it should be noted that a similar score has been validated as a reliable and valid instrument for observational studies and clinical trials and that symptom and medication scores are recommended as a primary outcome of clinical trials [32]. The scale and the inclusion of 3 organs are the same, but the specific symptoms (3 per organ vs 4 per organ for the raw PHD score) and the exact weighting of medication are different. The results of the raw PHD scores are listed in Table 1.

The SLI of this raw symptom score is calculated as an average of the same pool of users (filtered per location, correlation with certain aeroallergens, and number of entries within a certain time frame, as mentioned previously) and the raw PHD symptom score within a certain range from a minimum of 0 up to a maximum of 10. The SLI is thus a normalization of the PHD raw symptom score and was developed to compare crowdsourced symptom data of the PHD with other datasets in a clear and comprehensible way. It has been successfully applied, and its robustness has been proven in a couple of publications [27,31,33]. The results of the SLI scores based on the PHD raw symptom score are listed in Table 2.

The EMA raw symptom score is calculated based on the directive EMA/414476/2011 of the EMA.

Symptoms are rated on a 4-point scale that is comparable to the PHD raw symptom score, with absent symptoms=0, mild symptoms=1, moderate symptoms=2, and severe symptoms=3. The organs included are eyes and nose only (no lung symptoms). Two symptoms are included for eyes (tearing and itching/grittiness/redness), and 4 symptoms are included for nose (nasal itching, sneezing, rhinorrhea, and nasal obstruction). Therefore, the maximum EMA raw symptom score amounts to 12 points. The results of the EMA raw symptom scores are listed in Table 1.

The SLI of the EMA raw score is calculated based on the EMA raw symptom score data and thus considers only symptoms associated with eyes and nose. The results of the SLI based on the EMA raw symptom score are listed in Table 2. In addition, the percentage of the affected organ was calculated for the two SLI methods (Table 2).

Table 2. Results of the symptom load index calculations (traditional and European Medicines Agency symptom load index) per year, season, and country, including the percentages of the contribution of each affected organ and the medication score.
Country, allergen, and yearTraditional SLIa calculationEuropean Medicines Agency SLI calculation

SLIEyes (%)Nose (%)Lungs (%)Medb (%)SLIEyes (%)Nose (%)













































aSLI: symptom load index.

bMed: medication score.

Pollen Data

We followed the terminology recommended by Galán et al [34] for aerobiological data. Pollen data were selected only from pollen monitoring stations of known high quality, low occurrence of gaps, and wide geographical coverage during the study period of 10 years to allow a justified estimation for the whole of Austria and Germany. All stations included are listed in Multimedia Appendix 3, including their exact location and height above the sea level, with 17 stations for Austria and 28 for Germany. Pollen data were evaluated following the minimum recommendations of the European aerobiology community [35] and the EAN and were derived from automatic volumetric pollen and spore traps of the Hirst design [36]. The EAN standard pollen season definition was chosen, as percentage definitions are recommended for retrospective studies [37]. The season starts at 1% of the Annual Pollen Integral (APIn [34]) and ends at 95% of the APIn of the respective aeroallergen following this definition. The resulting birch and grass pollen seasons with their APIn are given in Multimedia Appendix 4.


The graphs and correlation computations were performed using the statistical software R 3.4.3 [38]. The graphs were drafted with the package ggplot2 [39]. The correlation computations were calculated for the comparison of 4 symptom score calculation methods (Tables 3 and 4). The Pearson correlation coefficients were computed pairwise for all symptom scores, the raw PHD symptom score, the SLI of the raw PHD symptom score, the EMA raw score, and the SLI of the EMA score. The Pearson correlation coefficient is a measure of linear correlation between 2 variables (with 1=total positive correlation; 0=no linear correlation, and −1=total negative linear correlation) and commonly used when a linear relationship is assumed. This method was chosen because it shows the strength of the relationship between the different score calculations. In addition, cause/effect are not relevant in this study as the goal was to examine possible differences between calculation methods. In the preanalysis, we recognized most coefficients achieving values of 0.99 when comparing the scores. Hence, we compared the difference between 2 days to remove a trend component because the symptom data are dependent on pollen data and, thus, follow a trend. The resulting coefficients were slightly lower but still strongly significant, with most values achieving 0.9 (Tables 3 and 4).

Table 3. Pearson correlation coefficients for the birch (Betula) pollen season for the 4 symptom score calculation methods for Austria and Germany from 2009 to 2018. Note the high correlation values for every comparison.
ATd 20090.9640.9540.9380.9140.9470.960
DEe 20090.9530.9460.9140.9090.9600.942
AT 20100.9810.9840.9750.9590.9820.980
DE 20100.9320.9740.9180.9080.9720.930
AT 20110.9670.9790.9540.9520.9710.971
DE 20110.9790.9850.9650.9660.9750.980
AT 20120.9730.9870.9700.9570.9770.981
DE 20120.9470.9280.8980.9620.9470.904
AT 20130.9820.9920.9790.9740.9950.979
DE 20130.9910.9830.9740.9700.9760.988
AT 20140.9890.9910.9790.9820.9900.985
DE 20140.9690.9800.9450.9540.9750.970
AT 20150.9630.9750.9410.9340.9620.968
DE 20150.9850.9820.9800.9770.9810.988
AT 20160.9770.9760.9300.9550.9450.965
DE 20160.9890.9780.9730.9710.9800.987
AT 20170.9740.9510.9370.9020.9310.967
DE 20170.9800.9840.9650.9730.9820.980
AT 20180.9800.9840.9700.9680.9890.976
DE 20180.9860.9760.9650.9540.9700.975

aEMA: European Medicines Agency.

bSLI: symptom load index.

cPHD: Patient’s Hayfever Diary.

dAT: Austria.

eDE: Germany.

Table 4. Pearson correlation coefficients for the grass (Poaceae) pollen season for the 4 symptom score calculation methods for Austria and Germany from 2009 to 2018. Note the high correlation values for every comparison.
ATd 20090.9540.9560.9130.9140.9340.953
DEe 20090.8780.8230.7890.7250.8300.903
AT 20100.9630.9770.9460.9370.9700.957
DE 20100.9480.9500.9290.8860.9550.943
AT 20110.9710.9740.9500.9370.9600.969
DE 20110.9530.9630.9290.9080.9580.940
AT 20120.9590.9720.9260.9270.9600.950
DE 20120.9560.9710.9130.9380.9540.951
AT 20130.9830.9830.9690.9610.9750.978
DE 20130.9730.9750.9480.9430.9620.966
AT 20140.9690.9720.9400.9370.9580.965
DE 20140.9650.9650.9510.9230.9590.965
AT 20150.9740.9710.9580.9400.9680.972
DE 20150.9630.9710.9400.9260.9560.962
AT 20160.9660.9570.9350.9220.9590.959
DE 20160.9850.9910.9750.9770.9900.983
AT 20170.9650.9620.9230.9240.9490.950
DE 20170.9710.9800.9500.9500.9740.968
AT 20180.9680.9630.9390.9130.9390.957
DE 20180.9730.9740.9440.9450.9540.970

aEMA: European Medicines Agency.

bSLI: symptom load index.

cPHD: Patient’s Hayfever Diary.

dAT: Austria.

eDE: Germany.

User Characterization

In general, user numbers were low at the launch of the PHD and increased toward the last years (Multimedia Appendices 1 and 2). The average user numbers over the whole period of 10 years were higher in the grass pollen season than that in the birch pollen season. There was a notable increase in 2013, when the PHD became available as a mobile app (Pollen). The highest user numbers occurred in 2014 for the birch season and in 2015 for the grass pollen season in Austria. This is contrasted by the occurrence of the highest user numbers in 2016 for Germany for both the birch and the grass pollen seasons.

In the gender and age group distribution, less variation in different years could be observed. The gender distribution is fairly similar between Austria and Germany in both pollen seasons: Approximately 55% of users are male. It is noteworthy that the gender is usually indicated.

The age distribution (younger than 21 years, 21-40 years, older than 40 years, and unknown) was much less indicated by users, although only age groups was asked for and not a specific age or the birthday. Approximately 20% of users did not specify their age group on average. This applies to both countries and pollen seasons. The distribution to the aforementioned groups was fairly similar for Austria and Germany. Users younger than 21 years were the least frequent group, followed by the unknown age group. The most frequent age group varied for the birch and grass pollen seasons: The group older than 40 years dominated in the birch pollen season, whereas the group between the ages of 21 and 40 years dominated in the grass pollen season.

Symptom Score Calculation Methods

The following patterns became apparent when comparing all score calculations in the period from 2009 to 2018 in Austria and Germany (Tables 1 and 2): (1) The scores were usually higher in the birch pollen season, (2) the scores varied from year to year (or season to season), and (3) the scores varied between the countries under study. The highest values were identified for the PHD raw score, followed by the SLI for the raw score, the SLI of the EMA score, and the raw EMA score. This was expected as the EMA raw scores included fewer symptoms and fewer organs, resulting in a lower maximum score. The raw scores resulted in low values in general. However, it has to be considered that these were computed averages and that experiencing the highest severity for all organs with all symptoms and medications is more than unrealistic for a relevant fraction of the population. The same pattern, for example, an increase or decrease of the score, can be observed between the 4 calculation methods. This behavior became even more apparent when visualized for 2017 and 2018 (Figure 1). The curves show the same course, and this applies to both countries, both pollen seasons, and all years. Only the relative level (absolute score values) varied because of the different calculation methods (Figure 1 and Multimedia Appendices 5-8).

Figure 1. Pattern of the four calculation methods: dark continuous line=raw Patient’s Hayfever Diary score (PHD), gray dots=symptom load index of the raw Patient’s Hayfever Diary score (SLI), gray continuous line=European Medicines Agency raw score (EMA_RAW), gray dashed line=symptom load index of the EMA score (EMA_SLI) for the Austria (A-D) and Germany (E-H) for the birch (A-B and E-F) and grass (C-D and G-H) pollen seasons for the years 2017 (A, C, E, and G) and 2018 (B, D, F, and H).
View this figure

The percentages calculated for the SLIs showed their relative contribution to the score (Table 2). These percentages represented a rather robust pattern for both the pollen seasons and the 2 countries. The variation can be attributed mostly to a yearly variation. The highest percentage value was attributed to the nose, followed by eyes, medication, and lungs. The importance of symptoms of the nose was emphasized when calculated for the SLI of the EMA score. The lung percentage was slightly higher during the birch pollen seasons, whereas the percentage for medication intake was slightly higher during the grass pollen seasons.

All computed Pearson correlations (Tables 3 and 4) were highly significant, showing the visually recognizable strong linear relationship between the series. The evident trend because of the relationship between symptom and pollen data series was removed from the time series.


This study shows the evaluation of strictly filtered symptom data over 10 years in 2 Central European countries and pollen seasons. As such, it is informative for the symptom behavior and the user characterization in this region. In addition, 4 different symptom score calculation methods were applied to examine possible divergences in the results. The WAO recommends the inclusion of a concomitant symptom and medication score [40,41]. The PHD was developed based on this recommendation. Therefore, the PHD raw score and the resulting SLI included data on medication use. However, other score calculations were used as well, eg, those of the EMA that included only data on nose and eyes. Aerobiology and related fields often used nasal symptoms as a proxy, eg, nasal scores and medication use [42]; nose and eye symptoms with nose and eye medication [43]; nose and eye symptoms and a visual analog scale [44]; or eyes, nose, and lung symptoms without medication [24]. To our knowledge, the inclusion of nose symptoms applies to all symptom score calculations for pollen allergies.

Principal Findings and Relation to Previous Work

It is worth discussing that our results challenge the current dogma of using a combined symptom and medication score. It seems that scoring symptoms gives the most information, but any indication from medication is missing. This might still be important for clinical trials. An analysis of symptoms vs symptoms and medication scores for clinical trials showed that both measures are able to verify the difference between the placebo and the group receiving the active substance [45]. However, the symptom score leads to less severe values than the score considering rescue medication [45]. The conclusion of that study was that a combined score is a valuable alternative and that the inclusion of rescue medication use leads to an improvement in assessing the symptom severity and treatment effect. Our study focused only on the relationship between the scores without any relation to treatment. Therefore, we cannot give recommendations concerning clinical trials, but for observational studies and the aerobiological field, the use of a symptom or a combined symptom and medication score is justified, as suggested by our data.

The calculation of the percentage regarding the contribution for specific organs and the medication intake showed a value of about 40% on average for the nose in this study. This pattern is visible for 10 years in 2 different pollen seasons and for 2 countries. Thus, the nose is recognized as the most important organ reporting allergic symptoms representing the main burden of a pollen allergy. These findings underline and complement previous studies concerning the significance of nose symptoms [46]. The organ eyes represents the second highest contribution to the main burden, directly followed by medication use. The additional use of one or the other is justified when analyzing symptom scores because of the similar contribution of both datasets. The lung symptoms contribute the least to the total score. This outcome is probably attributed to the fact that lung symptoms are not frequently experienced in most people affected by a pollen allergy [46].

Lessons Learned and Limitations

The 4 different symptom score calculation methods underpin the value of nose symptoms for any symptom score. The progress and pattern (increase/decrease during the season) are corresponding in all calculations, although on a different level depending on the maximum scale for the respective score studied herein (Figure 1 and Multimedia Appendices 5-8). The Pearson correlation coefficients show a significant linear relationship between all symptom score calculation methods (Tables 3 and 4). Most values reach 0.9 even when calculated as the difference between days excluding the trend component (the dependence of symptom data on pollen concentrations). Most values below 0.9 occurred in the first year of the launch of the PHD (in 2009) when user numbers were low and not significant for such analyses.

Data on the user characterization of the PHD are presented herein for the first time and give valuable insights: user numbers are higher during the grass pollen season (Multimedia Appendix 2). Grass pollen allergy is the most frequent pollen allergy in east Austria [47], Germany [48], and Europe in general [4]. User numbers showed a significant increase when mobile apps were provided, which included the PHD as an additional service. This is evidenced by the launch of the mobile app, Husteblume, in 2016 in Germany and the launch of the Pollen app in 2013 in Austria and the introduction of personalized pollen information in 2014. The increase in user numbers was observed for both the birch and the grass pollen season. Moreover, nearly all users indicated their gender, but a relevant fraction of them did not indicate their age group. We observed that the PHD users are mostly male (60%:40% on average), and thus, the results are biased toward male (and German speaking because of the country selection) users. This finding should be taken into consideration for all conclusions and comparisons with the general population. The bias toward males could be explained by the behavior regarding the use of mobile technologies and the internet in general. Recent studies indicate that internet consumption by men is higher than that by women, even when accounting for age and ethnicity, with younger people using the internet most [49]. Moreover, internet use is higher in younger people and much lower in those aged older than 45 years, even more so in older adults (aged >65 years) who are less likely to adopt the internet [50]. The observation of sex differences (not performed in this study) could lead to a gender bias, especially in an unbalanced sample [51]. Therefore, we have restricted our findings to our user pool in total (females and males) and have to leave possible differences and inferences open to future studies. Our findings underline the importance of mHealth technology as a mobile communication channel [52].

The most indicated age group for the birch pollen season is those older than 40 years, contrasting with the results of the grass pollen season where most frequent users were in the age group of 21 to 40 years. This pattern was recognized in both countries analyzed in this study. It remains unknown why the user age groups differ between the two pollen seasons and which age group might be hidden most in the age group unknown and for what reasons.

Finally, the data give more evidence on spatiotemporal aspects of symptom data. Observations of higher and lower symptom score calculations for different years and pollen seasons (Tables 1 and 2) provide more evidence that the burden of those affected by pollen allergy varies [27]. There are less or more intense seasons and years in terms of the severity of symptoms of those possibly affected by pollen allergy. The biogeographical component is obscured because the analyses were performed on a country level. Still, it is evident that there are also geographical differences and small variations between the datasets from Austria and Germany. The grass pollen season seems to have an additional burden on average in Germany (Table 1), whereas the pattern of increase or decrease of the birch pollen seasons deviates between the 2 countries (eg, in 2015 and 2016; Table 1).


Users of the PHD and its mobile apps are mostly male belonging to the age groups of 21 to 40 years (grass pollen season) or >40 years (birch pollen season). Crowdsourced symptom datasets can be seen as beneficial in terms of increasing the number of users of mHealth and eHealth technology and the availability of mobile apps: Users receive personalized information based on their individual symptoms and researchers gain insight into the real burden of those affected by pollen allergy. The user pool for Austria and Germany is fairly similar. The technique of a Web-based diary can be applied globally to allow international monitoring of the effect of pollen on human health.

The evaluation of 4 different symptom score calculations for 2 countries (Austria and Germany) and 2 pollen seasons (birch and grass) over the last decade showed that the choice of the calculation method is not critical. The inclusion of the nose as an affected organ and its symptoms is most relevant, as its contribution to the score calculation is the highest. Herein, the medication score is of similar importance as the eye symptom data. However, the Pearson correlation coefficients show a significant linear relationship for all calculation methods. The SLI calculations smoothen the pattern (and curves; see Figure 1) and give a more stable pattern when compared with the raw score calculations with fewer high or low values. Therefore, the SLI can be recommended as a symptom score calculation method for all apps such as clinical trials, but it points to the fact that all of the computation methods tested herein work as long as they are clearly defined, are consequently used, and include nose symptoms.

There is variation in the symptom scores between pollen seasons, years, and countries. Thus, studies should also refer to a comparison dataset to explore if their findings can be explained because of a known higher burden (specific pollen season), a strong season (year), sample-specific reaction pattern (gender, age group, and other parameters), or because of biogeographical factors (country/region).

Symptom data are a most valuable data source for aerobiology, allergology, and all fields involved in pollen allergy research because they give a direct indication about the burden of persons affected. Nonetheless, standardization of symptom scores is needed for clinical trials and allergology in general and should be the goal of a joint effort from all institutions and organizations concerned.


The authors are deeply indebted to Christoph Jäger, who takes care of the EAN and the PHD database from a technical point of view. In addition, the authors owe thanks to all users of the pollen diary and the Pollen and Husteblume apps who improved their understanding of allergic symptom onset and development and all the EAN data suppliers who built the fundament for such studies by routine aerobiology work.

Authors' Contributions

The study was designed by KBa, MB, KBe, and UB. Data preparation and analyses were performed by MBe and MBa. KBe contributed with data from Germany in addition. Technical and scientific supervision was carried out by UB. All authors were involved in data interpretation and drafting, editing, and final approval of the manuscript.

Conflicts of Interest

KBe, MBe, KBa, and UB report to have taken part in the development and/or implementation of the PHD or freely available mobile apps (Pollen and Husteblume) that have no advertisements and thus no financial interest. MBa has no conflicts to declare.

Multimedia Appendix 1

Characterization of user data from the Patient’s Hayfever Diary during the calculated birch (Betula) pollen season in Austria and Germany. Total user numbers, the percentage of gender (male/female/unknown), and the percentage of age groups (below 21 years /21-40 years/above 40 years/unknown) are presented per year and as an average of the 10 years of study period.

PDF File (Adobe PDF File), 57 KB

Multimedia Appendix 2

Characterization of user data from the Patient’s Hayfever Diary during the calculated grass (Poaceae) pollen season in Austria and Germany. Total user numbers, the percentage of gender (male/female/unknown), and the percentage of age groups (below 21 years/21-40 years/above 40 years/unknown) are presented per year and as an average of the 10 years of study period.

PDF File (Adobe PDF File), 49 KB

Multimedia Appendix 3

List of pollen monitoring stations included in this study for Austria and Germany and their exact location data and height above sea level. Pollen data were used only to calculate the respective pollen season and the Annual Pollen Integral.

PDF File (Adobe PDF File), 62 KB

Multimedia Appendix 4

Calculation of the Annual Pollen Integral and the pollen season for birch (Betula) and grasses (Poaceae) for Austria and Germany during 2009 until 2018.

PDF File (Adobe PDF File), 52 KB

Multimedia Appendix 5

Pattern of the four calculation methods: dark continuous line=raw Patient’s Hayfever Diary score (PHD), gray dots=symptom load index of the raw Patient’s Hayfever Diary score (SLI), gray continuous line=European Medicines Agency raw score (EMA_RAW), gray dashed line=symptom load index of the European Medicines Agency score (EMA_SLI) for Austria (A-D) and Germany (E-H) for the birch (A-B and E-F) and grass (C-D and G-H) pollen seasons for the years 2015 (A, C, E, and G) and 2016 (B, D, F, and H).

PDF File (Adobe PDF File), 72 KB

Multimedia Appendix 6

Pattern of the four calculation methods: dark continuous line=raw Patient’s Hayfever Diary score (PHD), gray dots=symptom load index of the raw Patient’s Hayfever Diary score (SLI), gray continuous line=European Medicines Agency raw score (EMA_RAW), gray dashed line=symptom load index of the European Medicines Agency score (EMA_SLI) for Austria (A-D) and Germany (E-H) for the birch (A-B and E-F) and grass (C-D and G-H) pollen seasons for the years 2013 (A, C, E, and G) and 2014 (B, D, F, and H).

PDF File (Adobe PDF File), 71 KB

Multimedia Appendix 7

Pattern of the four calculation methods: dark continuous line=raw Patient’s Hayfever Diary score (PHD), gray dots=symptom load index of the raw Patient’s Hayfever Diary score (SLI), gray continuous line=European Medicines Agency raw score (EMA_RAW), gray dashed line=symptom load index of the European Medicines Agency score (EMA_SLI) for Austria (A-D) and Germany (E-H) for the birch (A-B and E-F) and grass (C-D and G-H) pollen seasons for the years 2011 (A, C, E, and G) and 2012 (B, D, F, and H).

PDF File (Adobe PDF File), 73 KB

Multimedia Appendix 8

Pattern of the four calculation methods: dark continuous line=raw Patient’s Hayfever Diary score (PHD), gray dots=symptom load index of the raw Patient’s Hayfever Diary score (SLI), gray continuous line=European Medicines Agency raw score (EMA_RAW), gray dashed line=symptom load index of the European Medicines Agency score (EMA_SLI) for Austria (A-D) and Germany (E-H) for the birch (A-B and E-F) and grass (C-D and G-H) pollen seasons for the years 2009 (A, C, E, and G) and 2010 (B, D, F, and H).

PDF File (Adobe PDF File), 72 KB

  1. Rusznak C, Davies RJ. ABC of allergies. Diagnosing allergy. Br Med J 1998 Feb 28;316(7132):686-689 [FREE Full text] [CrossRef] [Medline]
  2. Pawankar R, Canonica G, Holgate S, Lockey R, Blaiss M. White Book on Allergy: Update 2013. Milwaukee, Wisconsin: World Allergy Organization; 2013.
  3. Asher MI, Montefort S, Björkstén B, Lai CK, Strachan DP, Weiland SK, ISAAC Phase Three Study Group. Worldwide time trends in the prevalence of symptoms of asthma, allergic rhinoconjunctivitis, and eczema in childhood: ISAAC Phases One and Three repeat multicountry cross-sectional surveys. Lancet 2006 Aug 26;368(9537):733-743. [CrossRef] [Medline]
  4. D'Amato G, Cecchi L, Bonini S, Nunes C, Annesi-Maesano I, Behrendt H, et al. Allergenic pollen and pollen allergy in Europe. Allergy 2007 Sep;62(9):976-990. [CrossRef] [Medline]
  5. Bousquet J, Anto J, Auffray C, Akdis M, Cambon-Thomsen A, Keil T, et al. MeDALL (Mechanisms of the Development of ALLergy): an integrated approach from phenotypes to systems medicine. Allergy 2011 May;66(5):596-604. [CrossRef] [Medline]
  6. Dorner T, Rieder A, Lawrence K, Kunze M. Österreichischer Allergiebericht. Vienna: Verein Altern mit Zukunft; 2006.
  7. Bergmann KC, Heinrich J, Niemann H. Current status of allergy prevalence in Germany: position paper of the Environmental Medicine Commission of the Robert Koch Institute. Allergo J Int 2016;25:6-10 [FREE Full text] [CrossRef] [Medline]
  8. Accorsi CA, Bandini-Mazzanti M, Romano B, Frenguelli G, Mincigrucci G. Allergenic pollen: morphology and microscopic photographs. In: D'Amato G, Bonini S, Spieksma FT, editors. Allergenic Pollen and Pollinosis in Europe. Oxford: Wiley-blackwell; 1991:24-44.
  9. D'Amato G, Spieksma FT, Liccardi G, Jäger S, Russo M, Kontou-Fili K, et al. Pollen-related allergy in Europe. Allergy 1998 Jun;53(6):567-578. [CrossRef] [Medline]
  10. Gregory PH. The Microbiology of the Atmosphere. New York: Leonhard Hill (Books Limited, Interscience Publishers Inc); 1961.
  11. Sofiev M, Bergmann KC. Allergenic Pollen: A Review of the Production, Release, Distribution and Health Impacts. Dordrecht: Springer; 2012.
  12. World Health Organization. World Health Organization.: World Health Organization; 2006. WHO Air Quality Guidelines for Particulate Matter, Ozone, Nitrogen Dioxide and Sulfur Dioxide. Global Update 2005. Summary of Risk Assessment   URL: [accessed 2019-02-21] [WebCite Cache]
  13. Obersteiner A, Gilles S, Frank U, Beck I, Häring F, Ernst D, et al. Pollen-associated microbiome correlates with pollution parameters and the allergenicity of pollen. PLoS One 2016;11(2):e0149545 [FREE Full text] [CrossRef] [Medline]
  14. D'Amato G, Cecchi L, D'Amato M, Liccardi G. Urban air pollution and climate change as environmental risk factors of respiratory allergy: an update. J Investig Allergol Clin Immunol 2010;20(2):95-102; quiz following 102 [FREE Full text] [Medline]
  15. Pasqualini S, Tedeschini E, Frenguelli G, Wopfner N, Ferreira F, D'Amato G, et al. Ozone affects pollen viability and NAD(P)H oxidase release from Ambrosia artemisiifolia pollen. Environ Pollut 2011 Oct;159(10):2823-2830 [FREE Full text] [CrossRef] [Medline]
  16. Bastl K, Kmenta M, Pessi A, Prank M, Saarto A, Sofiev M, et al. First comparison of symptom data with allergen content (Bet v 1 and Phl p 5 measurements) and pollen data from four European regions during 2009-2011. Sci Total Environ 2016 Apr 1;548-549:229-235. [CrossRef] [Medline]
  17. Buters J, Prank M, Sofiev M, Pusch G, Albertini R, Annesi-Maesano I, et al. Variation of the group 5 grass pollen allergen content of airborne pollen in relation to geographic location and time in season. J Allergy Clin Immunol 2015 Jul;136(1):87-95.e6. [CrossRef] [Medline]
  18. Spieksma FT, Kramps JA, van der Linden AC, Nikkels BH, Plomp A, Koerten HK, et al. Evidence of grass-pollen allergenic activity in the smaller micronic atmospheric aerosol fraction. Clin Exp Allergy 1990 May;20(3):273-280. [CrossRef] [Medline]
  19. Süring K, Bach S, Bossmann K, Wolter E, Neumann A, Straff W, et al. PM10 contains particle-bound allergens: dust analysis by flow cytometry. Environ Technol Inno 2016;5:60-66. [CrossRef]
  20. Bastl K, Berger M, Bergmann K, Kmenta M, Berger U. The medical and scientific responsibility of pollen information services. Wien Klin Wochenschr 2017 Jan;129(1-2):70-74. [CrossRef] [Medline]
  21. Berger U, Kmenta M, Bastl K. Individual pollen exposure measurements: are they feasible? Curr Opin Allergy Clin Immunol 2014 Jun;14(3):200-205. [CrossRef] [Medline]
  22. European Medicines Agency. London: European Medicines Agency; 2008. Guideline on the Clinical Development of Products for Specific Immunotherapy for the Treatment of Allergic Diseases. Doc. Ref. CHMP/EWP/18504/20   URL: https:/​/www.​​en/​documents/​scientific-guideline/​guideline-clinical-development-products-specific- immunotherapy-treatment-allergic-diseases_en.​pdf [accessed 2019-02-21]
  23. Gonzalo-Garijo MA, Tormo-Molina R, Palacios IS, Pérez-Calderón R, Fernández-Rodríguez S. Use of a short messaging service system to provide information about airborne pollen concentrations and forecasts. J Investig Allergol Clin Immunol 2009;19(5):418-419 [FREE Full text] [Medline]
  24. Kiotseridis H, Cilio CM, Bjermer L, Tunsäter A, Jacobsson H, Dahl ?. Grass pollen allergy in children and adolescents-symptoms, health related quality of life and the value of pollen prognosis. Clin Transl Allergy 2013;3:19 [FREE Full text] [CrossRef] [Medline]
  25. Kmenta M, Zetter R, Berger U, Bastl K. Pollen information consumption as an indicator of pollen allergy burden. Wien Klin Wochenschr 2016 Jan;128(1-2):59-67. [CrossRef] [Medline]
  26. World Health Organization. Management of Patient Information: Trends and Challenges in Member States: Based on the Findings of the Second Global Survey on eHealth. Geneva: World Health Organization; 2012.
  27. Bastl K, Kmenta M, Jäger S, Bergmann K, Berger U. Development of a symptom load index: enabling temporal and regional pollen season comparisons and pointing out the need for personalized pollen information. Aerobiologia 2014;30(3):269-280. [CrossRef]
  28. Kmenta M, Bastl K, Jäger S, Berger U. Development of personal pollen information-the next generation of pollen information and a step forward for hay fever sufferers. Int J Biometeorol 2014 Oct;58(8):1721-1726. [CrossRef] [Medline]
  29. de Weger LA, Hiemstra PS, Op den Buysch E, van Vliet AJ. Spatiotemporal monitoring of allergic rhinitis symptoms in The Netherlands using citizen science. Allergy 2014 Aug;69(8):1085-1091. [CrossRef] [Medline]
  30. Costa C, Menesatti P, Brighetti MA, Travaglini A, Rimatori V, Businco AR, et al. Pilot study on the short-term prediction of symptoms in children with hay fever monitored with e-Health technology. Eur Ann Allergy Clin Immunol 2014 Nov;46(6):216-225. [Medline]
  31. Bastl K, Kmenta M, Berger M, Berger U. The connection of pollen concentrations and crowd-sourced symptom data: new insights from daily and seasonal symptom load index data from 2013 to 2017 in Vienna. World Allergy Organ J 2018;11(1):24 [FREE Full text] [CrossRef] [Medline]
  32. Häfner D, Reich K, Matricardi PM, Meyer H, Kettner J, Narkus A. Prospective validation of 'Allergy-Control-SCORE(TM)': a novel symptom-medication score for clinical trials. Allergy 2011 May;66(5):629-636. [CrossRef] [Medline]
  33. Bastl K, Kmenta M, Geller-Bernstein C, Berger U, Jäger S. Can we improve pollen season definitions by using the symptom load index in addition to pollen counts? Environ Pollut 2015 Sep;204:109-116. [CrossRef] [Medline]
  34. Galán C, Ariatti A, Bonini M, Clot B, Crouzy B, Dahl A, et al. Recommended terminology for aerobiological studies. Aerobiologia 2017;33(3):293-295. [CrossRef]
  35. Galán C, Smith M, Thibaudon M, Frenguelli G, Oteros J, Gehrig R, et al. Pollen monitoring: minimum requirements and reproducibility of analysis. Aerobiologia 2014;30(4):385-395. [CrossRef]
  36. HIRST JM. An automatic volumetric spore trap. Ann Appl Biol 1952;39(2):257-265. [CrossRef]
  37. Bastl K, Kmenta M, Berger UE. Defining pollen seasons: background and recommendations. Curr Allergy Asthma Rep 2018 Oct 29;18(12):73 [FREE Full text] [CrossRef] [Medline]
  38. R Core Team. The R Project for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing; 2017.   URL: [accessed 2019-02-21] [WebCite Cache]
  39. Wickham H. Ggplot2: Elegant Graphics For Data Analysis. New York: Springer International Publishing; 2016.
  40. Canonica GW, Baena-Cagnani CE, Bousquet J, Bousquet PJ, Lockey RF, Malling H, et al. Recommendations for standardization of clinical trials with Allergen Specific Immunotherapy for respiratory allergy. A statement of a World Allergy Organization (WAO) taskforce. Allergy 2007 Mar;62(3):317-324 [FREE Full text] [CrossRef] [Medline]
  41. Pfaar O, Demoly P, van Wijk RG, Bonini S, Bousquet J, Canonica GW, European Academy of Allergy and Clinical Immunology. Recommendations for the standardization of clinical outcomes used in allergen immunotherapy trials for allergic rhinoconjunctivitis: an EAACI Position Paper. Allergy 2014 Jul;69(7):854-867. [CrossRef] [Medline]
  42. Karatzas K, Katsifarakis N, Riga M, Werchan B, Werchan M, Berger U, et al. New European Academy of Allergy and Clinical Immunology definition on pollen season mirrors symptom load for grass and birch pollen-induced allergic rhinitis. Allergy 2018 Sep;73(9):1851-1859. [CrossRef] [Medline]
  43. Hjelmroos M. Long-distance transport of Betula pollen grains and allergic symptoms. Aerobiologia 1992;8(2):231-236. [CrossRef]
  44. Bousquet J, Lund VJ, van Cauwenberge P, Bremard-Oury C, Mounedji N, Stevens MT, et al. Implementation of guidelines for seasonal allergic rhinitis: a randomized controlled trial. Allergy 2003 Aug;58(8):733-741. [CrossRef] [Medline]
  45. Grouin J, Vicaut E, Jean-Alphonse S, Demoly P, Wahn U, Didier A, et al. The average Adjusted Symptom Score, a new primary efficacy end-point for specific allergen immunotherapy trials. Clin Exp Allergy 2011 Sep;41(9):1282-1288. [CrossRef] [Medline]
  46. Zuberbier T, Abelson MB, Akdis CA, Bachert C, Berger U, Bindslev-Jensen C, Global Allergy and Asthma European Network (GA(2)LEN) European Union Network of Excellence in Allergy and Asthma. Validation of the Global Allergy and Asthma European Network (GALEN) chamber for trials in allergy: innovation of a mobile allergen exposure chamber. J Allergy Clin Immunol 2017 Apr;139(4):1158-1166 [FREE Full text] [CrossRef] [Medline]
  47. Hemmer W, Schauer U, Trinca AM, Neumann C. Land Niederösterreich: Startseite. St. Pölten: Amt der NÖ Landesregierung, Landesamtsdirektion, Abteilung Gebäudeverwaltung, Amtsdruckerei; 2010. Endbericht 2009 zur Studie: Prävalenz der Ragweedpollen-Allergie in Ostösterreich   URL: [accessed 2020-01-14]
  48. Haftenberger M, Laußmann D, Ellert U, Kalcklösch M, Langen U, Schlaud M, et al. [Prevalence of sensitisation to aeraoallergens and food allergens: results of the German Health Interview and Examination Survey for Adults (DEGS1)]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 2013 May;56(5-6):687-697. [CrossRef] [Medline]
  49. Dixon LJ, Correa T, Straubhaar J, Covarrubias L, Graber D, Spence J, et al. Gendered space: the digital divide between male and female users in internet public access sites. J Comput-Mediat Commun 2014;19(4):991-1009. [CrossRef]
  50. Neves BB, Fonseca JR, Amaro F, Pasqualotti A. Social capital and internet use in an age-comparative perspective with a focus on later life. PLoS One 2018;13(2):e0192119 [FREE Full text] [CrossRef] [Medline]
  51. Hamberg K. Gender bias in medicine. Womens Health (Lond) 2008 May;4(3):237-243. [CrossRef] [Medline]
  52. Matricardi PM, Dramburg S, Alvarez-Perea A, Antolín-Amérigo D, Apfelbacher C, Atanaskovic-Markovic M, et al. The role of mobile health technologies in allergy care: An EAACI position paper. Allergy 2019 Jun 22. [CrossRef] [Medline]

API: application programming interface
APIn: Annual Pollen Integral
EAN: European Aeroallergen Network
eHealth: electronic health
EMA: European Medicines Agency
EU: European Union
mHealth: mobile health
PHD: Patient’s Hayfever Diary
REST: representational state transfer
SLI: symptom load index
WAO: World Allergy Organization

Edited by G Eysenbach; submitted 23.10.19; peer-reviewed by C Geller-Bernstein, L de Weger; comments to author 13.11.19; revised version received 20.11.19; accepted 15.12.19; published 21.02.20


©Katharina Bastl, Maximilian Bastl, Karl-Christian Bergmann, Markus Berger, Uwe Berger. Originally published in the Journal of Medical Internet Research (, 21.02.2020.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on, as well as this copyright and license information must be included.