Published on in Vol 23, No 5 (2021): May

Preprints (earlier versions) of this paper are available at, first published .
Testing a Novel Web-Based Neurocognitive Battery in the General Community: Validation and Usability Study

Testing a Novel Web-Based Neurocognitive Battery in the General Community: Validation and Usability Study

Testing a Novel Web-Based Neurocognitive Battery in the General Community: Validation and Usability Study

Original Paper

1Department of Psychology, Temple University, Philadelphia, PA, United States

2Department of Psychiatry and Behavioral Sciences, University of Minnesota, Minneapolis, MN, United States

3Department of Research and Development, Posit Science Corporation, San Francisco, CA, United States

Corresponding Author:

Riley Capizzi, BSc

Department of Psychology

Temple University

1701 N 13th St

Philadelphia, PA, 19122

United States

Phone: 1 (847) 997 4148


Background: In recent years, there has been increased interest in the development of remote psychological assessments. These platforms increase accessibility and allow clinicians to monitor important health metrics, thereby informing patient-centered treatment.

Objective: In this study, we report the properties and usability of a new web-based neurocognitive assessment battery and present a normative data set for future use.

Methods: A total of 781 participants completed a portion of 8 tasks that captured performance in auditory processing, visual-spatial working memory, visual-spatial learning, cognitive flexibility, and emotional processing. A subset of individuals (n=195) completed a 5-question survey measuring the acceptability of the tasks.

Results: Between 252 and 426 participants completed each task. Younger individuals outperformed their older counterparts in 6 of the 8 tasks. Therefore, central tendency data metrics were presented using 7 different age bins. The broad majority of participants found the tasks interesting and enjoyable and endorsed some interest in playing them at home. Only 1 of 195 individuals endorsed not at all for the statement, “I understood the instructions.” Older individuals were less likely to understand the instructions; however, 72% (49/68) of individuals over the age of 60 years still felt that they mostly or very much understood the instructions.

Conclusions: Overall, the tasks were found to be widely acceptable to the participants. The use of web-based neurocognitive tasks such as these may increase the ability to deploy precise data-informed interventions to a wider population.

J Med Internet Res 2021;23(5):e25082




For decades, neuropsychological methods have been leveraged to understand and characterize the relative strengths and weaknesses of individuals experiencing an array of neuropsychiatric syndromes [1-5]. These profiles have also been shown to predict future deterioration in Alzheimer disease and other cognitive and functional outcomes in schizophrenia, bipolar disorder, depression, and traumatic brain injury [6-8]. However, traditional paper and pencil assessments are lengthy and expensive and require considerable training on the part of the assessor. Given these concerns and the rising availability of personal mobile technologies, a movement to capture reliable cognitive functioning digitally has begun [9-12]. Although remote cognitive technologies are still budding, they promise several benefits in clinical and research settings.

Remote platforms greatly increase the accessibility of psychological testing. The National Institute of Mental Health has established a priority to reach typically underserved populations, such as individuals living with limited physical mobility or in rural areas [13]. Digital measures allow testing to be conducted within the comfort of one’s own home, thereby increasing the ability to serve historically difficult-to-reach patients. Remote testing would also likely reduce the time providers spend scheduling and carrying out in-person assessments. These benefits may also help to reduce assessment costs. Accessibility concerns have taken center stage during the current global pandemic, highlighting the need for reliable, easy-to-use, and remote measures.

In addition to opportunities for accessibility, digital remote methods for cognitive testing encourage the use of longitudinal assessments. Repeated measurements provide a wealth of information above and beyond individual snapshots of performance [14]. A study of individuals at clinical high risk for conversion into a psychotic disorder found that although baseline measures of cognition helped differentiate at-risk individuals from healthy participants, cognitive trajectories followed for 2 years differentiated converters from nonconverters [15]. Another study of healthy older adults found that repeated memory assessments pushed to a handheld device were widely accepted and showed associations with the hippocampal brain structure, whereas typical baseline measures of cognition did not [16].

Although in-person neuropsychological methods unquestionably have their benefits and function optimally in distraction-free testing environments, the generalizability of data collected in these unique settings has been questioned [17]. There is a growing body of literature evaluating assessments that can be administered in people’s daily lives, thus potentially capturing more meaningful data on how they function within their own environments [18,19]. For example, a clinician may be more interested in how their patient performs cognitively while completing their night shift at work rather than in a highly controlled testing center at midday.

Digital methods also provide a means to measure important metrics such as reaction time (RT) and trial-by-trial performance, which have also shown associations with clinical outcomes [20-22]. These potential benefits, alongside the added ability of electronic health record integration, may further facilitate person-centered practice and move us closer toward the goal of precision medicine [23]. However, little to no normative data currently exists for these task batteries, limiting the interpretability of new findings [24].

A particular novel mobile assessment platform implemented by the Posit Science Corporation has shown recent utility capturing cognitive performance differences common to psychiatric disorders, such as schizophrenia and depression, using a neuroscience-informed approach to task development [25,26]. These tasks were designed to capture the variability associated with aberrant neural mechanisms underlying an individual’s ability to flexibly acquire new information and adapt to changing cognitive and emotional demands, integral processes for normative cognitive functioning and general learning ability [26-29]. Sound sweeps, a task of auditory processing speed, showed improvements during a cognitive training intervention in individuals with recent-onset and chronic schizophrenia [30]. The greatest improvement was observed during the first 20 hours of training, followed by a plateau at subsequent assessment points. Intraindividual variability in the time taken to reach this plateau was also associated with the likelihood that the intervention would generalize to other untrained cognitive domains, suggesting a potential target-mediated treatment response. Another study found that improvements in sound sweeps performance after cognitive training were correlated with gains in working memory and global cognition [31]. Despite the apparent utility of these tasks, they have yet to be adequately evaluated in normative data samples.


Therefore, we set out to test a battery of 8 web-based cognitive tasks developed by Posit Science in samples drawn from the general community. Sensory perception, social cognition, and executive functions were digitally assessed in-person with state fair attendees and remotely through testing of college students. In addition, the usability and feasibility of this new battery were investigated in a subset of participants.


All study procedures were approved by the institutional review board of the University of Minnesota (UMN). Recruitment and study participation took place either in-person at the Minnesota State Fair (MSF) or remotely for course credit at UMN. The MSF is the second-most highly attended state fair in the United States, with approximately 2 million Midwesterners visiting every year [32]. State fairgoers represent a wide array of demographic backgrounds [33]. MSF participants were asked to participate if they were aged between 18 years and 80 years (inclusive), and UMN students were asked to participate if they were aged between 18 years and 40 years (inclusive). Inclusion criteria for all participants were (1) no visual, auditory, or motor impairments that would prevent completion of the assessments; (2) fluent and literate in English; and (3) no use of illicit substances or alcohol over the prior 8 hours. We received cognitive data from 816 participants and usability survey responses from 219 participants who participated in a substudy. However, cognitive data from 35 participants who took the battery multiple times were omitted from the primary cognitive analyses below, and survey data were omitted for another 24 participants who completed usability questionnaires multiple times. Therefore, 781 participants were included in the primary analyses, and survey data from 195 participants were inspected (Table 1).

Table 1. Participant demographics by sample.
DemographicsSampleTotal (N=781)

College (n=202)State fair (n=579)

Female, n (%)144 (71.6)338 (58.6)482 (62.0)

Male, n (%)57 (28.4)237 (41.1)294 (37.8)

Intersex, n (%)0 (0.0)2 (0.3)2 (0.3)
Age, mean (SD)20.12 (2.28)44.82 (17.72)38.4 (18.7)
Years of education, mean (SD)14.59 (1.5)16.57 (2.77)16.1 (2.65)
Grade point average, mean (SD)b3.45 (0.35)N/Ac3.45 (0.35)
Occupation level, mean (SD)d5.16 (1.68)3.06 (1.84)3.81 (2.05)

aThree participants preferred not to respond.

bGrade point average only collected in the undergraduate samples.

cN/A: not applicable.

dHollingshead two-factor index: occupational scale.


After receiving informed consent, the study participants provided demographic information and completed a battery of cognitive tasks. College students participated remotely via their own personal devices (eg, tablets or personal computers) and were asked to find a stable internet connection to complete the tasks in a quiet, private environment using headphones, preferably over-the-ear headphones. MSF participants completed the study procedures in an enclosed fair structure using lab iPads (Apple Inc) and over-the-ear headphones. Participants were randomly assigned to complete 3 to 4 of the 8 cognitive assessments. State fair participants completed a subset of the measures because of the time limit set by the Driven to Discover State Fair program. A maximum of 30 minutes was allowed for giving consent and the completion of demographic information, cognitive measures, and questionnaires. Participants were randomly assigned to complete 1 of 2 cognitive batteries that were equivalent in length and comparable in terms of the number of auditory, visual, and social cognition tests.

Online Neurocognitive Assessments

Our work aims to translate measures from basic cognitive neuroscience into short, computerized assessments of discrete cognitive processes that individuals can easily complete with minimal assistance in various settings [25]. These assessments are designed to enable the interpretation of specific deficits that could signal that an individual is experiencing cognitive difficulties and impaired learning ability.

The first step in the development of the assessment suite was to decide on the cognitive domains and processes that are known to play a critical role in an individual’s ability to learn new information, to interact adaptively with cognitive and emotional challenges in the environment, and to adapt to new learning demands. In line with the principles of team science, we integrated theoretical perspectives, technical expertise, and empirical knowledge drawn from a team of cognitive neuroscientists working in human and animal model systems, clinical researchers, and preclinical translational behavioral neuroscientists. Through a consensus-building process, we identified 3 critical neural domains: (1) perceptual processing (sound sweeps and beep seeker), (2) executive functioning (bubble pop, pathfinder, and mind bender), and (3) social-emotional processing (face to face, tap the emotion, and emotional face). Within each domain, we identified constructs, that is, a definable cognitive process that could be measured at the behavioral level and for which there existed clearly hypothesized and measurable neural-circuit mechanisms (eg, for executive functioning, set-shifting). We identified a cognitive neuroscience paradigm that could selectively and parametrically measure each of these constructs at the behavioral level. Guided by item response theory, these assessments use adaptive testing models to adjust the difficulty level according to the user, thereby reducing the duration of the test [34]. Online neurocognitive assessments (ONAs) take an average of 184 (SD 201) seconds to complete. Instructions on sampling the ONAs are presented in Multimedia Appendix 1.

Beep Seeker: Auditory Discrimination and Sensory Memory

Beep seeker is an auditory discrimination task in which participants are presented with a target tone and are asked to identify it in later trials amidst 2 other distractor tones. Accurate identification of the target tone prompts subsequent trials with more similar distractor tones. A linear staircase method was used to identify the participant’s discrimination threshold on a scale of 1 to 15, with higher scores indicating better performance.

Sound Sweeps: Auditory Perception and Processing Speed

Sound sweeps is an auditory perception task in which participants are presented with 2 consecutive tones that may either sweep from a low to high pitch or high to low pitch. The participants were asked to identify the direction of each sweep. The sweep sounds’ speed varied according to trial performance, and participant performance was measured as log10(average RT in seconds).

Bubble Pop: Visual-Spatial Working Memory

Bubble pop is a working memory task in which participants are asked to follow a set of target bubbles that independently move around the screen alongside other distractor bubbles. The number of target bubbles varied with the performance. Accuracy was measured as the number of bubbles correctly tracked using a 2-up, 1-down staircase method.

Pathfinder: Visual-Spatial Learning

Pathfinder is a learning test. A path with 15 nodes is presented. The participant then attempted to recreate the path from memory. If a node is missed, the trial ends, and the path is shown again (5 trials in total). Accuracy was measured as the percentage of nodes correctly recalled out of the total number of nodes.

Mind Bender: Cognitive Flexibility

Mind bender is a task that instructs participants to identify images that follow an established rule around other pictures that violate the rule. Performance was measured as log10(correct trial RT in milliseconds).

Tap the Emotion: Emotion Detection and Inhibitory Control

Tap the emotion requires the participant to tap the screen when a happy or sad face appears and inhibits the prepotent response to tap when presented with a neutral face. Performance was measured as the mean accuracy of neutral trials.

Face to Face: Emotion Identification

A face is shown with a specific emotion, followed by a series of faces with various emotions. Participants were asked to identify the target emotion first shown among the series of faces. The task varies in difficulty by changing the number of emotions presented and the number of faces to choose from. Performance was measured as log10(duration of target stimulus presentation).

Emotional Face: Inhibitory Control of Emotion

Emotional face provides a measure of executive attention, in which an expressive face is shown and overlaid with a congruent or incongruent word (Stroop effect). The increased RT to incongruent stimulus combinations captures the capacity for conflict resolution. Performance was measured by subtracting the mean RT of the correct congruent trials from the mean RT of the correct incongruent trials.

Usability Questionnaire

This lab-designed measure asked participants to indicate how much they agreed with a series of 5 statements using a 5-point Likert scale ranging from not at all to very much. The specific questions are shown in Multimedia Appendix 2.

Statistical Analysis

All main outcomes were screened for normality. Although some measures showed skewness and kurtosis (eg, beep seeker, tap the emotion, and face to face), all had adequate variance. The task distributions are shown in Multimedia Appendix 1. Nonparametric statistics were subsequently used for beep seeker, tap the emotion, and face to face. Outliers greater than 2.5 SD from the mean were winsorized and represented less than 1% of the data. For ease of comparison, outputs from sound sweeps, mind bender, and face to face were transformed so that higher scores indicated better performance.

General tendencies, dispersion metrics, and associations with demographic variables were calculated for each ONA. A total of 2 individuals identified as intersex were excluded from the gender comparison of ONA performance because of insufficient power. Given that 6 of 8 ONAs showed significant associations with age, task performance was also summarized across 7 different age ranges. To investigate concurrent and discriminant validity across ONAs, a Spearman rank-order correlation matrix was summarized; however, a factor analysis was not conducted because of inefficient overlap of measures completed across samples and a subsequent lack of power.

In total, 781 individuals completed at least one ONA (college, n=202; MSF, n=579). Table 2 presents the means, SD, and ranges for each task.

Table 2. Normative statistics by online neurocognitive assessment.
AssessmentsParticipant, n (%)Task metric, mean (SD)MinimumaMaximumb
Beep seeker269 (34.4)5.01 (4.63)1.0015.00
Sound sweepsc269 (34.4)0.98 (0.33)0.001.70
Bubble pop293 (37.5)5.07 (1.20)1.007.80
Path finder323 (41.4)52.32 (24.97)1.00100.00
Mind benderc256 (32.8)−9.09 (1.03)−11.60−6.80
Tap the emotion426 (54.5)78.48 (17.88)20.00100.00
Face to facec339 (43.4)−2.03 (0.42)−3.45−1.50
Emotional face252 (32.3)53.19 (80.11)−169.00342.90

aMinimum value refers to the lowest task metric across participants on a given task.

bMaximum value refers to the highest task metric across participants on a given task.

cRaw task output was inverted so that higher scores indicated better performance.

Over 96.4% (188/195) of participants found the ONAs to be at least a little bit interesting enjoyable. Almost 91.8% (179/195) of individuals found the tasks at least a little bit easy. Only 1 participant endorsed not at all to the statement, “I understood the instructions.” The vast majority of participants found the ONA instructions easy to understand at the level of a little bit or more. Age was found to be negatively associated with the extent to which people agreed that they understood the instructions (ρ=−0.21; P=.004). Still, over 72% (49/68) of participants over the age of 60 years reported that they mostly or very much aligned with the statement, “I understood the instructions.” In addition, when asked how much people agreed with the statement, “I would play these games at home,” 69.2% (135/195) endorsed a little bit or more.

Task relationships with demographic variables such as gender, age, and education are presented in Table 3 and those with occupation level and grade point average (GPA) are presented in Table 4. Male participants outperformed female participants on beep seeker (Wilcoxon rank-sum [W]=6927; P=.01), sound sweeps (two-tailed t test: t163.65=−2.12; P=.04), and path finder (two-tailed t test: t264.93=−4.11; P<.001). Age was negatively correlated with task performance on sound sweeps (Pearson correlation [r]=−0.33; P<.001), bubble pop (r=−0.50; P<.001), path finder (r=−0.38; P<.001), mind bender (r=−0.41; P<.001), tap emotion (ρ=−0.24; P<.001), and face to face (r=−0.19; P<.001). Given the consistent association with age, task performance is displayed across 7 age bins, as shown in Tables 5 and 6. Counterintuitively, fewer years of education were associated with better performance on bubble pop (r=−0.13; P=.03), mind bender (r=−0.13; P=.04), and tap the emotion (ρ=−0.16; P=.001); however, when controlling for age, these relationships were not significant. Higher levels of occupation were significantly associated with an elevation in beep seeker (ρ=0.15; P=.02). Lower levels were associated with better performance on sound sweeps (r=−0.16; P=.03), bubble pop (r=−0.17; P=.01), path finder (r=−0.24; P<.001), mind bender (r=−0.21; P=.001), and tap the emotion (ρ=−0.20; P<.001), but when age was accounted for, only bubble pop remained associated (F1,227=4.12; P=.04). GPA was not significantly correlated with any of the ONAs.

Table 3. Association of online neurocognitive assessments with demographic variables (gender, age, and education).

GenderAge (years)Education (years)

Student two-tailed t testWilcoxon ranked sumPearson correlationSpearman rank-order correlationPearson correlationSpearman rank-order correlation

t test (df)P valueWP valuerP valueρP valuerP valueρP value
Beep seekera6927.
Sound sweepsb−2.12 (163.65).04−0.33<.001<0.001.99
Bubble pop−1.09 (185.61).28−0.50<.001−0.13.03
Path finder−4.11 (264.93)<.001−0.38<.001−0.10.07
Mind benderb−1.39 (145.12).17−0.41<.001−0.13.04
Tap the emotion19,480.28−0.24<.001−0.16.001
Face to faceb12,984.85−0.19<.001−0.01.85
Emotional face1.70 (223.3).09−0.03.610.01.82

aNonparametric test provided given violation of normality.

bRaw task output was inverted so a larger output indicated better performance.

Table 4. Association of online neurocognitive assessments with demographic variables (occupation level and grade point average).

Occupation levelGrade point average

Pearson correlationSpearman rank-order correlationPearson correlationSpearman rank-order correlation

rP valueρP valuerP valueρP value
Beep seekera0.
Sound sweepsb−
Bubble pop−
Path finder−0.24<.0010.13.26
Mind benderb−0.21.0010.08.41
Tap the emotion−0.20<.0010.02.83
Face to faceb−
Emotional face−0.04.560.04.74

aNonparametric test provided given violation of normality.

bRaw task output was inverted so a larger output indicated better performance.

Table 5. Normative data by age bin for ages 17-50 years.
AssessmentsAge bin (years)


Participant, nMean (SD)Participant, nMean (SD)Participant, nMean (SD)Participant, nMean (SD)
Beep seeker763.76 (4.06)635.11 (4.57)226.91 (5.87)326.50 (4.94)
Sound sweeps1021.08 (0.29)621.04 (0.32)190.84 (0.33)220.88 (0.27)
Bubble pop945.59 (0.95)685.41 (1.13)235.04 (1.01)334.93 (1.17)
Path finder9161.86 (21.22)7059.79 (25.02)2555.16 (24.87)3346.18 (21.99)
Mind bender97−8.7 (0.95)56−8.9 (0.94)20−8.82 (0.96)23−9.59 (0.84)
Tap the emotion13686.29 (10.91)10874.76 (20.27)3272.56 (21.05)3974.32 (18.31)
Face to face154−1.96 (0.41)80−1.95 (0.38)22−2.17 (0.35)25−2.09 (0.42)
Emotional face6256.42 (76.89)6353.82 (73.4)2467.65 (91.75)3236.49 (64.09)
Table 6. Normative data by age bin for ages 51-80 years.
AssessmentsAge bin (years)


Participant, nMean (SD)Participant, nMean (SD)Participant, nMean (SD)
Beep seeker356.29 (4.93)343.97 (3.66)73.71 (4.11)
Sound sweeps280.89 (0.28)290.79 (0.38)70.7 (0.40)
Bubble pop324.57 (1.08)333.93 (0.99)103.68 (0.92)
Path finder4945.96 (25.25)4534.98 (21.28)1035.7 (21.89)
Mind bender28−10.05 (1.07)28−9.62 (0.74)4−9.43 (0.48)
Tap the emotion5178.80 (14.93)4875.6 (18.52)1262.92 (25.71)
Face to face30−2.11 (0.39)24−2.33 (0.56)4−2.29 (0.25)
Emotional face3555.09 (73.18)3052.26 (107.65)638.07 (110.4)

Most visual tasks were significantly associated with one another, with Spearman rho coefficients ranging from 0.19 to 0.42, indicating some expected shared variance and evidence for the measurement of unique constructs (Table 7). Beep seeker and bubble pop, which measure similar memory processes across auditory and visual domains, were also correlated with one another; however, the Spearman rho was only 0.28, suggesting that they largely capture distinct constructs. In addition, although significant, sound sweeps was found to be only minimally associated (ρ=0.16-0.36) with other theoretically distinct constructs (eg, visual-spatial learning and emotion detection).

Table 7. Online neurocognitive assessments’ correlation matrix.
AssessmentsBeep seekerSound sweepsBubble popPath finderMind benderTap the emotionFace to face
Sound sweeps


P valueN/A
Bubble pop


P value.002.05
Path finder


P value.22<.001N/A
Mind bender


P value.05<.001.05.005
Tap the emotion


P value.<.001
Face to face


P value.
Emotional face


P value.005N/A.35.38N/A.32.55

aρ: Spearman\'s rank correlation coefficient.

bN/A: not applicable; the tasks were not included in the same battery.

cRedundant information.

Principal Findings and Significance

In this study, we tested a new web-based neurocognitive assessment battery for individuals from the general community. Normative metrics were collected across 8 tasks measuring auditory discrimination, auditory perception, visual-spatial working memory, visual-spatial learning, cognitive flexibility, emotion detection, emotion identification, and inhibitory control of emotion.

Participants found the assessments to be widely acceptable across a range of ages. Age was negatively associated with the extent to which participants were able to understand task instructions; however, the broad majority of individuals in older age groups found the tasks understandable. Still, future iterations of these tasks should consider adaptations to better reach these individuals, such as improving task instructions, increasing practice trials, embedding video tutorials, or lowering the starting level of difficulty to reduce initial frustration. Despite finding the tasks widely interesting and enjoyable, fewer individuals indicated an interest in playing games at home. This difference may suggest that fewer participants would choose to engage with these tasks as a leisure activity but may be open to completing them if recommended by a member of their health care team. It is important to note that contrary to the survey wording, these tasks are not considered to be games and should not be interpreted as such. In contrast to traditional neuropsychological assessments, these measures are brief, allowing clinicians to collect a wealth of cognitive information in a relatively short period. These findings highlight the acceptability and efficiency of the battery.

Participant age was associated with 6 of the 8 tasks, with younger participants performing better than older participants. This trend has been widely observed in the cognitive literature [35-37]. Therefore, normative task data were stratified by age. Hearing sensitivity and sensitivity to mistuned, oddball, or discontinuous tones have been shown to deteriorate with age [38-40]. The slowing of processing speed with age has also been well documented [41,42]. Visuospatial working memory performance also decreases with age, potentially because of differences in chunking strategies and proactive interference [43-45]. Better performance observed with path finder, a task of visuospatial learning, may be due in part to more efficient within-task adaptability by younger individuals, a result identified previously with visuospatial tasks [43]. Other similar age-related associations have been documented in learning paradigms [46]. The findings of this study of decreased cognitive flexibility with age are also largely supported by other studies, such as those examining task-switching capacity [47,48]. Finally, in the literature, older age has been found to be related to poorer emotion recognition, processing facial emotion, and inhibitory control [49-51]. These results likely explain why performance on tap the emotion and face to face, tasks of emotion processing and inhibitory control, was negatively associated with age; however, it is unclear why emotional face did not show an association. Given these findings, which are supported by the greater cognitive literature, normative task data were stratified by age. Therefore, normative task data were stratified by age.

Better performance on a visual learning task was observed in men, corroborating previous gender findings with learning tasks that involve spatial navigation and manipulation [52,53]. These results appear to follow previous findings, suggesting male advantage in detecting interaural time and intensity differences, complex masking tasks, and lateralization of auditory discrimination processing, although these differences are small [54,55]. With samples larger than those in this study, age- and gender-specific norms could be constructed to better characterize the individual performance.

Although previous research has demonstrated associations between performance on cognition and education tests, our task battery did not show relationships with GPA or years of education after controlling for age [56,57]. These findings may indicate that the ONAs successfully captured neural system functioning as opposed to more notion-based intelligence. In addition, only bubble pop performance was related to the occupation level after controlling for age, suggesting a potential benefit of work status.

The majority of predominantly visual ONAs were only minimally associated with one another, as expected, suggesting that they are likely to tap into unique domains of performance. Beep seeker and bubble pop, which measure sensory memory and working memory across auditory and visuospatial domains, respectively, were related. Previous research has supported this association and provided evidence for similar neural mechanisms underlying both memory systems [58,59]. In addition, 2 of the 3 tasks of emotion processing were only minimally correlated with one another. This finding is expected given the varying demands on processes of inhibition between the tasks and differences in the facial targets (eg, responding to faces with any emotion vs identifying faces with matching emotions). It is surprising that tap the emotion and emotional face were not associated, given that both tasks involved processing of facial emotions while inhibiting a prepotent response, although it is unclear whether this relationship would exist with a larger sample who performed both tasks.

Despite the recent development of various digital cognitive assessment tools, few studies have released normative data sets, thus limiting the information that can be gained from these tasks. Collecting normative data from the general community is an important and necessary first step before interpreting cognitive performance in clinical populations. The current global health crisis has arrested the ability to administer traditional in-person cognitive assessments in clinical and research settings. The development of valid and reliable assessment platforms, which can be delivered remotely, has become crucial.

Limitations and Future Work

Despite the strengths of our results, a few limitations exist that should be considered. Although 74.1% (579/781) of the sample came from the general community, a subset of participants was recruited through their attendance at a large public college institution. Therefore, young adults’ performance in this sample may differ somewhat from that of the general population. Due to the limited overlap of measures that each participant completed, we were unable to evaluate whether subsets of the tasks were explained by common underlying factors to investigate construct validity. Similarly, the correlations between tasks should be interpreted with caution, given the variability in the number of individuals who were administered each task. Data on participant race and ethnicity were also not collected as part of this study. In addition, 2 individuals identified as intersex prevented our ability to reliably predict how a greater population of intersex individuals may perform on the presented tasks. Future work should aim to evaluate race and gender minorities more specifically to better understand the performance and feasibility of these populations.

Experiential cognitive assessment in patients’ homes may help gain a better understanding of their true cognitive states, but it also raises the possibility that individuals will lack the attention or motivation to properly engage in testing [9]. Failure to prevent some patients from using substances or eliciting outside help during testing may also be a necessary aspect of remote testing [24,60]. In these situations, performance on a given task may not truly represent the intended cognitive domain, as it is likely to be in a distraction-limited environment. More information is also needed to understand how cognition in the general community fluctuates from shorter to more extended periods when assessed remotely.

The accessibility of these and other digital assessments provides an opportunity to integrate objective behavioral assessments directly into medical records to guide care. Given that the field of health informatics is still budding, future work should evaluate the extent to which these tasks capture unique variance and predict outcomes amidst other data.


This study presented the performance of individuals from the general community using a novel cognitive assessment tool that can be employed remotely. Participants found the tasks to be interactive and easy to use. The development and validation of web-based tasks such as these widely expand the accessibility of cognitive assessment to new populations such as rural groups and others with limited physical mobility and may also increase the ability of health practitioners to conduct repeated testing. Quick, easy-to-use digital assessment platforms with remote capabilities such as this may help bring the field closer to achieving more impactful patient-centered care.

Authors' Contributions

RC assisted with project development, conducted the analysis, and wrote the first draft of the manuscript. MF and BB assisted with project development, supervised the analysis, and revised and optimized further versions of the manuscript. SV led project development and revised versions of the manuscript. NG, AC, KF, and NA participated in data collection and reviewed the manuscript. All the authors contributed and have approved the final manuscript.

Conflicts of Interest

BB is a senior scientist at Posit Science, a company that produces cognitive training and assessment software. The assessments described in this study were provided for research purposes free of charge by Posit Science. SV has been a site PI on an NIH SBIR grant to Positscience Inc.

Multimedia Appendix 1

Supplementary material including online neurocognitive assessment task links and task distributions.

DOCX File , 275 KB

Multimedia Appendix 2

Participant usability feedback for online neurocognitive assessments.

PNG File , 16 KB

  1. Dickinson D, Ramsey ME, Gold JM. Overlooking the obvious: a meta-analytic comparison of digit symbol coding tasks and other cognitive measures in schizophrenia. Arch Gen Psychiatry 2007 May;64(5):532-542. [CrossRef] [Medline]
  2. Josiassen RC, Curry LM, Mancall EL. Development of neuropsychological deficits in Huntington's disease. Arch Neurol 1983 Dec 01;40(13):791-796. [CrossRef] [Medline]
  3. Austin MP, Mitchell P, Goodwin GM. Cognitive deficits in depression: possible implications for functional neuropathology. Br J Psychiatry 2001 Mar;178:200-206. [CrossRef] [Medline]
  4. Raven JC, Court JH, Raven JC. Raven's progressive matrices and vocabulary scales. Oxford Psychologists Press. Oxford, England: Oxford Psychologists Press; 1998.   URL: [accessed 2021-03-26]
  5. Nuechterlein KH, Green MF, Kern RS, Baade LE, Barch DM, Cohen JD, et al. The MATRICS Consensus Cognitive Battery, part 1: test selection, reliability, and validity. Am J Psychiatry 2008 Feb;165(2):203-213. [CrossRef] [Medline]
  6. Tabert MH, Manly JJ, Liu X, Pelton GH, Rosenblum S, Jacobs M, et al. Neuropsychological prediction of conversion to Alzheimer disease in patients with mild cognitive impairment. Arch Gen Psychiatry 2006 Aug;63(8):916-924. [CrossRef] [Medline]
  7. Lee RS, Hermens DF, Naismith SL, Lagopoulos J, Jones A, Scott J, et al. Neuropsychological and functional outcomes in recent-onset major depression, bipolar disorder and schizophrenia-spectrum disorders: a longitudinal cohort study. Transl Psychiatry 2015 Apr 28;5:e555 [FREE Full text] [CrossRef] [Medline]
  8. Millis SR, Rosenthal M, Novack TA, Sherer M, Nick TG, Kreutzer JS, et al. Long-term neuropsychological outcome after traumatic brain injury. J Head Trauma Rehabil 2001 Aug;16(4):343-355. [CrossRef] [Medline]
  9. Bauer RM, Iverson GL, Cernich AN, Binder LM, Ruff RM, Naugle RI. Computerized neuropsychological assessment devices: joint position paper of the American Academy of Clinical Neuropsychology and the National Academy of Neuropsychology. Clin Neuropsychol 2012;26(2):177-196 [FREE Full text] [CrossRef] [Medline]
  10. Hafiz P, Miskowiak KW, Kessing LV, Elleby Jespersen A, Obenhausen K, Gulyas L, et al. The internet-based cognitive assessment tool: system design and feasibility study. JMIR Form Res 2019 Jul 26;3(3):- [FREE Full text] [CrossRef] [Medline]
  11. Resnick HE, Lathan CE. From battlefield to home: a mobile platform for assessing brain health. Mhealth 2016;2:30 [FREE Full text] [CrossRef] [Medline]
  12. Monsch RJ, Burckhardt AC, Berres M, Thomann AE, Ehrensperger MM, Steiner LA, et al. Development of a novel self-administered cognitive assessment tool and normative data for older adults. J Neurosurg Anesthesiol 2019;31(2):218-226. [CrossRef]
  13. Ending disparities in mental health (EDIfy-MH).   URL: https:/​/www.​​about/​organization/​od/​odwd/​ending-disparities-in-mental-health-edify-mh.​shtml [accessed 2020-12-20]
  14. Hultsch DF, Strauss E, Hunter MA, MacDonald SW. Intraindividual variability, cognition, and aging. In: The Handbook of Aging and Cognition. London: Psychology Press; 2007:66.
  15. Lam M, Lee J, Rapisarda A, See YM, Yang Z, Lee S, et al. Longitudinal cognitive changes in young individuals at ultrahigh risk for psychosis. JAMA Psychiatry 2018 Sep 01;75(9):929-939 [FREE Full text] [CrossRef] [Medline]
  16. Allard M, Husky M, Catheline GL, Pelletier A, Dilharreguy B, Amieva H, et al. Mobile technologies in the early detection of cognitive decline. PLoS One 2014;9(12):- [FREE Full text] [CrossRef] [Medline]
  17. Sbordone RJ. Limitations of neuropsychological testing to predict the cognitive and behavioral functioning of persons with brain injury in real-world settings. NeuroRehabilitation 2001;16(4):199-201. [Medline]
  18. Burke LE, Shiffman S, Music E, Styn MA, Kriska A, Smailagic A, et al. Ecological momentary assessment in behavioral research: addressing technological and human participant challenges. J Med Internet Res 2017 Mar 15;19(3):e77 [FREE Full text] [CrossRef] [Medline]
  19. Stone AA, Shiffman S. Ecological momentary assessment (EMA) in behavorial medicine. Ann Behav Med 1994;16(3):199-202. [CrossRef]
  20. Schwartz F, Munich RL, Carr A, Bartuch E, Lesser B, Rescigno D, et al. Negative symptoms and reaction time in schizophrenia. J Psychiatr Res 1991;25(3):131-140. [CrossRef]
  21. Lövdén M, Li SC, Shing YL, Lindenberger U. Within-person trial-to-trial variability precedes and predicts cognitive decline in old and very old age: longitudinal data from the Berlin Aging Study. Neuropsychologia 2007 Sep 20;45(12):2827-2838. [CrossRef] [Medline]
  22. Geary EK, Kraus MF, Pliskin NH, Little DM. Verbal learning differences in chronic mild traumatic brain injury. J Int Neuropsychol Soc 2010 Mar 01;16(3):506-516. [CrossRef]
  23. Vinogradov S. The golden age of computational psychiatry is within sight. Nat Hum Behav 2017 Feb 8;1(2):-. [CrossRef]
  24. Moore RC, Swendsen J, Depp CA. Applications for self-administered mobile cognitive assessments in clinical research: a systematic review. Int J Methods Psychiatr Res 2017 Dec;26(4):A [FREE Full text] [CrossRef] [Medline]
  25. Biagianti B, Fisher M, Brandrett B, Schlosser D, Loewy R, Nahum M, et al. Development and testing of a web-based battery to remotely assess cognitive health in individuals with schizophrenia. Schizophr Res 2019 Jun;208:250-257 [FREE Full text] [CrossRef] [Medline]
  26. Van Vleet T, Stark-Inbar A, Merzenich MM, Jordan JT, Wallace DL, Lee MB, et al. Biases in processing of mood-congruent facial expressions in depression. Psychiatry Res 2019 May;275:143-148 [FREE Full text] [CrossRef] [Medline]
  27. Koo BM, Vizer LM. Mobile technology for cognitive assessment of older adults: a scoping review. Innov Aging 2019;3(1):- [FREE Full text] [CrossRef] [Medline]
  28. CANTAB Mobile.   URL: [accessed 2020-12-20]
  29. Carter CS, Barch DM. Cognitive neuroscience-based approaches to measuring and improving treatment effects on cognition in schizophrenia: the CNTRICS initiative. Schizophr Bull 2007 Sep 17;33(5):1131-1137 [FREE Full text] [CrossRef] [Medline]
  30. Biagianti B, Fisher M, Neilands TB, Loewy R, Vinogradov S. Engagement with the auditory processing system during targeted auditory cognitive training mediates changes in cognitive outcomes in individuals with schizophrenia. Neuropsychology 2016 Nov;30(8):998-1008 [FREE Full text] [CrossRef] [Medline]
  31. Fisher M, Holland C, Merzenich MM, Vinogradov S. Using neuroplasticity-based auditory training to improve verbal memory in schizophrenia. Am J Psychiatry 2009 Jul;166(7):805-811 [FREE Full text] [CrossRef] [Medline]
  32. The biggest state fairs in the United States (Top 10).   URL: [accessed 2020-05-17]
  33. Minnesota state fair. 2019.   URL: [accessed 2021-03-26]
  34. Reise SP, Waller NG. Item response theory and clinical measurement. Annu Rev Clin Psychol 2009 Apr;5(1):27-48. [CrossRef] [Medline]
  35. Salthouse TA, Fristoe N, Rhee SH. How localized are age-related effects on neuropsychological measures? Neuropsychology 1996;10(2):272-285. [CrossRef]
  36. Reitan RM, Wolfson D. Influence of age and education on neuropsychological test results. Clin Neuropsychol 1995 May;9(2):151-158. [CrossRef]
  37. Deary IJ, Der G. Reaction time, age, and cognitive ability: longitudinal findings from age 16 to 63 years in representative population samples. Aging Neuropsychol Cogn 2005 Jun;12(2):187-215. [CrossRef]
  38. Alain C, McDonald KL, Ostroff JM, Schneider B. Age-related changes in detecting a mistuned harmonic. J Acoust Soc Am 2001 May;109(5 Pt 1):2211-2216. [CrossRef] [Medline]
  39. Grube M, von Cramon DY, Rübsamen R. Inharmonicity detection. Effects of age and contralateral distractor sounds. Exp Brain Res 2003 Dec 3;153(4):637-642. [CrossRef] [Medline]
  40. Heinrich A, Schneider B. Age-related changes in within- and between-channel gap detection using sinusoidal stimuli. J Acoust Soc Am 2006 Apr;119(4):2316-2326. [CrossRef] [Medline]
  41. Finkel D, Reynolds CA, McArdle JJ, Pedersen NL. Age changes in processing speed as a leading indicator of cognitive aging. Psychol Aging 2007 Sep;22(3):558-568. [CrossRef] [Medline]
  42. Salthouse TA. The processing-speed theory of adult age differences in cognition. Psychol Rev 1996 Jul;103(3):403-428. [CrossRef] [Medline]
  43. Rowe G, Hasher L, Turcotte J. Age differences in visuospatial working memory. Psychol Aging 2008 Mar;23(1):79-84. [CrossRef] [Medline]
  44. Cornoldi C, Bassani C, Berto R, Mammarella N. Aging and the intrusion superiority effect in visuo-spatial working memory. Neuropsychol Dev Cogn B Aging Neuropsychol Cogn 2007 Jan;14(1):1-21. [CrossRef] [Medline]
  45. Bo J, Borza V, Seidler RD. Age-related declines in visuospatial working memory correlate with deficits in explicit motor sequence learning. J Neurophysiol 2009 Nov;102(5):2744-2754 [FREE Full text] [CrossRef] [Medline]
  46. Howard DV, Howard Jr JH. Age differences in learning serial patterns: direct versus indirect measures. Psychol Aging 1989 Sep;4(3):357-364. [CrossRef] [Medline]
  47. Taconnat L, Raz N, Toczé C, Bouazzaoui B, Sauzéon H, Fay S, et al. Ageing and organisation strategies in free recall: the role of cognitive flexibility. Eur J Cognit Psychol 2009 Mar;21(2-3):347-365. [CrossRef]
  48. Wecker NS, Kramer JH, Hallam BJ, Delis DC. Mental flexibility: age effects on switching. Neuropsychology 2005 May;19(3):345-352. [CrossRef] [Medline]
  49. Keightley ML, Winocur G, Burianova H, Hongwanishkul D, Grady CL. Age effects on social cognition: faces tell a different story. Psychol Aging 2006 Sep;21(3):558-572. [CrossRef] [Medline]
  50. Wieser MJ, Mühlberger A, Kenntner-Mabiala R, Pauli P. Is emotion processing affected by advancing age? An event-related brain potential study. Brain Res 2006 Jun 22;1096(1):138-147. [CrossRef] [Medline]
  51. West R, Alain C. Age-related decline in inhibitory control contributes to the increased Stroop effect observed in older adults. Psychophysiology 2000 Mar;37(2):179-189. [CrossRef]
  52. Collins DW, Kimura D. A large sex difference on a two-dimensional mental rotation task. Behav Neurosci 1997;111(4):845-849. [CrossRef]
  53. Astur RS, Ortiz ML, Sutherland RJ. A characterization of performance by men and women in a virtual Morris water task: a large and reliable sex difference. Behav Brain Res 1998;93(1-2):185-190. [CrossRef] [Medline]
  54. Brown CP, Fitch RH, Tallal P. Sex and hemispheric differences for rapid auditory processing in normal adults. Laterality 1999 Jan;4(1):39-50. [CrossRef] [Medline]
  55. McFadden D. Sex differences in the auditory system. Dev Neuropsychol 1998 Jan;14(2-3):261-298. [CrossRef]
  56. Ceci SJ. How much does schooling influence general intelligence and its cognitive components? A reassessment of the evidence. Dev Psychol 1991;27(5):703-722. [CrossRef]
  57. Ceci SJ, Williams WM. Schooling, intelligence, and income. Am Psychol 1997;52(10):1051-1058. [CrossRef]
  58. Pasternak T, Greenlee MW. Working memory in primate sensory systems. Nat Rev Neurosci 2005;6(2):97-107. [CrossRef] [Medline]
  59. Bonetti L, Haumann NT, Brattico E, Kliuchko M, Vuust P, Särkämö T, et al. Auditory sensory memory and working memory skills: association between frontal MMN and performance scores. Brain Res 2018 Dec 01;1700:86-98. [CrossRef] [Medline]
  60. Tiplady B, Oshinowo B, Thomson J, Drummond G. Alcohol and cognitive function: assessment in everyday life and laboratory settings using mobile phones. Alcohol Clin Exp Res 2009;33(12):2094-2102. [CrossRef] [Medline]

GPA: grade point average
MSF: Minnesota State Fair
ONA: online neurocognitive assessment
RT: reaction time
UMN: University of Minnesota

Edited by R Kukafka; submitted 18.10.20; peer-reviewed by P Hafiz, A Wright; comments to author 18.11.20; revised version received 04.01.21; accepted 16.03.21; published 06.05.21


©Riley Capizzi, Melissa Fisher, Bruno Biagianti, Neelufaer Ghiasi, Ariel Currie, Karrie Fitzpatrick, Nicholas Albertini, Sophia Vinogradov. Originally published in the Journal of Medical Internet Research (, 06.05.2021.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on, as well as this copyright and license information must be included.