Background

JMIR

J Med Internet Res

Journal of Medical Internet Research

1438-8871

JMIR Publications

Toronto, Canada

v23i2e24266

33503002

10.2196/24266

Original Paper

Digital Pathology During the COVID-19 Outbreak in Italy: Survey Study

Eysenbach

Gunther

Poluru

Ramesh

Kaas-Hansen

Benjamin

Giaretto

Simone

MD 1

https://orcid.org/0000-0002-2742-7587

Renne

Salvatore Lorenzo

MD 1 2

Department of Pathology Humanitas Clinical and Research Center – IRCCS

via Manzoni 56

Rozzano (MI), 20089

Italy 39 0282244787 salvatore.renne@hunimed.eu

https://orcid.org/0000-0002-0186-1318

Rahal

Daoud

MD 2

https://orcid.org/0000-0002-5579-7550

Bossi

Paola

MD 2

https://orcid.org/0000-0003-2024-6668

Colombo

Piergiuseppe

MD 2

https://orcid.org/0000-0002-6161-6138

Spaggiari

Paola

MD 2

https://orcid.org/0000-0003-2905-1736

Manara

Sofia

MD 2

https://orcid.org/0000-0002-2579-0358

Sollai

Mauro

MD 2

https://orcid.org/0000-0003-1527-9342

Fiamengo

Barbara

MD 2

https://orcid.org/0000-0001-6351-8964

Brambilla

Tatiana

MD 2

https://orcid.org/0000-0001-6899-2681

Fernandes

Bethania

MD 2

https://orcid.org/0000-0002-4327-1586

Rao

Stefania

MD 2

https://orcid.org/0000-0003-0665-0146

Elamin

Abubaker

MD 1

https://orcid.org/0000-0003-3573-3207

Valeri

Marina

MD 1 2

https://orcid.org/0000-0002-9998-8762

De Carlo

Camilla

1 2

https://orcid.org/0000-0001-5236-8307

Belsito

Vincenzo

MD 1 2

https://orcid.org/0000-0001-5531-8583

Lancellotti

Cesare

MD 1 2

https://orcid.org/0000-0001-6465-5995

Cieri

Miriam

MD 2

https://orcid.org/0000-0002-3669-6949

Cagini

Angelo

MD 2

https://orcid.org/0000-0002-1288-3993

Terracciano

Luigi

MD 1 2

https://orcid.org/0000-0002-9393-9660

Roncalli

Massimo

MD, PhD 1 2

https://orcid.org/0000-0002-7901-8910

Di Tommaso

Luca

MD 1 2

https://orcid.org/0000-0002-9013-4728

1 Department of Biomedical Sciences Humanitas University

Pieve Emanuele (MI)

Italy 2 Department of Pathology Humanitas Clinical and Research Center – IRCCS

Rozzano (MI)

Italy

Corresponding Author: Salvatore Lorenzo Renne salvatore.renne@hunimed.eu

2 2021

22 2 2021

23 2

e24266

11 9 2020 1 12 2020 9 12 2020 14 12 2020

©Simone Giaretto, Salvatore Lorenzo Renne, Daoud Rahal, Paola Bossi, Piergiuseppe Colombo, Paola Spaggiari, Sofia Manara, Mauro Sollai, Barbara Fiamengo, Tatiana Brambilla, Bethania Fernandes, Stefania Rao, Abubaker Elamin, Marina Valeri, Camilla De Carlo, Vincenzo Belsito, Cesare Lancellotti, Miriam Cieri, Angelo Cagini, Luigi Terracciano, Massimo Roncalli, Luca Di Tommaso. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 22.02.2021.

2021

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.

Background

Transition to digital pathology usually takes months or years to be completed. We were familiarizing ourselves with digital pathology solutions at the time when the COVID-19 outbreak forced us to embark on an abrupt transition to digital pathology.

Objective

The aim of this study was to quantitatively describe how the abrupt transition to digital pathology might affect the quality of diagnoses, model possible causes by probabilistic modeling, and qualitatively gauge the perception of this abrupt transition.

Methods

A total of 17 pathologists and residents participated in this study; these participants reviewed 25 additional test cases from the archives and completed a final psychologic survey. For each case, participants performed several different diagnostic tasks, and their results were recorded and compared with the original diagnoses performed using the gold standard method (ie, conventional microscopy). We performed Bayesian data analysis with probabilistic modeling.

Results

The overall analysis, comprising 1345 different items, resulted in a 9% (117/1345) error rate in using digital slides. The task of differentiating a neoplastic process from a nonneoplastic one accounted for an error rate of 10.7% (42/392), whereas the distinction of a malignant process from a benign one accounted for an error rate of 4.2% (11/258). Apart from residents, senior pathologists generated most discrepancies (7.9%, 13/164). Our model showed that these differences among career levels persisted even after adjusting for other factors.

Conclusions

Our findings are in line with previous findings, emphasizing that the duration of transition (ie, lengthy or abrupt) might not influence the diagnostic performance. Moreover, our findings highlight that senior pathologists may be limited by a digital gap, which may negatively affect their performance with digital pathology. These results can guide the process of digital transition in the field of pathology.

COVID19 digital pathology Bayesian data analysis probabilistic modeling

Introduction

Digital pathology (DP) intends to use computer workstations and digital whole slide imaging to diagnose a pathological process [1-4]. A complete transition from classical to digital pathology is usually a “soft” procedure, taking months or even years to be completed [4-9]. We planned a digitalization of our department, and we were testing several technical aspects related to digital transition. By February 2020, most of our staff pathologists and residents had used digital whole slide imaging for educational or scientific purposes, but the situation radically changed in March 2020. With the COVID-19 pandemic and the subsequent guidelines adopted by the Italian national government and the medical direction of our hospital, we were forced to reduce the presence of staff in the laboratory. Taking advantage of the ongoing digitalization, we decided to adopt DP to sustain smart work.

Most of the reported discordances between diagnoses in DP and those by the gold standard (ie, evaluation of a glass slide under a microscope) are less than 10% [10], and none of these reports were made under an abrupt transition in diagnostic approach. These discrepancies could be attributed to several factors that could be pathologist dependent (eg, career level or individual performance) or pathologist independent (eg, specimen type or the task to be undertaken during the diagnostic procedure). Discerning the relative effect of these features (that could be really small)—even in a carefully designed experimental setting—might be challenging. Probabilistic modeling (and Bayesian data analysis, in general) allows the detection of small effects [11-13]. Moreover, the employment of multilevel hierarchical modeling permits the transfer of shared information among data clusters, resulting in balanced regularization; thus, it reduces overfitting and improves the out-of-sample predictive performance [11,14-18].

In this study, we aimed to (1) quantitatively describe how abrupt transition to DP might affect the quality of diagnosis, (2) model the possible causes via probabilistic modeling, and (3) qualitatively gauge the perception of this abrupt transition.

Methods

A detailed description of the study methods is described in Multimedia Appendix 1 [15,16,19-24].

Ethics Approval

No ethics approval was required for this study. The study participants (ie, pathologists and residents) agreed to—and coauthored—the study.

Study Participants

This study involved 17 participants who were divided into the following 4 groups or career levels, based on their pathology experience: (1) senior (pathologists with >20 years of experience, n=2), (2) expert (pathologists with 10-20 years of experience, n=5), (3) junior (pathologists with <10 years of experience, n=6), and (4) resident (1st year, n=1; 2nd year, n=3). Each of the 17 participants evaluated 25 digital cases, with a total of 425 digital images examined in the study. Overall, 1445 questions were examined (ie, 85 questions per participant) in the study.

Study Design

In addition to their own diagnostic tasks, which were not considered in this study, the pathologists and residents received (1) a set of digital cases within the area of general surgical pathology, (2) specific questions to be addressed while reviewing the cases, and (3) a survey about their digital experience.

Sets of Digital Cases

We set up 5 sets of digital cases representing 3 different specialties (breast: n=2; urology: n=1; and gastrointestinal: n=2) and assigned them to each study participant. Each test comprised 5 cases, represented by one or more slides of a single case that was previously diagnosed using conventional microscopy by the referral pathologist at our institution. The information reported about the original diagnosis was considered as the gold standard. To cover a spectrum of conditions overlapping the routine situation, we considered biopsy and surgical specimens (specimen type). Cases were digitalized using the Aperio AT2 scanner (Leica Biosystems) and visualized using the WebViewer APERIO ImageScope (version 12.1). The slides used for the tests were from 8 nontumoral and 17 tumoral cases. Of the tumoral cases, 7 tumors were benign and 10 were malignant; all malignant tumors were infiltrative and equally distributed between grade 2 and grade 3; 14 cases were biopsy and 11 were surgical.

Study Questionnaire

Participants answered (all or some) of the following questions (ie, categories of diagnostic task), for each case: (1) Is it neoplastic or negative for neoplasia? (2) Is it a malignant (in situ or infiltrative) or a benign neoplasia? (3) What is the histopathological diagnosis? (4) What is the histotype of the lesion? (5) What is the grade of the lesion? Questions 1 and 3 were answered for all cases, question 2 was answered only for neoplastic lesions, and questions 4 and 5 were answered for malignant neoplasms.

Statistical Analysis

To model data clusters, we used a varying effects, multilevel (hierarchical) model [14-16]. The rate of wrong answers (W_i) was modeled as a Bernoulli distribution:

W_i ∼ Binomial ( 1, p_i )

For each pathologist (PID), their career level (LEVEL), the specific diagnostic question (CATEGORY), the specimen type (SPECIMEN), and the subspecialty of the case (SPECIALTY), we used the logit link function and modeled the varying intercepts as follows:

The prior distribution for the intercepts and SD values were as follows:

α_j ∼ Normal ( , σ_α ), for j = 1..17

β_j ∼ Normal ( 0 , σ_β ), for j = 1..4

γ_j ∼ Normal ( 0 , σ_γ ), for j = 1..5

δ_j ∼ Normal ( 0 , σ_δ ), for j = 1..2

ε_j ∼ Normal ( 0 , σ_ε ), for j = 1..3

σ_β ∼ Exponential ( 1 )

σ_γ ∼ Exponential ( 1 )

σ_δ ∼ Exponential ( 1 )

σ_ε ∼ Exponential ( 1 )

The hyperpriors for the hyperparameters average pathologist and σ_α were set as follows:

∼ Normal ( 0, 1.5 )

σ_α ∼ Exponential ( 1 )

The SD value for was set at 1.5 since it produces a flat (weakly regularizing) prior after logit transformation [16,18]; moreover, we used an exponential distribution to model SD, because this assumes the least, for maximum entropy reasons [16,25-28], given the fact that σ is a nonnegative continuous parameter. To assess the validity of priors, we run prior predictive simulation of the model [16,29,30] (see Table S1 in Multimedia Appendix 1, and Multimedia Appendices 2 and 3). To limit divergent transitions, we reparametrized the models with a noncentered equivalent form [31,32]. Models were fit using Stan (a probabilistic programming language) and R [33,34]. Full anonymized data and custom code can be found in the public repository SmartCovid hosted on Github [35].

Study Survey

The survey was inspired by previous published works [36-38]. Briefly, the survey included 17 questions in a randomized order for all the pathologists, covering 3 fields: (1) attitude towards DP, (2) confidence in using DP solutions, and (3) satisfaction with DP. The survey was sent at the end of the digital experience. Pathologists were requested to answer the questions using a Likert scale, with scores ranging from 1 (strongly disagree) to 5 (strongly agree). The results were reported as the proportion of pathologists who assigned each single value of the Likert scale.

Results Quantitative Description

The pathologists answered 1345 of the total 1445 questions (100 missing answers in total), of which 1228 (91.30%) corresponded to the original diagnoses and were considered correct. Table 1 depicts the errors among each group of the 5 different categories recorded, and Figure 1 highlights the median (IQR) values of those categories. Considerable variation was observed among the performances of each pathologist, ranging from an error rate of 0.01 (1/67, Pathologist #4) to 0.32 (26/81, Pathologist #13), with a collective median error of 0.07 (IQR 0.04-0.11). This performance variation was tapered once the same data were considered after filtering among the different career levels, yielding the same median of 0.07, but a narrower IQR of 0.07-0.10. Moreover, some diagnostic tasks were more error prone than others; for instance, histotyping of the lesions had a very low rate of errors 0.01 (2/160), whereas grading was a more error-prone task with an error rate of 0.18 (27/147). The specimen type also resulted in different error rates, with surgical specimens easier to diagnose, with an error rate of 0.06 (40/716), than biopsy specimens, with a 2-fold error rate at 0.12 (77/629).

Table 1

Proportion of errors among different groups.

Group		Number of tasks performed	Number of errors	Error rate
Pathologist ID
	P1	84	5	0.06
	P2	78	4	0.05
	P3	82	7	0.09
	P4	67	1	0.01
	P5	82	7	0.09
	P6	82	6	0.07
	P7	83	2	0.02
	P8	84	3	0.04
	P9	82	5	0.06
	P10	83	3	0.04
	P11	82	9	0.11
	P12	83	3	0.04
	P13	81	26	0.32
	P14	64	9	0.14
	P15	84	12	0.14
	P16	79	9	0.11
	P17	65	6	0.09
Career level
	Resident	310	47	0.15
	Junior	460	30	0.07
	Expert	411	27	0.07
	Senior	164	13	0.08
Category of the diagnostic task
	Neoplasia?	392	42	0.11
	Malignant/benign?	258	11	0.04
	Histopathological diagnosis?	388	35	0.09
	Histotype?	160	2	0.01
	Grade?	147	27	0.18
Specimen type
	Surgery	716	40	0.06
	Biopsy	629	77	0.12
Case subspecialty
	Breast	550	64	0.12
	Gastrointestinal	497	40	0.08
	Urology	298	13	0.04
Total		1345	117	0.09

Figure 1

Error rates among different categories. This dot-bar plot depicts the median (IQR) values of error rates among different categories. The error rates showed the widest IQR among individual pathologists (PID), whereas the least IQR was noted for the career level and the specimen type (biopsy vs surgical).

Differences in error rates for two important tasks—differentiation between neoplastic and nonneoplastic processes and that between benign and malignant neoplastic processes—were observed among pathologists at different career levels and for different specimen types. The same error profile was observed across career levels, although the former task had a higher error rate (Figure 2A). However, even though the differentiation of a neoplastic process from a nonneoplastic one might be more challenging on a biopsy specimen, the distinction between a benign and malignant neoplasm was done with the same error rate regardless of the specimen type (Figure 2B). Differences in the prevalence of errors among individual pathologists and those at different career levels, as well as across diagnostic tasks, specimen type, and case subspecialty, are further highlighted in Multimedia Appendices 4 and 5.

Figure 2

Raw proportion of errors across (A) career levels and (B) specimen types in performing two important tasks: differentiation between neoplastic and nonneoplastic processes and between malignant and benign tumors.

Prediction of Average Pathologist Performance

Diagnostics of the model’s fit are shown in Multimedia Appendices 6, 7, and 8. The analysis reported a good overall performance: the average pathologist showed a negative mean coefficient of -1.8 with most of the posterior probability mass below 0 (given the model structure, positive values reflect the probability of making errors; Table S2 in Multimedia Appendix 1). The pathologists’ individual performances and their career levels were the variables that showed less variance in predicting the error rate, whereas the specimen type, case subspecialty, and the particular type of task collectively showed more variance (Multimedia Appendix 9). Hence, we simulated the performance of an average pathologist at different career levels; this prediction shows better performance among pathologists at intermediate career levels of career (Figure 3).

Figure 3

Prediction of average pathologist performance. Pathologists of intermediate levels of career perform better on average. The graph depicts the posterior predictive distributions for the multilevel model. Solid lines represent posterior mean values; shaded regions represent 89% high-posterior density interval; and dashed lines represent raw data.

Survey Results

Most pathologists reported a very good score (ie, 4 or 5 indicating they “moderately agree” and “strongly agree,” respectively) for their attitude toward DP (44/68, 64%), confidence in DP (75/119, 63%), and satisfaction with DP (56/102, 54.9%). A detailed analysis of these parameters showed that the residents reported the highest value for confidence, junior pathologists reported the highest values for attitude and satisfaction, whereas expert and senior pathologists reported relatively lower levels of confidence in and satisfaction with DP (Figure 4).

Figure 4

Overview of the psychological aspect of the study. This series of graphs summarize the results of the survey conducted among pathologists at different career levels (residents, junior, expert, and senior) to evaluate their attitudes toward, confidence in, and satisfaction with digital pathology solutions.

Discussion Principal Findings

Our study showed an overall discordant rate of 9% among diagnoses performed using digital slides and those performed using the gold standard (ie, conventional microscopy). However, when we considered the different diagnostic tasks, this rate dropped to less than 5% in the category “benign versus malignant tumor”, which is probably the most clinically impacting category among the other diagnostic tasks. A systematic review of 38 pertinent studies published before 2015 reported a 7.6% overall discordance rate between digital and glass slide diagnoses. Among these studies, 17 studies reported a discordant rate higher than 5%, and 8 reported a discordant rate higher than 15% [39]. A later reanalysis of the same series fixed the overall discordance rate to 4% and major discrepancies to 1% [40]. A more recent review, covering studies published until 2018, reported a disagreement ranging from 1.7% to 13% [10]. Two multicentric, randomized, non-inferiority studies reported major discordant rates of 4.9% [41] and 3.6% [42] between diagnoses done by digital and glass slides. Furthermore, a study from a single, large academic center reported an overall diagnostic equivalency of 99.3% [43]. The same group was also the first to report about the use of DP during COVID-19 with an overall concordance of 98.8% [44]. Thus, despite our challenging approach to DP, the diagnostic performance we recorded was consistent with previous reports—a result that further supports the transition to DP.

In our study, a high proportion of errors was generated in small biopsy specimen type (12.2%) and diagnostic tasks involving tumor grading (23%). These results are consistent with those of the review by Williams et al [40]. The latter showed that 21% of all errors concerned grading or histotyping of malignant lesions, whereas 10% of the errors could be ascribed to the inability to find the target.

Moreover, recent studies have consistently reported high, intermediate, and low discordant rates for bladder, breast, and gastrointestinal tract specimens, respectively [41,42]—a finding suggesting intrinsic difficulties of specific areas. In contrast, we observed 4%, 8%, and 12% of discrepancies for urology, gastrointestinal tract, and breast specimens. This result could be attributed to a nonrandom selection of the cases and might represent a study limitation, biasing the value of the coefficients of specific parameters of the case subspecialty, similar to those of diagnostic tasks and the specimen type. However, these characteristics were excluded in the posterior predictive simulation, which was intended to represent how the different career levels might impact the pathologists’ performance, after adjusting for all other factors.

As compared by the study by Hanna et al [44], our readiness to undertake digital diagnostic tasks was far from being mature in March 2020, and this study was specifically designed to identify and illustrate the effects of such a sudden adoption of DP—something that had never been investigated before. Our results suggest that this abrupt transition might not influence the adoption of and performance with DP. However, different factors seem to be involved. In particular, data concerning major discrepancies between diagnoses using DP and gold standard methods disclosed an interesting feature. Both in the distinction of neoplastic versus non-neoplastic lesions and of benign versus malignant tumors, the worst results obtained were among residents and senior pathologists–2 contrasting categories in terms of pathologists’ working experience. Therefore, these survey results might suggest an explanation to this paradoxical result: senior pathologists felt ready to diagnose a pathological process using a digital approach (ie, positive attitude) but were less prepared to use digital devices (ie, low confidence). Residents, in turn, had a high predisposition to using a digital device (ie, high confidence) but also had some concerns about diagnosis of a pathological process (ie, poor attitude). The hypothesis that senior pathologists were limited by a digital gap was supported by another finding: once they decided a lesion was malignant, they demonstrated the best performance with regard to tumor grading. By contrast, residents made several errors, likely due to their limited working experience. Lastly, even if expert pathologists showed a good diagnostic performance, they had the lowest level of satisfaction in DP. This result suggests that DP can be adopted rapidly for practical purposes. However, it also highlights a critical point of the process that needs to be addressed, possibly with adequate training or user-friendly equipment, and warrants further investigations.

Conclusions

Our study describes how the abrupt transition to DP affected the quality of diagnoses and qualitatively gauged the psychological aspects of this abrupt transition. Moreover, our study model highlighted the potential causes for these challenges and might inform what could be expected in other laboratories. In conclusion, the exceptional conditions dictated by the COVID-19 pandemic highlighted that DP could be adopted safely for diagnostic purposes by any skilled pathologist, even abruptly.

Multimedia Appendix 1

Supplementary materials and methods.

Multimedia Appendix 2

Coefficients of model parameters from the prior predictive simulation.

Multimedia Appendix 3

Simulation from the prior. This figure shows the meaning of the priors (ie, what the model thinks before it sees the data).

Multimedia Appendix 4

Proportion of errors among individual pathologists. Upper left panel shows the overall error rates. Upper right panel shows the error rates among different diagnostic tasks. Lower left panel shows the error rate among different specimen types. Lower right panel highlights the different error rates among different case subspecialties. GI: gastrointestinal, Uro: urology.

Multimedia Appendix 5

Proportion of errors among different career levels. Upper left panel shows the overall error rates. Upper right panel shows the error rates among the different diagnostic tasks. Lower left panel shows the error rate among different specimen types. Lower right panel highlights the different error rates among different case subspecialties. GI: gastrointestinal, Uro: urology.

Multimedia Appendix 6

Traceplot of the model fit - part A.

Multimedia Appendix 7

Traceplot of the model fit - part B.

Multimedia Appendix 8

Traceplot of the model fit - part C.

Multimedia Appendix 9

Model coefficients. Graphical representation of the coefficients for the model parameters conditional on the data. The lowest box depicts the coefficients for the hyper-parameter α¯ (alpha_bar) and the variances – the σ (sigma_a, b, [...] e) – of the categories of clusters modeled. All other boxes depict the distributions of the mean value for each element of the category considered. From top to bottom: the first box depicts the parameters of the pathologists’ performance; the second, the parameters regarding the career level; the third, the diagnostic category analyzed; the fourth, the specimen type; and the fifth, the case subspecialty. Interpretation of the model at the parameter level is not possible because they combine in a very complicated way: prediction (ie, see how the model behave on the outcome scale, Figure 4 in the manuscript) is the only practical way to understand what the model “thinks”.

Abbreviations

digital pathology

None declared.

Pantanowitz

Sharma

Carter

Kurc

Sussman

Saltz

Twenty years of digital pathology: an overview of the road travelled, what is on the horizon, and the emergence of vendor-neutral archives

J Pathol Inform 2018 9 40

10.4103/jpi.jpi_69_18

30607307

JPI-9-40

PMC6289005

Griffin

Treanor

Digital pathology in clinical use: where are we now and what is holding us back?

Histopathology 2017 01 70 1 134 145

10.1111/his.12993

27960232

Zarella

Bowman

Aeffner

Farahani

Xthona

Absar

Parwani

Bui

Hartman

A practical guide to whole slide imaging: a white paper from the digital pathology association

Arch Pathol Lab Med 2019 02 143 2 222 34

10.5858/arpa.2018-0343-ra

Hartman

Pantanowitz

McHugh

Piccoli

OLeary

Lauro

Enterprise implementation of digital pathology: feasibility, challenges, and opportunities

J Digit Imaging 2017 10 30 5 555 560

10.1007/s10278-017-9946-9

28116576

10.1007/s10278-017-9946-9

PMC5603431

Williams

Treanor

Practical guide to training and validation for primary diagnosis with digital pathology

J Clin Pathol 2020 07 73 7 418 422

10.1136/jclinpath-2019-206319

31784420

jclinpath-2019-206319

Stathonikos

Nguyen

Spoto

Verdaasdonk

MAM

van Diest

Being fully digital: perspective of a Dutch academic pathology laboratory

Histopathology 2019 11 75 5 621 635

10.1111/his.13953

31301690

PMC6856836

Fraggetta

Garozzo

Zannoni

Pantanowitz

Rossi

Routine digital pathology workflow: the Catania experience

J Pathol Inform 2017 8 51

10.4103/jpi.jpi_58_17

29416914

JPI-8-51

PMC5760840

Retamero

Aneiros-Fernandez

del Moral

Complete digital pathology for routine histopathology diagnosis in a multicenter hospital network

Arch Pathol Lab Med 2020 02 144 2 221 228

10.5858/arpa.2018-0541-oa

Thorstenson

Molin

Lundström

Implementation of large-scale routine diagnostics using whole slide imaging in Sweden: Digital pathology experiences 2006-2013

J Pathol Inform 2014 5 1 14

10.4103/2153-3539.129452

24843825

JPI-5-14

PMC4023034

Araújo

ALD

Arboleda

LPA

Palmier

Fonsêca

de Pauli Paglioni

Gomes-Silva

Ribeiro

ACP

Brandão

Simonato

Speight

Fonseca

Lopes

de Almeida

Vargas

Madrid Troconis

Santos-Silva

The performance of digital microscopy for primary diagnosis in human pathology: a systematic review

Virchows Arch 2019 03 474 3 269 287

10.1007/s00428-018-02519-z

30685784

10.1007/s00428-018-02519-z

Gelman

Carlin

Beyond power calculations: assessing type S (sign) and type M (magnitude) errors

Perspect Psychol Sci 2014 11 9 6 641 51

10.1177/1745691614551642

26186114

9/6/641

Gelman

Andrew

The failure of null hypothesis significance testing when studying incremental changes, and what to do about it

Pers Soc Psychol Bull 2018 01 44 1 16 23

10.1177/0146167217729162

28914154

Gelman

The problems with p-values are not just with p-values

Am Stat (Supplemental material to the ASA statement on p-values and statistical significance) 2016 06 129 133

Gelman

Hill

Data Analysis Using Regression and Multilevel/Hierarchical Models (Analytical Methods for Social Research) 2006

Cambridge

Cambridge University Press

Gelman

Carlin

Stern

Dunson

Vehtari

Rubin

Bayesian data analysis 2013

New York

CRC Press

McElreath

Statistical Rethinking: A Bayesian course with examples in R and Stan 2020

Boca Raton

CRC Press

Gelman

Weakliem

Of Beauty, Sex and Power - Too little attention has been paid to the statistical challenges in estimating small effects

American Scientist 2009 97 4 310

10.1511/2009.79.310

Renne

Valeri

Tosoni

Bertolotti

Rossi

Renne

Colecchia

Myoid gonadal tumor. Case series, systematic review, and Bayesian analysis

Virchows Arch 2020 11

10.1007/s00428-020-02957-8

33140129

10.1007/s00428-020-02957-8

Metropolis

Rosenbluth

Teller

Equation of state calculations by fast computing machines

J Chem Phys 1953 06 21 6 1087 1092

10.1063/1.1699114

Hoffman

Gelman

The no-U-turn sampler: Adaptively setting path lengths in Hamiltonian Monte Carlo

J Mach Learn Res 2014 04 15 1593 1623

Gelman

Analysis of variance: why it is more important than ever

Ann Stat 2005 33 1 1 31

Watanabe

Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory

J Mach Learn Res 2010 11 3571 3594

Vehtari

Gelman

Gabry

Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC

Stat Comput 2016 8 30 27 5 1413 1432

10.1007/s11222-016-9696-4

Gelman

Hwang

Vehtari

Understanding predictive information criteria for Bayesian models

Stat Comput 2013 8 20 24 6 997 1016

10.1007/s11222-013-9416-2

Williams

Bayesian conditionalisation and the principle of minimum information

Br J Philos Sci 1980 06 01 31 2 131 144

10.1093/bjps/31.2.131

Caticha

Giffin

Updating probabilities

AIP Conference Proceedings 2006 872 1 31 42

10.1063/1.2423258

Giffin

Maximum entropy: the universal method for inference

arXiv Preprint posted online January 20, 2009.

Jaynes

Erickson

Smith

The relation of Bayesian and maximum entropy method

Maximum-Entropy and Bayesian Methods in Science and Engineering. Fundamental Theories of Physics (An International Book Series on The Fundamental Theories of Physics: Their Clarification, Development and Application) 1988

Dordrecht

Springer

Gabry

Simpson

Vehtari

Betancourt

Gelman

Visualization in Bayesian workflow

J R Stat Soc Ser A 2019 01 15 182 2 389 402

10.1111/rssa.12378

Gelman

Vehtari

Simpson

Margossian

Carpenter

Yao

Kennedy

Gabry

Bürkner

P-C

Modrák

Bayesian Workflow

arXiv Preprint posted online November 3, 2020.

Papaspiliopoulos

Roberts

Sköld

A general framework for the parametrization of hierarchical models

Statist Sci 2007 02 22 1 59 73

10.1214/088342307000000014

22.7 Reparameterization

Stan Development Team Stan User's Guide Version 2 2020-12-09

https://mc-stan.org/docs/2_25/stan-users-guide/reparameterization-section.html

Carpenter

Gelman

Hoffman

Lee

Goodrich

Betancourt

Brubaker

Guo

Riddell

Stan: a probabilistic programming language

J Stat Soft 2017 76 1 1 32

10.18637/jss.v076.i01

The R Project for Statistical Computing

The R Foundation 2019

2021-01-28

https://www.r-project.org/

SmartCovid: Datasets and code for the study

GitHub 2021-01-29

https://github.com/slrenne/SmartCovid

Randell

Ruddle

Treanor

Barriers and facilitators to the introduction of digital pathology for diagnostic work

Stud Health Technol Inform 2015 216 443 7

26262089

Pavone

Guida rapida per operatori in campo contro il COVID-19: Autovalutazione dello stress e Gestione del disagio emotivo 2020 03 29

2020-12-09

https://associazioneitalianacasemanager.it/wp-content/uploads/2020/04/COVID_19_e_stress_professionale_3_1-FP-ASST.pdf

Retamero

Aneiros-Fernandez

Del Moral

Microscope? No, thanks: user experience with complete digital pathology for routine diagnosis

Arch Pathol Lab Med 2020 06 144 6 672 673

10.5858/arpa.2019-0355-LE

32459536

Goacher

Randell

Williams

Treanor

The diagnostic concordance of whole slide imaging and light microscopy: a systematic review

Arch Pathol Lab Med 2017 01 141 1 151 161

10.5858/arpa.2016-0025-RA

27399211

Williams

DaCosta

Goacher

Treanor

A systematic analysis of discordant diagnoses in digital pathology compared with light microscopy

Arch Pathol Lab Med 2017 12 141 12 1712 1718

10.5858/arpa.2016-0494-OA

28467215

Mukhopadhyay

Feldman

Abels

Ashfaq

Beltaifa

Cacciabeve

Cathro

Cheng

Cooper

Dickey

Gill

Heaton

Kerstens

Lindberg

Malhotra

Mandell

Manlucu

Mills

Moskaluk

Nelis

Patil

Przybycin

Reynolds

Rubin

Saboorian

Salicru

Samols

Sturgis

Turner

Wick

Yoon

Zhao

Taylor

Whole slide imaging versus microscopy for primary diagnosis in surgical pathology

Am J Surg Pathol 2017 1

10.1097/pas.0000000000000948

Borowsky

Glassy

Wallace

Kallichanda

Behling

Miller

Oswal

Feddersen

Bakhtar

Mendoza

Molden

Saffer

Wixom

Albro

Cessna

Hall

Lloyd

Bishop

Darrow

Gui

Jen

K-Y

Walby

JAS

Bauer

Cortez

Gandhi

Rodgers

Rodriguez

Martin

McConnell

Reynolds

Spigel

Stepenaskie

Viktorova

Magari

Wharton

Qiu

Bauer

Digital whole slide imaging compared with light microscopy for primary diagnosis in surgical pathology

Arch Pathol Lab Med 2020 10 01 144 10 1245 1253

10.5858/arpa.2019-0569-OA

32057275

427502

Hanna

Reuter

Hameed

Tan

Chiang

Sigel

Hollmann

Giri

Samboy

Moradel

Rosado

Otilano

England

Corsale

Stamelos

Yagi

Schüffler

Fuchs

Klimstra

Sirintrapun

Whole slide imaging equivalency and efficiency study: experience at a large academic center

Mod Pathol 2019 07 32 7 916 928

10.1038/s41379-019-0205-0

30778169

10.1038/s41379-019-0205-0

Hanna

Reuter

Ardon

Kim

Sirintrapun

Schüffler

Busam

Sauter

Brogi

Tan

Bale

Agaram

Tang

Ellenson

Philip

Corsale

Stamelos

Friedlander

Ntiamoah

Labasin

England

Klimstra

Hameed

Validation of a digital pathology system including remote review during the COVID-19 pandemic

Mod Pathol 2020 11 33 11 2115 2127

10.1038/s41379-020-0601-5

32572154

10.1038/s41379-020-0601-5

PMC7306935