gms | German Medical Science

68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS)

17.09. - 21.09.23, Heilbronn

Methods for estimation of diagnostic test accuracy using longitudinal data

Meeting Abstract

  • Julia Böhnke - University of Münster, Institute of Epidemiology and Social Medicine, Münster, Germany
  • Antonia Zapf - Department of Medical Biometry, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
  • Katharina Kramer - Institute of Mathematics, University of Augsburg, Augsburg, Germany
  • Philipp Weber - Department of Medical Biometry, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
  • André Karch - University of Münster, Institute of Epidemiology and Social Medicine, Münster, Germany
  • Nicole Rübsamen - University of Münster, Institute of Epidemiology and Social Medicine, Münster, Germany

Deutsche Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie. 68. Jahrestagung der Deutschen Gesellschaft für Medizinische Informatik, Biometrie und Epidemiologie e. V. (GMDS). Heilbronn, 17.-21.09.2023. Düsseldorf: German Medical Science GMS Publishing House; 2023. DocAbstr. 162

doi: 10.3205/23gmds056, urn:nbn:de:0183-23gmds0568

Veröffentlicht: 15. September 2023

© 2023 Böhnke et al.
Dieser Artikel ist ein Open-Access-Artikel und steht unter den Lizenzbedingungen der Creative Commons Attribution 4.0 License (Namensnennung). Lizenz-Angaben siehe http://creativecommons.org/licenses/by/4.0/.


Gliederung

Text

Introduction: Diagnostic test accuracy (DTA) studies enable the estimation of a test’s accuracy prior to its usage in clinical practice [1], [2]. DTA estimation methods are available for one-time test application [2], which are improper if a test is applied multiple times per patient, e.g. continuously during a hospital stay. We show how to perform and adjust the DTA estimation when using longitudinal data.

Methods: We identified two DTA estimation levels for the binary disease status classification of longitudinal data. Either each individual time unit (i.e., per minute/hour/day) was classified and included in the DTA estimation (i.e., time-level) or time units based on the result of the reference test were grouped together to blocks, classified, and included in the DTA estimation (i.e., event-level). Thus, the event-level uses fewer classified units than the time-level.

Afterwards, each patient was assigned to one of the three clusters (i.e., diagnosis ‘consistently absent’, ‘consistently present’, or ‘mixed (i.e., diseased and disease-free periods)’) by Konietschke & Brunner [3] and each subunit of a cluster was given the same weight. By using a weighted nonparametric rank statistic, based on Konietscke & Brunner [3] and Lange [4], which adjusts for multiple test applications, we estimated the adjusted DTA using longitudinal data.

Results: DTA estimates varied considerably depending on the used level and time unit. Precision of DTA estimates increased when the number of included observations increased (e.g., using the time-level for classification and minutes as time unit). DTA estimates were misleading if the used time unit and/or measurement unit did not adequately represent the target diagnosis.

Discussion and conclusion: Researchers should predefine early in the planning phase and as part of their research question(s) the estimand(s) and their methodological choices (i.e., the features: level and time unit). If the aim is precision and/or prediction, we recommend using the time-level and a small time unit. The event-level should be used if the focus is on a test’s diagnostic ability within a clinical setting because it only provides new information in cases of a change in a patient’s health status. Researchers can report multiple DTA estimates using various diagnosis-appropriate feature combinations while depicting differences in feature-related DTA interpretations. We will present three distinct use cases (i.e., systemic inflammatory response syndrome, epilepsy, and depression) and why the selected features should reflect the target diagnosis (i.e., episode length and intervals between episodes).

Nonetheless, further research is necessary to improve the available methodological approaches and the dependence of the underlying time series of the units included in the DTA estimation and how they behave in real-life scenarios (e.g., occurrence of missing values).

Funding: This work was supported by the German Federal Ministry of Health via the ELISE project (grant number 252DAT66C) and by the German Research Foundation (DFG; grant number 499188607).

The authors declare that they have no competing interests.

The authors declare that an ethics committee vote is not required.


References

1.
Chassé M, Fergusson DA. Diagnostic Accuracy Studies. Semin Nucl Med. 2019 Mar 1;49(2):87–93. DOI: 10.1053/J.SEMNUCLMED.2018.11.005 Externer Link
2.
Sitch AJ, Dekkers OM, Scholefield BR, Takwoingi Y. Introduction to diagnostic test accuracy studies. Eur J Endocrinol. 2021 Feb 1;184(2):E5–9. DOI: 10.1530/EJE-20-1239. Externer Link
3.
Konietschke F, Brunner E. Nonparametric analysis of clustered data in diagnostic trials: Estimation problems in small sample sizes. Comput Stat Data Anal. 2009 Jan 15;53(3):730–41. DOI: 10.1016/J.CSDA.2008.08.006. Externer Link
4.
Lange K. Nichtparametrische Analyse diagnostischer Gütemaße bei Clusterdaten [Dissertation]. Göttingen, Germany: Georg-August-University; 2011. DOI: 10.53846/GOEDISS-3538 Externer Link