- Original article
- Open access
- Published:
Interobserver agreement of an ED PoCUS video training dataset of normal appendix and appendicitis in children
The Ultrasound Journal volume 16, Article number: 38 (2024)
Abstract
Background
Educational video datasets can be an effective method for training in emergency department (ED) point-of-care ultrasound (PoCUS). A video dataset for normal appendix and appendicitis in children using ED PoCUS images was developed to assess interobserver agreement, as measured by Cohen’s Kappa on key sonographic findings.
Methods
Three sets of 25 ED PoCUS videos were selected and curated from pediatric patients with normal appendix and acute appendicitis. Four participant ED sonologist-physicians were trained on the first set of 25 videos showing normal appendix or normal bowel in patients without appendicitis to note if normal appendix was seen in any part or in it’s entirety from tip-to-cecum. They were then tested on the second set of similar videos. A third set of 25 videos from patients who had appendicitis where participant sonologists were asked to note if appendicitis was present or absent, with and without appendicolith or perforation. Cohen’s Kappa was calculated in aggregate and stratified by experience vs. novice against a senior sonologist-physician aware of all patient outcomes for visualization of: 1. any part of normal appendix, 2. normal appendix visualized from tip to cecum 3. any part of appendicitis, 4. appendicolith, 5. appendiceal perforation.
Results
Cohen’s Kappa for any part of normal appendix, 0.71, 95% CI (0.58–0.85); normal appendix tip-to-cecum, 0.43, 95% CI (0.19–0.67), appendicitis, 0.53, 95%CI (0.34–0.70), appendicolith, 0.63, 95%CI (0.43–0.84), perforated appendicitis, 0.46, 95%CI (0.22–0.70). Stratified by experienced vs. novice: any part of normal appendix, 0.75 vs. 0.68; normal appendix tip-to-cecum, 0.50 vs. 0.36; appendicitis, 0.78 vs. 0.31; appendicolith, 0.75 vs. 0.5; perforated appendicitis, 0.5 vs 0.42.
Conclusions
This educational video dataset may be used to train sonologist-physicians in ED PoCUS scanning for normal appendix and appendicitis in children. Sonologist experience affected interobserver agreement with respect to visualization of entire normal appendix and appendicitis.
Introduction
Visual training by review of ultrasound image datasets is an effective educational method of acquiring interpretation skills in specific ultrasound applications [1,2]. ED PoCUS videos can be selected and curated to create video datasets to assist ED clinicians acquire skills in PoCUS identification of normal appendix and appendicitis in children.
Interobserver agreement, as calculated by Cohen’s Kappa to measure diagnostic test precision for identifying normal appendix and appendicitis on point-of-care ultrasound in children has not been examined in depth in the medical literature. Variability in interobserver agreement of various ultrasound findings may lead to variation in time to definitive diagnosis, as well as obtaining further diagnostic testing (e.g. radiology ultrasound, CT scan or MRI) and management, operative or non-operative management of children (e.g. appendectomy vs. observation). For example, presence of an appendicolith is a contraindication to non-operative management of appendicitis [3], as these cases are ideally treated by appendectomy (Fig. 1A; https://youtu.be/tTSukgBuqnk). Perforated appendicitis often requires broad-spectrum antibiotics with anaerobic coverage with a variety of subsequent management approaches depending on degree of perforation and abscess size (Fig. 1B; https://youtu.be/At9rqgciZko). In lower risk patients with abdominal pain, being able to trace a normal appendix from tip to cecum (Figs. 2A, B; https://youtu.be/3-jkOw-YW5M) is reassuring to clinicians at the bedside when ruling out appendicitis, allowing for safe discharge, rather than further diagnostic imaging or observation in hospital for serial abdominal examinations.
We selected point-of-care ultrasound videos from a retrospective cohort of pediatric ED patients evaluated for appendicitis to determine interobserver agreement of various ultrasound findings for normal appendix and appendicitis and to create a video dataset for educational purposes.
Methods
Three sets of 25 PoCUS videos approximately 6 to 30 s in length were selected and curated by a senior-sonologist physician from ED PoCUS examinations of 57 pediatric patients evaluated for appendicitis from January 2016 to August 2022. PoCUS videos were obtained from QpathE (PoCUS image archive). Clinical information and outcomes (operative notes and/or surgical pathology reports in patients with appendicitis; 30 day follow-up from index ED visit in patients who did not receive appendectomy) were obtained from EPIC electronic medical records. The mean age was 9.9 years, 47% were female and 33% were diagnosed with appendicitis. Our institutional review board approved this study #22–0702.
The first set of 25 videos was curated from patients who did not have appendicitis and showed normal appendix, or normal bowel in the right lower quadrant: https://youtu.be/1XeBgjlZ23M. This first set was used for training 4 participant sonologist-physicians (3 attendings and 1emergency ultrasound fellow) by a senior sonologist-physician. Two of the participant sonologist-physicians were experienced with appendix PoCUS (using it in clinical practice) and the other two sonologist-physicians were emergency physicians who had emergency ultrasound fellowship training and novice to appendix PoCUS. Emergency ultrasound fellowship is a 1 year fellowship training in emergency point-of-care ultrasound after emergency medicine residency. The second set of 25 videos, like the first set, was curated from patients who did not have appendicitis and showed either normal appendix, or normal bowel in the right lower quadrant: https://youtu.be/PtDb7FoivIU. This second set was used for testing participant sonologist-physicians on whether no appendix was visualized, any part of normal appendix, or whether the entire appendix from tip to cecum visualized in the video. A third set of 25 videos was curated from 17 patients with a mean age of 12.6 years that were confirmed to have appendicitis on operative and/or surgical pathology reports: https://youtu.be/RAp9BjXFUjQ. After being shown 3–5 examples of appendicitis videos (https://youtu.be/1cHofuZJ3TA), participant sonologist-physicians were tested on the presence of appendicitis in the videos, the presence or absence of appendicolith in the videos (echogenic appendicolith with posterior acoustic shadowing), and presence or absence of appendiceal perforation or rupture (as noted by peri-appendiceal abscess[4] in the videos.
Interobserver agreement as measured by Cohen’s Kappa for these 5 sonographic findings were calculated using an online calculator (http://vassarstats.net/) in aggregate: 1. any part of normal appendix, 2. normal appendix visualized from tip to cecum 3. any part of appendicitis, 4. presence or absence of appendicolith, 5. presence or absence of appendiceal perforation or rupture [visualized by presence of peri-appendiceal abscess] was calculated between a senior sonologist-physician aware of all patient outcomes as a gold standard observer 1 against participant sonologist physicians (2 experienced, 2 novice) each as observer 2. Classification for Cohen’s Kappa which adjusts for interobserver agreement by chance that raw agreement does not, was defined as follows: 0–0.2 no or poor agreement; 0.21–0.4 fair agreement; 0.41–0.060 moderate agreement; 0.61–0.80 good agreement; 0.81–1.0 very good or near perfect agreement.
Results
Interobserver agreement as measured by Cohen’s Kappa for 5 sonographic findings are presented in Table 1. Overall Kappa results, and stratified by experience versus novice in performing appendix PoCUS are presented.
Discussion
Prior literature reports Kappa for appendicitis visualization in children at 0.69, 95% CI: 0.58–0.78 [5]. Our Kappa results for normal appendix visualization are similar (0.71, 95%CI 0.58–0.85), but for appendicitis visualization there was a large difference between Kappa results between experienced (0.78, 95%CI 0.58–0.99) versus novice (0.31, 95%CI 0.005–0.62) sonologist-physicians. We speculate this difference is due to the lack of an appendicitis training set for novice sonologist-physicians to learn from prior to testing. Interobserver agreement for appendicolith as measured by Kappa was a class higher (0.63—moderate) than for identifying appendiceal perforation with abscess (0.46—fair).
Variable skill and confidence in physician PoCUS for appendicitis as evidenced by lower interobserver agreement in PoCUS findings may result in greater variability in management and time to definitive diagnosis. Finding normal appendix in its entirety from tip to cecum (Fig. 2A, B; https://youtu.be/3-jkOw-YW5M) may be particularly impactful in a child with low clinical suspicion for appendicitis (e.g. normal wbc count, no right lower quadrant pain) who may be discharged home. This is opposed to a child with moderate to higher suspicion for appendicitis, who may undergo further radiology imaging (e.g., radiology ultrasound, MRI or CT abdomen/pelvis) or be admitted for serial abdominal examinations. Identification of an appendicolith (Fig. 1A; https://youtu.be/tTSukgBuqnk) can guide clinical course as a contra-indication to non-operative management of appendicitis thus favoring appendectomy. Suspected perforated appendicitis (Fig. 1B; https://youtu.be/At9rqgciZko) may also lead to further radiology imaging to definitively identify perforated appendicitis with various management options depending on the pediatric surgeon, from interventional radiology drainage versus intravenous antibiotics followed by interval appendectomy. In the hands of experienced sonologist physicians, finding appendicitis (Fig. 3A, B; https://youtu.be/alIkvtiVVAw) with PoCUS in a child with high clinical suspicion for appendicitis may avoid the need for further imaging [6] when working closely with pediatric surgeons who can review or be shown PoCUS images.
Test characteristics such as sensitivity, specificity and likelihood ratios are standard measures to describe diagnostic test performance of imaging modalities such as ultrasound. Generally, ED PoCUS for pediatric appendicitis has high specificity for ruling-in disease, and variable sensitivity for ruling-out disease [5,6]Sensitivity is the operator dependent test characteristic, generally increasing with operater experience[6, 7]Improvement in imaging resolution for PoCUS machines in the past decade has likely increased sensitivity for ruling out appendicitis by allowing better identification of normal appendix. However, scant data exists regarding interobserver agreement as measured by Cohen’s Kappa for various sonographic findings for normal appendix and appendicitis.
Limitations
We were limited by the relatively small number (n = 75) of selected and curated videos, but needed to avoid participant fatigue by limiting training and testing sessions to approximately 30 min for each of the three 25 video datasets for practical purposes. Normal appendix scanning likely is one of the more challenging of all ultrasound applications requiring skill. The skill of image acquisition and probe handling is key to visualizing entire normal appendix for ruling out appendicitis is likely best addressed with actual hands-on patient scanning.
We erred by not providing an appendicitis training video dataset for novice participant sonologist-physicians, as we initially hypothesized that learning on normal appendix videos would be sufficient to then identify appendicitis on PoCUS videos. We have included a 10 appendicitis video dataset (with and without appendicolith and/or perforation with abscess) here: https://youtu.be/1cHofuZJ3TA. The median number of video images to achieve mastery at interpretation or acceptable performance benchmarks has been estimated to be a low of 87 for lung US (IQR 54–118) to a high of 128 (IQR 86–201) for cardiac US; normal appendix and appendicitis interpretation is likely similar.2 We have additional normal appendix and appendicitis videos available for review: https://bit.ly/2Jnti7W and https://bit.ly/2RoxiYN.
Conclusion
We created an educational video dataset for normal appendix and appendicitis and present information on interobserver agreement on key sonographic findings. This video dataset may be used to train sonologist-physicians in ED PoCUS scanning for normal appendix and appendicitis in children. Sonologist experience affected interobserver agreement with respect to visualization of entire normal appendix and appendicitis.
Data availability
Data is available on request via corresponding author: jtsung@gmail.com.
Change history
14 August 2024
The name of the author has been corrected in the original publication. The article has been updated to rectify the error.
References
Kwan C, Pusic M, Pecaric M, Weerdenburg K, Tessaro M, Boutis K (2019) The variable journey in learning to Interpret pediatric point-of-care ultrasound images: a multicenter prospective cohort study. AEM Educ Train 4(2):111–122. https://doi.org/10.1002/aet2.10375
Kwan C, Weerdenburg K, Pusic M, Constantine E, Chen A, Rempell R, Herman JE, Boutis K (2022) Learning pediatric point-of-care ultrasound: how many cases does mastery of image interpretation take? Pediatr Emerg Care 38(2):e849–e855. https://doi.org/10.1097/PEC.0000000000002396
Minneci PC, Hade EM, Gil LA, Metzger GA, Saito JM, Mak GZ, Hirschl RB, Gadepalli S, Helmrath MA, Leys CM, Sato TT, Lal DR, Landman MP, Kabre R, Fallat ME, Cooper JN, Deans KJ (2022) Midwest pediatric surgery consortium. demographic and clinical characteristics associated with the failure of nonoperative management of uncomplicated appendicitis in children: secondary analysis of a nonrandomized clinical trial. JAMA Netw Open. https://doi.org/10.1001/jamanetworkopen.2022.9712
Carpenter JL, Orth RC, Zhang W, Lopez ME, Mangona KL, Guillerman RP (2017) Diagnostic performance of US for differentiating perforated from nonperforated pediatric appendicitis: a prospective cohort study. Radiology 282(3):835–841. https://doi.org/10.1148/radiol.2016160175
Sivitz AB, Cohen SG, Tejani C (2014) Evaluation of acute appendicitis by pediatric emergency physician sonography. Ann Emerg Med 64(4):358-364.e4. https://doi.org/10.1016/j.annemergmed.2014.03.028
Elikashvili I, Tay ET, Tsung JW (2014) The effect of point-of-care ultrasonography on ED length of stay and CT utilization in children with suspected appendicitis. Acad Emerg Med. https://doi.org/10.1111/acem.12319Cita
Tsung JW (2010) Rules for the road: an evidence-based approach to understanding diagnostic test performance of point-of-care ultrasound for pediatric abdominal emergencies Critical Ultrasound Journal 1(3):101-103 https://doi.org/10.1007/s13089-010-0020-9
Acknowledgements
We thank members of our Emergency Ultrasound Division for providing observations for the calculation of interobserver agreement.
Funding
There were no sources of funding for this manuscript.
Author information
Authors and Affiliations
Contributions
The authors are responsible for the reported research. All authors have participated in the concept and design of the study, analysis and interpretation of data, and drafting and revising of the manuscript, and all authors have approved the manuscript as submitted.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This work has been performed in accordance with the Declaration of Helsinki and approved by Icahn School of Medicine at Mount Sinai Program for Protection of Human Subjects #22–0702.
Consent for publication
Consent was determined to be exempt and waived as this was a retrospective study conforming to HIPAA policy with de-identified data.
Competing interests
James Tsung MD, MPH has served as a paid consultant in 2022 to DIA Imaging Analysis, LTD, an artificial intelligence ultrasound imaging company. The other authors have no financial or other conflicts of interests to declare with respect to this manuscript.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tsung, J.W., Firnberg, M. & Sosa, P. Interobserver agreement of an ED PoCUS video training dataset of normal appendix and appendicitis in children. Ultrasound J 16, 38 (2024). https://doi.org/10.1186/s13089-024-00386-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s13089-024-00386-1