- Original article
- Open Access
Bedside lung ultrasonography by emergency department residents as an aid for identifying heart failure in patients with acute dyspnea after a 2-h training course
The Ultrasound Journal volume 13, Article number: 5 (2021)
Ultrasonographic B-lines have recently emerged as a bedside imaging tool for the differential diagnosis of acute dyspnea in the Emergency Department (ED). However, despite its simplicity, LUS has not fully penetrated emergency department. This study aimed to assess the accuracy and reproducibility of ultrasonographic B-lines performed by emergency medicine (EM) residents for the diagnosis of congestive heart failure (CHF) in patients admitted to ED for acute dyspnea.
Patients and methods
This is a cross-sectional prospective study conducted between January 2016 and October 2017 including patients aged over 18 years admitted to ED for acute dyspnea. At admission, two consecutive bedside LUS study were performed by a pair of EM residents who received a 2-h course for recognition of sonographic B-lines to determine independently B-lines score and B-profile pattern. All participating sonographers were blinded to patients’ clinical data. B-lines score ≥ 15 or a B-profile pattern was considered as suggestive of CHF. The final leading diagnosis was assessed by two expert sonographers, who were blinded to the residents’ interpretations, based on clinical findings, chest X-ray, brain natriuretic peptide, cardiac and lung ultrasound testing. Accuracy and agreement of B-lines score and B-profile pattern were calculated.
We included 700 patients with a mean age of 68 ± 12.6 years and a sex ratio (M/F) of 1.43. The diagnosis of CHF was recorded in 371 patients (53%). The diagnostic performance of B-lines score at a cut-off 15 and B-profile pattern was, respectively, 88% and 82.5% for sensitivity, 75% and 84% for specificity, 80% and 85% for positive predictive value, 84% and 81% for negative predictive value. The area under receiver operating characteristic curve was 0.86 [0.83–0.89] and 0.83 [0.80–0.86], respectively, for B-lines score and B-profile pattern. There was an excellent agreement between residents for the diagnosis of CHF using both scores (kappa = 0.81 and 0.85, respectively, for ordinal scale B-lines score and B-profile pattern).
Lung ultrasound B-lines assessment has a good accuracy and an excellent reproducibility in the diagnosis of CHF in the hand of EM residents following a short training program.
Trial registration Name of the registry: clinicaltrials.gov; Trial registration number: NCT03717779; Date of registration: October 24, 2018 ‘Retrospectively registered’; URL of trial registry record: clinicaltrials.gov
Acute dyspnea is a common clinical emergency and a leading cause of hospital admissions . While the differential diagnosis is broad, congestive heart failure (CHF) is one of the most frequent causes that can be difficult to differentiate from other etiologies. Although immediate and accurate diagnosis is critical, available diagnostic modalities of CHF among dyspneic patients, lack either specificity or sensitivity [2,3,4]. Echocardiography was shown to be pivotal in the diagnostic workup of CHF, but such facility requires high skills and is not always available in many emergency departments [5, 6]. Recently, lung ultrasound (LUS) has emerged as a promising alternative tool that can be performed by novice sonographers [7,8,9,10,11,12,13]. This easy non-invasive bedside method provides rapid diagnostic information allowing an earlier and targeted treatment. Consequently, LUS is increasingly used in clinical practice particularly in acute care settings . Nonetheless, before accepting the widespread use of LUS, there is still need to assess its accuracy and reproducibility in the hand of non-experts.
The purpose of our study is to evaluate the accuracy and reproducibility of B-lines testing assessed by emergency medicine (EM) residents after 2-h training in the diagnosis of CHF in patients admitted to the emergency department with acute dyspnea.
Patients and methods
This is prospective cross-sectional study conducted in the Emergency Department (ED) of three University Hospitals (Fattouma Bourguiba University Hospital, Sahloul University Hospital, and Farhat Hached University Hospital, Tunisia) from January 2016 to October 2017.
A convenience sampling approach, including all patients admitted to the ED for acute dyspnea as chief complaint, was used. Exclusion criteria were: age less than 18 years, impossibility to give consent to participate in the study, post-traumatic dyspnea, pregnant women, and need for endotracheal intubation or inotropic drugs patients who were deemed too unstable for sonography by the treating team were also excluded.
All eligible patients underwent a complete physical examination. Blood pressure, heart rate, and pulse oximetry were measured and oxygen was delivered by face mask as needed. Research associates collected the following data: name, age, sex, previous medical history, ongoing treatment, and physical examination findings. The following additional tests were performed for all included patients: blood gas, hemoglobin, serum creatinine, BNP, electrocardiogram, chest X-ray, and echocardiogram. Lung ultrasonography was performed by EM residents using two ultrasound machines (Philips EnVisor C, Nederland; SonoSite M-Turbo, Sonosite Inc., Bothell, WA) and broadband curved array probes (3.5–5 MHz). The study period overlapped one and half academic year in three university hospitals, so a total of 40 residents were eligible to participate. ED residents were appointed to carry out this examination less than 4 h following patients’ admission. None of the ED residents used LUS for the assessment of B-lines prior to the study. All participating residents were previously attended a 2-h training session with at least 10 clinical tests supervised by a certified emergency physicians who had accomplished a full mentoring program for “Ultra-Sound Life Support”. The first 30 min of the training course included basic ultrasound physics, use of ultrasound equipment, probe positioning, and lung ultrasound interpretation (A-lines, B-lines, consolidation, lung sliding, lung pulse, and miscellaneous artifacts). In the second 30 min, real-time LUS was performed in healthy volunteers describing the technique and findings. The rest of the training was hands-on training on actual patients. Trainees had to identify the presence of lung sliding, A-lines, B-lines and consolidation.
For each patient, two LUS tests were performed by two independent residents who were not aware of patient's clinical data and did not participate in the patient’s management. We recorded the ED residents’ interpretation and images were recorded for each LUS study for later expert review. To not break the blind protocol, patients were asked to not provide information on their medical history to the operators during LUS. Patients were placed in a semi-recumbent or supine position depending on their respiratory tolerance. For each side of the chest, 4 zones have to be assessed (Fig. 1): 2 anterior and 2 lateral. The anterior chest wall was delineated from the sternum to the anterior axillary line and was subdivided into upper and lower halves (approximately from clavicle to the second–third intercostal spaces and from the third space to diaphragm). The lateral chest was delineated from the anterior to the posterior axillary line and was subdivided into upper and basal halves. The operator was asked to calculate the B-lines score which is the sum of the B-lines found in both sides (8 zones) ; the intercostal space with the greatest number of B-lines within each zone was used for scoring. B-line was defined as a vertical bright echogenic bundle with a narrow basis, spreading from the transducer to the deepest part of the screen (Fig. 2). For B-lines that were wide or confluent, the score was determined by assessing the percentage of the rib space occupied by B-lines and dividing it by ten .
According to the study of Gargani et al. the B-lines score is suggestive of CHF when it is ≥ 15 . The probability of CHF was also expressed according to the following ordinal scale: unlikely if B-lines score < 15, likely if B-lines score is between 16 and 29, and very likely if B-lines score ≥ 30. The operator also had to assess the presence or absence of B-profile pattern which is suggestive of CHF according to Lichtenstein criteria . B-profile pattern was defined as such if two or more lung zones per side were positive. A lung zone was positive if three or more B-lines were identified. The final leading diagnosis of dyspnea was assessed by two independent senior EM physicians after reviewing the entire medical record of each patient it was based on: (1) the clinical presentation (severe shortness of breath, worsening dyspnea, orthopnea, paroxysmal nocturnal dyspnea, coughing up or wheezing with white or pink blood-tinged phlegm, foamy mucus), and the physical exam findings (pulmonary congestion and/or peripheral edema, rales, crackles); (2) the diagnostic tests’ results including chest X-ray (pulmonary venous congestion, pleural effusion, interstitial or alveolar edema and cardiomegaly), echocardiography (structural or functional cardiac abnormalities), brain natriuretic peptide (BNP > 300 pg/mL, or NT-proBNP > 1200 pg/mL), the saved images of LUS study, treatment, and outcome . In case of a disagreement, a third senior physician was consulted and adjudicated the case. All senior physicians participating in the study were masked to LUS results. Informed consent was obtained in all the patients before the start of the protocol.
Prior to enrollment, a power analysis was performed to determine the sample size needed. Assuming an alpha of 0.05 and a desired precision of 0.07, we calculated a sample size of 502 patients required if we considered that the estimated prevalence of CHF is 25% and the targeted sensitivity and specificity would both be 0.80.
After analysis of normality distribution, variables were expressed by the arithmetic mean and standard deviation (SD) or the median and the 95% confidence interval (or interquartile range). Comparison between patients with CHF (HF group) and those without CHF (non-HF group) was performed by Student’s t-test for continuous variables and Chi-2 test for categorical variables. The difference was considered statistically significant for values of p ≤ 0.05. Discrimination power of the assessed models was studied by the area under the receiver operating characteristic (ROC) curve. An area under curve (AUC) = 1 represents a perfect test; an area of 0.5 represents a worthless test (random prediction), and an area greater than 0.70 means that accuracy of the diagnostic test is at least fair. For the assessment of diagnostic accuracy of B-lines, the scanning order was randomly determined according to an electronic randomization. Agreement between residents’ interpretation was assessed by kappa agreement index for qualitative indices (B-lines score as ordinal scale, and B-profile pattern recorded dichotomously as present or absent). Agreement was considered “low” when kappa value was less 0.40, “fair” from 0.41 to 0.60, “good” from 0.61 to 0.80 and “excellent” from 0.81 to 1. For the B-lines score, the Bland and Altman plot was constructed. A good match was defined when the differences between B-lines score pairs is around the average line and between the lines of − 2 and + 2 SD. The data obtained in this study were collected, recorded and analyzed using SPSS computer software version 18.0 (Chicago, IL).
During the study period, 1024 patients with acute dyspnea were screened. Two hundred forty-two patients were excluded for one or more predefined exclusion criteria; additional 64 patients were excluded for blind protocol violation, and 18 declined or were unable to tolerate a complete examination (Fig. 3). The characteristics of the remaining 700 patients are outlined in Table 1. Four hundred twelve patients (58.8%) were men with a mean age of 68 years (± 12.6). Heart failure was the final diagnosis in 53% of dyspneic patients (HF group, n = 371).
The most common etiology of dyspnea in non-HF group (n = 329) was chronic obstructive pulmonary disease exacerbation (n = 149), pneumonia (n = 57), pulmonary embolism (n = 19), and acute asthma (n = 12). The mean B-lines score was 29 ± 9 in HF group and 8 ± 3 in non-HF group. The difference was statistically significant (p < 0.001). In HF group, the B-lines score was suggestive of CHF (B-lines score ≥ 15) in 325 patients (87.6%). In the same group, B-profile pattern was present in 306 patients (82.5%). The difference in patients’ distribution between HF and non-HF groups according to B-profile and B-lines classes is summarized in Fig. 4. This difference was statistically significant (p < 0.001). The discriminating power of B-lines score and B-profile pattern was good as assessed by area under ROC curve of 0.86 (95% CI 0.83–0.89) and 0.83 (95% CI 0.80–0.86), respectively, for B-lines score and B-profile pattern (p = 0.91) (Fig. 5). Performance of B-lines score at a cut-off = 15 showed that sensitivity, specificity, negative predictive value and positive predictive value of the two models were similar with trends to a moderately higher sensitivity for B-lines score compared to B-profile pattern (87.6% versus 82.5%) and lower specificity (74.7% versus 83.9%) (Table 2). Agreement between residents in the determination of CHF diagnosis was excellent for both models as demonstrated by kappa agreement index value of 0.81 and 0.85, respectively, for B-lines score and B-profile pattern. For B-lines scoring, there is a good agreement between residents’ interpretation as shown in the Bland and Altman plot (mean differences between B-lines scores = 0.49 ± 0.22, p: not significant) (Fig. 6).
Our study has shown study that EM residents can be significantly aided to establish the diagnosis of CHF after a short and accelerated ultrasonographic B-lines assessment training, with an excellent inter-rater agreement in patients admitted for acute dyspnea.
Among the many potential underlying causes of acute dyspnea, CHF is one of most common and challenging etiologies . Among patients presenting to the ED with CHF, over 80% are admitted to the hospital, making it the most common reason for admission and a significant financial burden on the health care system. Despite this high prevalence, the standard workup for acute shortness of breath in the ED is non-specific and often fails to differentiate CHF from conditions such as chronic obstructive pulmonary disease exacerbation . This distinction is essential as inappropriate management has been shown to affect negatively the morbidity and mortality. Overall, approximately 20% of patients presenting to the ED with dyspnea are misdiagnosed and treated inappropriately . In fact, substantial diagnostic uncertainty is inevitable when relying only on traditional clinical findings . Lung ultrasonography, once considered inconceivable, is increasingly considered as a bedside imaging tool for evaluating pulmonary congestion . A recent systematic review showed that B-lines study is highly accurate in the diagnosis of acute heart failure with an area under ROC of 0.91, a sensitivity of 0.90 and a specificity of 0.93 . Of note, many of the studies included in this review had small sample sizes and were performed in settings other than EDs, even though our results are consistent with the findings of this meta-analysis. Importantly, our study is the largest in demonstrating the accuracy of B-lines study performed by residents with no previous experience of ultrasound techniques. Similar results were reported by Bedetti et al. in a smaller sample size study . In addition, in their estimations of the sensitivity and specificity, Chiem et al. showed results close to ours, but slightly lower than those reported in previous studies [5, 12, 23]. Of note, all these studies included non-expert operators. It is possible that, with sustained and more supervised practice, these novice trainees would improve significantly their performance.
The second important objective of the present study was to assess the reproducibility of B-lines. Available evidence regarding inter-observer agreement reveals that B-lines study has a good is reproducibility [7, 23, 24]. It should be highlighted that most studies assessing reproducibility were based on retrospective LUS imaging review performed longtime after the first LUS testing. B-lines is a dynamic phenomenon that can be influenced with number of technical and pathologic factors . Consequently, reproducibility should be assessed without delay between pairs of LUS examinations and ideally in the same conditions. In the present study, we minimized this time between each pair of operators testing (one immediately followed the other). Moreover, we demonstrated the excellent inter-observer agreement of B-lines study by using two different models, the B-lines scoring system and the B-profile pattern which reinforces the validity of the results.
Our study has some limitations. First, the study was conducted in academic EDs and the same evaluation in another setting may show different results. Second, since only hospitalized patients were considered eligible for the study purpose, a selection bias could not be excluded and our results may not be applicable to patients with milder symptoms. Third, some of our patients received specific heart failure treatment (intravenous diuretics, nitrates, CPAP) before undergoing LUS test, which could improve lung congestion, B-lines number would be reduced and this would probably underestimate the sensitivity of B-lines testing. Lastly, it is not clear whether introduction of LUS in routine clinical practice would influence medical decision-making and change patients’ prognosis? It is not possible for us to give a clear answer to this question; it is above the scope of the present study. Nonetheless, the fact that LUS can help to identify rapidly the diagnosis of CHF, this would give to physicians more confidence in choosing the most appropriate and effective treatment. Fourth, the training course of residents is limited in our study to 2 h; this could be insufficient to be comfortable to practice LUS. However, according to a recent meta-analysis in clinical lung ultrasound, the learning time spent in the different included studies ranged from 30 min sessions to 2.5 h sessions . Similar brief durations reported by Noble et al. (1 h) resulted in a significant improvement of image recognition skills for physicians without previous ultrasound experience. Moreover, a recent study by Gargani et al. showed that even web-based training in lung ultrasound can be a highly effective approach for training inexperienced operators .
In summary, the present study demonstrated that point-of-care B-lines study in the hand of non-expert residents is a reliable and reproducible technique. It can improve the identification of CHF in ED patients with undifferentiated dyspnea. Our results, if confirmed by other larger prospective high-quality studies, have potentially significant clinical implications. Being a rapid technique with high accuracy in the diagnosis of cardiogenic dyspnea, B-lines study could be suitable in departments with lack of technical and human resources.
Availability of data and materials
The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Congestive heart failure
OR interquartile range
Area under curve
Mebazaa A, Tolppanen H, Mueller C et al (2016) Acute heart failure and cardiogenic shock: a multidisciplinary practical guidance. Intensive Care Med 42:147–163. https://doi.org/10.1007/s00134-015-4041-5
Stevenson LW, Perloff JK (1989) The limited reliability of physical signs for estimating hemodynamics in chronic heart failure. JAMA 261:884–888
Mulrow CD, Lucey CR, Farnett LE (1993) Discriminating causes of dyspnea through clinical examination. J Gen Intern Med 8:383–392. https://doi.org/10.1007/BF02600079
Ponikowski P, Voors AA, Anker SD et al (2016) 2016 ESC Guidelines for the diagnosis and treatment of acute and chronic heart failure: the Task Force for the diagnosis and treatment of acute and chronic heart failure of the European Society of Cardiology (ESC)Developed with the special contribution of the Heart Failure Association (HFA) of the ESC. Eur Heart J 37:2129–2200. https://doi.org/10.1093/eurheartj/ehw128
Gallard E, Redonnet J-P, Bourcier J-E et al (2015) Diagnostic performance of cardiopulmonary ultrasound performed by the emergency physician in the management of acute dyspnea. Am J Emerg Med 33:352–358. https://doi.org/10.1016/j.ajem.2014.12.003
Price S, Platz E, Cullen L et al (2017) Expert consensus document: Echocardiography and lung ultrasonography for the assessment and management of acute heart failure. Nat Rev Cardiol 14:427–440. https://doi.org/10.1038/nrcardio.2017.56
Gullett J, Donnelly JP, Sinert R et al (2015) Interobserver agreement in the evaluation of B-lines using bedside ultrasound. J Crit Care 30:1395–1399. https://doi.org/10.1016/j.jcrc.2015.08.021
Lichtenstein DA, Mezière GA (2008) Relevance of lung ultrasound in the diagnosis of acute respiratory failure: the BLUE protocol. Chest 134:117–125. https://doi.org/10.1378/chest.07-2800
Gargani L (2011) Lung ultrasound: a new tool for the cardiologist. Cardiovasc Ultrasound 9:6. https://doi.org/10.1186/1476-7120-9-6
Volpicelli G, Elbarbary M, Blaivas M et al (2012) International evidence-based recommendations for point-of-care lung ultrasound. Intensive Care Med 38:577–591. https://doi.org/10.1007/s00134-012-2513-4
Wang Y, Shen Z, Lu X et al (2018) Sensitivity and specificity of ultrasound for the diagnosis of acute pulmonary edema: a systematic review and meta-analysis. Med Ultrason 1:32–36. https://doi.org/10.11152/mu-1223
Chiem AT, Chan CH, Ander DS et al (2015) Comparison of expert and novice sonographers’ performance in focused lung ultrasonography in dyspnea (FLUID) to diagnose patients with acute heart failure syndrome. Acad Emerg Med 22:564–573. https://doi.org/10.1111/acem.12651
Al Deeb M, Barbic S, Featherstone R et al (2014) Point-of-care ultrasonography for the diagnosis of acute cardiogenic pulmonary edema in patients presenting with acute dyspnea: a systematic review and meta-analysis. Acad Emerg Med 21:843–852. https://doi.org/10.1111/acem.12435
Frassi F, Gargani L, Tesorio P et al (2007) Prognostic value of extravascular lung water assessed with ultrasound lung comets by chest sonography in patients with dyspnea and/or chest pain. J Card Fail 13:830–835. https://doi.org/10.1016/j.cardfail.2007.07.003
Gargani L, Frassi F, Soldati G et al (2008) Ultrasound lung comets for the differential diagnosis of acute cardiogenic dyspnoea: a comparison with natriuretic peptides. Eur J Heart Fail 10:70–77. https://doi.org/10.1016/j.ejheart.2007.10.009
Maisel AS, Krishnaswamy P, Nowak RM et al (2002) Rapid measurement of B-type natriuretic peptide in the emergency diagnosis of heart failure. N Engl J Med 347:161–167. https://doi.org/10.1056/NEJMoa020233
Wang CS, FitzGerald JM, Schulzer M et al (2005) Does this dyspneic patient in the emergency department have congestive heart failure? JAMA 294:1944–1956. https://doi.org/10.1001/jama.294.15.1944
Collins SP, Peacock WF, Lindsell CJ et al (2009) S3 detection as a diagnostic and prognostic aid in emergency department patients with acute dyspnea. Ann Emerg Med 53:748–757. https://doi.org/10.1016/j.annemergmed.2008.12.029
Badgett RG, Lucey CR, Mulrow CD (1997) Can the clinical examination diagnose left-sided heart failure in adults? JAMA 277:1712–1719
Cardinale L, Volpicelli G, Binello F et al (2009) Clinical application of lung ultrasound in patients with acute dyspnea: differential diagnosis between cardiogenic and pulmonary causes. Radiol Med 114:1053–1064. https://doi.org/10.1007/s11547-009-0451-1
Staub LJ, Mazzali Biscaro RR, Kaszubowski E, Maurici R (2019) Lung ultrasound for the emergency diagnosis of pneumonia, acute heart failure, and exacerbations of chronic obstructive pulmonary disease/asthma in adults: a systematic review and meta-analysis. J Emerg Med 56:53–69. https://doi.org/10.1016/j.jemermed.2018.09.009
Bedetti G, Gargani L, Corbisiero A et al (2006) Evaluation of ultrasound lung comets by hand-held echocardiography. Cardiovasc Ultrasound 4:34. https://doi.org/10.1186/1476-7120-4-34
Pivetta E, Goffi A, Lupia E et al (2015) Lung ultrasound-implemented diagnosis of acute decompensated heart failure in the ED: a SIMEU multicenter study. Chest 148:202–210. https://doi.org/10.1378/chest.14-2608
Liteplo AS, Marill KA, Villen T et al (2009) Emergency thoracic ultrasound in the differentiation of the etiology of shortness of breath (ETUDES): sonographic B-lines and N-terminal pro-brain-type natriuretic peptide in diagnosing congestive heart failure. Acad Emerg Med 16:201–210. https://doi.org/10.1111/j.1553-2712.2008.00347.x
Pivetta E, Baldassa F, Masellis S et al (2018) Sources of variability in the detection of B-Lines, using lung ultrasound. Ultrasound Med Biol 44:1212–1216. https://doi.org/10.1016/j.ultrasmedbio.2018.02.018
Pietersen PI, Madsen KR, Graumann O et al (2018) Lung ultrasound training: a systematic review of published literature in clinical lung ultrasound training. Crit Ultrasound J. https://doi.org/10.1186/s13089-018-0103-6
Efficacy of a remote web-based lung ultrasound training for nephrologists and cardiologists: a LUST trial sub-project | Nephrology Dialysis Transplantation | Oxford Academic. https://academic.oup.com/ndt/article/31/12/1982/2661710. Accessed 26 Jan 2021
The authors acknowledge all of our Research Laboratory LR12SP18 University of Monastir members who contributed greatly to this study.
Ethics approval and consent to participate
The study was approved by the Ethics Committee of each participating institution; it was recorded in the ClinicaTrials.gov register under number NCT03660592.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Msolli, M.A., Sekma, A., Marzouk, M.B. et al. Bedside lung ultrasonography by emergency department residents as an aid for identifying heart failure in patients with acute dyspnea after a 2-h training course. Ultrasound J 13, 5 (2021). https://doi.org/10.1186/s13089-021-00207-9
- Lung ultrasonography
- Congestive heart failure