Sleep apnea–hypopnea syndrome (SAHS) is a complex, heterogeneous, multifactorial disorder with a variable clinical presentation.1 Continuous positive airway pressure (CPAP) treatment is effective in symptomatic patients. However, while there is strong evidence for the association between severe SAHS, cardiovascular disease (CVD), and mortality in observational studies, randomized clinical trials have shown that CPAP does not reduce the risk of these outcomes.2 One possible explanation is that, at present, the management of the disease is based mainly on the apnea–hypopnea index (AHI), despite the fact that it is difficult for a single parameter to capture the entire heterogeneity of the disease. This situation underlines the need for precision medicine in SAHS. Just as phenotypes have been described in other chronic diseases such as asthma and COPD, attempts have been made in the last decade to identify SAHS phenotypes that can characterize patients with a higher risk of morbidity and mortality and a higher likelihood of responding to treatment. A series of statistical techniques have been used to analyze multifactorial data, one of the most important being cluster analysis, a multivariate statistical technique that seeks to group elements (or variables) in the attempt to achieve the maximum homogeneity within groups and the greatest difference between groups.3
The validity of the identification of phenotypes will depend on the homogeneity of the variables analyzed, their progress over time, and their reproducibility.
One of the first cluster analyses in SAHS was the Iceland Cohort (ISAC) conducted in 2014.4 Since then, other studies have been published, many of which have focused on moderate-severe SAHS, although some include mild manifestations.5–8 However these studies differ widely in terms of variables studied, cluster methods applied, and the duration of follow-up. Therefore, many different phenotypes have been identified and few have been associated with relevant clinical outcomes.
The ISAC study identified 3 phenotypes, classified as patients with “disturbed sleep” (32.7%), “minimally symptomatic” patients (24.7%) and patients with “excessive daytime sleepiness” (42.6%). One of the aims of this study was to determine if there was a higher likelihood of comorbidities (hypertension [HT], diabetes, CVD) in the “minimally symptomatic” group. Vavougios et al.6 identified 6 groups, of which 2 had severe SAHS and 2 had moderate SAHS, differentiated by the presence of comorbidities. In patients with a similar AHI, the variables associated with “sick” phenotypes were age, body mass index (BMI), presence of HT, and lower daytime oxygen saturation. Similarly, Lacedonia et al.9 identified a phenotype with more comorbidities that was associated with higher nocturnal and daytime hypoxemia and a higher BMI. Turino et al.,10 in a cohort of patients receiving CPAP, found 2 phenotypes associated with higher mortality: the “neoplastic” group (with a high prevalence of malignant neoplasms) and “oldest (mean age 72 years) and cardiac disease patients”.
Another limitation of most of these studies, apart from the variability of the parameters analyzed, is that their cross-sectional design rules out any prediction of progress of the phenotypes over time with respect to relevant clinical findings. The first study to analyze the clinical course of phenotypes was that of Zinchuk et al.8: this group identified 7 phenotypes according to 65 polysomnographic variables. The groups that were significantly associated with the primary outcome (a combination of stroke, transient ischemic attack, acute coronary syndrome, or death) at 4.9 years presented “restless leg syndrome”, “hypopnea and hypoxia”, and “combined severe”. No association with AHI was identified. A second study of the original ISAC cohort described the 2-year follow-up of the original phenotypes in subjects treated with CPAP; improved symptoms were observed in all three groups. Although CPAP adherence and symptomatic improvement were greater in the group with excessive daytime sleepiness, there were no differences in the prevalence of comorbidities.11
ISAC is the only study that other cohorts have attempted to reproduce, but besides finding some differences in the number of clusters and their characteristics, the association with comorbidities of these studies also proved different:
- •
Kim et al.12 identified 3 clusters similar to those described in ISAC: in this case, the “minimally symptomatic” group was the most prevalent and no phenotype was found to be associated with more comorbidities, except for a higher prevalence of HT in patients with “disturbed sleep”. Unlike ISAC, the cohort was a general Korean population, and cultural differences may account for the different manifestation of symptoms and partly justify the different outcomes.
- •
Keenan et al.13 reproduced ISAC in a new clinical cohort in Iceland and in another multi-ethnic cohort. They found 5 clusters (80% of patients were in the original 3 clusters) with similar prevalences to ISAC, but the “disturbed sleep” group was the one with the highest prevalence of comorbidities.
- •
The Sleep Heart Health Study cohort went a step further by following patients for up to 11.8 years and analyzing the incidence of CVD. In addition to the three ISAC groups, they identified a new group: “moderate sleepiness”. In the “excessive daytime sleepiness” group (16.7%), a higher prevalence and incidence of CVD was observed, but surprisingly, the greater AHI score of this group is not discussed.14
The somewhat reproducible finding of the 3 clinical ISAC phenotypes in different cohorts is interesting. However, the phenotype most associated with comorbidities varies (“minimally symptomatic” in the original, “disturbed sleep” in the Kim et al.12 study, none in Keenan et al.’s Iceland cohort13), prompting us to ask if they are really identifying the same phenotypes.
To date, cluster studies in SAHS reflect biases derived from variables previously established in the analysis. Thus, the generalization of the described phenotypes is limited by methodological differences, and further studies using a multifactorial approach (clinical, physiological, biological, polysomnographic parameters, clinical consequences) and a consensus definition of the variables analyzed will be needed to evaluate the most appropriate grouping techniques. Incorporating large quantities of data (big data) from electronic medical records is a promising strategy for advancing precision medicine in sleep apnea.
Please cite this article as: Silveira MG, Lloberes P. Análisis de clúster para identificar fenotipos de síndrome de apneas-hipopneas del sueño: ¿hacia dónde vamos? Arch Bronconeumol. 2020;56:689–690.