Bronchial epithelial gene expression and interstitial lung abnormalities
Respiratory Research volume 24, Article number: 245 (2023)
Interstitial lung abnormalities (ILA) often represent early fibrotic changes that can portend a progressive fibrotic phenotype. In particular, the fibrotic subtype of ILA is associated with increased mortality and rapid decline in lung function. Understanding the differential gene expression that occurs in the lungs of participants with fibrotic ILA may provide insight into development of a useful biomarker for early detection and therapeutic targets for progressive pulmonary fibrosis.
Measures of ILA and gene expression data were available in 213 participants in the Detection of Early Lung Cancer Among Military Personnel (DECAMP1 and DECAMP2) cohorts. ILA was defined using Fleischner Society guidelines and determined by sequential reading of computed tomography (CT) scans. Primary analysis focused on comparing gene expression in ILA with usual interstitial pneumonia (UIP) pattern with those with no ILA.
ILA was present in 51 (24%) participants, of which 16 (7%) were subtyped as ILA with a UIP pattern. One gene, pro platelet basic protein (PPBP) and seventeen pathways (e.g. TNF-α signalling) were significantly differentially expressed between those with a probable or definite UIP pattern of ILA compared to those without ILA. 16 of these 17 pathways, but no individual gene, met significance when comparing those with ILA to those without ILA.
Our study demonstrates that abnormal inflammatory processes are apparent in the bronchial airway gene expression profiles of smokers with and without lung cancer with ILA. Future studies with larger and more diverse populations will be needed to confirm these findings.
Interstitial lung abnormalities (ILA) are chest computed tomography (CT) findings in persons not known to have interstitial lung disease (ILD) that may represent an early stage of pulmonary fibrosis (PF) . Overlapping associations with genetic polymorphisms  and protein biomarkers  between those with ILA and idiopathic pulmonary fibrosis (IPF) highlight the fact that these conditions may share pathobiology.
While there are numerous cell types in the lung that likely play important roles in the pathobiology of IPF, growing data suggests that patients with PF have altered patterns of bronchial epithelial cell gene expression . Significant alterations in airway morphology and gene expression signatures have been described in patients with IPF [5,6,7,8,9]. Additionally, several of the single nucleotide polymorphisms that are associated with increased risk for IPF, including MUC5B and DSP are expressed in bronchial airway cells [10, 11]. However, no prior study has characterized bronchial epithelial gene expression among persons with ILA.
To provide an assessment of the genes and pathways associated with ILA, and ILA subtypes we performed bronchial airway epithelial gene expression analyses in patients enrolled in Detection of Early Lung Cancer Among Military Personnel (DECAMP 1 and DECAMP 2) cohorts .
The DECAMP Study is a multicenter consortium comprised of 15 military treatment facilities, Veterans Affairs hospitals, and academic centers across the United States. Participants were recruited into one of two study protocols, designated as DECAMP-1 and DECAMP-2 . Studies were registered at clinicaltrials.gov as DECAMP 1 (NCT01785342) and DECAMP 2 (NCT02504697) respectively. Briefly, study participants of DECAMP-1 were adults aged 45 and older with indeterminate pulmonary nodules and heavy smoking history whereas, study participants of DECAMP-2 were aged 50–79 with a heavy smoking history and a family history of lung cancer or a personal history of chronic obstructive pulmonary disease (COPD). This study was approved by the Human Research Protection Office (HRPO) for the Department of Defense, and the individual site IRBs for every participating site. All subjects were approached for written informed consent to participate in the study per IRB regulations (Additional file 1: Table S1).
Biospecimen collection in DECAMP
All individuals in the DECAMP study underwent bronchoscopy. Bronchial airway epithelial cells were obtained from brushings of the right mainstem bronchus collected during fiberoptic bronchoscopy with an endoscopic cytobrush (Cellebrity Endoscopic Cytology Brush, Boston Scientific, Boston). The brushes were immediately placed in 1 mL of RNAprotect Cell Reagent (Qiagen, Valencia, CA) and kept at − 80 °C until RNA isolation was performed.
RNA isolation, sequencing and data pre-processing
Total RNA was isolated using the miRNeasy Mini Kit (Qiagen, Valencia, CA). RNA integrity was assessed by Agilent BioAnalyzer, and RNA purity confirmed using a NanoDrop spectrophotometer. Libraries were generated using the Illumina TruSeq Stranded Total RNA kit and sequenced on the Illumina NextSeq 500 and Illumina HiSeq3000 with 75 base-pair paired-end reads (Illumina, San Diego, CA). For data preprocessing, we developed an automatic pipeline using the Nextflow framework. Quality of FASTQ files was assessed with FastQC. Reads were aligned to the human genome with 2-pass STAR and gene-level and isoform-level expression quantified with RSEM. Splice junction saturation, transcript integrity, and biotype distributions were calculated for each sample with RSeQC. DESeq2 or edgeR was used to identify associations between gene expression profiles and clinical variables while controlling for confounding covariates. Genetic variants were called using the Broad Institute’s GATK RNA-seq best-practices workflow. Briefly, duplicates were marked with Picard tools, splitting of intronic reads, realignment around indels, and base quality score recalibration were performed with GATK, and variants were called with Haplotypecaller.
Imaging acquisition in DECAMP
DECAMP-1 utilized CT scans collected as part of routine clinical care while DECAMP-2 utilized a standardized protocol for image acquisition and reconstruction. DECAMP-2 scans were collected using low dose helical computed tomography on a minimum 16-slice scanner. The scans were acquired at 2.5 to 5 mm but reconstructed into 1 mm slice thickness using the soft tissue and lung algorithms. Images from all sites were then de-identified and submitted to the American College of Radiology Imaging Network (ACRIN) Core Laboratory for storage.
Gene expression analysis
The LIMMA package in R (version 3.4.0) was used to assess differential gene expression (DGE). To do this, raw count matrix of gene expression was initially filtered by counts per million (CPM) such that a gene could only be included if its CPM was greater than 1 in 10% of the total number of patients. DGE analysis was then performed using a pairwise comparison between the ILA phenotypes at a false discovery rate (FDR) of 0.05. The differentially expressed genes (DEG) identified by LIMMA were further analyzed by Enrichr for over-representation analysis. Heatmaps were used to visualize the data and identify unsupervised gene clusters using the “Ward.D2” algorithm. Gene set enrichment analysis (GSEA)  was performed on pre-ranked gene lists created by pairwise comparisons between ILA status using Hallmark gene sets . An FDR threshold of 0.01 was applied to select significant hallmark gene sets.
Measures of ILA were assessed on chest CT scans using a sequential method by readers (including radiologists and pulmonologists) blinded to prior interpretations and participant information and per Fleischner society recommendations . Participants with indeterminate ILA status were excluded from this analysis (Additional file 2: Fig. S1) . ILA with a UIP pattern was specifically characterised in those who had either a probable or definite UIP pattern as previously published .
Data are presented as means and standard deviations for continuous measurements and number and percentage for categorical features. P values were calculated using Fisher’s exact test or t test as applicable. All analyses were adjusted for age, gender, cohort, cancer status, smoking status and sequencing method as confounders. Genes and pathways demonstrating a p-value < 0.05 after adjusting for the false discovery rate (FDR) were considered statistically significant.
Among the 213 participants with both ILA phenotyping and gene expression data, there were 51 (24%) with ILA and 162 (76%) without ILA (Additional file 2: Fig. S1). Of the 51 with ILA, 16 (30%) had a probable or definite UIP pattern (hereafter referred to as ‘ILA with a UIP pattern’) and 35 (70%) had ILA without a UIP pattern (hereafter referred to as ‘ILA with no UIP’). Compared to those without ILA and those with ILA with no UIP pattern, participants with a UIP pattern of ILA were older, and had increased measures of forced expiratory volume in one second (FEV1), forced vital capacity (FVC) and ratios of FEV1 to forced vital capacity (FEV1/FVC) (Table 1). Consistent with these results, participants with a UIP pattern of ILA had lower rates of COPD on spirometry (37.5% vs 71%; Fisher’s Exact Test; p = 0.009; Table 1). Those with ILA both without UIP pattern and with a UIP pattern had more malignant lung nodules compared to those without ILA (Table 1).
After adjusting for multiple testing, no individual gene met statistical significance for differential expression between those with ILA (n = 51) compared to those without ILA (n = 162) (Additional file 2: Fig. S1 and Additional file 3: Table S2). Similarly no significant differential expression was observed when comparing ILA with no UIP (n = 35) and those without ILA (n = 162) (Additional file 4: Table S3).
Participants with ILA with a UIP pattern compared to those without ILA, demonstrated a > eightfold upregulation of the gene encoding Pro-Platelet basic protein [PPBP] (FDR < 0.05). At a less stringent threshold, the 122 genes with p < 0.01 (Additional file 5: Table S4) divided the samples into two clusters including one that was enriched for samples from participants with ILA with a UIP pattern (Fisher’s test, p = 0.001; Fig. 1). We did not detect any other significant differences between the clusters with regard to the characteristics of the participants from whom the samples were obtained. The list of genes with UIP associated gene expression was highly overlapping when COPD was included as covariate (Fisher’s Exact Test; p < 1e−200).
Gene set enrichment analysis (GSEA) demonstrated seventeen pathways whose genes were significantly enriched among those with a UIP pattern of ILA when compared to those without ILA (FDR < 0.01) including tumour necrosis factor-alpha (TNF-α) signalling through nuclear factor kappa-B (NF-κB) (Table 2). 16 out of these 17 pathways were also amongst the 24 pathways significantly upregulated (FDR < 0.01) when comparing participants with ILA to that without ILA (Additional file 6: Table S5), a highly significant overlap (hypergeometric test; p < 7 × 10–7). GSEA of the ILA with no UIP group showed 13 pathways that met significance; of which 9 were common to the ones in the ILA with UIP group (hypergeometric test p = 0.003) and 12 in common with the whole ILA group (hypergeometric test; p = 0.0001) (Additional file 7: Table S6).
This is the first assessment of bronchial epithelial gene expression among participants with ILA and these findings suggest that both the UIP pattern of ILA and ILA may be associated with similarly altered bronchial airway gene expression at the pathway level.
In this cohort, the most significantly differentially expressed gene was PPBP; a gene within the C-X-C motif cytokine family, specifically ligand 7. While CXCL7 has been associated with non-small cell lung cancer , it has also has been implicated in early stages of wound healing and has previously been associated with IPF, both in serum samples as well as bronchoalveolar lavage fluid [18, 19]. It is hypothesized that this cytokine helps to recruit mesenchymal stem cells from bone marrow and promote fibrosis .
Genes involved in TNF-α signalling were among the genes most induced in bronchial brushes from individuals with UIP pattern of ILA and in individuals with ILA relative to non-ILA controls. This pathway has similarly been demonstrated to be upregulated in the lungs of patients with pulmonary fibrosis , and it’s downregulation through the suppression of transforming growth factor-beta (TGF-β) is posited to be one of the therapeutic effects of pirfenidone . These findings suggest that studying persons with ILA, can help to identify biologic pathways that are dysregulated in patients with clinically identified PF.
ILA with no UIP group may represent an intermediate step between the No ILA group and ILA with UIP group based on clinical characteristics and gene expression.
There are limitations to the present study. First, our ability to identify significant pathway enrichment among the genes most differentially expressed despite our overall failure to detect differential expression of individual genes suggests that our study is limited by its small sample size. Second, although analyses were adjusted for the presence of malignancy, it is difficult to exclude the possibility that residual differential background gene expression and the potential covariance of lung cancer and ILA could have affected our results. Third, we cannot rule out that performing this analysis on a subset of DECAMP, due to data availability influenced these results. Specifically, the prevalence of ILA in this cohort is larger than previously described and this may be due to selection bias from using a limited dataset of cohort participants with both ILA phenotyping and RNA sequencing data available. The results of the study are also limited to subjects with a known current or former smoking history and may differ from subjects with ILA without a smoking history. For these reasons, replication of these findings in additional cohorts will be important to validate the observed associations and assess generalizability to other populations.
There are differentially expressed bronchial epithelial cell genes and pathways associated with ILA among patients in DECAMP. Some of the most differentially expressed genes and pathways have been similarly demonstrated to be upregulated among patients with IPF. Additional studies in larger cohorts are warranted to confirm these findings.
Availability of data and materials
The data that support the findings of this study are available from the DECAMP consortium but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors upon reasonable request and with permission of DECAMP consortium.
Interstitial lung abnormalities
Detection of Early Lung Cancer Among Military Personnel
Usual interstitial pneumonia
Interstitial lung disease
Idiopathic pulmonary fibrosis
False discovery rate
Pro-platelet basic protein
Gene set enrichment analysis
Tumour necrosis factor-alpha
Nuclear factor kappa-B
C-X-C motif cytokine family, ligand 7
Transforming growth factor-beta
Hatabu H, Hunninghake GM, Richeldi L, Brown KK, Wells AU, Remy-Jardin M, et al. Interstitial lung abnormalities detected incidentally on CT: a position paper from the Fleischner Society. Lancet Respir Med. 2020;8(7):726–37.
Hobbs BD, Putman RK, Araki T, Nishino M, Gudmundsson G, Gudnason V, et al. Overlap of genetic risk between interstitial lung abnormalities and idiopathic pulmonary fibrosis. Am J Respir Crit Care Med. 2019;200(11):1402–13.
Sanders JL, Putman RK, Dupuis J, Xu H, Murabito JM, Araki T, et al. The association of aging biomarkers, interstitial lung abnormalities, and mortality. Am J Respir Crit Care Med. 2021;203(9):1149–57.
Chakraborty A, Mastalerz M, Ansari M, Schiller HB, Staab-Weijnitz CA. Emerging roles of airway epithelial cells in idiopathic pulmonary fibrosis. Cells. 2022;11(6):1050.
Ikezoe K, Hackett TL, Peterson S, Prins D, Hague CJ, Murphy D, et al. Small airway reduction and fibrosis is an early pathologic feature of idiopathic pulmonary fibrosis. Am J Respir Crit Care Med. 2021;204(9):1048–59.
Tanabe N, McDonough JE, Vasilescu DM, Ikezoe K, Verleden SE, Xu F, et al. Pathology of idiopathic pulmonary fibrosis assessed by a combination of microcomputed tomography, histology, and immunohistochemistry. Am J Pathol. 2020;190(12):2427–35.
Habermann AC, Gutierrez AJ, Bui LT, Yahn SL, Winters NI, Calvi CL, et al. Single-cell RNA sequencing reveals profibrotic roles of distinct epithelial and mesenchymal lineages in pulmonary fibrosis. Sci Adv. 2020;6(28): eaba1972.
Prasse A, Binder H, Schupp JC, Kayser G, Bargagli E, Jaeger B, et al. BAL cell gene expression is indicative of outcome and airway basal cell involvement in idiopathic pulmonary fibrosis. Am J Respir Crit Care Med. 2019;199(5):622–30.
De Sadeleer LJ, Verleden SE, Schupp JC, McDonough JE, Goos T, Yserbyt J, et al. BAL transcriptomes characterize idiopathic pulmonary fibrosis endotypes with prognostic impact. Chest. 2022;161(6):1576–88.
Noth I, Zhang Y, Ma SF, Flores C, Barber M, Huang Y, et al. Genetic variants associated with idiopathic pulmonary fibrosis susceptibility and mortality: a genome-wide association study. Lancet Respir Med. 2013;1(4):309–17.
Hao Y, Bates S, Mou H, Yun JH, Pham B, Liu J, et al. Genome-wide association study: functional variant rs2076295 regulates desmoplakin expression in airway epithelial cells. Am J Respir Crit Care Med. 2020;202(9):1225–36.
Billatos E, Duan F, Moses E, Marques H, Mahon I, Dymond L, et al. Detection of early lung cancer among military personnel (DECAMP) consortium: study protocols. BMC Pulm Med. 2019;19(1):59.
Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci USA. 2005;102(43):15545–50.
Liberzon A, Birger C, Thorvaldsdottir H, Ghandi M, Mesirov JP, Tamayo P. The molecular signatures database (MSigDB) hallmark gene set collection. Cell Syst. 2015;1(6):417–25.
Washko GR, Hunninghake GM, Fernandez IE, Nishino M, Okajima Y, Yamashiro T, et al. Lung volumes and emphysema in smokers with interstitial lung abnormalities. N Engl J Med. 2011;364(10):897–906.
Putman RK, Gudmundsson G, Axelsson GT, Hida T, Honda O, Araki T, et al. Imaging patterns are associated with interstitial lung abnormality progression and mortality. Am J Respir Crit Care Med. 2019;200(2):175–83.
Crossno PF, Polosukhin VV, Blackwell TS, Johnson JE, Markin C, Moore PE, et al. Identification of early interstitial lung disease in an individual with genetic variations in ABCA3 and SFTPC. Chest. 2010;137:969–73.
Foster MW, Morrison LD, Todd JL, Snyder LD, Thompson JW, Soderblom EJ, et al. Quantitative proteomics of bronchoalveolar lavage fluid in idiopathic pulmonary fibrosis. J Proteome Res. 2015;14(2):1238–49.
Lee J, Arisi I, Puxeddu E, Mramba LK, Amicosante M, Swaisgood CM, et al. Bronchoalveolar lavage (BAL) cells in idiopathic pulmonary fibrosis express a complex pro-inflammatory, pro-repair, angiogenic activation pattern, likely associated with macrophage iron accumulation. PLoS ONE. 2018;13(4): e0194803.
Kalwitz G, Endres M, Neumann K, Skriner K, Ringe J, Sezer O, et al. Gene expression profile of adult human bone marrow-derived mesenchymal stem cells stimulated by the chemokine CXCL7. Int J Biochem Cell Biol. 2009;41(3):649–58.
Frankel SK, Cosgrove GP, Cha SI, Cool CD, Wynes MW, Edelman BL, et al. TNF-alpha sensitizes normal and fibrotic human lung fibroblasts to Fas-induced apoptosis. Am J Respir Cell Mol Biol. 2006;34(3):293–304.
Maher TM. Pirfenidone in idiopathic pulmonary fibrosis. Drugs Today. 2010;46(7):473–82.
DECAMP Investigators Executive Center—Boston University: Avi Spira, MD*Ŧ, Elizabeth Moses, PhD*, Jennifer Beane, PhD, Josh Campbell, PhD, Jack Cunningham, Gang Liu, PhD, Hanqiao Liu, Sherry Zhang, Jiarui Zhang, Xu Ke, Xingyi Shi, Carter Merenstein, MS, Yue Zhao (Boston University School of Medicine); Ehab Billatos, MD*Ŧ (Pulmonary Center, Boston University School of Medicine, Boston); Marc Lenburg, PhDŦ (Section of Computational Biomedicine, Boston University School of Medicine).
Coordinating Center—American College of Radiology: Denise Aberle, MD*, Mitchell Schnall, MD, PhD*, Charles Apgar, MBA*, Irene Mahon, RN, MPH*, Lindsey Dymond, Joe Bauza, Sarah Gevo (American College of Radiology).
Data Coordinating Center—Brown University: Constantine Gastonis, PhD*, Helga Marquez, David Elashoff, PhDŦ (Brown University); Fenghai Duan, PhD*Ŧ (Department of Biostatistics and Center for Statistical Sciences, Brown University School of Public Health, Providence, RI).
Pathology Core—MD Anderson Cancer Center: Ignacio Wistuba, MD, Humam Kadara, PhD, Junya Fujimoto, MD, PhD (MD Anderson Cancer Center).
Sequencing Core—Uniformed Services University of the Health Science: Clifton Dalgard, PhD, Matthew Wilkerson, PhD (Uniformed Services University of the Health Science).
Imaging Core: Denise Aberle, MD (American College of Radiology); George Washko, MD* (Division of Pulmonary and Critical Care Medicine, Brigham and Women’s Hospital, Boston, MA); Charles M Kinsey, MD, MPH (University of Vermont, Medical Center); Fenghai Duan, PhD (Department of Biostatistics and Center for Statistical Sciences, Brown University School of Public Health).
Clinical Centers: Denise Fine (Boston University); Ron Goldstein, MD, Kathleen LaCerda (Boston VA); John Battaile, MD, Mitchell Kroll, (Dallas VA); Bob Keith, MD, Mary Jackson, (Denver VA); Steve Dubinett, MD, Gina Lee, MD, Babak Aryanfar, Rafael Corona (UCLA); Anil Vachani, MDŦ, Sam Soloman (University of Pennsylvania); Charles Atwood, MD, Gregory Owens, Hanna Edvardsson (Pittsburgh VA); Pierre Massion, MD (Vanderbilt University); Trey Helton (Nashville VA); Mary Reid (Roswell Park Cancer Institute); Chris Kuzniewski, MD, Jacob Carmichael, MD, Holly LaPerriere (Naval Medical Center Portsmouth); J. Scott Parrish, MD, Lindsey White, Anna Kaur (Naval Medical Center San Diego); Robert Browning Jr., MD*Ŧ, Maggie Nelissery, Folashade Akanni, Luis Rojas (Walter Reed Army Medical Center).
*Denotes DECAMP Executive Committee Member.
ŦDenotes DECAMP Data Access Committee Member.
Dr. Putman is supported by National Institutes of Health (NIH) Grant K08 HL140087. Dr. Rose is supported by NIH Grant T32 HL 76331. Dr. Cho is supported by NIH Grants R01 HL153248, R01 HL135142, R01 HL149861, R01 HL137927, and R01 HL137148. Dr. O’Connor is supported by NIH Grant OT2 OD026553. Dr. Hatabu is supported by NIH grants R01 CA203636, U01 CA209414, R01 HL111024, R01 HL135142, and R01 HL130974. Dr. Lenburg is supported by NIH grants R01 CA210360, U01 CA214182, U01 CA196408, U2C CA233238. Dr. Hunninghake and this work was supported by NIH Grants R01 HL111024, R01 HL130974, and R01 135142. DECAMP study is supported by funds from the Department of Defense (W81XWH-11-2-0161), the National Cancer Institute (U01CA196408), and Johnson and Johnson Services, Inc (JJSI).
Ethics approval and consent to participate
The DECAMP studies were registered with clinicaltrials.gov and individual IRB approval was obtained from each participating institution. Data provided as Additional files.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Institutional Review Board committee names and project approval numbers for each center for the DECAMP Study.
Study design diagram.
List of differentially expressed genes that met p-value < 0.01 among those with ILA as compared to those without ILA.
List of differentially expressed genes that met p < 0.01 among those without probable or definite UIP pattern of ILA as compared to those without ILA.
List of differentially expressed genes that met p-value < 0.01 among those with Probable or Definite UIP pattern of ILA as compared to those without ILA.
List of differentially expressed pathways comparing ILA with no ILA.
List of differentially expressed pathways comparing ILA no UIP with no ILA.
Cancer breakdown of nodule identified in participants in DECAMP 1 Cohort.
About this article
Cite this article
Menon, A.A., Lee, M., Ke, X. et al. Bronchial epithelial gene expression and interstitial lung abnormalities. Respir Res 24, 245 (2023). https://doi.org/10.1186/s12931-023-02536-w