Skip to content

Profiling the Non-Small Cell Lung Cancer (NSCLC) microenvironment by integrating transcriptomics to uncover potential phenotypic profiles associated to patterns in immune infiltration

Notifications You must be signed in to change notification settings

VeraPancaldiLab/LungPredict1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

81 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Please visit https://github.com/VeraPancaldiLab/LungPredict1_paper/tree/main for the updated version of the Lung Predict project.

LungPredict

Profiling the Non-Small Cell Lung Cancer (NSCLC) microenvironment by integrating transcriptomics to uncover potential phenotypic profiles associated to patterns in immune infiltration

Summary

Here, we applied a computational immunology approach involving differential expression and pathway analyses, along with the quantification of immune cell proportions using computational deconvolution methods, inference of transcription factor activities, and specification of immune score types to better characterize bulk transcriptomics samples of lung adenocarcinoma. This comprehensive analysis enabled us to identify biomarkers of disease progression and immune infiltration patterns across disease stages and patient groups, thus allowing us to compute and evaluate specific hallmarks of immune response across patients. Through our methodology and feature selection pipeline, which we briefly describe, we identified niches of immune cells and report on a duality in the behavior of NK cells in the Tumor Microenvironment (TME), corresponding to either immune response or characteristic dysfunctional/exhausted states. We validated our results using an independent cohort, detailed in terms of size and nature, with both bulk and single-cell RNA sequencing (scRNAseq) samples. This validation enabled us to further investigate the NK cell profiles that we had identified.

Data

Bulk RNASeq data from NSCLC patients

  • LungPredict dataset: 62 patients with Lung Cancer (Adenocarcinoma) - Stage I, II, III, IV
  • Vanderbilt dataset: 76 patients NSCLC (Adenocarcinoma) - Early Stage
  • Vanderbilt Single-cell RNAseq data: 15 patients NSCL (Adenocarcinoma) - Early Stage

image

Project organization

  • Scripts: Codes used to develop the project's methods.
  • Figures: Images results from analysis in Early and Late Stage patients
  • Output: Result files from analysis.

For reproduce the figures of the paper, use file Scripts/Analysis_EarlyStage.rmd For reproduce the figures of the single cell part, use file Scripts/VANscRNAseq.rmd

Methodology

Analysis was divided in two parts for Early (Stage I, II) and Late Stage (Stage IV) patients. Some of the methods used:

  • Unsupervised and supervised analysis
  • Differential Gene expression (DESeq2)
  • Immune cell type Deconvolution (GEMDECan)
  • TFs inference for bulk and single cell data
  • Feature selection (Random forest, mutual information)
  • Clustering algorithms (PCA, UMAP)
  • Data visualization
  • Univariate Linear model
  • Pseudobulk analysis for SingleCell

Contributing

If you are interested in contributing to this project, please contact the project leader for more information ([email protected])

Authors

About

Profiling the Non-Small Cell Lung Cancer (NSCLC) microenvironment by integrating transcriptomics to uncover potential phenotypic profiles associated to patterns in immune infiltration

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages