Repository to store code used for the analyses presented in "Defining the Regulatory Logic of Breast Cancer Using Single-Cell Epigenetic And Transcriptome Profiling".
bioRxiv preprint: https://www.biorxiv.org/content/10.1101/2024.06.13.598858v1
Processed data will be made publicly available at GEO (https://www.ncbi.nlm.nih.gov/geo/) after publication.
Raw data will be available with controlled access at dbGaP (https://www.ncbi.nlm.nih.gov/gap/).
The following bash scripts were run in this order to execute specific tasks under the each heading. More specifically, these are slurm scripts that either sourced external data or submitted an R script to run in a Docker container executed with Singularity in an HPC environment.
Please visit the wiki for an in-depth walkthrough of these scripts and how they were used in the making of each figure.
-
scripts/make_cellranger_directories.sh
-
scripts/copy_cellranger_outputs.sh
-
scripts/find_versions.sh
-
scripts/get_reference_data.sh
-
scripts/Sumbit-Individual_Samples_scRNA-QC_DoubletRemoval_Preprocessing.sh
-
scripts/Submit-Individual_Samples_scRNA-MultiKClustering.sh,
scripts/Submit-Individual_Samples_scRNA-MultiKClustering_AlternateSeed.sh,
scripts/Submit-Individual_Samples_scRNA-MultiKClustering_AlternateSeed_SecondAttempt.sh,
scripts/Submit-Individual_Samples_scRNA-MultiKClustering_AlternateSeed_ThirdAttempt.sh
-
scripts/Submit-Individual_Samples_scRNA-FindClusterMarkerGenes.sh
-
scripts/Individual_Samples_scRNA-RemoveLowMappingRatePopulation_Reprocess_3FCDEL.sh
-
scripts/Wu_etal_2021_BRCA_scRNA-CreateSeuratObjectWithCCA.sh
-
scripts/Wu_etal_2021_BRCA_scRNA-CreateSeuratObjectWithOutCCA.sh
-
scripts/Submit-Individual_Samples_scRNA-CellTypeAnnotation.sh
-
scripts/Submit-Individual_Samples_scRNA-inferCNV_CancerCellDetection.sh
-
scripts/get_SCSubtype_training_data.sh
-
scripts/Submit-Individual_Samples_scRNA-SCSubtype_Classification.sh
-
scripts/Patient_Samples_scRNA-Merge_And_ReCluster.sh
- scripts/All_Samples_scATAC-QC_DoubletRemoval_Preprocessing.sh
- scripts/Patient_Samples_scATAC-Subset.sh
- scripts/Patient_Samples_scATAC-DimReduc_GeneScoring.sh
- scripts/Patient_Samples_scATAC-Transfer_Labels_from_scRNA.sh
- scripts/Basal_And_TN_Samples_scRNA-Subset_And_ReCluster-TESTING.sh
- scripts/Basal_And_TN_Samples_scATAC-Subset_DimReduc-TESTING3.sh
- scripts/Basal_And_TN_Samples_scATAC-Transfer_Labels_from_scRNA_Call_Peaks-TESTING3.sh
- scripts/Luminal_And_TN_Samples_scRNA-Subset_And_ReCluster-TESTING.sh
- scripts/Luminal_And_TN_Samples_scATAC-Subset_DimReduc-TESTING3.sh
- scripts/Luminal_And_TN_Samples_scATAC-Transfer_Labels_from_scRNA_Call_Peaks-TESTING3.sh
- scripts/CellLine_Samples_scRNA-Merge_And_ReCluster.sh
- scripts/CellLine_Samples_scATAC-Subset_GeneScoring_DimReduc_TransferLabels_CallPeaks.sh
- scripts/scLME_update-metacells-Basal-SingFits_OLS.sh
- scripts/scLME_update-metacells-Luminal-SingFits_OLS.sh
- scripts/scLME_update-metacells-SingFits_OLS-CellLines.sh
- scripts/Full_Cohort_Results.sh
- scripts/Basal_Cohort_Results-updates.sh
- scripts/Luminal_Cohort_Results-updates.sh
- scripts/DepMap_Analysis_Basal_Luminal-updates.sh
- scripts/Cell_Line_Cohort_Results.sh
- scripts/Table_1_S1.sh
- scripts/Supplemental_Tables-P2Gs.sh
- scripts/Supplemental_Tables-barcode_metadata.sh
R scripts were run in a Docker container executed with Singularity in an HPC environment.
To help users replicate our computational environment, we have uploaded the Docker image to DockerHub for public access: https://hub.docker.com/r/regnerm/scbreast_2023.
To pull the latest version of the Docker image, you may run the following command:
docker pull regnerm/scbreast_2023:1.8.0