Skip to content

Commit

Permalink
Add CT Scans for all TCGA Pancan (#2015)
Browse files Browse the repository at this point in the history
Add OHIF Viewer for all TCGA pancan studies. Add docs of how they were
generated

Co-authored-by: Ramya Madupuri <[email protected]>
  • Loading branch information
inodb and rmadupuri authored Aug 5, 2024
1 parent fc4b313 commit 10e3d3a
Show file tree
Hide file tree
Showing 125 changed files with 393 additions and 0 deletions.
21 changes: 21 additions & 0 deletions docs/tcga_pan_can_atlas/ohif-viewer.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@
# OHIF Viewer for TCGA CT Scans
OHIF URLs for CT Scans were obtained from the Imaging Data Commons and added as
resource data to all tcga_pan_can_2018 studies,

## Steps
### 1. Download Data from IDC

Go to IDC and select TCGA -> CT Scans. Download a manifest with all the links to ~/Downloads/idc_ohif.tsv.

### 2. Curate coadread_tcga semi-manually

Add all *resource* files for coad and read data and link patients to samples. This was done semi-manually.

### 3. Do the rest using a command one liner

Generate them for the rest:
```bash
for f in $(cut -f2 ~/Downloads/idc_ohif.tsv | gsort | uniq | grep tcga_ | grep -v Filters | grep -v coad | grep -v read); do (head -1 coadread_tcga_pan_can_atlas_2018/data_resource_patient.txt; cut -f1,2,4 ~/Downloads/idc_ohif.tsv | tail -n +9 | grep $f | cut -f1,3 | awk -vFS='\t' -vOFS='\t' '{$1=substr($1,0,12); $3="https://viewer.imaging.datacommons.cancer.gov/viewer/"$2; $2="IDC_OHIF_V2"; print $0}' | gsort -k1,1 | uniq | rev | uniq -f2 | rev; ) > ${f/tcga_/}_tcga_pan_can_atlas_2018/*data_resource*patient*; done
```

Note: there are a few patients that have multiple CT Scans. Not entirely sure what the difference is, the above command just selects the first one
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/acc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: acc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/acc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: acc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/blca_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: blca_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/blca_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: blca_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/brca_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: brca_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/brca_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: brca_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/cesc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: cesc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/cesc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: cesc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/chol_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: chol_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/chol_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: chol_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/dlbc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: dlbc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/dlbc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: dlbc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/esca_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: esca_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/esca_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: esca_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/gbm_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: gbm_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/gbm_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: gbm_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/hnsc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: hnsc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/hnsc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: hnsc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/kich_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kich_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/kich_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kich_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/kirc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kirc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/kirc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kirc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/kirp_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kirp_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/kirp_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: kirp_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/laml_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: laml_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/laml_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: laml_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/lgg_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lgg_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/lgg_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lgg_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/lihc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lihc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/lihc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lihc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/luad_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: luad_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/luad_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: luad_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/lusc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lusc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/lusc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: lusc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/meso_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: meso_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/meso_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: meso_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/ov_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: ov_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/ov_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: ov_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/paad_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: paad_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/paad_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: paad_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/pcpg_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: pcpg_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/pcpg_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: pcpg_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/prad_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: prad_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/prad_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: prad_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/sarc_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: sarc_tcga_pan_can_atlas_2018
resource_type: DEFINITION
data_filename: data_resource_definition.txt
3 changes: 3 additions & 0 deletions public/sarc_tcga_pan_can_atlas_2018/meta_resource_patient.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
cancer_study_identifier: sarc_tcga_pan_can_atlas_2018
resource_type: PATIENT
data_filename: data_resource_patient.txt
Git LFS file not shown
3 changes: 3 additions & 0 deletions public/skcm_tcga_pan_can_atlas_2018/data_resource_patient.txt
Git LFS file not shown
Loading

0 comments on commit 10e3d3a

Please sign in to comment.