ScreenProcessing

There are two steps to analyzing sequencing data from pooled screens using the ScreenProcessing pipeline

Convert raw sequencing files into library counts using bowtie alignment

The script used for this step is fastqgz_to_counts.py, and requires a reference fasta file of the library used in the experiment. Indices for several published libraries are included here, and more can be generated upon request.
Generate sgRNA phenotype scores, gene-level scores, and gene-level p-values

This step relies on the script process_experiments.py. This script requires you to first fill out a configuration file which allows you to:
- Assign the read counts generated for each sequencing file to the appropriate sample condition
- Specify which conditions you want to compare in order to generate phenotypes
- Set several data quality filtering and normalization parameters
- Specify how to score genes
This script also generates a set of standard graphs using screen_analysis.py
[Optional] Generate graphs interactively using screen_analysis.py

Dependencies

Python v3 (tested in 3.7; legacy scripts for v2.7 are available in the python2 folder)
Biopython
Scipy/Numpy/Pandas/Matplotlib
iPython or iPython Notebook recommended for interactive graph plotting

(ScreenProcessing no longer uses Bowtie to align sequencing reads; if you want to use or fork from this functionality use an earlier version of the program)

Installation (In Progress)

A requirements.txt file has been added, as there may be issues with some current packages. This may not be the most recent functional version - testing is in progress. This file should be used to create a virtual environment.
Alternatively, a Singularity Definition file has been added, intended to be used to create a Singularity container that has the correct functional versions of dependencies. Here is how to create a container:

singularity build ScreenProcessing.sif ScreenProcessing.def

ScreenProcessing Demo

A PDF slideshow with a step-by-step tutorial of screen analysis using the data files included in the Demo folder can found here: ScreenProcessing Demo

The demo files represent a tiny slice of the full sequencing dataset to speed up the download and demo scripts. The full complement of sequencing data used for the cell growth and cholera toxin sensitivity CRISPRi screens published in Gilbert and Horlbeck et al., Cell 2014 can be accessed here: data link

Name		Name	Last commit message	Last commit date
Latest commit History 109 Commits
Demo		Demo
library_reference		library_reference
library_tables		library_tables
python2		python2
.gitignore		.gitignore
README.md		README.md
ScreenProcessing.def		ScreenProcessing.def
ScreenProcessing_tutorial.pdf		ScreenProcessing_tutorial.pdf
cell_doubling_measurements.xlsx		cell_doubling_measurements.xlsx
experiment_config_file_BLANK.txt		experiment_config_file_BLANK.txt
expt_config_parser.py		expt_config_parser.py
fastqgz_to_counts.py		fastqgz_to_counts.py
process_experiments.py		process_experiments.py
requirements.txt		requirements.txt
screen_analysis.py		screen_analysis.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ScreenProcessing

There are two steps to analyzing sequencing data from pooled screens using the ScreenProcessing pipeline

Dependencies

Installation (In Progress)

ScreenProcessing Demo

About

Releases 2

Packages

Contributors 6

Languages

mhorlbeck/ScreenProcessing

Folders and files

Latest commit

History

Repository files navigation

ScreenProcessing

There are two steps to analyzing sequencing data from pooled screens using the ScreenProcessing pipeline

Dependencies

Installation (In Progress)

ScreenProcessing Demo

About

Resources

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 6

Languages

Packages