NetBID (Network-based Bayesian Inference of Drivers) is a data-driven system biology pipeline and toolkit for finding drivers from transcriptomics, proteomics and phosphoproteomics data, where the drivers can be either transcription facotrs (TF) or signaling factors (SIG).
NetBID 2.0 is an upgraded version of NetBID 1.0 that has been published in Nature in 2018. NetBID 2.0 inherites all the main functions from NetBID 1.0, and provides many more functions and pipelines to perform advanced end-to-end analyses.
Require R >= 3.6.0
. Other dependencies can be found in table https://jyyulab.github.io/NetBID/docs/pre_request.
Installation instructions are in Installation section of the documentation.
Instructions, documentation, and tutorials can be found at:
A PDF manual NetBID_manual.pdf can be found in the repository.
Demo scripts can be found in demo_scripts directory.
Summary of steps in pipeline_network_demo1.R:
- Step1: load in gene expression datasets for network construction (exp-load)
- Step2: normalization for the exp dataset (exp-QC)
- Step3: check sample cluster info, optional (exp-cluster)
- Step4: prepare SJARACNE (sjaracne-prep)
Summary of steps in pipeline_analysis_demo1.R:
- Step1: load in gene expression datasets for analysis (exp-load,exp-cluster,exp-QC)
- Step2: activity calculation (act-prep,act-get)
- Step3: get DE/DA (act-DA)
- Step4: generate master table (ms-tab)
Questions that the analyses in analysis_and_plot_demo1.R help to answer:
-
Part I: More details about the top drivers
- How to get the top drivers with significant differential activity (DA) in the comparison between G4 vs. other subtypes?
- How to interpret the significance of top DA drivers?
- What is the expression/activity pattern of these top DA drivers across sample subtypes?
- What are the biological functions of these top DA drivers?
- What are the biological functions of the target genes of these top DA drivers?
-
Part II: More details about the selected driver
- How to interpret the significance of the selected driver?
- How to visualize the network structure of the selected driver?
- What is the expression/activity of this selected driver across subtypes of sample?
- What are the functions of the target genes of this selected driver?
-
Part III: Other analyses
- What are the activities of the curated gene sets across all samples?
- How to find drivers share significantly overlapped target genes?