REPET-Slurm

A collection of scripts to get started with running the REPET pipeline on a cluster with the SLURM resource manager and a module system installed.

Caveats/Warnings

FASTA Format
- Header
  - Recommended format: ">XX_i" (XX = letters, i = numbers)
  - avoid spaces and symbols like "=;:|"
- 60 bps (or less) per line for sequences

Prerequisite Files

TEdenovo

Host genome (FASTA format)
REPET-specific Pfam HMM File
rDNA (FASTA format) of host genome
- RNAmmer
RepBase Amino Acid Database
RepBase Nucleotide Database
cDNA of host genome (FASTA format)

A RepeatScout bank can also be provided but there are additional pre-processing steps before it can be used in the pipeline. See the TEdenovo tuto webpage or text file included with REPET. These scripts currently do NOT perform this pre-processing steps.

TEannot

Host genome (FASTA format)
TE library (FASTA format)
- from TEdenovo or another source
RepBase Amino Acid Database
RepBase Nucleotide Database

Getting Started

TEdenovo

Clone the repository and copy the default configuration.

$ git clone https://github.com/stajichlab/REPET-slurm
$ cd REPET-slurm/TEdenovo
$ cp /path/to/REPET/config/TEdenovo.cfg .

Change the settings in TEdenovo.cfg and TEdenovo_AllSteps.sh to match your environment/project.
Copy/link the prerequisite files into the TEdenovo folder.
sh TEdenovo_AllSteps.sh or sbatch TEdenovo_AllSteps.sh.

TEannot

If you already ran TEdenovo, then skip step 1.

Clone the repository and copy the default configuration.

$ git clone https://github.com/stajichlab/REPET-slurm
$ cd REPET-slurm/TEannot
$ cp /path/to/REPET/config/TEannot.cfg .

Change the settings in TEannot.cfg and TEannot_AllSteps.sh to match your environment/project.
Copy/link the prerequisite files into the TEannot folder.
- TE library has a required naming format: <project_name>_refTEs.fa
sh TEannot_AllSteps.sh or sbatch TEannot_AllSteps.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
RepeatClassifier		RepeatClassifier
TEannot		TEannot
TEdenovo		TEdenovo
lib		lib
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

REPET-Slurm

Caveats/Warnings

Prerequisite Files

TEdenovo

TEannot

Getting Started

TEdenovo

TEannot

About

Releases 1

Packages

Contributors 2

Languages

License

stajichlab/REPET-slurm

Folders and files

Latest commit

History

Repository files navigation

REPET-Slurm

Caveats/Warnings

Prerequisite Files

TEdenovo

TEannot

Getting Started

TEdenovo

TEannot

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages