GBS_barcode

This script split GBS fastq file by barcode sequences.

Getting Started

Usage: GBS_barcode.pl yourBarCodeFile yourEnzymeFile

You will need to create two text files first, a barcode file and an enzyme file. Barcode File Example (see the example file, tab delimited text file with 4 columns, code p for paired end, you will need to provide the first end file under the file column :

SampleName	code	paired	file
s1	CTCC	s	my_1.fastq
s2	TGCA	s	my_1.fastq
s3	ACTA	s	my_1.fastq
s4	GTCT	s	my_1.fastq
s5	GAAT	s	my_1.fastq
s6	GCGT	s	my_1.fastq
s7	TGGC	s	my_1.fastq
s8	CGAT	s	my_1.fastq
s9	CTTGA	s	my_1.fastq
s10	TCACC	s	my_1.fastq
s11	CTAGC	s	my_1.fastq
s12	ACAAA	s	my_1.fastq
s13	TTCTC	s	my_1.fastq
s14	AGCCC	s	my_1.fastq

Enzyme File Example (ApeKI):

Enzyme: C[AT]GC
FinalSize: 64
Ends: GCTGGATC,GCAGGATC,GCTGAGAT,GCAGAGAT,GCAGC,GCTGC
EnzymeEndSize: 4

Note:

If you works in Linux or Mac environment, you can use gzip compressed Illumina files directly, as long as the file names end with .gz.
In barcode file, "s" refere to single end Illumina data, "p" refer to paired end Illumina data. For paired end data files, you only need to provide the first file (with _1 in file name). The second file should have "_2" in file name, and will automatically be recognized.
In the enzyme file, you are required to provide final size. If reads are shorter than the final size after trimming of the barcode and 3' adapter, the read will be padded with "N" at 3' end.
Some sample enzyme files are provided here.

Prerequisites

Installing

Download the PERL script.

Authors

Qi Sun

License

This project is licensed under the MIT License - see the LICENSE.md file for details

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
GBS_barcode.pl		GBS_barcode.pl
GBS_barcode_template.txt		GBS_barcode_template.txt
README.md		README.md
apeki.txt		apeki.txt
ecot22i.txt		ecot22i.txt
enzyme_template.txt		enzyme_template.txt
psti.txt		psti.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GBS_barcode

Getting Started

Prerequisites

Installing

Authors

License

Acknowledgments

About

Releases

Packages

Languages

qisun2/GBS_barcode

Folders and files

Latest commit

History

Repository files navigation

GBS_barcode

Getting Started

Prerequisites

Installing

Authors

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages