Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Supporting NextSeq and MiniSeq #65

Open
avilella opened this issue Aug 14, 2018 · 12 comments
Open

Supporting NextSeq and MiniSeq #65

avilella opened this issue Aug 14, 2018 · 12 comments

Comments

@avilella
Copy link

Hi, what would be needed to support NextSeq and MiniSeq instruments?
Anything I can provide?

@johandahlberg
Copy link
Contributor

Hi!

Adding new instruments should be relatively simple. What is needed is:

  1. to implement new classes for the instruments here
    class NovaSeq(IlluminaInstrument):
    for each of the new instruments
  2. add the correct instrument identifier prefix here:
    machine_type_mappings = {"M": "miseq",
  3. implement reasonable default qc criteria in the config file

I'd be happy to add support for them. What I would need from you, is if you could tell me what prefix the instruments uses, and what you think would be reasonable default qc criteria. And, since I don't have access to data from these instruments it would be great if you could run some beta testing making sure that everything seems to work (or if possible send me some data that I could try it out on).

@apeltzer
Copy link

Hm, I have a couple of NextSeq 500 runs here that I could get hands on.

@johandahlberg
Copy link
Contributor

That's great, @apeltzer. I found some information which indicated that the NextSeq instruments have serial numbers that start with SN, is that correct? If I get a pre-release out, would you be willing to beta test it?

@apeltzer
Copy link

I guess I could do that yes - regarding the serial number, I will check. Could however very well be the case yes.

@apeltzer
Copy link

Thats a normal FastQ file out here:

@NS500559:25:HJHMNBGXX:1:11101:4226:1073 1:N:0:TTACTTCT+CTAACTTA
GATCTNGGTCTGGTTTCATCCGCGGCATTTTGCCACCCTGACCGGAGTGGTCTTTGCCGTCGGTTATCTGGGAAA
+
AAAAA#EEEEEEEEEEEEEEAEEEEEEE/A<EAE/EEAEEEEAEEEEEAEAA/E/EEEEEEEEEEAAA6EEEEE/
@NS500559:25:HJHMNBGXX:1:11101:18957:1076 1:N:0:TTACTTCT+CTAACTTA
 

@johandahlberg
Copy link
Contributor

Thanks! Do you have any idea about what values can be used to differentiate between the High and Mid-output modes of the instrument? I'm guessing that information would be available somewhere like the runParameters.xml, but since I don't have a runfolder I can't check it.

@apeltzer
Copy link

apeltzer commented Sep 13, 2018

I'm linking in Stephen here who should have access to such runParameters.xml - could you maybe make some available to Johan for that purpose? one for High and one for Mid Output modes on a NextSeq 500?

@sc13-bioinf

@cbrueffer
Copy link
Contributor

Here's some information based on our NextSeq 550 DX. The DX instrument version is certified for diagnostic use, so has a different instrument ID, e.g. NDX550213 in our case.

I don't have access to medium output kit runs, but the high output ones have this in RunParameters.xml under the RunParameters node: <Chemistry>NextSeq High</Chemistry>

@johandahlberg
Copy link
Contributor

Sorry for the very later reply @cbrueffer , and thank you for the information. While we don't currently have the resources to implement this, we would very much welcome a PR to fix it.

There is a stale PR here #69 where I stared work on this, that basically should take you through most of the changes that needs to be made.

@cbrueffer
Copy link
Contributor

cbrueffer commented Oct 13, 2020

No worries Johan; I haven't had time to look into this further yet (hopefully soon), but for now I can at least add some more information:

The mid output kit is marked as <Chemistry>NextSeq Mid</Chemistry> in RunParameters.xml.

ID strings for the regular NextSeq start with @NS and @NB, according to https://github.com/OpenGene/fastp/blob/e30ec117f2dd45148942064128f0c9b3a48876e3/src/evaluator.cpp#L25

@matrulda
Copy link
Collaborator

Nice! I can add that Illumina have NextSeq and MiniSeq data in their demo data collection (requires registration to access), could perhaps serve as testdata.

@maleasy
Copy link

maleasy commented Sep 6, 2021

I was wondering if there is any progress in supporting NextSeq?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

6 participants