Differences
This shows you the differences between two versions of the page.
| Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
| support:demultiplexing [2024/04/11 13:17] – Completed page with links to sections. Richard Bowers | support:demultiplexing [2025/08/06 09:10] (current) – Richard Bowers | ||
|---|---|---|---|
| Line 3: | Line 3: | ||
| The data files the CRUK-CI sequencing service provides after sequencing your samples are demultiplexed according to the barcodes provided in the submission spreadsheet. It sometimes happens that users of the service make a mistake when filling in this spreadsheet and put an incorrect index against one or more samples, or samples are omitted from the submission spreadsheet. This manifests as those samples' | The data files the CRUK-CI sequencing service provides after sequencing your samples are demultiplexed according to the barcodes provided in the submission spreadsheet. It sometimes happens that users of the service make a mistake when filling in this spreadsheet and put an incorrect index against one or more samples, or samples are omitted from the submission spreadsheet. This manifests as those samples' | ||
| - | The good news is that the reads are still present in the data: they' | + | The good news is that the reads are still present in the data: they' |
| The bad news is that correcting the submission information regarding the indexes attached to the samples is troublesome to the point of impossibility in our Clarity LIMS system. We cannot therefore fix your submission to rerun demultiplexing and attach those files so you can fetch them with the download tool or on the FTP site. Rerunning demultiplexing is also not an automatic process: a little time and effort will be needed to reprocess the FASTQ files. | The bad news is that correcting the submission information regarding the indexes attached to the samples is troublesome to the point of impossibility in our Clarity LIMS system. We cannot therefore fix your submission to rerun demultiplexing and attach those files so you can fetch them with the download tool or on the FTP site. Rerunning demultiplexing is also not an automatic process: a little time and effort will be needed to reprocess the FASTQ files. | ||
| Line 15: | Line 15: | ||
| The initial creation and demultiplexing of the FASTQ files is done with [[https:// | The initial creation and demultiplexing of the FASTQ files is done with [[https:// | ||
| - | * [[https:// | + | * [[https:// |
| - | * [[https:// | + | * [[https:// |
| - | * [[https:// | + | * [[https:// |
| - | * [[https:// | + | |
| If you build from source, the INSTALL file in the tar ball gives instructions on how to build the program. | If you build from source, the INSTALL file in the tar ball gives instructions on how to build the program. | ||
| Line 59: | Line 58: | ||
| Regardless of the type of barcoding kit, you will need to produce a configuration file per read for your data. If your data is single end, one file is sufficient. If it is paired end, you will need a second configuration file with different file names for read two. In our naming conventions (as above), this is the "// | Regardless of the type of barcoding kit, you will need to produce a configuration file per read for your data. If your data is single end, one file is sufficient. If it is paired end, you will need a second configuration file with different file names for read two. In our naming conventions (as above), this is the "// | ||
| + | |||
| + | ==== Two or More Barcodes per Sample ==== | ||
| + | |||
| + | It is possible to demultiplex in such a way as to put reads from more than one barcode into a single file. One can do this by listing each barcode sequence in the index file with the same file name. | ||
| + | |||
| + | < | ||
| + | AGTTCC | ||
| + | AGTCAA | ||
| + | CAGATC | ||
| + | CTTGTA | ||
| + | </ | ||
| + | |||
| + | The example above will create two FASTQ files, each containing reads tagged with two different barcodes. | ||
| + | |||
| + | The most likely real world example of this is the old style single index 10x SIGA/SINA barcoding. These have four barcodes per sample, so do need to be combined into a single file for their relevant sample. For example, to demultiplex and extract the SINAA1 and SINAA2 barcodes from a multiplexed file one would use an index file similar to below. | ||
| + | |||
| + | < | ||
| + | AAACGGCG | ||
| + | CCTACCAT | ||
| + | GGCGTTTC | ||
| + | TTGTAAGA | ||
| + | AGCCCTTT | ||
| + | CAAGTCCA | ||
| + | GTGAGAAG | ||
| + | TCTTAGGC | ||
| + | </ | ||
| + | |||
| + | The examples here are for single index, but dual index works in exactly the same way with the additional column for the i5 sequence. | ||
| ===== Running the Demultiplexer ===== | ===== Running the Demultiplexer ===== | ||