action.skip

DriverMap Adaptive Immune Receptor (AIR) DNA TCR-BCR Profiling

Easy-to-run, single-day assay that uses multiplex PCR-NGS technology to profile T-cell receptor (TCR) or B-cell receptor (BCR) repertoire present in human DNA. The DriverMap AIR DNA assay measures the frequency of CDR3 clonotypes to quantify the clonal frequency of T and B cells.

Bellow you can see the structure of cDNA library.

The data for this tutorial consists of one sample prepared according to the manufacturer's protocol.

All data may be downloaded using the script bellow.

Use aria2c for efficient download of the full dataset with the proper filenames:

download.sh
mkdir -p raw
aria2c -c -s 16 -x 16 -k 1M -j 8 -i download-list.txt
download-list.txt
ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR264/031/SRR26446931/SRR26446931_1.fastq.gz
  out=raw/Sample1_R1.fastq.gz
ftp://ftp.sra.ebi.ac.uk/vol1/fastq/SRR264/031/SRR26446931/SRR26446931_2.fastq.gz
  out=raw/Sample1_R2.fastq.gz

Upstream analysis

MiXCR has a dedicated preset for this protocol, thus analysing the data is as easy as:

mixcr analyze cellecta-human-dna-xcr-umi-drivermap-air \
    raw/Sample1_R1.fastq.gz \
    raw/Sample1_R2.fastq.gz \
    results/Sample1

Running the command above will generate the following files:

> ls result/

# human-readable reports 
Sample1.align.report.txt
Sample1.align.report.json
Sample1.refine.report.txt
Sample1.refine.report.json
Sample1.assemble.report.txt
Sample1.assemle.report.json

# raw alignments (highly compressed binary file)
Sample1.vdjca

#Alignments with corrected barcode after ```mixcr refineTagsAndSort```
Sample1.refined.vdjca

# TCR and BCR CDR3 clonotypes (highly compressed binary file)
Sample1.clns

# TCR and BCR CDR3 clonotypes exported in tab-delimited txt
Sample1.clones_IGH.tsv
Sample1.clones_IGK.tsv
Sample1.clones_IGL.tsv
Sample1.clones_TRAD.tsv
Sample1.clones_TRB.tsv

While .clns file holds all data and is used for downstream analysis using mixcr postanalisis, the output .txt clonotype table will contain exhaustive information about each clonotype as well:

See Sample1.clones_IGH.tsv clonotype table
cloneId readCount readFraction uniqueMoleculeCount uniqueMoleculeFraction targetSequences targetQualities allVHitsWithScore allDHitsWithScore allJHitsWithScore allCHitsWithScore allVAlignments allDAlignments allJAlignments allCAlignments nSeqCDR3 minQualCDR3 aaSeqCDR3 refPoints
18756 248 0.0604141 3 0.0428571 TGTGCGAGAAACGACTGGTGGCAGATCAGGAAGGAGTACTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-2100(358.5),IGHV3-3300(356.7) IGHD2-800(40),IGHD3-900(35) IGHJ4*00(262.8) nan 463|472|494|0|9||90.0;463|472|494|0|9||90.0 41|49|93|13|21||40.0;46|53|93|12|19||35.0 28|37|68|33|42|SC30G|61.0 nan TGTGCGAGAAACGACTGGTGGCAGATCAGGAAGGAGTACTGG 58 CARNDWWQIRKEYW :::::::::0:-2:9:13:-10:-13:21:33:-8:42:::
10605 242 0.0589525 4 0.0571429 TGTGCGAGATTTGCGACCTGCCATATTTGCAAAGAAAACTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-33*00(523.5) IGHD6-600(32),IGHD2-2100(30),IGHD3-9*00(30) IGHJ2*00(269) nan 463|472|494|0|9||90.0 3|15|54|13|25|SG6CST11C|32.0;30|36|84|21|27||30.0;39|45|93|22|28||30.0 38|42|73|38|42||40.0 nan TGTGCGAGATTTGCGACCTGCCATATTTGCAAAGAAAACTGG 58 CARFATCHICKENW :::::::::0:-2:9:13:15:-21:25:38:-18:42:::
18754 206 0.0501827 3 0.0428571 TGTGCGAGAAGCTGGATCAGCAGCTGCCATGCCCGCGATTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV5-51*00(680.1) IGHD2-200(52),IGHD6-1300(48) IGHJ400(166.4),IGHJ500(166.4) nan 445|454|476|0|9||90.0 49|65|93|20|36|ST56CSG62C|52.0;4|22|63|17|35|ST8AST14CSA17G|48.0 34|37|68|39|42||30.0;37|40|71|39|42||30.0 nan TGTGCGAGAAGCTGGATCAGCAGCTGCCATGCCCGCGATTGG 58 CARSWISSCHARDW :::::::::0:-2:9:20:-18:3:36:39:-14:42:::
18755 193 0.0470158 3 0.0428571 TGTGCGAGAAGCGAGCTGTTCATCGAGAGCACCATCTGCAAGTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-11*00(327.2) IGHD6-600(35),IGHD2-200(32),IGHD6-13*00(31) IGHJ4*00(247.8) nan 459|468|490|0|9||90.0 3|10|54|11|18||35.0;44|56|93|27|39|ST46CSG51T|32.0;2|11|63|30|39|SG6T|31.0 34|37|68|42|45||30.0 nan TGTGCGAGAAGCGAGCTGTTCATCGAGAGCACCATCTGCAAGTGG 58 CARSELFIESTICKW :::::::::0:-2:9:11:15:-26:18:42:-14:45:::
176883 119 0.028989 1 0.0142857 TGTGCGAGAGATATTAGGAGTTACTATGGTTCGGGGAGTTATGATTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-46*00(646.4) IGHD3-10*00(91) IGHJ4*00(340) nan 447|459|478|0|12||120.0 34|55|93|20|41|SA46G|91.0 27|37|68|41|51|SC30T|71.0 nan TGTGCGAGAGATATTAGGAGTTACTATGGTTCGGGGAGTTATGATTACTGG 45 CARDIRSYYGSGSYDYW :::::::::0:1:12:20:-3:-7:41:41:-7:51:::
176878 111 0.0270402 1 0.0142857 TGTGCGAGAGATTCTCTAGCAGCAGCTGGTACAGGGGGATATTACTACTACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-30-3*00(1032.3) IGHD6-13*00(80) IGHJ6*00(635.4) nan 463|475|494|0|12||120.0 26|42|63|16|32||80.0 18|52|83|38|72||340.0 nan TGTGCGAGAGATTCTCTAGCAGCAGCTGGTACAGGGGGATATTACTACTACTACTACGGTATGGACGTCTGG 45 CARDSLAAAGTGGYYYYYYGMDVW :::::::::0:1:12:16:-5:0:32:38:2:72:::
176884 109 0.026553 1 0.0142857 TGTGCGAGAGAGCGACCCCCTCATGGAGTAGTACCAGCTGCTCACTACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-30-3*00(1035.2) IGHD2-2*00(81) IGHJ6*00(405.8) nan 463|474|494|0|11||110.0 38|57|93|23|42|ST40G|81.0 26|52|83|43|69||260.0 nan TGTGCGAGAGAGCGACCCCCTCATGGAGTAGTACCAGCTGCTCACTACTACTACGGTATGGACGTCTGG 45 CARERPPHGVVPAAHYYYGMDVW :::::::::0:0:11:23:-7:-5:42:43:-6:69:::
177188 108 0.0263094 1 0.0142857 TGTGCACACAGGAGGGGTGGGAGTTACTATAACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV2-5*00(942.9) IGHD1-2600(51),IGHD3-1000(51) IGHJ4*00(290.7) nan 451|462|484|0|11||110.0 26|39|60|16|29|SC33T|51.0;47|60|93|18|31|ST55C|51.0 32|37|68|31|36||50.0 nan TGTGCACACAGGAGGGGTGGGAGTTACTATAACTGG 45 CAHRRGGSYYNW :::::::::0:-2:11:16:-6:-1:29:31:-12:36:::
176885 106 0.0258222 1 0.0142857 TGTGCGAGAGAGAATATTTCGCCGTGGGGTACAGTAACTACGTATATCCTTGGGAACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-18*00(795.8) IGHD4-1100(75),IGHD4-400(75) IGHJ6*00(516.5) nan 446|457|477|0|11||110.0 20|35|48|29|44||75.0;20|35|48|29|44||75.0 29|52|83|55|78||230.0 nan TGTGCGAGAGAGAATATTTCGCCGTGGGGTACAGTAACTACGTATATCCTTGGGAACTACTACGGTATGGACGTCTGG 45 CARENISPWGTVTTYILGNYYGMDVW :::::::::0:0:11:29:-4:3:44:55:-9:78:::
94933 103 0.0250914 1 0.0142857 TGTGCTAAAGATATTCGAAAATTACGATCTTTTCACTGGCTATTATCTGGGGGCTACTACTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-43*00(942.7) IGHD3-9*00(88) IGHJ4*00(476.2) nan 465|479|498|0|14|SA470T|111.0 33|59|93|20|46|SA41CSG46CST52C|88.0 19|37|68|54|72||180.0 nan TGTGCTAAAGATATTCGAAAATTACGATCTTTTCACTGGCTATTATCTGGGGGCTACTACTTTGACTACTGG 45 CAKDIRKLRSFHWLLSGGYYFDYW :::::::::0:1:14:20:-2:-3:46:54:1:72:::
176874 103 0.0250914 1 0.0142857 TGTGCGAGAGCCAAAGGCGATCTGTTCGCACTGCTGAGCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-1100(541.2),IGHV3-7100(541.2) IGHD6-1900(35),IGHD6-1300(30) IGHJ6*00(299.9) nan 459|469|490|0|10||100.0;471|481|502|0|10||100.0 8|15|63|28|35||35.0;7|13|63|30|36||30.0 48|52|83|38|42||40.0 nan TGTGCGAGAGCCAAAGGCGATCTGTTCGCACTGCTGAGCTGG 45 CARAKGDLFALLSW :::::::::0:-1:10:28:13:-27:35:38:-28:42:::
176893 100 0.0243605 1 0.0142857 TGTGCGAGAAACGAATGGTGGCATATCAGAAAGGAGTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-48*00(944.3) IGHD2-200(36),IGHD2-2100(32) IGHJ4*00(236.6) nan 465|474|496|0|9||90.0 0|10|93|19|29|SG6T|36.0;23|35|84|14|26|SC26GSA28G|32.0 28|37|68|33|42|SC30G|61.0 nan TGTGCGAGAAACGAATGGTGGCATATCAGAAAGGAGTACTGG 45 CARNEWWHIRKEYW :::::::::0:-2:9:19:31:-52:29:33:-8:42:::
43758 96 0.0233861 2 0.0285714 TGTGCGAGAGATCGTAGCGCCATTCGCGATGATGATTATTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV4-34*00(501.4) IGHD5-1200(40),IGHD5-1800(40) IGHJ4*00(236.4) nan 442|452|473|0|10||100.0 3|11|69|10|18||40.0;3|11|69|10|18||40.0 27|37|68|32|42|SC30TSC33T|42.0 nan TGTGCGAGAGATCGTAGCGCCATTCGCGATGATGATTATTGG 58 CARDRSAIRDDDYW :::::::::0:-1:10:10:20:-35:18:32:-7:42:::
176892 91 0.0221681 1 0.0142857 TGTGCGAGAAACGAGTGGTGGCACATCAGCAAGGAGTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNJNNNNNNN IGHV3-48*00(1020) IGHD2-1500(40),IGHD2-2100(35),IGHD3-3*00(35) IGHJ4*00(161.8) nan 465|474|496|0|9||90.0 41|49|93|13|21||40.0;36|43|84|14|21||35.0;46|53|93|12|19||35.0 28|37|68|33|42|SC30G|61.0 nan TGTGCGAGAAACGAGTGGTGGCACATCAGCAAGGAGTACTGG 41 CARNEWWHISKEYW :::::::::0:-2:9:13:-10:-13:21:33:-8:42:::
43755 90 0.0219245 2 0.0285714 TGTGCGAGAGCTCCGCCTCTCGAAACAATTAATATTTCTTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV1-46*00(583.2) IGHD5-2400(35),IGHD3-900(30) IGHJ400(127.6),IGHJ500(127.6) nan 447|460|478|0|13|SA457C|101.0 32|39|60|24|31||35.0;39|45|93|31|37||30.0 34|37|68|39|42||30.0;37|40|71|39|42||30.0 nan TGTGCGAGAGCTCCGCCTCTCGAAACAATTAATATTTCTTGG 58 CARAPPLETINISW :::::::::0:2:13:24:-12:-1:31:39:-14:42:::
176865 88 0.0214373 1 0.0142857 TGTGCGAGATTTGCAACCTGCCAGATTTGCAACGAAAACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-33*00(1006.1) IGHD1-100(26),IGHD1-2000(26),IGHD3-16*00(26) IGHJ2*00(206.5) nan 463|472|494|0|9||90.0 25|33|51|27|35|SG27C|26.0;25|33|51|27|35|SG27C|26.0;5|13|111|30|38|ST10A|26.0 38|42|73|38|42||40.0 nan TGTGCGAGATTTGCAACCTGCCAGATTTGCAACGAAAACTGG 45 CARFATCQICNENW :::::::::0:-2:9:27:-8:-1:35:38:-18:42:::
43757 86 0.0209501 2 0.0285714 TGTGCGAGAGGCAGGGCCTACGCCCTGGACGAGAGGTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-49*00(487.4) IGHD6-600(35),IGHD6-2500(32) IGHJ2*00(278.8) nan 471|481|502|0|10|SA474GST476G|42.0 0|7|54|26|33||35.0;25|37|54|10|22|SC29GSG31C|32.0 39|42|73|36|39||30.0 nan TGTGCGAGAGGCAGGGCCTACGCCCTGGACGAGAGGTGG 58 CARGRAYALDERW :::::::::0:-1:10:26:18:-29:33:36:-19:39:::
176910 86 0.0209501 1 0.0142857 TGTGCGAAAGAGTTGATGCGGACTGGTGTCGGGGAGGTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(902.4) IGHD2-800(35),IGHD3-900(35),IGHD3-10*00(34) IGHJ4*00(407.5) nan 465|476|496|0|11||110.0 41|48|93|21|28||35.0;46|53|93|20|27||35.0;40|52|93|23|36|I44GSA46G|34.0 26|37|68|37|48||110.0 nan TGTGCGAAAGAGTTGATGCGGACTGGTGTCGGGGAGGTTGACTACTGG 45 CAKELMRTGVGEVDYW :::::::::0:0:11:21:-10:-14:28:37:-6:48:::
176881 85 0.0207065 1 0.0142857 TGTGCGAGAGATCGGAGCGCAATTCGCGAGGATGATTATTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-400(494),IGHV4-3400(493.5) IGHD6-600(26),IGHD2-2100(25),IGHD5-24*00(25) IGHJ4*00(313.4) nan 444|457|475|0|13||130.0;442|452|473|0|10||100.0 4|21|54|14|30|DT8ST11ASA14TST16G|26.0;6|11|84|18|23||25.0;33|38|60|19|24||25.0 27|37|68|32|42|SC30TSC33T|42.0 nan TGTGCGAGAGATCGGAGCGCAATTCGCGAGGATGATTATTGG 45 CARDRSAIREDDYW :::::::::0:2:13:14:14:-15:30:32:-7:42:::
43756 77 0.0187576 2 0.0285714 TGTGCGAGAGCTCCACCTCTAGAAACGATTAATATTTCTTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV1-3*00(840.8) IGHD5-1200(35),IGHD5-1800(35),IGHD3-16*00(30) IGHJ400(223.7),IGHJ500(223.7) nan 447|460|478|0|13|SA457C|101.0 38|45|69|24|31||35.0;38|45|69|24|31||35.0;5|11|111|23|29||30.0 34|37|68|39|42||30.0;37|40|71|39|42||30.0 nan TGTGCGAGAGCTCCACCTCTAGAAACGATTAATATTTCTTGG 58 CARAPPLETINISW :::::::::0:2:13:24:-15:-1:31:39:-14:42:::
187949 77 0.0187576 1 0.0142857 TGTGCAACAGTCGAGAACTTACTATGGGAATACTACTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-2400(464.8),IGHV1-69-200(464.8) IGHD3-1000(45),IGHD3-2200(40) IGHJ4*00(445.7) nan 447|457|478|0|10||100.0;447|457|478|0|10||100.0 34|43|93|18|27||45.0;34|42|93|18|26||40.0 19|37|68|30|48||180.0 nan TGTGCAACAGTCGAGAACTTACTATGGGAATACTACTTTGACTACTGG 45 CATVENLLWEYYFDYW :::::::::0:-1:10:18:-3:-19:27:30:1:48:::
176853 76 0.018514 1 0.0142857 TGTGCGACCGGCAAGGATGGGGGGGCCTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-3000(930.6),IGHV3-30-300(930.6) IGHD2-200(30),IGHD3-1600(30) IGHJ4*00(287.5) nan 463|470|494|0|7||70.0;463|470|494|0|7||70.0 60|66|93|7|13||30.0;53|59|111|17|23||30.0 24|37|68|26|39||130.0 nan TGTGCGACCGGCAAGGATGGGGGGGCCTTTGACTACTGG 45 CATGKDGGAFDYW :::::::::0:-4:7:7:-29:4:13:26:-4:39:::
176879 76 0.018514 1 0.0142857 TGTGCGAGAGATCGCAGCGCGATTCGCGAAGATGATTATTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-34*00(996.6) IGHD6-2500(31),IGHD6-600(27) IGHJ4*00(312.6) nan 442|452|473|0|10||100.0 22|31|54|10|19|SA24C|31.0;22|33|54|10|21|SA24CST30G|27.0 27|37|68|32|42|SC30TSC33T|42.0 nan TGTGCGAGAGATCGCAGCGCGATTCGCGAAGATGATTATTGG 45 CARDRSAIREDDYW :::::::::0:-1:10:10:-4:-5:19:32:-7:42:::
176905 73 0.0177832 1 0.0142857 TGTGCGAAAGGGTTGAGTGACTCCCTTATCAGGTATTTCGATCTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(521) IGHD3-1000(35),IGHD3-1600(30),IGHD4-17*00(30) IGHJ2*00(447.5) nan 465|475|496|0|10||100.0 9|16|93|19|26||35.0;12|18|111|19|25||30.0;24|30|48|16|22||30.0 25|42|73|31|48|SC29T|141.0 nan TGTGCGAAAGGGTTGAGTGACTCCCTTATCAGGTATTTCGATCTCTGG 45 CAKGLSDSLIRYFDLW :::::::::0:-1:10:19:22:-46:26:31:-5:48:::
43754 70 0.0170524 2 0.0285714 TGTGCGAGACGGGACTCTATGATTACGTTTGGGGGAGTACGAGTAGATGCTTTTGATATCTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-21*00(450.3) IGHD3-16*00(91) IGHJ3*00(478.3) nan 463|472|494|0|9||90.0 41|62|111|17|38|SA50G|91.0 21|39|70|45|63||180.0 nan TGTGCGAGACGGGACTCTATGATTACGTTTGGGGGAGTACGAGTAGATGCTTTTGATATCTGG 58 CARRDSMITFGGVRVDAFDIW :::::::::0:-2:9:17:-4:-12:38:45👎63:::
43753 67 0.0163216 2 0.0285714 TGTGCGAGATATGCGAACGATCGCGAAGCGAGCTATTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-23*00(471.4) IGHD1-100(30),IGHD2-2100(30),IGHD3-16*00(30) IGHJ4*00(265.2) nan 465|472|496|0|7||70.0 27|33|51|14|20||30.0;21|27|84|8|14||30.0;5|11|111|15|21||30.0 30|37|68|32|39|SC33T|41.0 nan TGTGCGAGATATGCGAACGATCGCGAAGCGAGCTATTGG 58 CARYANDREASYW :::::::::0:-4:7:14:-10:-1:20:32:-10:39:::
187998 63 0.0153471 1 0.0142857 TGTGCAAGAAGGGGGGGTGGGAGCTACTCTGCTGCTTTTGATATCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-64D*00(478.5) IGHD1-26*00(60) IGHJ3*00(458.7) nan 461|465|492|0|4||40.0 26|38|60|16|28||60.0 20|39|70|29|48|SA22C|161.0 nan TGTGCAAGAAGGGGGGGTGGGAGCTACTCTGCTGCTTTTGATATCTGG 45 CARRGGGSYSAAFDIW :::::::::0:-7:4:16:-6:-2:28:29:0:48:::
176871 61 0.0148599 1 0.0142857 TGTGCGAGACATCGCCAAGTGGTGGTAGCTGCTACTATCTGGTTCGACCCCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-3900(489.6),IGHV4-3400(488.6) IGHD2-15*00(95) IGHJ5*00(376.2) nan 448|460|479|0|12||120.0;442|451|473|0|9||90.0 41|60|93|17|36||95.0 24|40|71|38|54||160.0 nan TGTGCGAGACATCGCCAAGTGGTGGTAGCTGCTACTATCTGGTTCGACCCCTGG 45 CARHRQVVVAATIWFDPW :::::::::0:1:12:17:-10:-2:36:38:-4:54:::
176876 59 0.0143727 1 0.0142857 TGTGCGAGAGGCCGCGCCCCGGAAAGCGCCCTGGCCGATTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-2*00(634.3) IGHD2-2100(33),IGHD1-1400(30) IGHJ1*00(180.3) nan 447|457|478|0|10||100.0 54|64|84|18|27|DT60|33.0;24|30|51|18|24||30.0 38|41|72|39|42||30.0 nan TGTGCGAGAGGCCGCGCCCCGGAAAGCGCCCTGGCCGATTGG 45 CARGRAPESALADW :::::::::0:-1:10:18:-26:8:27:39:-18:42:::
176873 58 0.0141291 1 0.0142857 TGTGCGAGAGCCAACGGCGAGCTGTTCGCCCTGCTGAGCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-53*00(543) IGHD6-6*00(38) IGHJ6*00(293.9) nan 460|470|491|0|10||100.0 35|46|54|14|24|DA38|38.0 48|52|83|38|42||40.0 nan TGTGCGAGAGCCAACGGCGAGCTGTTCGCCCTGCTGAGCTGG 45 CARANGELFALLSW :::::::::0:-1:10:14:-17:10:24:38:-28:42:::
176887 53 0.0129111 1 0.0142857 TGTGCGAGAGAAAAAAGCAGTGGCTGGTACGGTTACTGGTTCGACCCCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-2*00(940.3) IGHD6-19*00(80) IGHJ5*00(289.5) nan 447|458|478|0|11||110.0 27|43|63|15|31||80.0 23|40|71|34|51||170.0 nan TGTGCGAGAGAAAAAAGCAGTGGCTGGTACGGTTACTGGTTCGACCCCTGG 45 CAREKSSGWYGYWFDPW :::::::::0:0:11:15:-6:1:31:34:-3:51:::
176908 52 0.0126675 1 0.0142857 TGTGCGAAAGATCATCGGAGGATAGTTAACTACTGCTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-30*00(1020.2) IGHD2-1500(30),IGHD2-800(30),IGHD4-4*00(30) IGHJ4*00(454.3) nan 463|476|494|0|13||130.0 31|37|93|18|24||30.0;31|37|93|18|24||30.0;1|7|48|22|28||30.0 19|37|68|30|48|SA23G|151.0 nan TGTGCGAAAGATCATCGGAGGATAGTTAACTACTGCTTTGACTACTGG 45 CAKDHRRIVNYCFDYW :::::::::0:2:13:18:0:-25:24:30:1:48:::
176877 51 0.0124239 1 0.0142857 TGTGCGAGAGGAGGTGATTACTATGATAGCAGTGGTTATCCTCGGGCCGTATACTACTACGGTTTGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-18*00(1019) IGHD3-22*00(101) IGHJ6*00(495.1) nan 446|456|477|0|10||100.0 33|56|93|16|39|ST46C|101.0 28|52|83|51|75|SA40T|211.0 nan TGTGCGAGAGGAGGTGATTACTATGATAGCAGTGGTTATCCTCGGGCCGTATACTACTACGGTTTGGACGTCTGG 45 CARGGDYYDSSGYPRAVYYYGLDVW :::::::::0:-1:10:16:-2:-6:39:51:-8:75:::
176869 50 0.0121803 1 0.0142857 TGTGCGAGACTGAATTACGATTTTTGGAGTGGTTACTACTACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-21*00(448.5) IGHD3-3*00(95) IGHJ6*00(596.4) nan 463|472|494|0|9||90.0 33|52|93|13|32||95.0 21|52|83|32|63||310.0 nan TGTGCGAGACTGAATTACGATTTTTGGAGTGGTTACTACTACTACTACGGTATGGACGTCTGG 45 CARLNYDFWSGYYYYYGMDVW :::::::::0:-2:9:13:-2:-10:32:32👎63:::
176890 49 0.0119367 1 0.0142857 TGTGCGAGAAGCAGACAGATTAGGAGTGAGCTGTGGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-69*00(893.7) IGHD6-1900(34),IGHD3-300(31),IGHD6-6*00(30) IGHJ6*00(207.6) nan 448|457|479|0|9||90.0 26|38|63|20|33|SC29GI34A|34.0;42|51|93|19|28|ST44A|31.0;4|10|54|27|33||30.0 50|52|83|35|37||20.0 nan TGTGCGAGAAGCAGACAGATTAGGAGTGAGCTGTGGG 45 CARSRQ_LGVSCG :::::::::0:-2:9:20:-5:-4:33:35:-30:37:::
177180 49 0.0119367 1 0.0142857 TGTGCACGGATGGGGTACTACTACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV2-7000(536.5),IGHV2-70D00(536.5) nan IGHJ6*00(468.1) nan 451|462|484|0|11||110.0;451|462|484|0|11||110.0 nan 22|52|83|15|45||300.0 nan TGTGCACGGATGGGGTACTACTACTACTACGGTATGGACGTCTGG 45 CARMGYYYYYGMDVW :::::::::0:-2:11:::::15:-2:45:::
176868 48 0.0116931 1 0.0142857 TGTGCGAGATATGCAAACGAGCGCGACGCGAGCTATTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(1006.3) IGHD2-2100(30),IGHD6-600(30) IGHJ4*00(212.1) nan 465|472|496|0|7||70.0 21|27|84|8|14||30.0;2|8|54|16|22||30.0 30|37|68|32|39|SC33T|41.0 nan TGTGCGAGATATGCAAACGAGCGCGACGCGAGCTATTGG 45 CARYANERDASYW :::::::::0:-4:7:8:7:-29:14:32:-10:39:::
176859 47 0.0114495 1 0.0142857 TGTGCGAGCGCCCGCCTGGCGGTGGGAGCGGCGCGGTACTACTACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-3100(451.8),IGHV4-3400(451.8) IGHD1-26*00(40) IGHJ6*00(553.6) nan 447|455|478|0|8||80.0;442|450|473|0|8||80.0 26|34|60|21|29||40.0 25|52|83|36|63||270.0 nan TGTGCGAGCGCCCGCCTGGCGGTGGGAGCGGCGCGGTACTACTACTACGGTATGGACGTCTGG 45 CASARLAVGAARYYYYGMDVW :::::::::0:-3:8:21:-6:-6:29:36:-5:63:::
43759 46 0.0112058 2 0.0285714 TGTGCGAGAAGCGACCTGTTCATGGAGAGCAGCATCTGCAAGTGG [[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[[ IGHV3-11*00(498.7) IGHD6-1300(41),IGHD2-1500(38),IGHD3-10*00(36) IGHJ1*00(91.9) nan 459|468|490|0|9||90.0 27|38|63|27|38|SG34T|41.0;0|11|93|23|33|DT4|38.0;42|52|93|17|27|SG47T|36.0 38|41|72|42|45||30.0 nan TGTGCGAGAAGCGACCTGTTCATGGAGAGCAGCATCTGCAAGTGG 58 CARSDLFMESSICKW :::::::::0:-2:9:27:-6:-4:38:42:-18:45:::
176911 45 0.0109622 1 0.0142857 TGTGCGAAAAGGGGCCCCTCCTGGGGATTCGATGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(453) IGHD7-27*00(40) IGHJ4*00(346.6) nan 465|474|496|0|9||90.0 15|23|33|20|28||40.0 27|37|68|32|42||100.0 nan TGTGCGAAAAGGGGCCCCTCCTGGGGATTCGATGACTACTGG 45 CAKRGPSWGFDDYW :::::::::0:-2:9:20:-4:1:28:32:-7:42:::
176872 44 0.0107186 1 0.0142857 TGTGCGAGAGTGGATACAGCCGAGTCCTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-2100(550),IGHV3-3300(550) IGHD5-5*00(50) IGHJ4*00(438) nan 463|473|494|0|10||100.0;463|473|494|0|10||100.0 21|31|60|10|20||50.0 24|37|68|26|39||130.0 nan TGTGCGAGAGTGGATACAGCCGAGTCCTTTGACTACTGG 45 CARVDTAESFDYW :::::::::0:-1:10:10👎-9:20:26:-4:39:::
176875 43 0.010475 1 0.0142857 TGTGCGAGAGGTCCGGGGGGGGACTACGTTGACGAGTCCGGGTTTGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-18*00(1019) IGHD4-17*00(46) IGHJ4*00(418.1) nan 446|459|477|0|13|SA456G|101.0 17|29|48|21|33|SG24T|46.0 25|37|68|42|54||120.0 nan TGTGCGAGAGGTCCGGGGGGGGACTACGTTGACGAGTCCGGGTTTGACTACTGG 45 CARGPGGDYVDESGFDYW :::::::::0:2:13:21👎-3:33:42:-5:54:::
188543 42 0.0102314 1 0.0142857 TGTACTAGAGATTGTAGTAGTTCCAGTTGCATATGGGGGGGTCACTACGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-49*00(505.1) IGHD2-200(70),IGHD1-700(60) IGHJ6*00(461.1) nan 471|483|502|0|12||120.0 38|60|93|12|35|SA47TSC52TI56A|70.0;0|12|51|16|28||60.0 32|52|83|43|63||200.0 nan TGTACTAGAGATTGTAGTAGTTCCAGTTGCATATGGGGGGGTCACTACGGTATGGACGTCTGG 45 CTRDCSSSSCIWGGHYGMDVW :::::::::0:1:12:12:-7:-2:35:43:-12:63:::
188542 41 0.00998782 1 0.0142857 TGTACTAGATTGCCCTTCTCGGTGGCTACGCACATGACCGTTCCTTTTGATATCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-49*00(503.6) IGHD5-1200(45),IGHD5-1800(45),IGHD4-17*00(41) IGHJ3*00(435.6) nan 471|480|502|0|9||90.0 32|41|69|21|30||45.0;32|41|69|21|30||45.0;22|33|48|19|30|SA27G|41.0 25|39|70|43|57||140.0 nan TGTACTAGATTGCCCTTCTCGGTGGCTACGCACATGACCGTTCCTTTTGATATCTGG 45 CTRLPFSVATHMTVPFDIW :::::::::0:-2:9:21:-9:-5:30:43:-5:57:::
176901 39 0.00950061 1 0.0142857 TGTGCGAAGGTAGATCATTTCCACATATACTTCTACGGTTTGGATGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(500.3) IGHD3-1600(28),IGHD4-1700(28),IGHD4-4*00(28) IGHJ6*00(460.5) nan 465|473|496|0|8||80.0 24|32|111|9|18|I27G|28.0;9|17|48|9|18|I13A|28.0;9|17|48|9|18|I13A|28.0 28|52|83|27|51|SA32TSA40TSC45T|153.0 nan TGTGCGAAGGTAGATCATTTCCACATATACTTCTACGGTTTGGATGTCTGG 45 CAKVDHFHIYFYGLDVW :::::::::0:-3:8:9:13:-42:18:27:-8:51:::
176607 38 0.009257 1 0.0142857 TGTGCGTCTCATTGTAGTGGTGGTAGCTGCTACTCGGGGGACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-66*00(486.1) IGHD2-15*00(126) IGHJ400(354.9),IGHJ500(344.9) nan 460|466|491|0|6||60.0 36|64|93|10|38|SC61G|126.0 32|37|68|40|45||50.0;36|40|71|41|45||40.0 nan TGTGCGTCTCATTGTAGTGGTGGTAGCTGCTACTCGGGGGACTGG 45 CASHCSGGSCYSGDW :::::::::0:-5:6:10:-5:2:38:40:-12:45:::
176882 38 0.009257 1 0.0142857 TGTGCGAGAGATGCGGTATGGTTCGGGGAGTTATCTGGTATGGACGTCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-18*00(945.2) IGHD3-10*00(76) IGHJ6*00(352.3) nan 446|458|477|0|12||120.0 38|56|93|16|34|SA46G|76.0 37|52|83|36|51||150.0 nan TGTGCGAGAGATGCGGTATGGTTCGGGGAGTTATCTGGTATGGACGTCTGG 45 CARDAVWFGELSGMDVW :::::::::0:1:12:16:-7:-6:34:36:-17:51:::
176866 37 0.0090134 1 0.0142857 TGTGCGAGATGCGCGATGGAACTGCTGATTGCGACCCGCGAAGAATGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV1-8*00(571.6) IGHD2-2100(36),IGHD1-700(35) IGHJ400(203),IGHJ500(203) nan 448|460|479|0|12|SG457T|91.0 40|50|84|22|32|SG42C|36.0;25|32|51|16|23||35.0 34|37|68|45|48||30.0;37|40|71|45|48||30.0 nan TGTGCGAGATGCGCGATGGAACTGCTGATTGCGACCCGCGAAGAATGG 45 CARCAMELLIATREEW :::::::::0:1:12:22:-12:-6:32:45:-14:48:::
176909 35 0.00852619 1 0.0142857 TGTGCGAAAGATAGGGAGAACGCGTGTATTTCCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-3000(1042.3),IGHV3-30-300(1013.3) IGHD2-800(30),IGHD3-1000(30),IGHD1-1*00(28) IGHJ5*00(330.3) nan 463|475|494|0|12||120.0;463|475|494|0|12|SG470A|91.0 48|54|93|23|29||30.0;46|52|93|12|18||30.0;27|36|51|17|25|DA32|28.0 35|40|71|31|36||50.0 nan TGTGCGAAAGATAGGGAGAACGCGTGTATTTCCTGG 45 CAKDRENACISW :::::::::0:1:12:23:-17:-8:29:31:-15:36:::
176857 34 0.00828258 1 0.0142857 TGTGCGAGTCATTACTATGGTTCGGGGAGTTATTATAATGACTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-5900(499.2),IGHV4-6100(499.2) IGHD3-10*00(126) IGHJ4*00(372.6) nan 441|449|472|0|8||80.0;447|455|478|0|8||80.0 33|61|93|10|38|SA46G|126.0 27|37|68|38|48||100.0 nan TGTGCGAGTCATTACTATGGTTCGGGGAGTTATTATAATGACTACTGG 45 CASHYYGSGSYYNDYW :::::::::0:-3:8:10:-2:-1:38:38:-7:48:::
176889 34 0.00828258 1 0.0142857 TGTGCGAGAAGCGAACTGTTCATTGAGAGCATCATCTGCAAGTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-11*00(494.4) IGHD7-2700(31),IGHD6-1300(27) IGHJ4*00(228.5) nan 459|468|490|0|9||90.0 9|18|33|9|18|ST12G|31.0;27|38|63|27|38|SG31TSG34T|27.0 34|37|68|42|45||30.0 nan TGTGCGAGAAGCGAACTGTTCATTGAGAGCATCATCTGCAAGTGG 45 CARSELFIESIICKW :::::::::0:-2:9:9:2:-4:18:42:-14:45:::
176867 28 0.00682095 1 0.0142857 TGTGCGAGATATGCCAACGAACGCGATGCGAGCTATTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV3-23*00(992) IGHD2-200(30),IGHD2-2100(30) IGHJ4*00(211) nan 465|472|496|0|7||70.0 56|62|93|9|15||30.0;21|27|84|8|14||30.0 30|37|68|32|39|SC33T|41.0 nan TGTGCGAGATATGCCAACGAACGCGATGCGAGCTATTGG 45 CARYANERDASYW :::::::::0:-4:7:9:-25:0:15:32:-10:39:::
176880 19 0.0046285 1 0.0142857 TGTGCGAGAGATCGCAGCGCGATTCGCGAAGATGATTACTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-4*00(529.5) IGHD6-2500(30),IGHD6-600(26) IGHJ4*00(297.8) nan 444|457|475|0|13||130.0 25|31|54|13|19||30.0;25|33|54|13|21|ST30G|26.0 27|37|68|32|42|SC30T|71.0 nan TGTGCGAGAGATCGCAGCGCGATTCGCGAAGATGATTACTGG 45 CARDRSAIREDDYW :::::::::0:2:13:13:-7:-5:19:32:-7:42:::
176888 16 0.00389769 1 0.0142857 TGTGCGAGAATCATACCGGCATTGGCCCCCGCCTTGGCCTGG NNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNN IGHV4-5900(552.8),IGHV4-6100(552.8) IGHD3-300(37),IGHD1-1400(36),IGHD1-26*00(36) IGHJ5*00(148.6) nan 441|450|472|0|9||90.0;447|456|478|0|9||90.0 54|67|93|9|22|ST56CST64C|37.0;12|22|51|12|22|ST19C|36.0;15|25|60|12|22|ST22C|36.0 35|40|71|37|42||50.0 nan TGTGCGAGAATCATACCGGCATTGGCCCCCGCCTTGGCCTGG 45 CARIIPALAPALAW :::::::::0:-2:9:9:-23:5:22:37:-15:42:::

Quality control

Now let's look at the QC plots. That can be easily done using mixcr exportQc function.

First we will look at the alignment report:

mixcr exportQc align \
    Sample1.clns \
    figs/alignQc.pdf

align QC

From this plot we can tell that all samples have high alignment rate and nearly 90% of all reads from each sample have been successfully aligned to the reference sequences and CDR3 has been established.

Now we can check chains distribution plot:

mixcr exportQc chainUsage \
    results/Sample1.clns \
    figs/chainUsage.pdf

chain usage QC

Finally, we can take a look at the reads per UMI distribution.

mixcr exportQc tags \
    results/Sample1.clns \
    figs/tags.pdf

tags QC

Reports

Finally, MiXCR provides a very convenient way to look at the reports generated at ech step. Every .vdjca, .clns and .clna file holds all the reports for every MiXCR function that has been applied to this sample. E.g. in our case .clns file contains reports for mixcr align and mixcr assemble. To output this report use mixcr exportReports as shown bellow. Note --json parameter will output a JSON-formatted report.

mixcr exportReports \
    results/Sample1.clns \
    results/Sample1.report.txt
mixcr exportReports \
    --json \
    results/Sample1.clns \
    results/Sample1.report.json
Show report file
====================== report: align ======================
====================== report: align ======================
Total sequencing reads: 17311565
Successfully aligned reads: 17054645 (98.52%)
Coverage (percent of successfully aligned):
  VDJRegion: 0%
  CDR1_TO_FR4: 0%
  FR2_TO_FR4: 0%
  CDR2_TO_FR4: 0%
  FR3_TO_FR4: 0%
  CDR3: 99.55%
Alignment failed: no hits (not TCR/IG?): 4875 (0.03%)
Alignment failed: absence of V hits: 2187 (0.01%)
Alignment failed: absence of J hits: 47072 (0.27%)
Alignment failed: no target with both V and J alignments: 202786 (1.17%)
Overlapped: 16916473 (97.72%)
Overlapped and aligned: 16873568 (97.47%)
Overlapped and not aligned: 42905 (0.25%)
Alignment-aided overlaps, percent of overlapped and aligned: 10242 (0.06%)
No CDR3 parts alignments, percent of successfully aligned: 289 (0%)
Partial aligned reads, percent of successfully aligned: 76217 (0.45%)
V gene chimeras: 3939 (0.02%)
J gene chimeras: 20603 (0.12%)
Paired-end alignment conflicts eliminated: 68917 (0.4%)
Realigned with forced non-floating bound: 405334 (2.34%)
Realigned with forced non-floating right bound in left read: 2187 (0.01%)
Realigned with forced non-floating left bound in right read: 2187 (0.01%)
IGH chains: 4221 (0.02%)
IGH non-functional: 174 (4.12%)
IGK chains: 2161 (0.01%)
IGK non-functional: 158 (7.31%)
IGL chains: 1153 (0.01%)
IGL non-functional: 105 (9.11%)
TRA chains: 4162591 (24.41%)
TRA non-functional: 1554308 (37.34%)
TRB chains: 6743212 (39.54%)
TRB non-functional: 1524771 (22.61%)
TRD chains: 2119559 (12.43%)
TRD non-functional: 484960 (22.88%)
TRG chains: 4021748 (23.58%)
TRG non-functional: 2460360 (61.18%)
Trimming report:
  R1 reads trimmed left: 2117 (0.01%)
  R1 reads trimmed right: 90 (0%)
  Average R1 nucleotides trimmed left: 6.66490868965342E-4
  Average R1 nucleotides trimmed right: 2.0581616971082625E-4
  R2 reads trimmed left: 42956 (0.25%)
  R2 reads trimmed right: 7835 (0.05%)
  Average R2 nucleotides trimmed left: 0.21029929991886925
  Average R2 nucleotides trimmed right: 0.010215887471756597
Tag parsing report:
  Total reads: 17311565
  Matched reads: 17311565 (100%)
  Projection +R1 +R2: 17311565 (100%)
  For variant 0:
    For projection +R1 +R2:
      R1:Left position: 0
      R1:Right position: 148
      R2:Left position: 18
      R2:Right position: 148
      UMI:Left position: 0
      Variants: 0
      Cost: 0
      R1 length: 148
      R2 length: 130
      UMI length: 18
================ report: refineTagsAndSort ================
================ report: refineTagsAndSort ================
Time spent in correction: 0ns
Number of input records: 17053987
Number of output records: 16766984 (98.32%)
UMI correction report:
  UMI input diversity: 1009590
  UMI output diversity: 599709 (59.4%)
  UMI input reads: 17053987
  UMI output reads: 17037603 (99.9%)
  UMI mean reads per tag: 16.89
  UMI input core diversity: 984654 (97.53%)
  UMI input core reads: 17029051 (99.85%)
  UMI directly corrected diversity: 393497 (38.98%)
  UMI directly corrected reads: 396304 (2.32%)
  UMI diversity filtered by tag quality: 16384 (1.62%)
  UMI reads filtered by tag quality: 16384 (0.1%)
  UMI diversity filtered by whitelist: 0 (0%)
  UMI recursively corrected: 520
Filter report:
  Number of groups: 599709
  Number of groups accepted: 385705 (64.32%)
  Total records weight: 17037603
  Records weight accepted: 16766984 (98.41%)
  Operator #0:
    Effective threshold: 7.0
    Nested thresholds:
      #0: 7
      #1: 31
===================== report: assemble =====================
===================== report: assemble =====================
Final clonotype count: 206197
Reads used in clonotypes, percent of total: 16585588 (95.81%)
Average number of reads per clonotype: 80.44
Reads dropped due to the lack of a clone sequence, percent of total: 99754 (0.58%)
Reads dropped due to a too short clonal sequence, percent of total: 778 (0%)
Reads dropped due to low quality, percent of total: 0 (0%)
Reads dropped due to failed mapping, percent of total: 5722 (0.03%)
Reads dropped with low quality clones, percent of total: 611 (0%)
Aligned reads processed: 16600167
Reads used in clonotypes before clustering, percent of total: 16593056 (95.85%)
Number of reads used as a core, percent of used: 16591701 (99.99%)
Mapped low quality reads, percent of used: 1355 (0.01%)
Reads clustered in PCR error correction, percent of used: 7468 (0.05%)
Reads pre-clustered due to the similar VJC-lists, percent of used: 0 (0%)
Clonotypes dropped as low quality: 19
Clonotypes eliminated by PCR error correction: 154
Clonotypes pre-clustered due to the similar VJC-lists: 0
Clones dropped in post filtering: 0 (0%)
Reads dropped in post filtering: 0.0 (0%)
Alignments filtered by tag prefix: 0 (0%)
IGH chains: 54 (0.03%)
IGH non-functional: 1 (1.85%)
IGK chains: 30 (0.01%)
IGK non-functional: 2 (6.67%)
IGL chains: 17 (0.01%)
IGL non-functional: 1 (5.88%)
TRA chains: 54054 (26.21%)
TRA non-functional: 21102 (39.04%)
TRB chains: 82565 (40.04%)
TRB non-functional: 18802 (22.77%)
TRD chains: 30667 (14.87%)
TRD non-functional: 6750 (22.01%)
TRG chains: 38810 (18.82%)
TRG non-functional: 24308 (62.63%)
Pre-clone assembler report:
  Number of input groups: 385705
  Number of input groups with no assembling feature: 1753
  Number of input alignments: 16766984
  Number of alignments with assembling feature: 16667230 (99.41%)
  Number of output pre-clones: 385694
  Number of pre-clonotypes per group:  
    0: + 145 (0.04%) = 145 (0.04%)
    1: + 381930 (99.47%) = 382075 (99.51%)
    2: + 1867 (0.49%) = 383942 (100%)
    3: + 10 (0%) = 383952 (100%)
  Number of assembling feature sequences in groups with zero pre-clonotypes: 45521
  Number of dropped pre-clones by tag suffix conflict: 0
  Number of dropped alignments by tag suffix conflict: 0
  Number of core alignments: 16571535 (98.83%)
  Discarded core alignments: 95695 (0.58%)
  Empirically assigned alignments: 28632 (0.17%)
  Empirical assignment conflicts: 22 (0%)
  Tag+VJ-gene empirically assigned alignments: 28654 (0.17%)
  VJ-gene empirically assigned alignments: 0 (0%)
  Tag empirically assigned alignments: 0 (0%)
  Number of ambiguous groups: 1877
  Number of ambiguous V-genes: 383
  Number of ambiguous J-genes: 1468
  Number of ambiguous tag+V/J-gene combinations: 1851
  Ignored non-productive alignments: 0 (0%)
  Unassigned alignments: 100635 (0.6%)
{
  "align": {
    "type": "alignerReport",
    "commandLine": "align -f --report results/Sample1.align.report.txt --json-report results/Sample1.align.report.json --preset cellecta-human-dna-xcr-umi-drivermap-air --save-output-file-names results/Sample1.align.list raw/Sample1_R1.fastq.gz raw/Sample1_R2.fastq.gz results/Sample1.vdjca",
    "inputFiles": [
      "raw/Sample1_R1.fastq.gz",
      "raw/Sample1_R2.fastq.gz"
    ],
    "outputFiles": [
      "results/Sample1.vdjca"
    ],
    "version": "4.5.0-47-develop; built=Mon Oct 09 14:49:52 UTC 2023; rev=57293fb76d; lib=repseqio.v3.0.1",
    "totalReadsProcessed": 17311565,
    "aligned": 17054645,
    "notAligned": 256920,
    "notAlignedReasons": {
      "NoHits": 4875,
      "NoCDR3Parts": 0,
      "NoVHits": 2187,
      "NoJHits": 47072,
      "VAndJOnDifferentTargets": 202786,
      "LowTotalScore": 0,
      "NoBarcode": 0,
      "SampleNotMatched": 0
    },
    "overlapped": 16916473,
    "overlappedAligned": 16873568,
    "overlappedNotAligned": 42905,
    "alignmentAidedOverlaps": 10242,
    "noCDR3PartsAlignments": 289,
    "partialAlignments": 76217,
    "chimeras": 0,
    "vChimeras": 3939,
    "jChimeras": 20603,
    "pairedEndAlignmentConflicts": 68917,
    "realignedWithForcedNonFloatingBound": 405334,
    "realignedWithForcedNonFloatingRightBoundInLeftRead": 2187,
    "realignedWithForcedNonFloatingLeftBoundInRightRead": 2187,
    "chainUsage": {
      "type": "chainUsage",
      "chimeras": 0,
      "total": 17054645,
      "chains": {
        "IGH": {
          "total": 4221,
          "nonFunctional": 174,
          "isOOF": 69,
          "hasStops": 105
        },
        "IGK": {
          "total": 2161,
          "nonFunctional": 158,
          "isOOF": 141,
          "hasStops": 17
        },
        "IGL": {
          "total": 1153,
          "nonFunctional": 105,
          "isOOF": 93,
          "hasStops": 12
        },
        "TRA": {
          "total": 4162591,
          "nonFunctional": 1554308,
          "isOOF": 1468609,
          "hasStops": 85699
        },
        "TRB": {
          "total": 6743212,
          "nonFunctional": 1524771,
          "isOOF": 1227879,
          "hasStops": 296892
        },
        "TRD": {
          "total": 2119559,
          "nonFunctional": 484960,
          "isOOF": 422712,
          "hasStops": 62248
        },
        "TRG": {
          "total": 4021748,
          "nonFunctional": 2460360,
          "isOOF": 2357310,
          "hasStops": 103050
        }
      }
    },
    "trimmingReport": {
      "total": 17311565,
      "r1LeftTrimmedEvents": 2117,
      "r1RightTrimmedEvents": 90,
      "r2LeftTrimmedEvents": 42956,
      "r2RightTrimmedEvents": 7835,
      "r1LeftTrimmedNucleotides": 11538,
      "r1RightTrimmedNucleotides": 3563,
      "r2LeftTrimmedNucleotides": 3640610,
      "r2RightTrimmedNucleotides": 176853
    },
    "tagParsingReport": {
      "total": 17311565,
      "matched": 17311565,
      "totalBitCost": 0,
      "projections": {
        "1,2": 17311565
      },
      "detailedReport": [
        {
          "nested": {
            "1,2": {
              "variantId": 0,
              "nested": [
                [
                  {
                    "positionDistributions": {
                      "R1:Left": {
                        "0": 17311565
                      },
                      "R1:Right": {
                        "148": 17311565
                      }
                    },
                    "lengthDistributions": {
                      "R1": {
                        "148": 17311565
                      }
                    },
                    "costDistributions": {
                      "0": 17311565
                    },
                    "variantDistribution": {
                      "0": 17311565
                    }
                  }
                ],
                [
                  {
                    "positionDistributions": {
                      "R2:Left": {
                        "18": 17311565
                      },
                      "R2:Right": {
                        "148": 17311565
                      },
                      "UMI:Left": {
                        "0": 17311565
                      },
                      "UMI:Right": {
                        "18": 17311565
                      }
                    },
                    "lengthDistributions": {
                      "R2": {
                        "130": 17311565
                      },
                      "UMI": {
                        "18": 17311565
                      }
                    },
                    "costDistributions": {
                      "0": 17311565
                    },
                    "variantDistribution": {
                      "0": 17311565
                    }
                  }
                ]
              ],
              "costDistributions": {
                "0": 17311565
              }
            }
          }
        }
      ]
    },
    "notMatchedByHeader": 0,
    "transformerReports": [],
    "coverage": {
      "VDJRegion": 0,
      "CDR1_TO_FR4": 0,
      "FR2_TO_FR4": 0,
      "CDR2_TO_FR4": 0,
      "FR3_TO_FR4": 0,
      "CDR3": 16978139
    }
  },
  "assemble": {
    "type": "assemblerReport",
    "commandLine": "assemble -f --report results/Sample1.assemble.report.txt --json-report results/Sample1.assemble.report.json results/Sample1.refined.vdjca results/Sample1.clns",
    "inputFiles": [
      "results/Sample1.refined.vdjca"
    ],
    "outputFiles": [
      "results/Sample1.clns"
    ],
    "version": "4.5.0-47-develop; built=Mon Oct 09 14:49:52 UTC 2023; rev=57293fb76d; lib=repseqio.v3.0.1",
    "preCloneAssemblerReport": {
      "type": "preCloneAssemblerReport",
      "inputGroups": 385705,
      "groupsWithNoAssemblingFeature": 1753,
      "inputAlignments": 16766984,
      "inputAlignmentsWithAssemblingFeature": 16667230,
      "corePreClonesDroppedByTagSuffix": 0,
      "coreAlignmentsDroppedByTagSuffix": 0,
      "preClones": 385694,
      "preClonesPerGroup": {
        "0": 145,
        "1": 381930,
        "2": 1867,
        "3": 10
      },
      "assemblingFeatureSequencesInZeroPreClones": 45521,
      "coreAlignments": 16571535,
      "tagConflicts": 1877,
      "geneAndTagConflicts": 1851,
      "geneConflicts": {
        "Variable": 383,
        "Joining": 1468
      },
      "discardedCoreAlignments": 95695,
      "empiricallyAssignedAlignments": 28632,
      "vjEmpiricallyAssignedAlignments": 0,
      "tagEmpiricallyAssignedAlignments": 0,
      "geneAndTagEmpiricallyAssignedAlignments": 28654,
      "empiricalAssignmentConflicts": 22,
      "ignoredAlignments": 0,
      "unassignedAlignments": 100635
    },
    "totalReadsProcessed": 17311565,
    "totalAlignmentsProcessed": 385694,
    "totalAlignedReadsProcessed": 16600167,
    "initialClonesCreated": 206370,
    "readsDroppedNoTargetSequence": 99754,
    "readsDroppedTooShortClonalSequence": 778,
    "readsDroppedLowQuality": 0,
    "coreReads": 16591701,
    "readsDroppedFailedMapping": 5722,
    "lowQualityRescued": 1355,
    "clonesClustered": 154,
    "readsClustered": 7468,
    "clones": 206197,
    "clonesDroppedAsLowQuality": 19,
    "clonesPreClustered": 0,
    "readsPreClustered": 0,
    "readsInClones": 16585588,
    "readsInClonesBeforeClustering": 16593056,
    "readsDroppedWithLowQualityClones": 611,
    "clonalChainUsage": {
      "type": "chainUsage",
      "chimeras": 0,
      "total": 206197,
      "chains": {
        "IGH": {
          "total": 54,
          "nonFunctional": 1,
          "isOOF": 1,
          "hasStops": 0
        },
        "IGK": {
          "total": 30,
          "nonFunctional": 2,
          "isOOF": 2,
          "hasStops": 0
        },
        "IGL": {
          "total": 17,
          "nonFunctional": 1,
          "isOOF": 1,
          "hasStops": 0
        },
        "TRA": {
          "total": 54054,
          "nonFunctional": 21102,
          "isOOF": 20437,
          "hasStops": 665
        },
        "TRB": {
          "total": 82565,
          "nonFunctional": 18802,
          "isOOF": 17372,
          "hasStops": 1430
        },
        "TRD": {
          "total": 30667,
          "nonFunctional": 6750,
          "isOOF": 6161,
          "hasStops": 589
        },
        "TRG": {
          "total": 38810,
          "nonFunctional": 24308,
          "isOOF": 23285,
          "hasStops": 1023
        }
      }
    },
    "clonesFilteredInPostFiltering": 0,
    "readsFilteredInPostFiltering": 0,
    "postFilteringReports": null,
    "alignmentsFilteredByTagPrefix": 0
  },
  "refineTagsAndSort": {
    "type": "refineTagsAndSort",
    "commandLine": "refineTagsAndSort -f --report results/Sample1.refine.report.txt --json-report results/Sample1.refine.report.json results/Sample1.vdjca results/Sample1.refined.vdjca",
    "inputFiles": [
      "results/Sample1.vdjca"
    ],
    "outputFiles": [
      "results/Sample1.refined.vdjca"
    ],
    "version": "4.5.0-47-develop; built=Mon Oct 09 14:49:52 UTC 2023; rev=57293fb76d; lib=repseqio.v3.0.1",
    "correctionReport": {
      "inputRecords": 17053987,
      "outputRecords": 16766984,
      "steps": [
        {
          "tagName": "UMI",
          "inputGroups": 1,
          "inputDiversity": 1009590,
          "inputCount": 17053987,
          "coreDiversity": 984654,
          "coreCount": 17029051,
          "directlyCorrectedDiversity": 393497,
          "directlyCorrectedCount": 396304,
          "filteredDiversity": 16384,
          "filteredCount": 16384,
          "recursivelyCorrected": 520,
          "diversityFilteredByWhitelist": 0,
          "outputDiversity": 599709,
          "outputCount": 17037603
        }
      ],
      "filterReport": {
        "type": "filter_groups_report",
        "groupingKeys": [
          "allTags:Molecule"
        ],
        "numberOfGroups": 599709,
        "numberOfGroupsAccepted": 385705,
        "totalWeight": 17037603,
        "totalWeightAccepted": 16766984,
        "operatorReports": [
          {
            "operatorReport": {
              "type": "multi_operator_hist_report",
              "threshold": 7,
              "nestedThresholds": [
                7,
                31
              ]
            },
            "metricHists": [
              {
                "bins": [
                  {
                    "from": 1,
                    "to": 1.5848931924611136,
                    "weight": 183006
                  },
                  {
                    "from": 1.5848931924611136,
                    "to": 2.51188643150958,
                    "weight": 19144
                  },
                  {
                    "from": 2.51188643150958,
                    "to": 3.981071705534973,
                    "weight": 4625
                  },
                  {
                    "from": 3.981071705534973,
                    "to": 6.309573444801933,
                    "weight": 7229
                  },
                  {
                    "from": 6.309573444801933,
                    "to": 10,
                    "weight": 5897
                  },
                  {
                    "from": 10,
                    "to": 15.848931924611142,
                    "weight": 14373
                  },
                  {
                    "from": 15.848931924611142,
                    "to": 25.11886431509581,
                    "weight": 41976
                  },
                  {
                    "from": 25.11886431509581,
                    "to": 39.810717055349734,
                    "weight": 130591
                  },
                  {
                    "from": 39.810717055349734,
                    "to": 63.09573444801933,
                    "weight": 131624
                  },
                  {
                    "from": 63.09573444801933,
                    "to": 100,
                    "weight": 55624
                  },
                  {
                    "from": 100,
                    "to": 158.48931924611142,
                    "weight": 5557
                  },
                  {
                    "from": 158.48931924611142,
                    "to": 251.18864315095823,
                    "weight": 59
                  },
                  {
                    "from": 251.18864315095823,
                    "to": 398.1071705534973,
                    "weight": 2
                  },
                  {
                    "from": 398.1071705534973,
                    "to": 630.9573444801937,
                    "weight": 0
                  },
                  {
                    "from": 630.9573444801937,
                    "to": 1000,
                    "weight": 1
                  },
                  {
                    "from": 1000,
                    "to": 1584.893192461114,
                    "weight": 0
                  },
                  {
                    "from": 1584.893192461114,
                    "to": 2511.886431509582,
                    "weight": 0
                  },
                  {
                    "from": 2511.886431509582,
                    "to": 3981.0717055349733,
                    "weight": 0
                  },
                  {
                    "from": 3981.0717055349733,
                    "to": 6309.573444801937,
                    "weight": 0
                  },
                  {
                    "from": 6309.573444801937,
                    "to": 10000,
                    "weight": 0
                  },
                  {
                    "from": 10000,
                    "to": 15848.93192461114,
                    "weight": 0
                  },
                  {
                    "from": 15848.93192461114,
                    "to": 25118.864315095823,
                    "weight": 0
                  },
                  {
                    "from": 25118.864315095823,
                    "to": 39810.71705534977,
                    "weight": 0
                  },
                  {
                    "from": 39810.71705534977,
                    "to": 63095.73444801943,
                    "weight": 1
                  }
                ],
                "collectionSpec": {
                  "log": true,
                  "binNumber": 0,
                  "minBinWidth": 0.2,
                  "multiplyWeightByKey": false
                }
              }
            ]
          }
        ]
      }
    }
  }
}