Optimizing genomic alignment workloads
In some projects, there are a lot of files. Each of these files contain DNA sequences, and sometimes each of these files contain a different number of sequences. The workflow is to start several jobs that will align sequences for a subset of the files.
As an example, see the table below that contains files for the public human sample SRA000271.
Table 1: Number of sequences per file for sample SRA000271.
First, we need to use a more general terminology. Instead of talking about files, we will say objects. The weight of an object here is its number of sequences. As an example, the object SRR029335_2.fastq.bz2 has a weight of 7473894. And finally, instead of talking about jobs, we'll use the term 'bins'.
So let's say we have 16 bins and that we have the objects above. The goal is to balance the objects in 16 bins such as each bin has mostly the same weight.
With 508 objects and 16 bins, one way is to use 32 objects per bin (and 28 for the last, 15*32 + 1 * 28).
In that case, going for the 32-objects-per-bin will just not be very good because each bin will a very different weight.
Since the number of bins is known and the weight of each object is known, the expected weight per bin can be calculated by summing the weights of all objects and by dividing by the number of bins. In our case, we have 508 objects, and their total weight is 4045335994.
With 16 bins, we expect a average weight of 252833499 for any bin.
Given a distribution of objects, a score can be calculated by summing, for each bin, the absolute difference beween expected and actual weights. By minimizing this score, a really good distribution can be obtained very fast.
For the first state, all the objects are put in the first bin.
Given a state, the next one is generated by picking up 2 random objects. Each of these 2 objects is removed from its bin, and is deposited in a new randomly-selected bin. A new score is computed, and if it is better, the change is accepted. This is repeated as long as there is an improvement.
After 46620000 iterations (3 minutes), this solution was obtained:
Table 2: Weight of each bin after the balancing.
For this solution, the objects were distributed as follows.
Table 3: Distribution of objects into bin.
The solver is available here: https://github.com/sebhtml/NGS-Pipelines/blob/master/Balance-Objects.py
As an example, see the table below that contains files for the public human sample SRA000271.
Table 1: Number of sequences per file for sample SRA000271.
File | Sequences |
SRR002271_1.fastq.bz2 | 22243273 |
SRR002271_2.fastq.bz2 | 22243273 |
SRR002272_1.fastq.bz2 | 35756808 |
SRR002272_2.fastq.bz2 | 35756808 |
SRR002273_1.fastq.bz2 | 4276214 |
SRR002273_2.fastq.bz2 | 4276214 |
SRR002274_1.fastq.bz2 | 18095255 |
SRR002274_2.fastq.bz2 | 18095255 |
SRR002275_1.fastq.bz2 | 33729638 |
SRR002275_2.fastq.bz2 | 33729638 |
SRR002276_1.fastq.bz2 | 47074312 |
SRR002276_2.fastq.bz2 | 47074312 |
SRR002277_1.fastq.bz2 | 6757955 |
SRR002277_2.fastq.bz2 | 6757955 |
SRR002278_1.fastq.bz2 | 6093595 |
SRR002278_2.fastq.bz2 | 6093595 |
SRR002279_1.fastq.bz2 | 7177292 |
SRR002279_2.fastq.bz2 | 7177292 |
SRR002280_1.fastq.bz2 | 6580048 |
SRR002280_2.fastq.bz2 | 6580048 |
SRR002281_1.fastq.bz2 | 16693687 |
SRR002281_2.fastq.bz2 | 16693687 |
SRR002282_1.fastq.bz2 | 13383178 |
SRR002282_2.fastq.bz2 | 13383178 |
SRR002283_1.fastq.bz2 | 18374441 |
SRR002283_2.fastq.bz2 | 18374441 |
SRR002284_1.fastq.bz2 | 5600070 |
SRR002284_2.fastq.bz2 | 5600070 |
SRR002285_1.fastq.bz2 | 6286076 |
SRR002285_2.fastq.bz2 | 6286076 |
SRR002286_1.fastq.bz2 | 5709652 |
SRR002286_2.fastq.bz2 | 5709652 |
SRR002287_1.fastq.bz2 | 6309083 |
SRR002287_2.fastq.bz2 | 6309083 |
SRR002288_1.fastq.bz2 | 6006869 |
SRR002288_2.fastq.bz2 | 6006869 |
SRR002289_1.fastq.bz2 | 4776565 |
SRR002289_2.fastq.bz2 | 4776565 |
SRR002290_1.fastq.bz2 | 13044591 |
SRR002290_2.fastq.bz2 | 13044591 |
SRR002291_1.fastq.bz2 | 34898264 |
SRR002291_2.fastq.bz2 | 34898264 |
SRR002292_1.fastq.bz2 | 22854308 |
SRR002292_2.fastq.bz2 | 22854308 |
SRR002293_1.fastq.bz2 | 18042487 |
SRR002293_2.fastq.bz2 | 18042487 |
SRR002294_1.fastq.bz2 | 8109548 |
SRR002294_2.fastq.bz2 | 8109548 |
SRR002295_1.fastq.bz2 | 30697316 |
SRR002295_2.fastq.bz2 | 30697316 |
SRR002296_1.fastq.bz2 | 6101023 |
SRR002296_2.fastq.bz2 | 6101023 |
SRR002297_1.fastq.bz2 | 46467908 |
SRR002297_2.fastq.bz2 | 46467908 |
SRR002298_1.fastq.bz2 | 10848356 |
SRR002298_2.fastq.bz2 | 10848356 |
SRR002299_1.fastq.bz2 | 6078655 |
SRR002299_2.fastq.bz2 | 6078655 |
SRR002300_1.fastq.bz2 | 6483460 |
SRR002300_2.fastq.bz2 | 6483460 |
SRR002301_1.fastq.bz2 | 7260908 |
SRR002301_2.fastq.bz2 | 7260908 |
SRR002302_1.fastq.bz2 | 6283624 |
SRR002302_2.fastq.bz2 | 6283624 |
SRR002303_1.fastq.bz2 | 6092616 |
SRR002303_2.fastq.bz2 | 6092616 |
SRR002304_1.fastq.bz2 | 8669549 |
SRR002304_2.fastq.bz2 | 8669549 |
SRR002305_1.fastq.bz2 | 7079980 |
SRR002305_2.fastq.bz2 | 7079980 |
SRR002306_1.fastq.bz2 | 6061012 |
SRR002306_2.fastq.bz2 | 6061012 |
SRR002307_1.fastq.bz2 | 7990437 |
SRR002307_2.fastq.bz2 | 7990437 |
SRR002308_1.fastq.bz2 | 6403325 |
SRR002308_2.fastq.bz2 | 6403325 |
SRR002309_1.fastq.bz2 | 5929366 |
SRR002309_2.fastq.bz2 | 5929366 |
SRR002310_1.fastq.bz2 | 9745625 |
SRR002310_2.fastq.bz2 | 9745625 |
SRR002311_1.fastq.bz2 | 6113076 |
SRR002311_2.fastq.bz2 | 6113076 |
SRR002312_1.fastq.bz2 | 9808666 |
SRR002312_2.fastq.bz2 | 9808666 |
SRR002313_1.fastq.bz2 | 7841529 |
SRR002313_2.fastq.bz2 | 7841529 |
SRR002314_1.fastq.bz2 | 4334295 |
SRR002314_2.fastq.bz2 | 4334295 |
SRR002315_1.fastq.bz2 | 4857885 |
SRR002315_2.fastq.bz2 | 4857885 |
SRR002316_1.fastq.bz2 | 14025787 |
SRR002316_2.fastq.bz2 | 14025787 |
SRR002317_1.fastq.bz2 | 9732073 |
SRR002317_2.fastq.bz2 | 9732073 |
SRR002318_1.fastq.bz2 | 1670064 |
SRR002318_2.fastq.bz2 | 1670064 |
SRR002319_1.fastq.bz2 | 9264776 |
SRR002319_2.fastq.bz2 | 9264776 |
SRR003810_1.fastq.bz2 | 4605877 |
SRR003810_2.fastq.bz2 | 4605877 |
SRR003812_1.fastq.bz2 | 4891691 |
SRR003812_2.fastq.bz2 | 4891691 |
SRR003813_1.fastq.bz2 | 4908529 |
SRR003813_2.fastq.bz2 | 4908529 |
SRR003814_1.fastq.bz2 | 3958657 |
SRR003814_2.fastq.bz2 | 3958657 |
SRR003815_1.fastq.bz2 | 4780683 |
SRR003815_2.fastq.bz2 | 4780683 |
SRR003816_1.fastq.bz2 | 4673240 |
SRR003816_2.fastq.bz2 | 4673240 |
SRR003817_1.fastq.bz2 | 1226466 |
SRR003817_2.fastq.bz2 | 1226466 |
SRR003818_1.fastq.bz2 | 6450471 |
SRR003818_2.fastq.bz2 | 6450471 |
SRR003819_1.fastq.bz2 | 6593471 |
SRR003819_2.fastq.bz2 | 6593471 |
SRR003820_1.fastq.bz2 | 6553586 |
SRR003820_2.fastq.bz2 | 6553586 |
SRR003821_1.fastq.bz2 | 6328227 |
SRR003821_2.fastq.bz2 | 6328227 |
SRR003823_1.fastq.bz2 | 6247413 |
SRR003823_2.fastq.bz2 | 6247413 |
SRR003824_1.fastq.bz2 | 6025237 |
SRR003824_2.fastq.bz2 | 6025237 |
SRR003825_1.fastq.bz2 | 1561734 |
SRR003825_2.fastq.bz2 | 1561734 |
SRR003837_1.fastq.bz2 | 7512805 |
SRR003837_2.fastq.bz2 | 7512805 |
SRR003838_1.fastq.bz2 | 7152063 |
SRR003838_2.fastq.bz2 | 7152063 |
SRR003839_1.fastq.bz2 | 7162665 |
SRR003839_2.fastq.bz2 | 7162665 |
SRR003840_1.fastq.bz2 | 7316431 |
SRR003840_2.fastq.bz2 | 7316431 |
SRR003841_1.fastq.bz2 | 1978964 |
SRR003841_2.fastq.bz2 | 1978964 |
SRR003845_1.fastq.bz2 | 7425426 |
SRR003845_2.fastq.bz2 | 7425426 |
SRR003846_1.fastq.bz2 | 6496063 |
SRR003846_2.fastq.bz2 | 6496063 |
SRR003847_1.fastq.bz2 | 7303975 |
SRR003847_2.fastq.bz2 | 7303975 |
SRR003848_1.fastq.bz2 | 7322831 |
SRR003848_2.fastq.bz2 | 7322831 |
SRR003849_1.fastq.bz2 | 7353518 |
SRR003849_2.fastq.bz2 | 7353518 |
SRR003850_1.fastq.bz2 | 1956095 |
SRR003850_2.fastq.bz2 | 1956095 |
SRR003851_1.fastq.bz2 | 6349952 |
SRR003851_2.fastq.bz2 | 6349952 |
SRR003852_1.fastq.bz2 | 6750815 |
SRR003852_2.fastq.bz2 | 6750815 |
SRR003853_1.fastq.bz2 | 6676849 |
SRR003853_2.fastq.bz2 | 6676849 |
SRR003854_1.fastq.bz2 | 6582580 |
SRR003854_2.fastq.bz2 | 6582580 |
SRR003855_1.fastq.bz2 | 6764442 |
SRR003855_2.fastq.bz2 | 6764442 |
SRR003856_1.fastq.bz2 | 6721795 |
SRR003856_2.fastq.bz2 | 6721795 |
SRR003857_1.fastq.bz2 | 1637242 |
SRR003857_2.fastq.bz2 | 1637242 |
SRR003859_1.fastq.bz2 | 3741843 |
SRR003859_2.fastq.bz2 | 3741843 |
SRR003860_1.fastq.bz2 | 3946844 |
SRR003860_2.fastq.bz2 | 3946844 |
SRR003861_1.fastq.bz2 | 4062736 |
SRR003861_2.fastq.bz2 | 4062736 |
SRR003863_1.fastq.bz2 | 6555145 |
SRR003863_2.fastq.bz2 | 6555145 |
SRR003864_1.fastq.bz2 | 6419174 |
SRR003864_2.fastq.bz2 | 6419174 |
SRR003866_1.fastq.bz2 | 6498466 |
SRR003866_2.fastq.bz2 | 6498466 |
SRR003867_1.fastq.bz2 | 6459933 |
SRR003867_2.fastq.bz2 | 6459933 |
SRR003868_1.fastq.bz2 | 7433254 |
SRR003868_2.fastq.bz2 | 7433254 |
SRR003869_1.fastq.bz2 | 7308992 |
SRR003869_2.fastq.bz2 | 7308992 |
SRR003870_1.fastq.bz2 | 7557219 |
SRR003870_2.fastq.bz2 | 7557219 |
SRR003871_1.fastq.bz2 | 7443963 |
SRR003871_2.fastq.bz2 | 7443963 |
SRR003872_1.fastq.bz2 | 7499185 |
SRR003872_2.fastq.bz2 | 7499185 |
SRR003873_1.fastq.bz2 | 6323645 |
SRR003873_2.fastq.bz2 | 6323645 |
SRR003874_1.fastq.bz2 | 6145944 |
SRR003874_2.fastq.bz2 | 6145944 |
SRR003875_1.fastq.bz2 | 6348097 |
SRR003875_2.fastq.bz2 | 6348097 |
SRR003876_1.fastq.bz2 | 6296738 |
SRR003876_2.fastq.bz2 | 6296738 |
SRR003877_1.fastq.bz2 | 6450647 |
SRR003877_2.fastq.bz2 | 6450647 |
SRR003878_1.fastq.bz2 | 6543352 |
SRR003878_2.fastq.bz2 | 6543352 |
SRR003879_1.fastq.bz2 | 6418736 |
SRR003879_2.fastq.bz2 | 6418736 |
SRR003960_1.fastq.bz2 | 9722922 |
SRR003960_2.fastq.bz2 | 9722922 |
SRR003961_1.fastq.bz2 | 8995012 |
SRR003961_2.fastq.bz2 | 8995012 |
SRR003962_1.fastq.bz2 | 9249927 |
SRR003962_2.fastq.bz2 | 9249927 |
SRR003963_1.fastq.bz2 | 9185577 |
SRR003963_2.fastq.bz2 | 9185577 |
SRR003964_1.fastq.bz2 | 9484360 |
SRR003964_2.fastq.bz2 | 9484360 |
SRR003965_1.fastq.bz2 | 8959911 |
SRR003965_2.fastq.bz2 | 8959911 |
SRR003966_1.fastq.bz2 | 5851368 |
SRR003966_2.fastq.bz2 | 5851368 |
SRR003967_1.fastq.bz2 | 5236932 |
SRR003967_2.fastq.bz2 | 5236932 |
SRR003968_1.fastq.bz2 | 6170713 |
SRR003968_2.fastq.bz2 | 6170713 |
SRR003969_1.fastq.bz2 | 6276516 |
SRR003969_2.fastq.bz2 | 6276516 |
SRR003970_1.fastq.bz2 | 5765690 |
SRR003970_2.fastq.bz2 | 5765690 |
SRR003971_1.fastq.bz2 | 1615149 |
SRR003971_2.fastq.bz2 | 1615149 |
SRR004105_1.fastq.bz2 | 6815628 |
SRR004105_2.fastq.bz2 | 6815628 |
SRR004106_1.fastq.bz2 | 6857870 |
SRR004106_2.fastq.bz2 | 6857870 |
SRR004107_1.fastq.bz2 | 6961705 |
SRR004107_2.fastq.bz2 | 6961705 |
SRR004108_1.fastq.bz2 | 7019609 |
SRR004108_2.fastq.bz2 | 7019609 |
SRR004109_1.fastq.bz2 | 7017622 |
SRR004109_2.fastq.bz2 | 7017622 |
SRR004110_1.fastq.bz2 | 1510651 |
SRR004110_2.fastq.bz2 | 1510651 |
SRR004111_1.fastq.bz2 | 6500202 |
SRR004111_2.fastq.bz2 | 6500202 |
SRR004112_1.fastq.bz2 | 6382108 |
SRR004112_2.fastq.bz2 | 6382108 |
SRR004113_1.fastq.bz2 | 6769812 |
SRR004113_2.fastq.bz2 | 6769812 |
SRR004114_1.fastq.bz2 | 6749632 |
SRR004114_2.fastq.bz2 | 6749632 |
SRR004116_1.fastq.bz2 | 6660494 |
SRR004116_2.fastq.bz2 | 6660494 |
SRR004117_1.fastq.bz2 | 6894164 |
SRR004117_2.fastq.bz2 | 6894164 |
SRR004118_1.fastq.bz2 | 7012762 |
SRR004118_2.fastq.bz2 | 7012762 |
SRR004119_1.fastq.bz2 | 7106679 |
SRR004119_2.fastq.bz2 | 7106679 |
SRR004120_1.fastq.bz2 | 1890977 |
SRR004120_2.fastq.bz2 | 1890977 |
SRR004121_1.fastq.bz2 | 6688708 |
SRR004121_2.fastq.bz2 | 6688708 |
SRR004122_1.fastq.bz2 | 6442268 |
SRR004122_2.fastq.bz2 | 6442268 |
SRR004123_1.fastq.bz2 | 7219728 |
SRR004123_2.fastq.bz2 | 7219728 |
SRR004124_1.fastq.bz2 | 6650577 |
SRR004124_2.fastq.bz2 | 6650577 |
SRR004125_1.fastq.bz2 | 6980725 |
SRR004125_2.fastq.bz2 | 6980725 |
SRR004126_1.fastq.bz2 | 7121389 |
SRR004126_2.fastq.bz2 | 7121389 |
SRR004127_1.fastq.bz2 | 1920925 |
SRR004127_2.fastq.bz2 | 1920925 |
SRR004186_1.fastq.bz2 | 6039526 |
SRR004186_2.fastq.bz2 | 6039526 |
SRR004187_1.fastq.bz2 | 6189345 |
SRR004187_2.fastq.bz2 | 6189345 |
SRR004188_1.fastq.bz2 | 6202959 |
SRR004188_2.fastq.bz2 | 6202959 |
SRR004190_1.fastq.bz2 | 5896818 |
SRR004190_2.fastq.bz2 | 5896818 |
SRR004191_1.fastq.bz2 | 5619426 |
SRR004191_2.fastq.bz2 | 5619426 |
SRR004192_1.fastq.bz2 | 1399560 |
SRR004192_2.fastq.bz2 | 1399560 |
SRR004193_1.fastq.bz2 | 6978310 |
SRR004193_2.fastq.bz2 | 6978310 |
SRR004194_1.fastq.bz2 | 6918542 |
SRR004194_2.fastq.bz2 | 6918542 |
SRR004195_1.fastq.bz2 | 5848753 |
SRR004195_2.fastq.bz2 | 5848753 |
SRR004197_1.fastq.bz2 | 7159088 |
SRR004197_2.fastq.bz2 | 7159088 |
SRR004198_1.fastq.bz2 | 1930314 |
SRR004198_2.fastq.bz2 | 1930314 |
SRR004199_1.fastq.bz2 | 6536064 |
SRR004199_2.fastq.bz2 | 6536064 |
SRR004200_1.fastq.bz2 | 6839637 |
SRR004200_2.fastq.bz2 | 6839637 |
SRR004201_1.fastq.bz2 | 6943961 |
SRR004201_2.fastq.bz2 | 6943961 |
SRR004202_1.fastq.bz2 | 6886712 |
SRR004202_2.fastq.bz2 | 6886712 |
SRR004203_1.fastq.bz2 | 6920882 |
SRR004203_2.fastq.bz2 | 6920882 |
SRR004204_1.fastq.bz2 | 1862546 |
SRR004204_2.fastq.bz2 | 1862546 |
SRR004205_1.fastq.bz2 | 4905164 |
SRR004205_2.fastq.bz2 | 4905164 |
SRR004206_1.fastq.bz2 | 4757265 |
SRR004206_2.fastq.bz2 | 4757265 |
SRR004207_1.fastq.bz2 | 5253223 |
SRR004207_2.fastq.bz2 | 5253223 |
SRR004208_1.fastq.bz2 | 5061637 |
SRR004208_2.fastq.bz2 | 5061637 |
SRR004209_1.fastq.bz2 | 5250983 |
SRR004209_2.fastq.bz2 | 5250983 |
SRR004210_1.fastq.bz2 | 5348659 |
SRR004210_2.fastq.bz2 | 5348659 |
SRR004211_1.fastq.bz2 | 1331516 |
SRR004211_2.fastq.bz2 | 1331516 |
SRR004809_1.fastq.bz2 | 6281732 |
SRR004809_2.fastq.bz2 | 6281732 |
SRR004810_1.fastq.bz2 | 6479897 |
SRR004810_2.fastq.bz2 | 6479897 |
SRR004811_1.fastq.bz2 | 6428286 |
SRR004811_2.fastq.bz2 | 6428286 |
SRR004812_1.fastq.bz2 | 6386024 |
SRR004812_2.fastq.bz2 | 6386024 |
SRR004813_1.fastq.bz2 | 6397082 |
SRR004813_2.fastq.bz2 | 6397082 |
SRR004814_1.fastq.bz2 | 6108069 |
SRR004814_2.fastq.bz2 | 6108069 |
SRR004815_1.fastq.bz2 | 6525195 |
SRR004815_2.fastq.bz2 | 6525195 |
SRR004816_1.fastq.bz2 | 6511455 |
SRR004816_2.fastq.bz2 | 6511455 |
SRR004817_1.fastq.bz2 | 6462803 |
SRR004817_2.fastq.bz2 | 6462803 |
SRR004818_1.fastq.bz2 | 6433169 |
SRR004818_2.fastq.bz2 | 6433169 |
SRR004819_1.fastq.bz2 | 6338234 |
SRR004819_2.fastq.bz2 | 6338234 |
SRR004820_1.fastq.bz2 | 7308557 |
SRR004820_2.fastq.bz2 | 7308557 |
SRR004821_1.fastq.bz2 | 7156960 |
SRR004821_2.fastq.bz2 | 7156960 |
SRR004822_1.fastq.bz2 | 6213660 |
SRR004822_2.fastq.bz2 | 6213660 |
SRR004823_1.fastq.bz2 | 5949864 |
SRR004823_2.fastq.bz2 | 5949864 |
SRR004824_1.fastq.bz2 | 6087720 |
SRR004824_2.fastq.bz2 | 6087720 |
SRR004825_1.fastq.bz2 | 6078285 |
SRR004825_2.fastq.bz2 | 6078285 |
SRR004826_1.fastq.bz2 | 5882054 |
SRR004826_2.fastq.bz2 | 5882054 |
SRR004827_1.fastq.bz2 | 5541480 |
SRR004827_2.fastq.bz2 | 5541480 |
SRR004828_1.fastq.bz2 | 7078458 |
SRR004828_2.fastq.bz2 | 7078458 |
SRR004829_1.fastq.bz2 | 7524672 |
SRR004829_2.fastq.bz2 | 7524672 |
SRR004830_1.fastq.bz2 | 7820645 |
SRR004830_2.fastq.bz2 | 7820645 |
SRR004831_1.fastq.bz2 | 8105229 |
SRR004831_2.fastq.bz2 | 8105229 |
SRR004832_1.fastq.bz2 | 7956781 |
SRR004832_2.fastq.bz2 | 7956781 |
SRR004833_1.fastq.bz2 | 7547869 |
SRR004833_2.fastq.bz2 | 7547869 |
SRR004834_1.fastq.bz2 | 6447823 |
SRR004834_2.fastq.bz2 | 6447823 |
SRR004835_1.fastq.bz2 | 8616259 |
SRR004835_2.fastq.bz2 | 8616259 |
SRR004836_1.fastq.bz2 | 8975677 |
SRR004836_2.fastq.bz2 | 8975677 |
SRR004837_1.fastq.bz2 | 8642976 |
SRR004837_2.fastq.bz2 | 8642976 |
SRR004838_1.fastq.bz2 | 7023195 |
SRR004838_2.fastq.bz2 | 7023195 |
SRR004839_1.fastq.bz2 | 8062696 |
SRR004839_2.fastq.bz2 | 8062696 |
SRR004840_1.fastq.bz2 | 8060450 |
SRR004840_2.fastq.bz2 | 8060450 |
SRR004841_1.fastq.bz2 | 6678858 |
SRR004841_2.fastq.bz2 | 6678858 |
SRR004842_1.fastq.bz2 | 7091815 |
SRR004842_2.fastq.bz2 | 7091815 |
SRR004843_1.fastq.bz2 | 6890492 |
SRR004843_2.fastq.bz2 | 6890492 |
SRR004844_1.fastq.bz2 | 8549164 |
SRR004844_2.fastq.bz2 | 8549164 |
SRR004845_1.fastq.bz2 | 8837159 |
SRR004845_2.fastq.bz2 | 8837159 |
SRR004846_1.fastq.bz2 | 7759225 |
SRR004846_2.fastq.bz2 | 7759225 |
SRR004847_1.fastq.bz2 | 5661124 |
SRR004847_2.fastq.bz2 | 5661124 |
SRR004848_1.fastq.bz2 | 5408098 |
SRR004848_2.fastq.bz2 | 5408098 |
SRR004849_1.fastq.bz2 | 5950977 |
SRR004849_2.fastq.bz2 | 5950977 |
SRR004850_1.fastq.bz2 | 9849395 |
SRR004850_2.fastq.bz2 | 9849395 |
SRR004851_1.fastq.bz2 | 9534571 |
SRR004851_2.fastq.bz2 | 9534571 |
SRR004852_1.fastq.bz2 | 9340649 |
SRR004852_2.fastq.bz2 | 9340649 |
SRR004853_1.fastq.bz2 | 6474762 |
SRR004853_2.fastq.bz2 | 6474762 |
SRR004854_1.fastq.bz2 | 6360372 |
SRR004854_2.fastq.bz2 | 6360372 |
SRR004855_1.fastq.bz2 | 6512147 |
SRR004855_2.fastq.bz2 | 6512147 |
SRR004856_1.fastq.bz2 | 10212539 |
SRR004856_2.fastq.bz2 | 10212539 |
SRR004857_1.fastq.bz2 | 10214756 |
SRR004857_2.fastq.bz2 | 10214756 |
SRR004858_1.fastq.bz2 | 9948010 |
SRR004858_2.fastq.bz2 | 9948010 |
SRR004859_1.fastq.bz2 | 10152277 |
SRR004859_2.fastq.bz2 | 10152277 |
SRR004860_1.fastq.bz2 | 9972388 |
SRR004860_2.fastq.bz2 | 9972388 |
SRR004861_1.fastq.bz2 | 8818998 |
SRR004861_2.fastq.bz2 | 8818998 |
SRR004862_1.fastq.bz2 | 8128087 |
SRR004862_2.fastq.bz2 | 8128087 |
SRR004863_1.fastq.bz2 | 8579536 |
SRR004863_2.fastq.bz2 | 8579536 |
SRR004864_1.fastq.bz2 | 8333329 |
SRR004864_2.fastq.bz2 | 8333329 |
SRR004865_1.fastq.bz2 | 283573 |
SRR004865_2.fastq.bz2 | 283573 |
SRR004866_1.fastq.bz2 | 4325494 |
SRR004866_2.fastq.bz2 | 4325494 |
SRR004867_1.fastq.bz2 | 4441061 |
SRR004867_2.fastq.bz2 | 4441061 |
SRR004868_1.fastq.bz2 | 4417212 |
SRR004868_2.fastq.bz2 | 4417212 |
SRR004869_1.fastq.bz2 | 4426593 |
SRR004869_2.fastq.bz2 | 4426593 |
SRR004870_1.fastq.bz2 | 4325451 |
SRR004870_2.fastq.bz2 | 4325451 |
SRR004871_1.fastq.bz2 | 4437127 |
SRR004871_2.fastq.bz2 | 4437127 |
SRR005657_1.fastq.bz2 | 5950061 |
SRR005657_2.fastq.bz2 | 5950061 |
SRR005658_1.fastq.bz2 | 6023796 |
SRR005658_2.fastq.bz2 | 6023796 |
SRR005659_1.fastq.bz2 | 6046660 |
SRR005659_2.fastq.bz2 | 6046660 |
SRR005660_1.fastq.bz2 | 5778417 |
SRR005660_2.fastq.bz2 | 5778417 |
SRR005661_1.fastq.bz2 | 6083478 |
SRR005661_2.fastq.bz2 | 6083478 |
SRR005718_1.fastq.bz2 | 32158952 |
SRR005718_2.fastq.bz2 | 32158952 |
SRR005719_1.fastq.bz2 | 19748766 |
SRR005719_2.fastq.bz2 | 19748766 |
SRR005720_1.fastq.bz2 | 26060741 |
SRR005720_2.fastq.bz2 | 26060741 |
SRR005721_1.fastq.bz2 | 11956691 |
SRR005721_2.fastq.bz2 | 11956691 |
SRR005734_1.fastq.bz2 | 24962746 |
SRR005734_2.fastq.bz2 | 24962746 |
SRR005735_1.fastq.bz2 | 32958349 |
SRR005735_2.fastq.bz2 | 32958349 |
SRR006550_1.fastq.bz2 | 6485562 |
SRR006550_2.fastq.bz2 | 6485562 |
SRR006551_1.fastq.bz2 | 7621955 |
SRR006551_2.fastq.bz2 | 7621955 |
SRR006552_1.fastq.bz2 | 6982624 |
SRR006552_2.fastq.bz2 | 6982624 |
SRR006553_1.fastq.bz2 | 7074228 |
SRR006553_2.fastq.bz2 | 7074228 |
SRR006554_1.fastq.bz2 | 7003372 |
SRR006554_2.fastq.bz2 | 7003372 |
SRR006555_1.fastq.bz2 | 7308114 |
SRR006555_2.fastq.bz2 | 7308114 |
SRR006556_1.fastq.bz2 | 4359382 |
SRR006556_2.fastq.bz2 | 4359382 |
SRR006557_1.fastq.bz2 | 4337291 |
SRR006557_2.fastq.bz2 | 4337291 |
SRR006558_1.fastq.bz2 | 3599374 |
SRR006558_2.fastq.bz2 | 3599374 |
SRR006559_1.fastq.bz2 | 3450777 |
SRR006559_2.fastq.bz2 | 3450777 |
SRR006560_1.fastq.bz2 | 3022934 |
SRR006560_2.fastq.bz2 | 3022934 |
SRR006561_1.fastq.bz2 | 5884872 |
SRR006561_2.fastq.bz2 | 5884872 |
SRR006562_1.fastq.bz2 | 5477538 |
SRR006562_2.fastq.bz2 | 5477538 |
SRR006563_1.fastq.bz2 | 6660338 |
SRR006563_2.fastq.bz2 | 6660338 |
SRR006564_1.fastq.bz2 | 7568813 |
SRR006564_2.fastq.bz2 | 7568813 |
SRR029278_1.fastq.bz2 | 9014183 |
SRR029278_2.fastq.bz2 | 9014183 |
SRR029333_1.fastq.bz2 | 3422203 |
SRR029333_2.fastq.bz2 | 3422203 |
SRR029334_1.fastq.bz2 | 5765255 |
SRR029334_2.fastq.bz2 | 5765255 |
SRR029335_1.fastq.bz2 | 7473894 |
SRR029335_2.fastq.bz2 | 7473894 |
SRR029336_1.fastq.bz2 | 3905986 |
SRR029336_2.fastq.bz2 | 3905986 |
SRR029337_1.fastq.bz2 | 18361953 |
SRR029337_2.fastq.bz2 | 18361953 |
SRR029338_1.fastq.bz2 | 3602769 |
SRR029338_2.fastq.bz2 | 3602769 |
First, we need to use a more general terminology. Instead of talking about files, we will say objects. The weight of an object here is its number of sequences. As an example, the object SRR029335_2.fastq.bz2 has a weight of 7473894. And finally, instead of talking about jobs, we'll use the term 'bins'.
So let's say we have 16 bins and that we have the objects above. The goal is to balance the objects in 16 bins such as each bin has mostly the same weight.
With 508 objects and 16 bins, one way is to use 32 objects per bin (and 28 for the last, 15*32 + 1 * 28).
In that case, going for the 32-objects-per-bin will just not be very good because each bin will a very different weight.
Since the number of bins is known and the weight of each object is known, the expected weight per bin can be calculated by summing the weights of all objects and by dividing by the number of bins. In our case, we have 508 objects, and their total weight is 4045335994.
With 16 bins, we expect a average weight of 252833499 for any bin.
Given a distribution of objects, a score can be calculated by summing, for each bin, the absolute difference beween expected and actual weights. By minimizing this score, a really good distribution can be obtained very fast.
For the first state, all the objects are put in the first bin.
Given a state, the next one is generated by picking up 2 random objects. Each of these 2 objects is removed from its bin, and is deposited in a new randomly-selected bin. A new score is computed, and if it is better, the change is accepted. This is repeated as long as there is an improvement.
After 46620000 iterations (3 minutes), this solution was obtained:
Table 2: Weight of each bin after the balancing.
Bin | ExpectedWeight | ActualWeight |
0 | 252833499 | 252837466 |
1 | 252833499 | 252829890 |
2 | 252833499 | 252835796 |
3 | 252833499 | 252832018 |
4 | 252833499 | 252839402 |
5 | 252833499 | 252832650 |
6 | 252833499 | 252827646 |
7 | 252833499 | 252833920 |
8 | 252833499 | 252830266 |
9 | 252833499 | 252833886 |
10 | 252833499 | 252827036 |
11 | 252833499 | 252836770 |
12 | 252833499 | 252828776 |
13 | 252833499 | 252840966 |
14 | 252833499 | 252835218 |
15 | 252833499 | 252834288 |
For this solution, the objects were distributed as follows.
Table 3: Distribution of objects into bin.
Object | Weight | Bin |
SRR002271_1.fastq.bz2 | 22243273 | 11 |
SRR002271_2.fastq.bz2 | 22243273 | 11 |
SRR002272_1.fastq.bz2 | 35756808 | 10 |
SRR002272_2.fastq.bz2 | 35756808 | 10 |
SRR002273_1.fastq.bz2 | 4276214 | 9 |
SRR002273_2.fastq.bz2 | 4276214 | 9 |
SRR002274_1.fastq.bz2 | 18095255 | 15 |
SRR002274_2.fastq.bz2 | 18095255 | 15 |
SRR002275_1.fastq.bz2 | 33729638 | 0 |
SRR002275_2.fastq.bz2 | 33729638 | 0 |
SRR002276_1.fastq.bz2 | 47074312 | 13 |
SRR002276_2.fastq.bz2 | 47074312 | 13 |
SRR002277_1.fastq.bz2 | 6757955 | 12 |
SRR002277_2.fastq.bz2 | 6757955 | 12 |
SRR002278_1.fastq.bz2 | 6093595 | 3 |
SRR002278_2.fastq.bz2 | 6093595 | 3 |
SRR002279_1.fastq.bz2 | 7177292 | 14 |
SRR002279_2.fastq.bz2 | 7177292 | 14 |
SRR002280_1.fastq.bz2 | 6580048 | 14 |
SRR002280_2.fastq.bz2 | 6580048 | 14 |
SRR002281_1.fastq.bz2 | 16693687 | 12 |
SRR002281_2.fastq.bz2 | 16693687 | 12 |
SRR002282_1.fastq.bz2 | 13383178 | 9 |
SRR002282_2.fastq.bz2 | 13383178 | 9 |
SRR002283_1.fastq.bz2 | 18374441 | 2 |
SRR002283_2.fastq.bz2 | 18374441 | 2 |
SRR002284_1.fastq.bz2 | 5600070 | 12 |
SRR002284_2.fastq.bz2 | 5600070 | 12 |
SRR002285_1.fastq.bz2 | 6286076 | 14 |
SRR002285_2.fastq.bz2 | 6286076 | 14 |
SRR002286_1.fastq.bz2 | 5709652 | 11 |
SRR002286_2.fastq.bz2 | 5709652 | 11 |
SRR002287_1.fastq.bz2 | 6309083 | 0 |
SRR002287_2.fastq.bz2 | 6309083 | 0 |
SRR002288_1.fastq.bz2 | 6006869 | 10 |
SRR002288_2.fastq.bz2 | 6006869 | 10 |
SRR002289_1.fastq.bz2 | 4776565 | 15 |
SRR002289_2.fastq.bz2 | 4776565 | 15 |
SRR002290_1.fastq.bz2 | 13044591 | 4 |
SRR002290_2.fastq.bz2 | 13044591 | 4 |
SRR002291_1.fastq.bz2 | 34898264 | 8 |
SRR002291_2.fastq.bz2 | 34898264 | 8 |
SRR002292_1.fastq.bz2 | 22854308 | 7 |
SRR002292_2.fastq.bz2 | 22854308 | 7 |
SRR002293_1.fastq.bz2 | 18042487 | 10 |
SRR002293_2.fastq.bz2 | 18042487 | 10 |
SRR002294_1.fastq.bz2 | 8109548 | 4 |
SRR002294_2.fastq.bz2 | 8109548 | 4 |
SRR002295_1.fastq.bz2 | 30697316 | 11 |
SRR002295_2.fastq.bz2 | 30697316 | 11 |
SRR002296_1.fastq.bz2 | 6101023 | 3 |
SRR002296_2.fastq.bz2 | 6101023 | 3 |
SRR002297_1.fastq.bz2 | 46467908 | 8 |
SRR002297_2.fastq.bz2 | 46467908 | 8 |
SRR002298_1.fastq.bz2 | 10848356 | 13 |
SRR002298_2.fastq.bz2 | 10848356 | 13 |
SRR002299_1.fastq.bz2 | 6078655 | 5 |
SRR002299_2.fastq.bz2 | 6078655 | 5 |
SRR002300_1.fastq.bz2 | 6483460 | 2 |
SRR002300_2.fastq.bz2 | 6483460 | 2 |
SRR002301_1.fastq.bz2 | 7260908 | 15 |
SRR002301_2.fastq.bz2 | 7260908 | 15 |
SRR002302_1.fastq.bz2 | 6283624 | 15 |
SRR002302_2.fastq.bz2 | 6283624 | 15 |
SRR002303_1.fastq.bz2 | 6092616 | 9 |
SRR002303_2.fastq.bz2 | 6092616 | 9 |
SRR002304_1.fastq.bz2 | 8669549 | 14 |
SRR002304_2.fastq.bz2 | 8669549 | 14 |
SRR002305_1.fastq.bz2 | 7079980 | 1 |
SRR002305_2.fastq.bz2 | 7079980 | 1 |
SRR002306_1.fastq.bz2 | 6061012 | 12 |
SRR002306_2.fastq.bz2 | 6061012 | 12 |
SRR002307_1.fastq.bz2 | 7990437 | 0 |
SRR002307_2.fastq.bz2 | 7990437 | 0 |
SRR002308_1.fastq.bz2 | 6403325 | 13 |
SRR002308_2.fastq.bz2 | 6403325 | 13 |
SRR002309_1.fastq.bz2 | 5929366 | 9 |
SRR002309_2.fastq.bz2 | 5929366 | 9 |
SRR002310_1.fastq.bz2 | 9745625 | 14 |
SRR002310_2.fastq.bz2 | 9745625 | 14 |
SRR002311_1.fastq.bz2 | 6113076 | 8 |
SRR002311_2.fastq.bz2 | 6113076 | 8 |
SRR002312_1.fastq.bz2 | 9808666 | 0 |
SRR002312_2.fastq.bz2 | 9808666 | 0 |
SRR002313_1.fastq.bz2 | 7841529 | 15 |
SRR002313_2.fastq.bz2 | 7841529 | 15 |
SRR002314_1.fastq.bz2 | 4334295 | 13 |
SRR002314_2.fastq.bz2 | 4334295 | 13 |
SRR002315_1.fastq.bz2 | 4857885 | 7 |
SRR002315_2.fastq.bz2 | 4857885 | 7 |
SRR002316_1.fastq.bz2 | 14025787 | 13 |
SRR002316_2.fastq.bz2 | 14025787 | 13 |
SRR002317_1.fastq.bz2 | 9732073 | 11 |
SRR002317_2.fastq.bz2 | 9732073 | 11 |
SRR002318_1.fastq.bz2 | 1670064 | 11 |
SRR002318_2.fastq.bz2 | 1670064 | 11 |
SRR002319_1.fastq.bz2 | 9264776 | 9 |
SRR002319_2.fastq.bz2 | 9264776 | 9 |
SRR003810_1.fastq.bz2 | 4605877 | 12 |
SRR003810_2.fastq.bz2 | 4605877 | 12 |
SRR003812_1.fastq.bz2 | 4891691 | 3 |
SRR003812_2.fastq.bz2 | 4891691 | 3 |
SRR003813_1.fastq.bz2 | 4908529 | 11 |
SRR003813_2.fastq.bz2 | 4908529 | 11 |
SRR003814_1.fastq.bz2 | 3958657 | 12 |
SRR003814_2.fastq.bz2 | 3958657 | 12 |
SRR003815_1.fastq.bz2 | 4780683 | 2 |
SRR003815_2.fastq.bz2 | 4780683 | 2 |
SRR003816_1.fastq.bz2 | 4673240 | 0 |
SRR003816_2.fastq.bz2 | 4673240 | 0 |
SRR003817_1.fastq.bz2 | 1226466 | 3 |
SRR003817_2.fastq.bz2 | 1226466 | 3 |
SRR003818_1.fastq.bz2 | 6450471 | 1 |
SRR003818_2.fastq.bz2 | 6450471 | 1 |
SRR003819_1.fastq.bz2 | 6593471 | 0 |
SRR003819_2.fastq.bz2 | 6593471 | 0 |
SRR003820_1.fastq.bz2 | 6553586 | 9 |
SRR003820_2.fastq.bz2 | 6553586 | 9 |
SRR003821_1.fastq.bz2 | 6328227 | 5 |
SRR003821_2.fastq.bz2 | 6328227 | 5 |
SRR003823_1.fastq.bz2 | 6247413 | 9 |
SRR003823_2.fastq.bz2 | 6247413 | 9 |
SRR003824_1.fastq.bz2 | 6025237 | 11 |
SRR003824_2.fastq.bz2 | 6025237 | 11 |
SRR003825_1.fastq.bz2 | 1561734 | 12 |
SRR003825_2.fastq.bz2 | 1561734 | 12 |
SRR003837_1.fastq.bz2 | 7512805 | 6 |
SRR003837_2.fastq.bz2 | 7512805 | 6 |
SRR003838_1.fastq.bz2 | 7152063 | 0 |
SRR003838_2.fastq.bz2 | 7152063 | 0 |
SRR003839_1.fastq.bz2 | 7162665 | 14 |
SRR003839_2.fastq.bz2 | 7162665 | 14 |
SRR003840_1.fastq.bz2 | 7316431 | 9 |
SRR003840_2.fastq.bz2 | 7316431 | 9 |
SRR003841_1.fastq.bz2 | 1978964 | 13 |
SRR003841_2.fastq.bz2 | 1978964 | 13 |
SRR003845_1.fastq.bz2 | 7425426 | 10 |
SRR003845_2.fastq.bz2 | 7425426 | 10 |
SRR003846_1.fastq.bz2 | 6496063 | 9 |
SRR003846_2.fastq.bz2 | 6496063 | 9 |
SRR003847_1.fastq.bz2 | 7303975 | 5 |
SRR003847_2.fastq.bz2 | 7303975 | 5 |
SRR003848_1.fastq.bz2 | 7322831 | 3 |
SRR003848_2.fastq.bz2 | 7322831 | 3 |
SRR003849_1.fastq.bz2 | 7353518 | 2 |
SRR003849_2.fastq.bz2 | 7353518 | 2 |
SRR003850_1.fastq.bz2 | 1956095 | 15 |
SRR003850_2.fastq.bz2 | 1956095 | 15 |
SRR003851_1.fastq.bz2 | 6349952 | 6 |
SRR003851_2.fastq.bz2 | 6349952 | 6 |
SRR003852_1.fastq.bz2 | 6750815 | 6 |
SRR003852_2.fastq.bz2 | 6750815 | 6 |
SRR003853_1.fastq.bz2 | 6676849 | 11 |
SRR003853_2.fastq.bz2 | 6676849 | 11 |
SRR003854_1.fastq.bz2 | 6582580 | 1 |
SRR003854_2.fastq.bz2 | 6582580 | 1 |
SRR003855_1.fastq.bz2 | 6764442 | 2 |
SRR003855_2.fastq.bz2 | 6764442 | 2 |
SRR003856_1.fastq.bz2 | 6721795 | 6 |
SRR003856_2.fastq.bz2 | 6721795 | 6 |
SRR003857_1.fastq.bz2 | 1637242 | 12 |
SRR003857_2.fastq.bz2 | 1637242 | 12 |
SRR003859_1.fastq.bz2 | 3741843 | 7 |
SRR003859_2.fastq.bz2 | 3741843 | 7 |
SRR003860_1.fastq.bz2 | 3946844 | 7 |
SRR003860_2.fastq.bz2 | 3946844 | 7 |
SRR003861_1.fastq.bz2 | 4062736 | 4 |
SRR003861_2.fastq.bz2 | 4062736 | 4 |
SRR003863_1.fastq.bz2 | 6555145 | 11 |
SRR003863_2.fastq.bz2 | 6555145 | 11 |
SRR003864_1.fastq.bz2 | 6419174 | 6 |
SRR003864_2.fastq.bz2 | 6419174 | 6 |
SRR003866_1.fastq.bz2 | 6498466 | 0 |
SRR003866_2.fastq.bz2 | 6498466 | 0 |
SRR003867_1.fastq.bz2 | 6459933 | 10 |
SRR003867_2.fastq.bz2 | 6459933 | 10 |
SRR003868_1.fastq.bz2 | 7433254 | 11 |
SRR003868_2.fastq.bz2 | 7433254 | 11 |
SRR003869_1.fastq.bz2 | 7308992 | 8 |
SRR003869_2.fastq.bz2 | 7308992 | 8 |
SRR003870_1.fastq.bz2 | 7557219 | 3 |
SRR003870_2.fastq.bz2 | 7557219 | 3 |
SRR003871_1.fastq.bz2 | 7443963 | 1 |
SRR003871_2.fastq.bz2 | 7443963 | 1 |
SRR003872_1.fastq.bz2 | 7499185 | 3 |
SRR003872_2.fastq.bz2 | 7499185 | 3 |
SRR003873_1.fastq.bz2 | 6323645 | 15 |
SRR003873_2.fastq.bz2 | 6323645 | 15 |
SRR003874_1.fastq.bz2 | 6145944 | 7 |
SRR003874_2.fastq.bz2 | 6145944 | 7 |
SRR003875_1.fastq.bz2 | 6348097 | 12 |
SRR003875_2.fastq.bz2 | 6348097 | 12 |
SRR003876_1.fastq.bz2 | 6296738 | 9 |
SRR003876_2.fastq.bz2 | 6296738 | 9 |
SRR003877_1.fastq.bz2 | 6450647 | 10 |
SRR003877_2.fastq.bz2 | 6450647 | 10 |
SRR003878_1.fastq.bz2 | 6543352 | 4 |
SRR003878_2.fastq.bz2 | 6543352 | 4 |
SRR003879_1.fastq.bz2 | 6418736 | 13 |
SRR003879_2.fastq.bz2 | 6418736 | 13 |
SRR003960_1.fastq.bz2 | 9722922 | 6 |
SRR003960_2.fastq.bz2 | 9722922 | 6 |
SRR003961_1.fastq.bz2 | 8995012 | 14 |
SRR003961_2.fastq.bz2 | 8995012 | 14 |
SRR003962_1.fastq.bz2 | 9249927 | 10 |
SRR003962_2.fastq.bz2 | 9249927 | 10 |
SRR003963_1.fastq.bz2 | 9185577 | 1 |
SRR003963_2.fastq.bz2 | 9185577 | 1 |
SRR003964_1.fastq.bz2 | 9484360 | 4 |
SRR003964_2.fastq.bz2 | 9484360 | 4 |
SRR003965_1.fastq.bz2 | 8959911 | 5 |
SRR003965_2.fastq.bz2 | 8959911 | 5 |
SRR003966_1.fastq.bz2 | 5851368 | 5 |
SRR003966_2.fastq.bz2 | 5851368 | 5 |
SRR003967_1.fastq.bz2 | 5236932 | 5 |
SRR003967_2.fastq.bz2 | 5236932 | 5 |
SRR003968_1.fastq.bz2 | 6170713 | 4 |
SRR003968_2.fastq.bz2 | 6170713 | 4 |
SRR003969_1.fastq.bz2 | 6276516 | 12 |
SRR003969_2.fastq.bz2 | 6276516 | 12 |
SRR003970_1.fastq.bz2 | 5765690 | 1 |
SRR003970_2.fastq.bz2 | 5765690 | 1 |
SRR003971_1.fastq.bz2 | 1615149 | 12 |
SRR003971_2.fastq.bz2 | 1615149 | 12 |
SRR004105_1.fastq.bz2 | 6815628 | 7 |
SRR004105_2.fastq.bz2 | 6815628 | 7 |
SRR004106_1.fastq.bz2 | 6857870 | 2 |
SRR004106_2.fastq.bz2 | 6857870 | 2 |
SRR004107_1.fastq.bz2 | 6961705 | 3 |
SRR004107_2.fastq.bz2 | 6961705 | 3 |
SRR004108_1.fastq.bz2 | 7019609 | 7 |
SRR004108_2.fastq.bz2 | 7019609 | 7 |
SRR004109_1.fastq.bz2 | 7017622 | 5 |
SRR004109_2.fastq.bz2 | 7017622 | 5 |
SRR004110_1.fastq.bz2 | 1510651 | 6 |
SRR004110_2.fastq.bz2 | 1510651 | 6 |
SRR004111_1.fastq.bz2 | 6500202 | 12 |
SRR004111_2.fastq.bz2 | 6500202 | 12 |
SRR004112_1.fastq.bz2 | 6382108 | 9 |
SRR004112_2.fastq.bz2 | 6382108 | 9 |
SRR004113_1.fastq.bz2 | 6769812 | 9 |
SRR004113_2.fastq.bz2 | 6769812 | 9 |
SRR004114_1.fastq.bz2 | 6749632 | 2 |
SRR004114_2.fastq.bz2 | 6749632 | 2 |
SRR004116_1.fastq.bz2 | 6660494 | 6 |
SRR004116_2.fastq.bz2 | 6660494 | 6 |
SRR004117_1.fastq.bz2 | 6894164 | 9 |
SRR004117_2.fastq.bz2 | 6894164 | 9 |
SRR004118_1.fastq.bz2 | 7012762 | 4 |
SRR004118_2.fastq.bz2 | 7012762 | 4 |
SRR004119_1.fastq.bz2 | 7106679 | 0 |
SRR004119_2.fastq.bz2 | 7106679 | 0 |
SRR004120_1.fastq.bz2 | 1890977 | 1 |
SRR004120_2.fastq.bz2 | 1890977 | 1 |
SRR004121_1.fastq.bz2 | 6688708 | 13 |
SRR004121_2.fastq.bz2 | 6688708 | 13 |
SRR004122_1.fastq.bz2 | 6442268 | 2 |
SRR004122_2.fastq.bz2 | 6442268 | 2 |
SRR004123_1.fastq.bz2 | 7219728 | 7 |
SRR004123_2.fastq.bz2 | 7219728 | 7 |
SRR004124_1.fastq.bz2 | 6650577 | 10 |
SRR004124_2.fastq.bz2 | 6650577 | 10 |
SRR004125_1.fastq.bz2 | 6980725 | 5 |
SRR004125_2.fastq.bz2 | 6980725 | 5 |
SRR004126_1.fastq.bz2 | 7121389 | 4 |
SRR004126_2.fastq.bz2 | 7121389 | 4 |
SRR004127_1.fastq.bz2 | 1920925 | 15 |
SRR004127_2.fastq.bz2 | 1920925 | 15 |
SRR004186_1.fastq.bz2 | 6039526 | 4 |
SRR004186_2.fastq.bz2 | 6039526 | 4 |
SRR004187_1.fastq.bz2 | 6189345 | 15 |
SRR004187_2.fastq.bz2 | 6189345 | 15 |
SRR004188_1.fastq.bz2 | 6202959 | 3 |
SRR004188_2.fastq.bz2 | 6202959 | 3 |
SRR004190_1.fastq.bz2 | 5896818 | 4 |
SRR004190_2.fastq.bz2 | 5896818 | 4 |
SRR004191_1.fastq.bz2 | 5619426 | 4 |
SRR004191_2.fastq.bz2 | 5619426 | 4 |
SRR004192_1.fastq.bz2 | 1399560 | 2 |
SRR004192_2.fastq.bz2 | 1399560 | 2 |
SRR004193_1.fastq.bz2 | 6978310 | 15 |
SRR004193_2.fastq.bz2 | 6978310 | 15 |
SRR004194_1.fastq.bz2 | 6918542 | 1 |
SRR004194_2.fastq.bz2 | 6918542 | 1 |
SRR004195_1.fastq.bz2 | 5848753 | 3 |
SRR004195_2.fastq.bz2 | 5848753 | 3 |
SRR004197_1.fastq.bz2 | 7159088 | 3 |
SRR004197_2.fastq.bz2 | 7159088 | 3 |
SRR004198_1.fastq.bz2 | 1930314 | 0 |
SRR004198_2.fastq.bz2 | 1930314 | 0 |
SRR004199_1.fastq.bz2 | 6536064 | 5 |
SRR004199_2.fastq.bz2 | 6536064 | 5 |
SRR004200_1.fastq.bz2 | 6839637 | 4 |
SRR004200_2.fastq.bz2 | 6839637 | 4 |
SRR004201_1.fastq.bz2 | 6943961 | 2 |
SRR004201_2.fastq.bz2 | 6943961 | 2 |
SRR004202_1.fastq.bz2 | 6886712 | 4 |
SRR004202_2.fastq.bz2 | 6886712 | 4 |
SRR004203_1.fastq.bz2 | 6920882 | 6 |
SRR004203_2.fastq.bz2 | 6920882 | 6 |
SRR004204_1.fastq.bz2 | 1862546 | 13 |
SRR004204_2.fastq.bz2 | 1862546 | 13 |
SRR004205_1.fastq.bz2 | 4905164 | 15 |
SRR004205_2.fastq.bz2 | 4905164 | 15 |
SRR004206_1.fastq.bz2 | 4757265 | 5 |
SRR004206_2.fastq.bz2 | 4757265 | 5 |
SRR004207_1.fastq.bz2 | 5253223 | 7 |
SRR004207_2.fastq.bz2 | 5253223 | 7 |
SRR004208_1.fastq.bz2 | 5061637 | 7 |
SRR004208_2.fastq.bz2 | 5061637 | 7 |
SRR004209_1.fastq.bz2 | 5250983 | 7 |
SRR004209_2.fastq.bz2 | 5250983 | 7 |
SRR004210_1.fastq.bz2 | 5348659 | 6 |
SRR004210_2.fastq.bz2 | 5348659 | 6 |
SRR004211_1.fastq.bz2 | 1331516 | 6 |
SRR004211_2.fastq.bz2 | 1331516 | 6 |
SRR004809_1.fastq.bz2 | 6281732 | 3 |
SRR004809_2.fastq.bz2 | 6281732 | 3 |
SRR004810_1.fastq.bz2 | 6479897 | 12 |
SRR004810_2.fastq.bz2 | 6479897 | 12 |
SRR004811_1.fastq.bz2 | 6428286 | 5 |
SRR004811_2.fastq.bz2 | 6428286 | 5 |
SRR004812_1.fastq.bz2 | 6386024 | 9 |
SRR004812_2.fastq.bz2 | 6386024 | 9 |
SRR004813_1.fastq.bz2 | 6397082 | 8 |
SRR004813_2.fastq.bz2 | 6397082 | 8 |
SRR004814_1.fastq.bz2 | 6108069 | 3 |
SRR004814_2.fastq.bz2 | 6108069 | 3 |
SRR004815_1.fastq.bz2 | 6525195 | 3 |
SRR004815_2.fastq.bz2 | 6525195 | 3 |
SRR004816_1.fastq.bz2 | 6511455 | 3 |
SRR004816_2.fastq.bz2 | 6511455 | 3 |
SRR004817_1.fastq.bz2 | 6462803 | 8 |
SRR004817_2.fastq.bz2 | 6462803 | 8 |
SRR004818_1.fastq.bz2 | 6433169 | 7 |
SRR004818_2.fastq.bz2 | 6433169 | 7 |
SRR004819_1.fastq.bz2 | 6338234 | 5 |
SRR004819_2.fastq.bz2 | 6338234 | 5 |
SRR004820_1.fastq.bz2 | 7308557 | 1 |
SRR004820_2.fastq.bz2 | 7308557 | 1 |
SRR004821_1.fastq.bz2 | 7156960 | 2 |
SRR004821_2.fastq.bz2 | 7156960 | 2 |
SRR004822_1.fastq.bz2 | 6213660 | 15 |
SRR004822_2.fastq.bz2 | 6213660 | 15 |
SRR004823_1.fastq.bz2 | 5949864 | 2 |
SRR004823_2.fastq.bz2 | 5949864 | 2 |
SRR004824_1.fastq.bz2 | 6087720 | 3 |
SRR004824_2.fastq.bz2 | 6087720 | 3 |
SRR004825_1.fastq.bz2 | 6078285 | 5 |
SRR004825_2.fastq.bz2 | 6078285 | 5 |
SRR004826_1.fastq.bz2 | 5882054 | 4 |
SRR004826_2.fastq.bz2 | 5882054 | 4 |
SRR004827_1.fastq.bz2 | 5541480 | 5 |
SRR004827_2.fastq.bz2 | 5541480 | 5 |
SRR004828_1.fastq.bz2 | 7078458 | 12 |
SRR004828_2.fastq.bz2 | 7078458 | 12 |
SRR004829_1.fastq.bz2 | 7524672 | 11 |
SRR004829_2.fastq.bz2 | 7524672 | 11 |
SRR004830_1.fastq.bz2 | 7820645 | 14 |
SRR004830_2.fastq.bz2 | 7820645 | 14 |
SRR004831_1.fastq.bz2 | 8105229 | 0 |
SRR004831_2.fastq.bz2 | 8105229 | 0 |
SRR004832_1.fastq.bz2 | 7956781 | 14 |
SRR004832_2.fastq.bz2 | 7956781 | 14 |
SRR004833_1.fastq.bz2 | 7547869 | 6 |
SRR004833_2.fastq.bz2 | 7547869 | 6 |
SRR004834_1.fastq.bz2 | 6447823 | 9 |
SRR004834_2.fastq.bz2 | 6447823 | 9 |
SRR004835_1.fastq.bz2 | 8616259 | 6 |
SRR004835_2.fastq.bz2 | 8616259 | 6 |
SRR004836_1.fastq.bz2 | 8975677 | 5 |
SRR004836_2.fastq.bz2 | 8975677 | 5 |
SRR004837_1.fastq.bz2 | 8642976 | 12 |
SRR004837_2.fastq.bz2 | 8642976 | 12 |
SRR004838_1.fastq.bz2 | 7023195 | 3 |
SRR004838_2.fastq.bz2 | 7023195 | 3 |
SRR004839_1.fastq.bz2 | 8062696 | 3 |
SRR004839_2.fastq.bz2 | 8062696 | 3 |
SRR004840_1.fastq.bz2 | 8060450 | 0 |
SRR004840_2.fastq.bz2 | 8060450 | 0 |
SRR004841_1.fastq.bz2 | 6678858 | 9 |
SRR004841_2.fastq.bz2 | 6678858 | 9 |
SRR004842_1.fastq.bz2 | 7091815 | 15 |
SRR004842_2.fastq.bz2 | 7091815 | 15 |
SRR004843_1.fastq.bz2 | 6890492 | 2 |
SRR004843_2.fastq.bz2 | 6890492 | 2 |
SRR004844_1.fastq.bz2 | 8549164 | 13 |
SRR004844_2.fastq.bz2 | 8549164 | 13 |
SRR004845_1.fastq.bz2 | 8837159 | 5 |
SRR004845_2.fastq.bz2 | 8837159 | 5 |
SRR004846_1.fastq.bz2 | 7759225 | 14 |
SRR004846_2.fastq.bz2 | 7759225 | 14 |
SRR004847_1.fastq.bz2 | 5661124 | 9 |
SRR004847_2.fastq.bz2 | 5661124 | 9 |
SRR004848_1.fastq.bz2 | 5408098 | 10 |
SRR004848_2.fastq.bz2 | 5408098 | 10 |
SRR004849_1.fastq.bz2 | 5950977 | 7 |
SRR004849_2.fastq.bz2 | 5950977 | 7 |
SRR004850_1.fastq.bz2 | 9849395 | 14 |
SRR004850_2.fastq.bz2 | 9849395 | 14 |
SRR004851_1.fastq.bz2 | 9534571 | 6 |
SRR004851_2.fastq.bz2 | 9534571 | 6 |
SRR004852_1.fastq.bz2 | 9340649 | 9 |
SRR004852_2.fastq.bz2 | 9340649 | 9 |
SRR004853_1.fastq.bz2 | 6474762 | 14 |
SRR004853_2.fastq.bz2 | 6474762 | 14 |
SRR004854_1.fastq.bz2 | 6360372 | 12 |
SRR004854_2.fastq.bz2 | 6360372 | 12 |
SRR004855_1.fastq.bz2 | 6512147 | 1 |
SRR004855_2.fastq.bz2 | 6512147 | 1 |
SRR004856_1.fastq.bz2 | 10212539 | 1 |
SRR004856_2.fastq.bz2 | 10212539 | 1 |
SRR004857_1.fastq.bz2 | 10214756 | 1 |
SRR004857_2.fastq.bz2 | 10214756 | 1 |
SRR004858_1.fastq.bz2 | 9948010 | 8 |
SRR004858_2.fastq.bz2 | 9948010 | 8 |
SRR004859_1.fastq.bz2 | 10152277 | 5 |
SRR004859_2.fastq.bz2 | 10152277 | 5 |
SRR004860_1.fastq.bz2 | 9972388 | 4 |
SRR004860_2.fastq.bz2 | 9972388 | 4 |
SRR004861_1.fastq.bz2 | 8818998 | 8 |
SRR004861_2.fastq.bz2 | 8818998 | 8 |
SRR004862_1.fastq.bz2 | 8128087 | 1 |
SRR004862_2.fastq.bz2 | 8128087 | 1 |
SRR004863_1.fastq.bz2 | 8579536 | 11 |
SRR004863_2.fastq.bz2 | 8579536 | 11 |
SRR004864_1.fastq.bz2 | 8333329 | 14 |
SRR004864_2.fastq.bz2 | 8333329 | 14 |
SRR004865_1.fastq.bz2 | 283573 | 6 |
SRR004865_2.fastq.bz2 | 283573 | 6 |
SRR004866_1.fastq.bz2 | 4325494 | 11 |
SRR004866_2.fastq.bz2 | 4325494 | 11 |
SRR004867_1.fastq.bz2 | 4441061 | 12 |
SRR004867_2.fastq.bz2 | 4441061 | 12 |
SRR004868_1.fastq.bz2 | 4417212 | 0 |
SRR004868_2.fastq.bz2 | 4417212 | 0 |
SRR004869_1.fastq.bz2 | 4426593 | 13 |
SRR004869_2.fastq.bz2 | 4426593 | 13 |
SRR004870_1.fastq.bz2 | 4325451 | 13 |
SRR004870_2.fastq.bz2 | 4325451 | 13 |
SRR004871_1.fastq.bz2 | 4437127 | 7 |
SRR004871_2.fastq.bz2 | 4437127 | 7 |
SRR005657_1.fastq.bz2 | 5950061 | 14 |
SRR005657_2.fastq.bz2 | 5950061 | 14 |
SRR005658_1.fastq.bz2 | 6023796 | 2 |
SRR005658_2.fastq.bz2 | 6023796 | 2 |
SRR005659_1.fastq.bz2 | 6046660 | 12 |
SRR005659_2.fastq.bz2 | 6046660 | 12 |
SRR005660_1.fastq.bz2 | 5778417 | 0 |
SRR005660_2.fastq.bz2 | 5778417 | 0 |
SRR005661_1.fastq.bz2 | 6083478 | 7 |
SRR005661_2.fastq.bz2 | 6083478 | 7 |
SRR005718_1.fastq.bz2 | 32158952 | 6 |
SRR005718_2.fastq.bz2 | 32158952 | 6 |
SRR005719_1.fastq.bz2 | 19748766 | 12 |
SRR005719_2.fastq.bz2 | 19748766 | 12 |
SRR005720_1.fastq.bz2 | 26060741 | 1 |
SRR005720_2.fastq.bz2 | 26060741 | 1 |
SRR005721_1.fastq.bz2 | 11956691 | 2 |
SRR005721_2.fastq.bz2 | 11956691 | 2 |
SRR005734_1.fastq.bz2 | 24962746 | 10 |
SRR005734_2.fastq.bz2 | 24962746 | 10 |
SRR005735_1.fastq.bz2 | 32958349 | 15 |
SRR005735_2.fastq.bz2 | 32958349 | 15 |
SRR006550_1.fastq.bz2 | 6485562 | 14 |
SRR006550_2.fastq.bz2 | 6485562 | 14 |
SRR006551_1.fastq.bz2 | 7621955 | 15 |
SRR006551_2.fastq.bz2 | 7621955 | 15 |
SRR006552_1.fastq.bz2 | 6982624 | 7 |
SRR006552_2.fastq.bz2 | 6982624 | 7 |
SRR006553_1.fastq.bz2 | 7074228 | 2 |
SRR006553_2.fastq.bz2 | 7074228 | 2 |
SRR006554_1.fastq.bz2 | 7003372 | 4 |
SRR006554_2.fastq.bz2 | 7003372 | 4 |
SRR006555_1.fastq.bz2 | 7308114 | 4 |
SRR006555_2.fastq.bz2 | 7308114 | 4 |
SRR006556_1.fastq.bz2 | 4359382 | 0 |
SRR006556_2.fastq.bz2 | 4359382 | 0 |
SRR006557_1.fastq.bz2 | 4337291 | 11 |
SRR006557_2.fastq.bz2 | 4337291 | 11 |
SRR006558_1.fastq.bz2 | 3599374 | 13 |
SRR006558_2.fastq.bz2 | 3599374 | 13 |
SRR006559_1.fastq.bz2 | 3450777 | 2 |
SRR006559_2.fastq.bz2 | 3450777 | 2 |
SRR006560_1.fastq.bz2 | 3022934 | 6 |
SRR006560_2.fastq.bz2 | 3022934 | 6 |
SRR006561_1.fastq.bz2 | 5884872 | 13 |
SRR006561_2.fastq.bz2 | 5884872 | 13 |
SRR006562_1.fastq.bz2 | 5477538 | 3 |
SRR006562_2.fastq.bz2 | 5477538 | 3 |
SRR006563_1.fastq.bz2 | 6660338 | 1 |
SRR006563_2.fastq.bz2 | 6660338 | 1 |
SRR006564_1.fastq.bz2 | 7568813 | 14 |
SRR006564_2.fastq.bz2 | 7568813 | 14 |
SRR029278_1.fastq.bz2 | 9014183 | 5 |
SRR029278_2.fastq.bz2 | 9014183 | 5 |
SRR029333_1.fastq.bz2 | 3422203 | 4 |
SRR029333_2.fastq.bz2 | 3422203 | 4 |
SRR029334_1.fastq.bz2 | 5765255 | 2 |
SRR029334_2.fastq.bz2 | 5765255 | 2 |
SRR029335_1.fastq.bz2 | 7473894 | 3 |
SRR029335_2.fastq.bz2 | 7473894 | 3 |
SRR029336_1.fastq.bz2 | 3905986 | 0 |
SRR029336_2.fastq.bz2 | 3905986 | 0 |
SRR029337_1.fastq.bz2 | 18361953 | 7 |
SRR029337_2.fastq.bz2 | 18361953 | 7 |
SRR029338_1.fastq.bz2 | 3602769 | 14 |
SRR029338_2.fastq.bz2 | 3602769 | 14 |
The solver is available here: https://github.com/sebhtml/NGS-Pipelines/blob/master/Balance-Objects.py
Comments