

Abreak: evaluating de novo assemblies
source link: http://lh3.github.io/2014/07/07/abreak-evaluating-de-novo-assemblies
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Abreak: evaluating de novo assemblies
“Abreak” is a subcommand of htsbox, which is a little toy forked from on the lite branch of htslib. It takes an assembly-to-reference alignment as input and counts the number of alignment break points. An earlier version was used in my fermi paper to measure the missassembly rate of human de novo assemblies. A typical output looks like:
Number of unmapped contigs: 239
Total length of unmapped contigs: 54588
Number of alignments dropped due to excessive overlaps: 0
Mapped contig bases: 2933399461
Mapped N50: 6241
Number of break points: 102146
Number of Q10 break points longer than (0,100,200,500)bp: (28719,7206,4644,3222)
Number of break points after patching gaps short than 500bp: 94298
Number of Q10 break points longer than (0,100,200,500)bp after gap patching: (23326,5320,3369,2194)
Here it gives the mapped contig bases, mapped N50 and number of break points with flanking sequences longer than 0, 100, 200 and 500bp.
Although GAGE-B and QUAST are more powerful, the use of MUMmer limits them to small genomes only. In contrast, “abreak” works with any aligners supporting chimeric alignment. When BWA-SW or BWA-MEM is used to map contigs, “abreak” can easily and efficiently work with mammal-sized assemblies.
UPDATE on 11/24/2014: changed links to htslib/htsbox. BTW, the results vary with mapping parameters. If contigs and references are very close to each other, it is recommended to use:
bwa mem -B9 -O16 -E1
for mapping. The default works better when the divergence is high.
Recommend
-
32
Reference Assemblies, what are they, why do I need that? Reference Assemblies are a special kind of assembly that’s passed to the compiler as a reference. They do not contain any implementation and are not valid for norma...
-
26
I needed automatic version numbering based on current date when web application is built. It was wish by some customers and in their projects it’s okay with it. As their code is covered with automated tests and other diag...
-
3
Product Information
-
11
Novo, the SMB neobank, nabs $90M at a $700M valuationS&P 5004,693.83+23.54 (+0.50%)Dow 3036,091.27+22.40 (+0.06%)
-
7
Leiliane Sousa 1 minute ago Divergência na definição do idioma no SAP B1 cadastro de novo usuário 5 Views...
-
8
Eduardo Pedersetti June 15, 2022 2 minute read...
-
12
Integrating de novo and inherited variants in 42,607 autism cases identifies mutations in new moderate-risk genesMany previous genetic studies in autism spectrum disorder (ASD),...
-
9
Jessica Vaz February 6, 2023 2 minute read ...
-
6
Novo cálculo de IRS para Portugal /New tax calculation for Portugal English Version Follows Olá a Todos, Temos uma grande notícia para compartilhar com todos vós!
-
5
Novo Nordisk Weight Loss Pill Works as Well as Wegovy, Trial Data Shows
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK