Taxonomic Classification Of Microbial Metagenomes Using Minhash Signatures - ASM Microbe 2017
I’m extremely interested in metagenomics - especially assembly and classification. One of the projects I’ve been working on is taxonomic classification of metagenomes using sourmash - a tool for calculation and comparison of Minhash sketches. The authors of the Mash: fast genome and metagenome distance estimation using MinHash demonstrated the utility of minHash sketches for and accurate clustering of genomes and metagenomes. Using their work as inspiration our lab developed sourmash to explore some additional questions we’re interested in. The sbt gather functionality of sourmash compares minHash signatures computed with sourmash to an index containing signatures calculated for all of the microbial genomes in the NCBI RefSeq database. My advisor, C. Titus Brown and Luis Irber, a graduate student in our lab describe method for building the index and its functionality.