Mikael Huss gives two examples of how Hadoop could be used in the genomics space:
- Statistical associations between various kinds of data vectors — clinical, environmental, molecular, microbial… This is more or less a batch-processing problem and thus suited to Hadoop.
- Metagenomics: This calls for smart use of proper data structures for indexing and querying, and as much parallelization as possible, very likely in some Hadoopy kind of way.
Original title and link: What Hadoop Can Do for Genomics? (NoSQL database©myNoSQL)
