Indian Institute of Science, Bengaluru

  •  Indian Institute of Science
    CV Raman Rd, Bengaluru
    Karnataka - 560012, India
  •  +91 80 2293 2228
  •   Website

Principal Investigator: Y. Narahari

Co-Investigators: Yogesh Simmhan and Arun Kumar

Contributors: Chirag Jain

Role of the Institution in the GenomeIndia Project: Developing novel algorithms based on big data analytics for compression and decompression of Whole Genome Sequence (WGS) datasets for efficient data storage and transfer.

Accomplishments and Outcomes

We have developed pipelines based on advanced bioinformatics algorithms for seamless and guaranteed lossless compression and decompression of GenomeIndia uBAM datasets. These leverage parallel optimizations to achieve a 5x reduction in size (from ~50GB to ~5GB per sequence) that saves on storage and transfer costs, and a parallelized time of 120mins.