Butler enables rapid cloud-based analysis of thousands of human genomes.
journal contributionposted on 2020-04-17, 11:18 authored by Sergei Yakneen, Sebastian M Waszak, PCAWG Technical Working Group, Michael Gertz, Jan O Korbel, PCAWG Consortium
We present Butler, a computational tool that facilitates large-scale genomic analyses on public and academic clouds. Butler includes innovative anomaly detection and self-healing functions that improve the efficiency of data processing and analysis by 43% compared with current approaches. Butler enabled processing of a 725-terabyte cancer genome dataset from the Pan-Cancer Analysis of Whole Genomes (PCAWG) project in a time-efficient and uniform manner.