→ SlidesThe amount of raw genomics data is continuously growing with some estimating that the amount of data world wide is on the order of Exabytes. Processing such mountains of FASTQs into science ready formats like VCFs, expression matrices, etc is no trivial task and requires workflow architectures that can scale in both performance and cost efficiency. The cloud offers practically unlimited compute capacity, elasticity, and flexibility to process enormous amounts of genomics data cost effectively and on-demand. In this talk, we’ll highlight the core patterns, architectures, and tooling used by many genomics customers who are leveraging the cloud to tackle their biggest genomics data processing challenges.
Sponsorship:Amazon Web Services is a
Gold Level sponsor of BCC2020.
Lee Pang will also be giving this talk talk during
BCC West. AWS is
used in the research behind several presentations at BCC2020.