Live webinar

Population genomics is a data management problem

 

November 4, 2021  |  10:00 ΑM EDT
The webinar has been completed but is available on-demand. Provide your details to access the recording.

Watch on-demand

Join us to learn how TileDB is changing the population genomics landscape with its unprecedented scalability

Population genomics is an important and challenging problem plagued by non-scalable domain-specific formats, which make it difficult to efficiently store, access, share and analyze massive amounts of variant-call data at the scale required for gaining meaningful insights. TileDB addresses this challenge with a universal database that stores variant-call data as multi-dimensional arrays that can be updated, governed and analyzed at unprecedented scale and low cost.

What you will learn

In this comprehensive presentation of TileDB’s population genomics solution, TileDB-VCF, you will learn how to:

  • Model genomic variants as a 3D sparse array
  • Efficiently update variant datasets, solving the N+1 problem
  • Ingest huge collections of VCF samples in parallel on TileDB Cloud
  • Export to VCF for full compatibility with existing tools
  • Share access to TB of variant datasets avoiding file downloads
  • Implement scalable genome-wide analyses using serverless compute
  • Enable reproducible science and collaboration through code and data sharing

Stavros, the TileDB CEO, will explain the important data management problems with VCF data drawing on his academic and research experiences. He will be joined by guest speaker, Dr. Stephen Kingsmore, President and CEO of Rady Children's Institute for Genomics who will share insights about genome-informed inpatient pediatric care. Aaron Wolen, Senior Software Engineer, will walk through code examples and answer your questions.


Prior to attending this webinar, it is highly recommended to review this deep dive into TileDB Embedded webinar on the open-source storage engine that powers TileDB-VCF. We also suggest that you sign-up for TileDB Cloud (you will get $10 of free credits, no credit card needed) if you’d like to run the examples shown in the webinar and get access to public datasets such as the NY Genome Center 1000 Genomes dataset.

Speakers

Stavros Papadopoulos

Stavros Papadopoulos

Founder and CEO, TileDB

Stephen Kingsmore

Dr. Stephen Kingsmore, MD, DSc.

                  President and CEO,                 Rady Children's Institute of Genomic Medicine

 

Aaron Wolen

Aaron Wolen

Senior Software Engineer, TileDB