Learn how Intel and Broad Institute are collaborating to advance performance, scale, and ease of deployment for computational genomics and analytics through tools like the Genome Analysis Toolkit* (GATK*) and Intel® Select Solutions.
Intel and Broad Institute have collaborated on computing infrastructure and software optimization for years. In 2017, they launched a new effort—the Intel- ...Broad Center for Genomic Data Engineering is a five-year collaboration between the two organizations to simplify and accelerate genomics workflow execution using GATK, Burrow-Wheeler Aligner (BWA), Cromwell, Intel® Genomics Kernel Library (Intel® GKL), GenomicsDB*, and other tools and techniques. Together, experts from Broad Institute and Intel will build, optimize, and widely share tools and infrastructure to help scientists integrate and process genomic data. The result will be a growing set of optimized best practices in hardware and software for genomics analytics on Intel® architecture–based platforms that can be applied to research data sets stored in private data centers and that will extend to private, public, and hybrid clouds.
With the massive growth of genomics data, the collaboration makes use of technology to enable genomics analytics at scale. It has already resulted in Intel® Select Solutions for Genomics Analytics, a suite of optimized software, along with reference architectures for turnkey configuration, setup, and deployment to run genomics analysis that is qualified for GATK pipelines, Cromwell, and GenomicsDB.
The introduction of Intel® Select Solutions for Genomics Analytics makes it easier to run genomics workloads. It also enables accelerated deployment of predictable clusters designed for genomics analytics. Thus, many integrators of high-performance systems have partnered with Intel and are offering design and deployment of solutions that will meet the needs of their customers in the genomics community.
The work of genomics science is critical to the understanding of disease and the creation of diagnostic tools and safe and effective therapies. Genomics data and analytics are quickly advancing as researchers use technology to build massive genomics data repositories and come to understand the power of that data. Broad Institute is one of the largest contributors of genomics data in the world, and its GATK software is the world’s leading genome analysis tool for analytics and variant call research. The Intel-Broad Center for Genomic Data Engineering brings together science and technology to optimize genomics analytics codes and workflows and to define an optimized infrastructure—Intel® Select Solutions for Genomics Analytics—to run those workloads. The results enable faster analysis and quicker times to deploy hardware solutions that are customized for genetics analysis. Several system integrators already offer services to install such systems that will continue to enable further discoveries through genetics.