How Cloud and Big Data are Impacting the Human Genome -- Touching 7 Billion Lives - Forbes 2012-08-20


“Mapping the ‘blueprint for building a person’ is no small undertaking. While the Human Genome Project formally began in 1990 and was completed in 2003, researchers continue to study the role of genes and proteins in building life. The discovery of DNA is considered by some to be ‘the most important biological work of the last 100 years,’ and perhaps ‘the scientific frontier for the next 100.’ Little did Watson and Crick know that when they first suggested the double helix structure for DNA, having one’s personal genome mapped could later become an affordable diagnostic test to personalize medical treatment for all. At least that is the vision of genomics experts like DNAnexus CEO and co-Founder Andreas Sundquist. And, according to Dr. Sundquist, ‘DNAnexus would not exist if not for the cloud.’ Founded in 2009, DNAnexus is a Silicon Valley start-up who early on recognized how cloud computing could provide the ideal solution for scalable compute power and data storage to ‘power the genomics revolution.‘ Many genomics researchers today agree that cloud computing is the only compute model that can provide the elastic scale needed for DNA sequencing, whose rate of technology advancement could now exceed Moore’s Law. For example, it took 13 years to map the first genome, at a cost of several billion dollars. In less than 10 years, the time and cost of DNA sequencing of another genome was reduced by a factor of 1 million.  And today, your or mypersonal genome could be mapped in just a few days for a few thousand dollars. Personal genomics is a key enabler for predictive medicine, for which a patient’s genetic profile can be used to determine the most appropriate medical treatment. In 2-3 more years, mapping a personal genome could cost as little as a few hundred dollars, or who knows? In terms of size, a personal genome has about 100 gigabytes of data, or the equivalent of 102,400 photos. A million genomes would therefore constitute hundreds of petabytes of data... Today’s 1000 Genomes Project was launched by a public/private consortium to build out the most detailed map of human genetic variation available. This project reposits the largest central database of genomic data publically available. The database is managed by the National Center for Biotechnology Information (NCBI), who has recently made it more broadly available via Amazon Web Services. The cloud’s nearly infinite scale will address any future need for continuous data storage. DNAnexus recognized early on that computation and data storage – not sequencing technology – would become the limiting factor to continue innovating beyond the initial Human Genome Project at such a fast pace. So DNAnexus built a low-cost web interface to provide their state-of-the-art genomic analysis and visualization tools for DNA sequencing as a service, with automatic data upload to scalable infrastructure powered by AWS. Using DNAnexus solutions in the cloud, biologists can now push a button to immediately access sophisticated tools and infrastructure, without having to wait for expert bioinformatics colleagues to assist with complex computational projects. And, researchers all over the world can collaborate and share data with ease because the data is instantly accessible from one central location. Many clinical researchers and biopharmaceutical partners can now collaborate through DNAnexus per its ability to make both tools and data sets available via the cloud. Perhaps the greatest ‘cloud power’ possible but not yet fully tapped will be the ability to offer a) Platform as a Service for broad development of new analysis and visualization tools, and b) Software as a Service for use of those tools with shared data sets in a secure, collaborative workspace. Dr. Sundquist likens the opportunity to a ‘genomics version’ of an iPhone App Store, from which hundreds of specialty applications can be developed by technologists all over the world and instantly downloaded from one central place. In fact, an entirely new genomics industry could be made possible by cloud computing, which will transform medicine and life sciences to benefit the world’s seven billion people. One of the most intriguing insights from Dr. Sundquist was that, while DNAnexus is pioneering a field that is heavily dependent upon IT infrastructure, he employs not a single IT person. Instead, the role of IT is ‘subsumed by the software developers,’ who are also developing the core product. And, the IT infrastructure management is owned by the cloud services provider, who can manage it more cost-effectively. So the savings for DNAnexus in product time-to-market rather then IT time-to-build is months – and for a start-up funded by venture capital partners, achieving that kind of capital efficiency is a make-or-break proposition... For DNAnexus, the IT savings means Dr. Sundquist can hire another ‘star developer’ who might invent something new to further revolutionize medicine. With such transformative technology made possible and even accelerated by the cloud – the result will be better health for all of us!”



08/16/2012, 06:08

From feeds:

Open Access Tracking Project (OATP) »


oa.medicine oa.biology oa.business_models oa.comment oa.usa oa.costs oa.funding oa.1000genomes oa.biomedicine oa.ncbi



Date tagged:

08/20/2012, 18:13

Date published:

04/19/2012, 16:09