(Page 2 of 2)
When I spoke to Arnold to get an update on the company last week, he made it sound like Geospiza is trying to transform this product from a “nice-to-have” into a “must-have.”
“We can generate an analyzed data set for about $2,500 that would otherwise cost $25,000 if you do it yourself,” Arnold says. “What we can do in a matter of hours, would otherwise take months.”
How do they achieve that savings? Scientists have to start by taking a deep breath and letting somebody else host their precious data. But when they do, this frees them up from the expensive and daunting task of hiring their own in-house bioinformatics guru to take care of all the data on the lab’s own servers, Arnold says.
Cost is a key part of the Geospiza pitch, but it is also benefitting from the trend toward better, faster, cheaper gene sequencing. Once the price for DNA sequencing drops to a certain point, it may be common for researchers to want the whole 3-billion-letter string of DNA from, say, each individual who enrolls in a clinical trial. It is now estimated that existing sequencing equipment around the world has enough capacity to sequence 500,000 entire individual genomes in the next three to five years. Right now, it is still thought that fewer than 100 genomes have been sequenced.
“It’s pretty mind-boggling when you think about it,” Arnold says. All that rapid sequencing is going to create enormous haystacks of data that will be increasingly hard to pull the needle out of, he says.
Software, of course, isn’t some kind of magic bullet for this data problem. Human beings still need time to sort through, analyze, and study the data to make use of it, Arnold says. And while all this exponential data is being produced, we humans are falling farther behind. U.K.-based biophysicist Cameron Neylon made an important point about this a couple weeks ago during a talk about open-source science at Microsoft. He showed a slide which pointed out that the average capacity of the human mind isn’t keeping up with all this data, and that we as individual humans don’t “scale up” to process all of this data.
“Researchers are overwhelmed,” Arnold says.
There’s no one company that dominates this world of software for biological data, either. Microsoft has taken a crack at this with its Amalga Life Sciences program, which now incorporates assets it acquired from Merck’s Rosetta Biosoftware operation in Seattle. Victoria, BC-based Genologics, a company backed by Kirkland, WA-based OVP Venture Partners, overlaps some with Geospiza, although it has a broader strategy of stitching together basic genomic data with other health records. Bridgewater, NJ-based LabVantage makes some competitive laboratory software, as does St. Louis-based Partek. Geospiza has tried to build its competitive edge around being the only one to capture the genomic data and combine it with analytical capabilities, Arnold says.
It’s still very early days to see where this is all going. Over the coming years, researchers are going to have to learn to work in bigger collaborative teams to crunch all the genomic data, Arnold says. The ones who thrive will have high “IQ and EQ,” Arnold says, referring to not just brainpower, but people skills. Companies are going to have to work out standards across many of the existing proprietary “stovepipes” that make it hard to get consistent formats on data for things like whole genome sequences, transcriptomes, and other biological data points, Arnold says. It’s going to take a lot of collaboration among scientists, and companies, to tease out the most meaningful data to get close to that ultimate goal of personalized medicine.
“No one company is going to dominate this field,” Arnold says.
By posting a comment, you agree to our terms and conditions.