Early last year, three researchers set out to create one genetic data set to rule them all. The trio wanted to assemble the worldโs most comprehensive catalogue of human genetic variation, a single reference database that would be useful to researchers hunting rare disease-causing genetic variants.
Unlike past โbig dataโ projects, which have involved large groups of scientists, this one deliberately kept itself small, deploying just five analysts. Nearly two years in, it has identified about 50ย million genetic variants โ points at which one personโs DNA differs from anotherโs โ in whole-genome sequence data collected by 23ย other research collaborations. The group, called the Haplotype Reference Consortium, will unveil its database in San Diego, California, on 20ย October, at the annual meeting of the American Society of Human Genetics.
Geneticists have not always been so willing to share data. But that seems to be changing. โItโs been surprisingly easy to bring all these data sets together,โ says Jonathan Marchini, a statistical geneticist at the University of Oxford, UK, and one of the consortiumโs leaders. โThere is a lot of goodwill between the people in the field; they all understand the benefits of doing this and have worked hard to make their data available.โ
Read full, original article: Giant gene banks take on disease















