Building a genomic encyclopedia of life

Centrum securitatis Tree of life

The total number of species on Earth is estimated at 100,000 to 10 million. Despite the wide variation in estimates here, one thing scientists agree on is that it is a Very Big Number. Of the 100,000 to 10 million species that exist, a mere 19,000 or so have complete genomes sequenced.

On the one hand, given the relative novelty of sequencing technology, 19,000 is pretty impressive. On the other, it pales in comparison to the amount of sequencing needed if we’re to assemble a comprehensive, genomic version of  the “Tree of Life”.

The Scientist’s Anna Azvolinsky reports:

Large-scale sequencing consortia have been churning out data at an impressive rate, yet significant gaps remain in the genomic tree of life. And while these groups have largely been working independent of one another, together they might address more far-reaching questions, such as how life has evolved, how it currently functions, and how it might look down the line.

We’re at a turning point, Azvolinsky argues, where a truly cooperative approach to genomics will make or break efforts to take the independent work of many consortia and create a global database.

One of the biggest problems here isn’t necessarily the independent, competitive approach of genomics groups thus far, but the independent, competitive nature of funding for genomics research. She writes:

 

“We thought there was a turning point three years ago,” said Nikos Kyrpides, who heads up the microbial genomics and metagenomics program at the JGI. “The community believed that more funding agencies would begin to support microbial sequencing studies, not just for public health and industry applications, but to cover the reference genomes of the phylogenetic tree.”

Unfortunately, this turning point failed to materialize.

Part of the problem is that many government and private funding agencies are most interested in supporting scientists sequencing the genomes of species that impact human health, industry concerns, and environmental issues.

“Many times it is easier to receive funding to sequence species important for agriculture, for example. These types of projects go faster through the pipeline because there is more funding from governments or companies interesting in funding directed efforts,” said Toni Gabaldon, the head of bioinformatics and genomics at the Center for Genomic Regulation in Barcelona, Spain. One issue, said Kyrpides, is that funding agencies don’t often work together, and it typically takes more than a single funding body to support broad, encyclopedic sequencing efforts. “We are pushing for funding agencies to change: to stop delineating projects by application, and to work together.”

Other challenges to a genomic encyclopedia of life include the sheer volume of genetic data that needs to be properly analyzed and sorted. As such, bioinformatics and computational science are increasingly vital disciplines alongside traditional biology, taxonomy, and genetics. “Even within a single consortium’s database,” Azvolinsky writes, “as the number of genomic sequences increases from tens to many hundreds, scaling the storage and analytical tools has been a challenge.”

New groups are appearing whose goal isn’t to add to the grand sequencing efforts but to help coordinate the sequencing that already exists. One such group is iPlant, which is focused on making “cloud computing, data storage, and genomic analysis tools available to the broader plant community.”

One aspect that Azvolinsky doesn’t get into is how genomics is upending traditional taxonomy, in many cases revealing several species where once we saw one. Her article is based on the best understanding of the tree of life currently available, but it’s certain that our map of the tree will change as we incorporate more and more genomic data of ever-increasing sophistication.

Co-operation is going to be necessary, even if — according to scientist behind iPlant — it’s only just starting to evolve within the genomics community. Not only are there at least 81,000 sequences left to collect, analyze, and map against our current understanding of the tree of life, but in doing so biologists will likely need to redraw significant portions of the tree.

Read Anna Azvolinsky’s original article at The Scientist: “Sequencing the Tree of Life

Kenrick Vezina is Gene-ius Editor for the Genetic Literacy Project and a freelance science writer, educator, and naturalist based in the Greater Boston area.

Additional Resources:

 

 

{{ reviewsTotal }}{{ options.labels.singularReviewCountLabel }}
{{ reviewsTotal }}{{ options.labels.pluralReviewCountLabel }}
{{ options.labels.newReviewButton }}
{{ userData.canReview.message }}

Related Articles

Infographic: Global regulatory and health research agencies on whether glyphosate causes cancer

Infographic: Global regulatory and health research agencies on whether glyphosate causes cancer

Does glyphosate—the world's most heavily-used herbicide—pose serious harm to humans? Is it carcinogenic? Those issues are of both legal and ...

Most Popular

ChatGPT-Image-Feb-16-2026-01_04_32-PM
Raw milk myth wake-up call
Screenshot-2026-04-13-at-3.54.04-PM
AI disinformation stress test: Challenges and response strategies
ChatGPT-Image-Apr-23-2026-09_20_20-PM
Kennedy’s CDC blocks publication of study that shows vaccines reduce hospitalizations by 50%, then misrepresents why
ChatGPT-Image-Apr-20-2026-11_17_18-AM-2
10,000 scientists gone: Trump’s cuts create an unprecedented brain drain
ChatGPT-Image-Apr-22-2026-04_31_20-PM
‘Irresponsible decision’? On mandatory military flu shots, Hegseth chooses ‘freedom’ over health
ChatGPT-Image-Mar-11-2026-11_58_46-AM
The Trump administration has run out more than 4,000 National Institutes of Health employees. Here are the consequences
Screenshot-2026-04-12-135256
Bixonimania: The fake disease scam that AI swallowed whole
Screenshot-2026-04-22-at-1.14.34-PM
Latest fevered, right-wing conspiracy: Fox, New York Post, and kooky GOP legislators push ‘Dead Scientists’ scare
images
The never-ending GMO debate: Pros and cons
ChatGPT-Image-Mar-2-2026-03_22_54-PM
Why ‘support supplements’ for GLP-1 users are mostly a waste of money
glp menu logo outlined

Get news on human & agricultural genetics and biotechnology delivered to your inbox.