Missing Genes: How Eurocentric Bias in Human Gene Maps Affects Global Health (2025)

Imagine a world where the very blueprint of humanity is incomplete, leaving out vast swaths of our genetic heritage—now that's a startling reality we're facing with human gene maps. But here's where it gets controversial: these maps, built mostly from European DNA, are riddled with blind spots that could be hiding crucial insights into diseases and health disparities across the globe. Let's dive deeper into this groundbreaking study published in Nature Communications, and uncover why it matters for everyone.

Human gene maps reveal extensive omissions in representation from non-European populations, exposing major shortcomings due to their heavy reliance on DNA from individuals of European descent. This revelation comes from a fresh study that highlights thousands of absent transcripts—those vital RNA molecules that ferry a gene's directives—particularly in people from African, Asian, and American ancestries. And this is the part most people miss: some of these overlooked transcripts might stem from genes scientists haven't even identified yet, potentially encoding entirely new proteins.

Intriguingly, many of these transcripts tie into genes associated with conditions that vary by ancestry, such as lupus, rheumatoid arthritis, asthma, and traits related to cholesterol levels. This suggests that disparities in disease prevalence or behavior among different groups could stem from how their genes generate alternative transcripts and proteins via mechanisms like splicing—a process where cells edit and reassemble genetic information to create varied outcomes. These subtle molecular differences have been virtually undetectable in our current gene maps, obscuring possibly vital clues about disease risks and genetic influences.

"Scientists rely on gene maps daily, yet we've excluded enormous portions of the global population. For the first time, this research demonstrates the scale of what we've overlooked," notes Pau Clavell-Revelles, the lead author from the Barcelona Supercomputing Center (BSC) and Centre for Genomic Regulation (CRG). It's a wake-up call that underscores how Eurocentric our genetic databases have been.

The historical roots of Eurocentric genetics trace back to monumental achievements like the initial human genome draft in 2001. While revolutionary, it merely provided the DNA sequence without mapping gene locations, quantities, or the intricate ways a single gene can yield multiple proteins through splicing. To bridge this gap, comprehensive gene annotation maps were developed—think of them as detailed directories pinpointing every gene's spot and all its RNA transcripts. Initiatives like GENCODE transformed the genome's three billion base pairs into actionable knowledge, aiding researchers in pinpointing disease drivers and understanding genetic variances among people.

But here's the catch: these maps carry an inherited bias. Despite humans sharing 99.9% identical DNA, that tiny fraction captures evolutionary divergences shaped by millennia of separation, environmental adaptations, and random genetic drift. Certain populations have evolved unique variants over thousands of years, yet these distinctions remain inadequately recorded. The standard human genome reference, along with subsequent annotations, predominantly drew from European samples, sidelining biology specific to Africa, Asia, Oceania, and the Americas.

Consequently, our grasp of cellular gene utilization is skewed toward a limited demographic, rendering key transcripts and disease indicators invisible. "The bulk of sequencing data has been from Europeans, leading to reference catalogs that omit genes or transcripts unique to non-European groups," explains Dr. Roderic Guigó, a senior co-author affiliated with the Centre for Genomic Regulation and University Pompeu Fabra in Barcelona. "When a variant resides in one of these missing genes, we might wrongly dismiss its biological impact. In reality, that could be a mistake," he adds. This raises a provocative question: are we underestimating genetic effects in diverse populations, and what ethical implications does that have for global health equity?

Advancements in long-read RNA sequencing have begun peeling back these layers of hidden biology. Transcripts, the RNA messengers showing gene activity within cells, were targeted using this cutting-edge tech that sequences entire molecules end-to-end—unlike older methods that only grabbed fragments, causing reconstruction puzzles. The team analyzed blood cells from 43 participants across eight diverse groups: Yoruba from Nigeria, Luhya from Kenya, Mbuti from Congo, Han Chinese, Indian Telugu, Peruvians in Lima, Ashkenazi Jewish, and Utah Europeans. These groups overlap with the 1000 Genomes Project, enabling direct RNA-DNA comparisons.

Their analysis unveiled 41,000 transcripts absent from official GENCODE maps. Of those from known protein-coding genes, 41% likely produce novel protein versions. Picture this: thousands of protein variations previously unmapped, like in the SUB1 gene, crucial for DNA repair. Peruvian participants, for instance, generate a unique SUB1 transcript altering the protein output, yet it was nowhere in existing annotations. When sorted by ancestry, non-European samples showed far more novel transcripts—2,267 population-specific ones, mostly new outside Europe. Additionally, 773 stemmed from unrecognized gene regions, hinting at undiscovered genes.

To push boundaries further, the researchers tested personalized DNA references instead of the universal one. This approach unearthed hundreds more transcripts per person, especially in African ancestries, proving that a one-size-fits-all genome masks real biological diversity. It's a bold counterpoint: relying on a single reference might not just be convenient—it's potentially misleading for personalized medicine.

Why do these absent transcripts carry such weight? The study explored allele-specific transcript usage, where maternal and paternal gene copies yield different RNAs, affecting function. Detecting these requires complete transcript catalogs; omissions mean invisible effects. Incorporating the new findings boosted detection of ancestry-influenced genetic behaviors, particularly in non-European groups.

"Numerous novel transcripts biased by ancestry appear in genes linked to autoimmune diseases, asthma, and metabolic conditions," states Dr. Marta Melé, another senior co-author and group leader at BSC. She clarifies that these don't directly cause disparities but illuminate previously concealed genetic signals, explaining why diseases manifest differently or more frequently in certain populations. Without them, we'd miss critical data on health inequalities—think of it as piecing together a puzzle where key pieces from non-European ancestries were discarded.

Aiming toward a comprehensive human 'pantranscriptome,' the team acknowledges this as an initial stride with limitations: only blood cells from 43 people, excluding many global regions and complex organs like the brain or heart. Yet, even within this narrow scope, they unearthed tens of thousands of missing transcripts. "The modest scale and the volume of discoveries highlight how this is merely the iceberg's tip," remarks Dr. Fairlie Reese, a postdoctoral researcher at BSC. The authors advocate shifting to inclusive mapping, building on efforts like the Human Pangenome Project, which expands DNA diversity.

DNA is the blueprint, but the pantranscriptome—the full RNA catalog across tissues, life stages, and populations—reveals execution. "The pangenome is the instruction book; the pantranscriptome highlights the active words in each cell. Both are vital for grasping human diversity," Dr. Melé explains. Constructing this demands massive resources—their study generated over 10 terabytes of data and 800 million sequences, processed via machine learning on BSC's MareNostrum5 supercomputer. Scaling to hundreds of tissues and thousands of people requires unprecedented collaboration.

But is the effort worthwhile? Absolutely, they argue. "This work lays groundwork and invites global scientists to share data, methods, and diverse samples. Collective action alone can deliver a complete, equitable human biology map, crucial for unbiased genomic medicine," Dr. Melé concludes.

Related Stories:
- Eli Lilly’s Adverum move: The future of ocular gene therapy at the 4th Ophthalmic Drug Delivery Summit (https://www.news-medical.net/news/20251126/Eli-Lillye28099s-Adverum-move-The-future-of-ocular-gene-therapy-at-the-4th-Ophthalmic-Drug-Delivery-Summit.aspx)
- Multiple protein forms from a single gene offer fresh insight into rare disease mechanisms (https://www.news-medical.net/news/20251107/Multiple-protein-forms-from-a-single-gene-offer-fresh-insight-into-rare-disease-mechanisms.aspx)
- Scientists map the genetic control system behind FOXP3 in immune cells (https://www.news-medical.net/news/20251113/Scientists-map-the-genetic-control-system-behind-FOXP3-in-immune-cells.aspx)

Source:
Journal reference: Clavell-Revelles et al. (2023). Widespread missing transcripts in human gene maps due to Eurocentric bias. Nature Communications. DOI: [insert actual DOI if available]

This study challenges us to rethink genetic research: Is prioritizing European data perpetuating inequalities, or is it just a historical artifact of resource allocation? What do you think—should funding shift toward global inclusivity, even if it slows progress in some areas? Share your views in the comments; I'd love to hear agreements, disagreements, or fresh perspectives!

Missing Genes: How Eurocentric Bias in Human Gene Maps Affects Global Health (2025)
Top Articles
Latest Posts
Recommended Articles
Article information

Author: Dan Stracke

Last Updated:

Views: 6282

Rating: 4.2 / 5 (43 voted)

Reviews: 90% of readers found this page helpful

Author information

Name: Dan Stracke

Birthday: 1992-08-25

Address: 2253 Brown Springs, East Alla, OH 38634-0309

Phone: +398735162064

Job: Investor Government Associate

Hobby: Shopping, LARPing, Scrapbooking, Surfing, Slacklining, Dance, Glassblowing

Introduction: My name is Dan Stracke, I am a homely, gleaming, glamorous, inquisitive, homely, gorgeous, light person who loves writing and wants to share my knowledge and understanding with you.