November 2024

Genomic Databases Need More Diversity

November 7, 2024    |  

It is commonly known that most genomic databases are biased toward people with European ancestry. Scientists have warned that leaving out other populations could skew results in areas such as drug development, diagnostic testing, and polygenic risk scores — which looks at many genetic variations in a person’s DNA to predict their disease risk.

(L-R) Timothy O'Connor and Victor Borda

(L-R) Timothy O'Connor and Victor Borda

Now, researchers at the Institute for Genome Sciences (IGS) at the University of Maryland School of Medicine (UMSOM) have developed a broad and deep genomic database of Latin Americans that gathers genome-wide data of Latin American populations into a single source and allows other scientists to easily add that population to their own research studies without straining budgets. Cell Genomics published their work on Oct. 31.

The researchers define Latin Americans as those with heritage from Spanish- or Portuguese-speaking countries in the Americas. In the United States, Latin Americans make up 18 percent of the population and is the fastest growing group but remains understudied in biomedical research, leading to health disparities in the population. Worldwide the Latin American population is 656 million or about 8.5 percent of the world’s population.

The Genetics of Latin American Diversity Database (GLADdb) includes genome-wide information from almost 54,000 Latin Americans representing 46 geographical regions. The new resource explores patterns of Latin American population structure and allows other scientists to match genes to their external samples through GLAD-match, a new web tool the researchers developed. GLAD-match provides a strong foundation for the human genetics community as it moves from categorizing people broadly to viewing ancestry as a continuum.

“Treating Latin Americans as one homogeneous group oversimplifies their genetic diversity and hinders efforts to improve population health and clinical treatment for many diseases,” said Timothy O’Connor, PhD, the lead author of the paper, scientist at IGS, and associate professor of medicine at UMSOM. “By promoting genomic research in Latin Americans, we are contributing to the promise of precision medicine for more people.”

The team built GLADdb and GLAD-match by pulling Latin Americans’ data from whole genome sequencing projects across the Americas, as well as from dbGaP — the database of Genotypes and Phenotypes — developed by the National Human Genome Research Institute (NHGRI) to archive and distribute data and results from genotype-phenotype studies in humans. Most genomic studies funded by NHGRI are required to submit their data to dbGaP upon publication of the results.

“Since we define ‘heritage’ as including culture, geography, and genetics, one of the most interesting parts of this research is that we were able to explore the distant genetic relatedness among Latin American countries through population structure and migration patterns,” said Victor Borda, PhD, corresponding author on the paper and an IGS research associate. “For example, in the early 1900s, social and economic problems caused a large migration out of Puerto Rico. We found that those who migrated to Hawaii were recognized as countryside people who traditionally farmed, while those who left Puerto Rico for New York represented a cross-section of economic and social classes.”

Since Latin Americans represent only about 0.38 percent of participants in genome-wide association studies (GWAS), the researchers say that GLADdb should contribute significantly to the understanding of Latin American population genetics and genetic epidemiology when comparing external samples to those within the database.

“Our lack of knowledge of Latin American genetic diversity and environmental factors influencing the health has limited medicine’s understanding of complex traits in this population,” O’Connor added. “Our hope is that other scientists use GLADdb and GLAD-match in their research to help deepen our understanding and lessen disparities in the Latin American population during the evolving age of precision medicine.”

About the Institute for Genome Sciences

The Institute for Genome Sciences (IGS) has been part of the University of Maryland School of Medicine (UMSOM) since 2007. IGS scientists work in diverse areas, applying genomics and systems biology approaches to better understand health issues in premature infants, women, and transgender people; to improve vaccine development; to study evolutionary biology; and to understand cancer, parasitic, fungal, and infectious diseases, as well as identifying the underpinnings of aging, brain development, addiction, and mental health. IGS also remains at the forefront of high-throughput genomic technologies and bioinformatics analyses through its core facility, Maryland Genomics, which provides researchers around the world with cutting-edge, collaborative, and cost-effective sequencing and analysis.

About the University of Maryland School of Medicine

Now in its third century, the University of Maryland School of Medicine was chartered in 1807 as the first public medical school in the United States. It continues today as one of the fastest growing, top-tier biomedical research enterprises in the world — with 46 academic departments, centers, institutes, and programs, and a faculty of more than 3,000 physicians, scientists, and allied health professionals, including members of the National Academy of Medicine and the National Academy of Sciences, and a distinguished two-time winner of the Albert E. Lasker Award in Medical Research. With an operating budget of more than $1.2 billion, the School of Medicine works closely in partnership with the University of Maryland Medical Center and Medical System to provide research-intensive, academic, and clinically based care for nearly 2 million patients each year. The School of Medicine has more than $500 million in extramural funding, with most of its academic departments highly ranked among all medical schools in the nation in research funding. As one of the seven professional schools that make up the University of Maryland, Baltimore campus, the School of Medicine has a total population of nearly 9,000 faculty and staff, including 2,500 students, trainees, residents, and fellows. The School of Medicine, which ranks as the eighth highest among public medical schools in research productivity (according to the Association of American Medical Colleges profile) is an innovator in translational medicine, with 606 active patents and 52 start-up companies. In the latest U.S. News & World Report ranking of the Best Medical Schools, published in 2023, the UM School of Medicine is ranked No. 10 among the 92 public medical schools in the U.S., and in the top 16 percent (No. 32) of all 192 public and private U.S. medical schools. The School of Medicine works locally, nationally, and globally, with research and treatment facilities in 36 countries around the world. Visit medschool.umaryland.edu.