Developing a drug repurposing knowledge graph can guide COVID-19 efforts

The fight against COVID-19 has provided a host of challenges to experts in varying fields to find ways that improve or redesign current systems in a way better suited for international pandemic. Among these systems is drug discovery, by which new medications and remedies are discovered via ingredient identification or serendipity. However, thanks to the power of big data and the work of a team including TDAI affiliate Xia Ning, an entirely new model for drug discovery has been developed for public use.

The team of scientists, including Ning and representatives from University of Minnesota, Hunan University, and Amazon’s AWS AI laboratories, have developed what is called the Drug Repurposing Knowledge Graph (DRKG). Drug-repurposing is a drug discovery paradigm that uses existing drugs for new therapeutic indications. It has the advantages of significantly reducing the time and cost compared to de novo drug discovery, and due to its efficient design, may be of significant use in fighting the COVID-19 pandemic.

DRKG itself is a comprehensive biological knowledge graph that relates human genes, compounds, biological processes, drug side effects, diseases and symptoms. DRKG includes, curates, and normalizes information from six publicly available databases and data that were collected from recent publications related to Covid-19. It has 97,238 entities belonging to 13 types of entities, and 5,874,261 triplets belonging to 107 types of relations.

Alongside the graph, Ning and team also developed a set of machine learning tools that can be used to prioritize drugs for repurposing studies. The tools use the state-of-the-art deep graph learning methods (DGL-KE) to compute embeddings of DRKG entities and relations, and use these embeddings to predict how likely a drug can treat a disease or how likely a drug can bind to a protein associated with the disease. When tested against the human proteins associated with Covid-19, these tools identified with high scores many of the Covid-19 drug candidates that are currently under clinical trials.

Xia Ning and her collaborators have made DRKG publicly available on github along with the set of machine learning tools and pre-computed embeddings. This free infrastructure will ultimately facilitate researchers to conduct computational drug repurposing more efficiently and effectively for Covid-19 and for other diseases.


Share this page
Suggested Articles
Data analytics reveal how we decide

Can you identify the critical difference between these two scenarios?  In one, a medication has a 1 percent rate of adverse side effects in those who use it. In the...

Visualizations show health impact of patients' choices

Different views of personal information helps patients understand probabilities When it comes to conversations with patients, there are many ways for healthcare providers to give information about managing cardiovascular disease....

Moritz’s Distinguished Lecture to tackle big data governance vs. constitutional rights

Jack Balkin Moritz College of Law will host a Distinguished Lecture on Oct. 27 of one of today’s hottest topics not just in data analytics but in society: balancing the...

Panda lab’s open-source libraries give big data users high-performance speeds

In the world of big data, computer technology holds the keys to the castle—computer software that’s built to store, sort, and interpret information collected in data analytics projects. The better...

TDAI hosts new Business Data Ethics Lab in Residence

In early January, TDAI faculty affiliate Dennis Hirsch, who is Faculty Director of the Program of Data and Governance, and Christina Drummond launched a Business Data Ethics Lab in Residence...