Developing a drug repurposing knowledge graph can guide COVID-19 efforts

The fight against COVID-19 has provided a host of challenges to experts in varying fields to find ways that improve or redesign current systems in a way better suited for international pandemic. Among these systems is drug discovery, by which new medications and remedies are discovered via ingredient identification or serendipity. However, thanks to the power of big data and the work of a team including TDAI affiliate Xia Ning, an entirely new model for drug discovery has been developed for public use.

The team of scientists, including Ning and representatives from University of Minnesota, Hunan University, and Amazon’s AWS AI laboratories, have developed what is called the Drug Repurposing Knowledge Graph (DRKG). Drug-repurposing is a drug discovery paradigm that uses existing drugs for new therapeutic indications. It has the advantages of significantly reducing the time and cost compared to de novo drug discovery, and due to its efficient design, may be of significant use in fighting the COVID-19 pandemic.

DRKG itself is a comprehensive biological knowledge graph that relates human genes, compounds, biological processes, drug side effects, diseases and symptoms. DRKG includes, curates, and normalizes information from six publicly available databases and data that were collected from recent publications related to Covid-19. It has 97,238 entities belonging to 13 types of entities, and 5,874,261 triplets belonging to 107 types of relations.

Alongside the graph, Ning and team also developed a set of machine learning tools that can be used to prioritize drugs for repurposing studies. The tools use the state-of-the-art deep graph learning methods (DGL-KE) to compute embeddings of DRKG entities and relations, and use these embeddings to predict how likely a drug can treat a disease or how likely a drug can bind to a protein associated with the disease. When tested against the human proteins associated with Covid-19, these tools identified with high scores many of the Covid-19 drug candidates that are currently under clinical trials.

Xia Ning and her collaborators have made DRKG publicly available on github along with the set of machine learning tools and pre-computed embeddings. This free infrastructure will ultimately facilitate researchers to conduct computational drug repurposing more efficiently and effectively for Covid-19 and for other diseases.


Share this page
Suggested Articles
Why some friends make you feel more supported than others

It’s good to have friends and family to back you up when you need it – but it’s even better if your supporters are close with each other too, a...

To be translational, data analytics must consider data context and meaning

Data analytics can be computationally powerful and sophisticated but not be translational or have much leverage in the real world. This is a cautionary note about the dangers of big...

Study gathers first-ever measure of public support for algal bloom prevention

Scientists have found good strategies for curbing the toxic algae blooms that have threatened some of the nation’s water supplies. Farmers are willing to adopt these strategies. The American public...

Tanya Berger-Wolf named new TDAI faculty director

Randy Moses, Ohio State’s Senior Associate Vice President for Research, announced today that Tanya Berger-Wolf has joined the university as faculty director of the Translational Data Analytics Institute. Tanya Berger-Wolf...

Registration open for TDA Fall Forum Poster Session

Ohio State postdoctoral researchers, graduate and undergraduate students, faculty, and staff will present on their data analytics and decision science research at TDA’s second annual Fall Forum Research Poster Session on...