About My Health Careers Internship MedBlogs Contact us
Medindia LOGIN REGISTER
Advertisement

Text-Mining Algorithm for Identifying Relevant Research on Chemicals and Disease for Public Database Developed

by Kathy Jones on April 19, 2013 at 9:35 PM
Font : A-A+

 Text-Mining Algorithm for Identifying Relevant Research on Chemicals and Disease for Public Database Developed

Researchers at North Carolina State University have developed a new text mining algorithm that can help identify the most relevant scientific research for a public database that reveals the effects of environmental chemicals on human health

The Comparative Toxicogenomics Database (CTD), managed in part by the lead authors, is a manually curated, public database that correlates environmental chemicals with their effects on genes and human health. Thousands of new research papers are published each day, and finding the most relevant ones to include can be challenging. As Davis explains, "Over 33,000 scientific papers have been published on heavy metal toxicity alone, going as far back as 1926. We simply can't read and code them all. And, with the help of this new algorithm, we don't have to."

Advertisement

The algorithm described in the study assigns scientific articles a score based on data content, biological and toxicological relevance and several other parameters. Integrating this algorithm with the current system of manual curation helped the researchers significantly improve their process by prioritizing more relevant articles for inclusion in the database, increasing productivity by 27 percent and novel data content by 100 percent.

Only 15 percent of the papers studied were incorrectly identified by the algorithm as being highly relevant, but the researchers were able to identify the reasons for these inaccurate results. "Now, we can go back and tweak the algorithm to account for this and fine-tune the system," says Wiegers.

"We're not at the point yet where a computer can read and extract all the relevant data on its own," concludes Davis, "but having this text-mining process to direct us toward the most informative articles is a huge first step."



Source: Eurekalert
Advertisement

Advertisement
News A-Z
A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Advertisement
News Category
What's New on Medindia
International Day of Persons with Disabilities 2021 - Fighting for Rights in the Post-COVID Era
Effect of Blood Group Type on COVID-19 Risk and Severity
Woman with Rare Spinal Cord Defect from Birth Sues Doctor
View all

Medindia Newsletters Subscribe to our Free Newsletters!
Terms & Conditions and Privacy Policy.


Recommended Reading
Genetic Testing of Diseases
Genetic testing helps to confirm a genetic condition in an individual and involves q complex ......
Epigenetics
In the recent years 'epigenetics' represents inheritable changes in gene expression that do not ......
Birth Defect - Genetic
Achondroplasia is a genetic disorder which is one of the leading causes of dwarfism ...

Disclaimer - All information and content on this site are for information and educational purposes only. The information should not be used for either diagnosis or treatment or both for any health related problem or disease. Always seek the advice of a qualified physician for medical diagnosis and treatment. Full Disclaimer

© All Rights Reserved 1997 - 2021

This site uses cookies to deliver our services. By using our site, you acknowledge that you have read and understand our Cookie Policy, Privacy Policy, and our Terms of Use