4.6 Review

Effective use of latent semantic indexing and computational linguistics in biological and biomedical applications

期刊

FRONTIERS IN PHYSIOLOGY
卷 4, 期 -, 页码 -

出版社

FRONTIERS MEDIA SA
DOI: 10.3389/fphys.2013.00008

关键词

latent semantic indexing; data mining; computational linguistics; molecular interactions; drug discovery

资金

  1. Intramural Research Program of the National Institute on Aging, National Institutes of Health

向作者/读者索取更多资源

Text mining is rapidly becoming an essential technique for the annotation and analysis of large biological data sets. Biomedical literature currently increases at a rate of several thousand papers per week, making automated information retrieval methods the only feasible method of managing this expanding corpus. With the increasing prevalence of open-access journals and constant growth of publicly-available repositories of biomedical literature, literature mining has become much more effective with respect to the extraction of biomedically-relevant data In recent years, text mining of popular databases such as MEDLINE has evolved from basic term searches to more sophisticated natural language processing techniques, indexing and retrieval methods, structural analysis and integration of literature with associated metadata. In this review, we will focus on Latent Semantic Indexing (LSI), a computational linguistics technique increasingly used for a variety of biological purposes. It is noted for its ability to consistently outperform benchmark Boolean text searches and co-occurrence models at information retrieval and its power to extract indirect relationships within a data set. LSI has been used successfully to formulate new hypotheses, generate novel connections from existing data, and validate empirical data

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据