Metabolite Named Entity Recognition: A Hybrid Approach
Conference proceedings article
Authors/Editors
Strategic Research Themes
Publication Details
Author list: Wutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H Chan
Publication year: 2016
Title of series: Lecture Notes in Computer Science book series (LNTCS,volume 9947)
Volume number: 9947
Start page: 451
End page: 460
Number of pages: 10
URL: https://link.springer.com/chapter/10.1007/978-3-319-46687-3_50
Abstract
Since labor intensive and time consuming issue, manual curation in metabolic information extraction currently was replaced by text mining (TM). While TM in metabolic domain has been attempted previously, it is still challenging due to variety of specific terms and their meanings in different contexts. Named Entity Recognition (NER) generally used to identify interested keyword (protein and metabolite terms) in sentence, this preliminary task therefore highly influences the performance of metabolic TM framework. Conditional Random Fields (CRFs) NER has been actively used during a last decade, because it explicitly outperforms other approaches. However, an efficient CRFs-based NER depends purely on a quality of corpus which is a nontrivial task to produce. This paper introduced a hybrid solution which combines CRFs-based NER, dictionary usage, and complementary modules (constructed from existing corpus) in order to improve the performance of metabolic NER and another similar domain.
Keywords
No matching items found.