Metabolite Named Entity Recognition: A Hybrid Approach

Conference proceedings article


Authors/Editors


Strategic Research Themes


Publication Details

Author listWutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H Chan

Publication year2016

Title of seriesLecture Notes in Computer Science book series (LNTCS,volume 9947)

Volume number9947

Start page451

End page460

Number of pages10

URLhttps://link.springer.com/chapter/10.1007/978-3-319-46687-3_50


Abstract

Since labor intensive and time consuming issue, manual curation in metabolic information extraction currently was replaced by text mining (TM). While TM in metabolic domain has been attempted previously, it is still challenging due to variety of specific terms and their meanings in different contexts. Named Entity Recognition (NER) generally used to identify interested keyword (protein and metabolite terms) in sentence, this preliminary task therefore highly influences the performance of metabolic TM framework. Conditional Random Fields (CRFs) NER has been actively used during a last decade, because it explicitly outperforms other approaches. However, an efficient CRFs-based NER depends purely on a quality of corpus which is a nontrivial task to produce. This paper introduced a hybrid solution which combines CRFs-based NER, dictionary usage, and complementary modules (constructed from existing corpus) in order to improve the performance of metabolic NER and another similar domain.


Keywords

No matching items found.


Last updated on 2024-23-02 at 23:05