Metabolite Named Entity Recognition: A Hybrid Approach
Conference proceedings article
ผู้เขียน/บรรณาธิการ
กลุ่มสาขาการวิจัยเชิงกลยุทธ์
รายละเอียดสำหรับงานพิมพ์
รายชื่อผู้แต่ง: Wutthipong Kongburan, Praisan Padungweang, Worarat Krathu, Jonathan H Chan
ปีที่เผยแพร่ (ค.ศ.): 2016
ชื่อชุด: Lecture Notes in Computer Science book series (LNTCS,volume 9947)
Volume number: 9947
หน้าแรก: 451
หน้าสุดท้าย: 460
จำนวนหน้า: 10
URL: https://link.springer.com/chapter/10.1007/978-3-319-46687-3_50
บทคัดย่อ
Since labor intensive and time consuming issue, manual curation in metabolic information extraction currently was replaced by text mining (TM). While TM in metabolic domain has been attempted previously, it is still challenging due to variety of specific terms and their meanings in different contexts. Named Entity Recognition (NER) generally used to identify interested keyword (protein and metabolite terms) in sentence, this preliminary task therefore highly influences the performance of metabolic TM framework. Conditional Random Fields (CRFs) NER has been actively used during a last decade, because it explicitly outperforms other approaches. However, an efficient CRFs-based NER depends purely on a quality of corpus which is a nontrivial task to produce. This paper introduced a hybrid solution which combines CRFs-based NER, dictionary usage, and complementary modules (constructed from existing corpus) in order to improve the performance of metabolic NER and another similar domain.
คำสำคัญ
ไม่พบข้อมูลที่เกี่ยวข้อง