A CONDITIONAL RANDOM FIELDS APPROACH TO BIOMEDICAL NAMED ENTITY RECOGNITION

(整期优先)网络出版时间:2007-06-16
/ 1
Namedentityrecognitionisafundamentaltaskinbiomedicaldatamining.Inthisletter,anamedentityrecognitionsystembasedonCRFs(ConditionalRandomFields)forbiomedicaltextsispresented.Thesystemmakesextensiveuseofapersesetoffeatures,includinglocalfeatures,fulltextfeaturesandexternalresourcefeatures.Allfeaturesincorporatedinthissystemaredescribedindetail,andtheimpactsofdifferentfeaturesetsontheperformanceofthesystemareevaluated.Inordertoimprovetheperformanceofsystem,post-processingmodulesareexploitedtodealwiththeabbrevia-tionphenomena,cascadednamedentityandboundaryerrorsidentification.Evaluationonthissystemprovedthatthefeatureselectionhasimportantimpactonthesystemperformance,andthepost-processingexploredhasanimportantcontributiononsystemperformancetoachievebetterre-sults.