CSpace  > 系统科学研究所
Improving effectiveness of mutual information for substantival multiword expression extraction
Zhang, Wen1,3; Yoshida, Taketoshi1; Tang, Xijin2; Ho, Tu-Bao1
2009-10-01
发表期刊EXPERT SYSTEMS WITH APPLICATIONS
ISSN0957-4174
卷号36期号:8页码:10919-10930
摘要One of the deficiencies of mutual information is its poor capacity to measure association of words with unsymmetrical co-occurrence, which has large amounts for multi-word expression in texts. Moreover, threshold setting, which is decisive for success of practical implementation of mutual information for multi-word extraction, brings about many parameters to be predefined manually in the process of extracting multiword expressions with different number of individual words. In this paper, we propose a new method as EMICO (Enhanced Mutual Information and Collocation Optimization) to extract substantival multiword expression from text. Specifically, enhanced mutual information is proposed to measure the association of words and collocation optimization is proposed to automatically determine the number of individual words contained in a multiword expression when the multiword expression occurs in a candidate set. Our experiments showed that EMICO significantly improves the performance of substantival multiword expression extraction in comparison with a classic extraction method based on mutual information. (C) 2009 Elsevier Ltd. All rights reserved.
关键词Substantival multiword expression Mutual information Enhanced mutual information Collocation optimization EMICO
DOI10.1016/j.eswa.2009.02.026
语种英语
资助项目Ministry of Education, Culture, Sports, Science and Technology of Japan ; National Natural Science Foundation of China[70571078] ; National Natural Science Foundation of China[70221001]
WOS研究方向Computer Science ; Engineering ; Operations Research & Management Science
WOS类目Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic ; Operations Research & Management Science
WOS记录号WOS:000267179500016
出版者PERGAMON-ELSEVIER SCIENCE LTD
引用统计
文献类型期刊论文
条目标识符http://ir.amss.ac.cn/handle/2S8OKBNM/8564
专题系统科学研究所
通讯作者Zhang, Wen
作者单位1.Japan Adv Inst Sci & Technol, Sch Knowledge Sci, Ishikari, Hokkaido 9231292, Japan
2.Chinese Acad Sci, Acad Math & Syst Sci, Inst Syst Sci, Beijing 100080, Peoples R China
3.Chinese Acad Sci, Inst Software, Lab Internet Software Technol, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Wen,Yoshida, Taketoshi,Tang, Xijin,et al. Improving effectiveness of mutual information for substantival multiword expression extraction[J]. EXPERT SYSTEMS WITH APPLICATIONS,2009,36(8):10919-10930.
APA Zhang, Wen,Yoshida, Taketoshi,Tang, Xijin,&Ho, Tu-Bao.(2009).Improving effectiveness of mutual information for substantival multiword expression extraction.EXPERT SYSTEMS WITH APPLICATIONS,36(8),10919-10930.
MLA Zhang, Wen,et al."Improving effectiveness of mutual information for substantival multiword expression extraction".EXPERT SYSTEMS WITH APPLICATIONS 36.8(2009):10919-10930.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, Wen]的文章
[Yoshida, Taketoshi]的文章
[Tang, Xijin]的文章
百度学术
百度学术中相似的文章
[Zhang, Wen]的文章
[Yoshida, Taketoshi]的文章
[Tang, Xijin]的文章
必应学术
必应学术中相似的文章
[Zhang, Wen]的文章
[Yoshida, Taketoshi]的文章
[Tang, Xijin]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。