基于依存关系的复句关系词搭配库建设
Establishment of Relation Markers Collocation Corpus for Compound Sentences Based on Dependency Relations

作者: 司贝贝 , 杨进才 :华中师范大学计算机学院,湖北 武汉;

关键词: 复句关系词提取关系词搭配依存关系Compound Sentences Extraction of Relation Markers Collocation of Relation Markers Dependency Relations

摘要:
复句作为联系句子与篇章的桥梁,在中文信息处理中具有重要的地位,关系词的识别研究是复句研究的切入点。本文基于汉语依存句法、关系词及搭配的特征与规律、辅以关系词本体知识库,自动识别并提取关系词,建立了关系词搭配语料库。该关系词搭配库记录了各种关系词在复句中使用与搭配的状态,将有利于分析与统计关系词搭配的规律,从中获取用于关系词自动识别的规则,为关系词更准确的识别打下基础。

Abstract: Compound sentences, connecting sentences and paragraph, play an important role in Chinese in-formation processing. The research of relation word recognition is regarded as the breakthrough point for the research of compound sentences. Based on the dependency relationship in Chinese syntax and the characteristics and regularity of relation words and their collocations, this paper recognizes as well as extracts relation words automatically and established the relationship word collocation corpus with CCCS. The collocation corpus records the status of the match and use of various relation words in compound sentences, which will be advantageous to analyze the matching rule of the word collocation rule, and obtain rules for automatic relationship recognition, ultimately lay the foundation for the more accurate identification of the relation word.

文章引用: 司贝贝 , 杨进才 (2015) 基于依存关系的复句关系词搭配库建设。 软件工程与应用, 4, 81-87. doi: 10.12677/SEA.2015.44011

参考文献

[1] 胡金柱, 舒江波, 姚双云, 等 (2009) 面向中文信息处理的复句关系词提取算法研究. 计算机工程与科学, 10, 90-93.

[2] 李艳翠, 孙静, 周国栋, 等 (2013) 基于清华汉语树库的复句关系词识别与分类研究. 北京大学学报(自然科学版), 12, 118-124.

[3] 胡金柱, 陈江曼, 杨进才, 等 (2012) 基于规则的连用关系标记的自动标识研究. 计算机科学, 7, 190-194.

[4] 王慧兰 (2013) 汉语句类依存树库的构建研究. 北京大学学报(自然科学版), 1, 25-30.

[5] 李晓琪 (1991) 现代汉语复句中关联词的位置. 语言教学与研究, 2, 79-91.

[6] 张仕仁 (1993) 汉语复句的结构分析. 中文信息学报, 4, 43-54.

[7] 胡金柱, 吴锋文, 李琼, 等 (2010) 汉语复句关系词库的建设及其利用. 语言科学, 3, 133-142.

[8] 向磊 (2014) 基于决策树的汉语复句关系词自动识别中规则挖掘方法研究. 华中师范大学, 武汉.

[9] 姚双云 (2008) 复句关系标记的搭配研究. 华中师范大学出版社, 武汉.

[10] 舒江波 (2011) 面向中文信息处理的复句关系词自动标识研究. 华中师范大学, 武汉.

分享
Top