知识图谱中实体相似度计算研究

来源 :第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD | 被引量 : 0次 | 上传用户：huyuszsz

【摘要】

：

　　实体相似度的计算有诸多应用,例如电商平台的相似商品推荐,医疗疗效分析中的相似病人组等。在知识图谱的实体相似度计算中,给出了每个实体的属性值,并对部分实体进行相似

【作者】

：

李阳

【机构】

：

华东理工大学计算机科学与工程系,上海,200237

【出处】

：

第十五届全国计算语言学学术会议(CCL2016)暨第四届基于自然标注大数据的自然语言处理国际学术研讨会(NLP-NABD

【发表日期】

：

2016年期

【关键词】

：

知识图谱实体相似度计算方法集成学习模型 Logistic回归噪声数据学习问题数据类型

下载到本地 , 更方便阅读

下载此文赞助VIP

声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架

论文部分内容阅读

　　实体相似度的计算有诸多应用,例如电商平台的相似商品推荐,医疗疗效分析中的相似病人组等。在知识图谱的实体相似度计算中,给出了每个实体的属性值,并对部分实体进行相似度的标注,要求能得到其他实体之间的相似度。本文把该问题归结为监督学习问题,提出一种通用的实体相似度计算方法,通过清洗噪声数据,对数值、列表以及常文本等不同数据类型进行预处理,使用SVM,Logistic回归等分类模型、Random Forest等集成学习模型以及排序学习模型进行建模,得到了较好的结果。

其他文献

Transition-based Chinese Semantic Dependency Graph Parsing

　　Chinese semantic dependency graph is extended from semantic dependency tree,which uses directed acyclic graphs to capture richer latent semantics of sentenc

会议

Definition Extraction with LSTM Recurrent Neural Networks

　　Definition extraction is the task to identify definitional sentences automatically from unstructured text.The task can be used in the aspects of ontology ge

会议

Keeping the Meanings of the Source Text:An introduction to Yes Translate

　　The primary task of language translation is to faithfully pass the meaning(s)of the source text to the target language.Unfortunately,meanings often get lost

会议

基于点关联测度矩阵分解的中英跨语言词嵌入

　　研究基于矩阵分解的词嵌入方法，提出统一的描述模型，并应用于中英跨语言词嵌入问题.以双语对齐语料为知识源，提出跨语言关联词计算方法和两种点关联测度的计算方法：跨语言共

会议

关联词测度矩阵分解语言词义跨语言相似度计算嵌入问题目标函数

基于问题与答案共同表示学习的半监督问题分类方法

　　问题分类旨在对问题的类型进行自动分类，该任务是问答系统研究的一项基本任务。本文提出了一种基于问题和答案共同表示学习的问题分类方法。该方法的特色在于，利用问题及其

会议

《世说新语》的篇章连接词

　　本文标注《世说新语》的篇章结构,据此研究其连接词的显隐、语义及用法.研究发现：1)隐式关系(3346,81.9％)多于显式关系(786,18.1％),17类关系仅有3类(假设,选择,让步)显多隐

会议

世说新语连接词类关系用法同义篇章结构个案分析多义

I Can Guess What You Mean:A Monolingual Query Enhancement for Machine Translation

　　We introduce a monolingual query method with additional webpage data to improve the translation quality for more and more official use requirement of statis

会议

Sentence Alignment Method Based on Maximum Entropy Model Using Anchor Sentences

　　The paper proposes a sentence alignment method based on maximum entropy model using anchor sentences to align ancient and modern Chinese sentences in histor

会议

Chinese Hedge Scope Detection Based on Structure and Semantic Information

　　Hedge detection aims to distinguish factual and uncertain information,which is important in information extraction.The task of hedge detection contains two

会议

Semi-supervised Learning for Mongolian Morphological Segmentation

　　Unlike previous Mongolian morphological segmentation methods based on large labeled training data or complicated rules concluded by linguists,we explore a n

会议

知识图谱中实体相似度计算研究

与本文相关的学术论文