Leveraging Auxiliary Knowledge for Web Service Clustering

来源 :Chinese Journal of Electronics | 被引量 : 0次 | 上传用户:pingerk
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
By grouping Web services that share similar functionalities, Web service clustering can greatly enhance Web service discovery and selection. Most existing clustering techniques are designed to handle long text documents. However, the descriptions of most publicly available Web services are in the form of short text, which impairs the quality of service clustering due to the sparseness of useful information. Towards this issue, we propose a new service clustering approach based on transfer learning from auxiliary long text data obtained from Wikipedia.To handle the inconsistencies in semantics and topics between service descriptions and auxiliary data, we introduce a novel topic model – Tag aided dual Author topical model(TD-ATM), which jointly learns two sets of topics on the two data sets and automatically couples the topic parameters to avoid the potential inconsistencies between these two data sets. Experimental results show the proposed approach outperforms several existing Web service clustering approaches. Most existing clustering techniques are designed to handle long text documents. However, the descriptions of most of the available available web services are in the form of short text , which impairs the quality of service clustering due to the sparseness of useful information. Towards this issue, we propose a new service clustering approach based on transfer learning from auxiliary long text data obtained from Wikipedia. To handle the inconsistencies in semantics and topics between service descriptions and auxiliary data, we introduce a novel topic model - Tag aided dual Author topical model (TD-ATM), which jointly learns two sets of topics on the two data sets and automatically couples the topic parameters to avoid the potential inconsistencies between these two data sets. Experimental results show the proposed approach outperforms several existing Webs ervice clustering approaches.
其他文献
妇科疾病指的是女性生殖系统疾病,是一种临床多发病,包括输卵管疾病、盆腔疾病、阴道疾病、卵巢疾病等.妇科疾病发生率高,病程长,治疗难度大,可使患者的日常生活质量受到不良
期刊
After studying the routing and forwarding process of network stream and the implementation of SDN,we propose a retractable management model for flow table.A str
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
焦虑、抑郁是慢性阻塞性肺疾病(即慢阻肺)的常见并发症,发病率普遍高于普通人群.其发病机制与疾病本身、吸烟以及经济受收入水平相关.临床对于慢阻肺合并焦虑、抑郁的识别及
期刊
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥
该文从挂篮荷载计算、施工流程、支座及临时固结施工、挂篮安装及试验、合拢段施工、模板制作安装、钢筋安装、混凝土的浇筑及养生、测量监控等方面人手,介绍了S226海滨大桥