Cost-aware Learning Rate for Neural Machine Translation

来源 :第十六届全国计算语言学学术会议暨第五届基于自然标注大数据的自然语言处理国际学术研讨会 | 被引量 : 0次 | 上传用户:gchy111
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Neural Machine Translation(NMT)has drawn much attention due to its promising translation performance in recent years.The conventional optimiza-tion algorithm for NMT sets a unified learning rate for each gold target word dur-ing training.However,words under different probability distributions should be handled differently.Thus,we propose a cost-aware learning rate method,which can produce different learning rates for words with different costs.Specifically,for the gold word which ranks very low or has a big probability gap with the best candidate,the method can produce a larger learning rate and vice versa.The extensive experiments demonstrate the effectiveness of our proposed method.
其他文献
Recently long short-term memory language model(LSTMLM)has received tremendous interests from both language and speech communities,due to its superiorty on modelling long-term dependency.Moreover,integ
Tibetan syntactic functional chunk parsing is aimed at identifyingsyntactic constituents of Tibetan sentences.In this paper,based on the Tibetan syntactic functional chunk description system,we propos
We consider the task of entity linking over question answering pair(QA-pair).In conventional approaches of entity linking,all the entities whether in one sentence or not are considered the same.We foc
Obtaining bilingual parallel data from the multilingual websites is along-standing research problem,which is very benefit for resource-scarce lan-guages.In this paper,we present an approach for obtain
This paper proposes a neural model for closed-set Chinese word segmentation.The model follows the character-based approach which assigns a class label to each character,indicating its relative po-siti
Event detection suffers from data sparseness and label imbalance prob-lem due to the expensive cost of manual annotations of events.To address this problem,we propose a novel approach that allows for
会议
In this paper,we focus on the problem of answer triggering ad-dressed by Yang et al.(2015),which is a critical component for a real-world question answering system.We employ a hierarchical gated recur
This paper proposes a novel end-to-end neural model to jointly extract entities and relations in a sentence.Unlike most exist-ing approaches,the proposed model uses a hybrid neural network to automati
Mongolian text proofreading is the particularly difficult task because of its unique polyphonic alphabet,morphological ambiguity and agglutinative feature,and coding errors are currently pervasive in
Given a source document with extracted mentions,entity linking callsfor map-ping the mention to an entity in reference knowledge base.Previous en-tity linking approaches mainly focus on generic statis