您当前的位置: 首页 > 
  • 0浏览

    0关注

    2393博文

    0收益

  • 0浏览

    0点赞

    0打赏

    0留言

私信
关注
热门博文

DL之Encoder-Decoder:Encoder-Decoder结构的相关论文、设计思路、关键步骤等配图集合之详细攻略

一个处女座的程序猿 发布时间:2018-10-19 20:08:50 ,浏览量:0

DL之Encoder-Decoder:Encoder-Decoder模型的相关论文、设计思路、关键步骤等配图集合之详细攻略

 

 

目录

Encoder-Decoder模型的相关论文

Encoder-Decoder模型的设计思路

Encoder-Decoder模型的关键步骤

 

 

Encoder-Decoder模型的相关论文

 

1、Encoder-Decoder 结构做机器翻译任务的更多细节,可以参考 原始论文《Learning Phrase Representations using RNN Encoder– Decoder for Statistical Machine Translation》 论文地址:https://arxiv.org/pdf/1406.1078.pdf

 

 

Encoder-Decoder模型的设计思路

Abstract:In this paper, we propose a novel neural network model called RNN Encoder– Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixedlength vector representation, and the other decodes the representation into another sequence of symbols. The encoder and decoder of the proposed model are jointly trained to maximize the conditional probability of a target sequence given a source sequence. The performance of a statistical machine translation system is empirically found to improve by using the conditional probabilities of phrase pairs computed by the RNN Encoder–Decoder as an additional feature in the existing log-linear model. Qualitatively, we show that the proposed model learns a semantically and syntactically meaningful representation of linguistic phrases.

1、An illustration of the proposed RNN Encoder–Decoder.

2、An illustration of the proposed hidden activation function. The update gate z selects whether the hidden state is to be updated with a new hidden state h˜. The reset gate r decides whether the previous hidden state is ignored. See Eqs. (5)–(8) for the detailed equations of r, z, h and h˜.

3、: BLEU scores computed on the development and test sets using different combinations of approaches. WP denotes a word penalty, where we penalizes the number of unknown words to neural networks.

4、2–D embedding of the learned word representation. The left one shows the full embedding space, while the right one shows a zoomed-in view of one region (color–coded). For more plots, see the supplementary material.

5、2–D embedding of the learned phrase representation. The top left one shows the full representation space (5000 randomly selected points), while the other three figures show the zoomed-in view of specific regions (color–coded).

 

Encoder-Decoder模型的关键步骤

1、E-D整体结构

2、E-D步骤解释

 

 

 

 

关注
打赏
1664196048
查看更多评论
立即登录/注册

微信扫码登录

0.0488s