(2016. 2) Exploring Limits Of Lm
Submitted on 2016. 2
Rafal Jozefowicz, Oriol Vinyals, Mike Schuster, Noam Shazeer and Yonghui Wu
Simple Summary
explore recent advances in Recurrent Neural Networks for large scale Language Modeling, a task central to language understanding... perform an exhaustive study on techniques such as character Convolutional Neural Networks or Long-Short Term Memory
LSTM
CNN Inputs: characterlevel inputs to build word embeddings
Noise Contrastive Estimation(NCE) and Importance Sampling(IS)
CNN Softmax: produce
e_w
with a CNN over the characters ofw
ase_w = CNN(chars_w)
Char LSTM Prediction
Last updated