Natural Language Processing
Distributed Representations of Words and Phrases and their Compositionality (2013. 10)
Word2Vec
,CBOW
,Skip-gram
GloVe: Global Vectors for Word Representation (2014)
Word2Vec
,GloVe
,Co-Occurrence
Text Understanding from Scratch (2015. 2)
CNN
,Character-level
A Neural Conversational Model (2015. 6)
Seq2Seq
,Conversation
Character-Aware Neural Language Models (2015. 8)
CNN
,Character-level
Incorporating Structural Alignment Biases into an Attentional Neural Translation Model (2016. 1)
Seq2Seq
,Attention with Structural Biases
,Translation
Long Short-Term Memory-Networks for Machine Reading (2016. 1)
LSTMN
,Intra-Attention
,RNN
Recurrent Memory Networks for Language Modeling (2016. 1)
RMN
,Memory Bank
Swivel: Improving Embeddings by Noticing What's Missing (2016. 2)
Word2Vec
,Swivel
,Co-Occurrence
Recurrent Neural Machine Translation (2016. 7)
Translation
,Attention (RNN)
Multiplicative LSTM for sequence modelling (2016. 10)
mLSTM
,Language Modeling
,Character-Level
Dynamic Coattention Networks For Question Answering (2016. 11)
QA
,DCN
,Coattention Encoder
,Machine Comprehension
A recurrent neural network without chaos (2016. 12)
RNN
,CFN
,Dynamic
,Chaos
Comparative Study of CNN and RNN for Natural Language Processing (2017. 2)
Systematic Comparison
,CNN vs RNN
Dynamic Word Embeddings for Evolving Semantic Discovery (2017. 3)
Word Embedding
,Temporal
,Alignment
Learning to Generate Reviews and Discovering Sentiment (2017. 4)
Sentiment
,Unsupervised
,OpenAI
Ask the Right Questions: Active Question Reformulation with Reinforcement Learning (2017. 5)
QA
,Active Question Answering
,RL
,Agent (Reformulate, Aggregate)
Reinforced Mnemonic Reader for Machine Reading Comprehension (2017. 5)
QA
,Mnemonic (Syntatic, Lexical)
,RL
,Machine Comprehension
Depthwise Separable Convolutions for Neural Machine Translation (2017. 6)
SliceNet
,Super-Separable Conv
,Depsewise + Conv 1x1
MEMEN: Multi-layer Embedding with Memory Networks for Machine Comprehension (2017. 7)
MEMEN
,QA(MC)
,Embedding(skip-gram)
,Full-Orientation Matching
On the State of the Art of Evaluation in Neural Language Models (2017. 7)
Standard LSTM
,Regularisation
,Hyperparemeter
Adversarial Examples for Evaluating Reading Comprehension Systems (2017. 7)
Concatenative Adversaries(AddSent, AddOneSent)
,SQuAD
Learned in Translation: Contextualized Word Vectors (2017. 8)
Word Embedding
,CoVe
,Context Vector
Unsupervised Neural Machine Translation (2017. 10)
Train with both direction (tandem)
,Shared Encoder
,Denoising Auto-Encoder
Word Translation Without Parallel Data (2017. 10)
Unsupervised
,Multilingual Embedding
,Parallel Dictionary Induction
Unsupervised Machine Translation Using Monolingual Corpora Only (2017. 11)
Unsupervised
,Adversarial
,Monolingual Corpora
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model (2017. 11)
MoS (Mixture of Softmaxes)
,Softmax Bottleneck
Neural Speed Reading via Skim-RNN (2017. 11)
Skim-RNN
,Speed Reading
,Big(Read)-Small(Skim)
,Dynamic
Generalization without systematicity: On the compositional skills of sequence-to-sequence recurrent networks (2017. 11)
SCAN
,Compositional
,Mix-and-Match
Hierarchical Text Generation and Planning for Strategic Dialogue (2017. 12)
End2End Strategic Dialogue
,Latent Sentence Representations
,Planning + RL
Recent Advances in Recurrent Neural Networks (2018. 1)
RNN
,Recent Advances
,Review
Personalizing Dialogue Agents: I have a dog, do you have pets too? (2018. 1)
Chit-chat
,Profile Memory
,Persona-Chat Dataset
,ParlAI
Generating Wikipedia by Summarizing Long Sequences (2018. 1)
Multi-Document Summarization
,Extractive-Abstractive Stage
,T-DMCA
,WikiSum
,Google Brain
MaskGAN: Better Text Generation via Filling in the__ (2018. 1)
MaskGAN
,Neural Text Generation
,RL Approach
Beyond Word Importance: Contextual Decomposition to Extract Interactions from LSTMs (2018. 1)
Contextual Decomposition (CD)
,Disambiguate interactions between Gates
DeepType: Multilingual Entity Linking by Neural Type System Evolution (2018. 2)
DeepType
,Symbolic Information
,Type System
,Open AI
Ranking Sentences for Extractive Summarization with Reinforcement Learning (2018. 2)
Document-Summarization
,Cross-Entropy vs RL
,Extractive
code2vec: Learning Distributed Representations of Code (2018. 3)
code2vec
,Code Embedding
,Predicting method name
Universal Sentence Encoder (2018. 3)
Transformer
,Deep Averaging Network (DAN)
,Transfer
An efficient framework for learning sentence representations (2018. 3)
Sentence Representation
,True Context
,Unsupervised
An Analysis of Neural Language Modeling at Multiple Scales (2018. 3)
LSTM vs QRNN
,Hyperparemeter
,AWD-QRNN
Analyzing Uncertainty in Neural Machine Translation (2018. 3)
Uncertainty
,Beam Search Degradation
,Copy Mode
An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling (2018. 3)
Temporal Convolutional Network (TCN)
,CNN vs RNN
Training Tips for the Transformer Model (2018. 4)
Transformer
,Hyperparameter
,Multiple GPU
QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension (2018. 4)
QA
,Conv - Self-Attention
,Backtranslation (Data Augmentation)
SimpleQuestions Nearly Solved: A New Upperbound and Baseline Approach (2018. 4)
Top-K Subject Recognitio
,Relation Classification
Delete, Retrieve, Generate: A Simple Approach to Sentiment and Style Transfer (2018. 4)
Sentiment Transfer
,Disentangle Attribute
,Unsupervised
Parsing Tweets into Universal Dependencies (2018. 4)
Universal Dependencies (UD)
,TWEEBANK v2
Subword Regularization: Improving Neural Network Translation Models with Multiple Subword Candidates (2018. 4)
SR
,Subword Sampling + Hyperparameter
,Segmentation (BPE, Unigram)
Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension (2018. 4)
PI-SQuAD
,Challenge
,Document Encoder
,Scalability
On the Practical Computational Power of Finite Precision RNNs for Language Recognition (2018. 5)
Unbounded counting
,IBFP-LSTM
Paper Abstract Writing through Editing Mechanism (2018. 5)
Writing-editing Network
,Attentive Revision Gate
A robust self-learning method for fully unsupervised cross-lingual mappings of word embeddings (2018. 5)
Unsupervised initialization scheme
,Robust self-leraning
Global-Locally Self-Attentive Dialogue State Tracker (2018. 5)
GLAD
,WoZ and DSTC2 Dataset
Learning to Ask Good Questions: Ranking Clarification Questions using Neural Expected Value of Perfect Information (2018, 5)
Dataset
,EVPI
,ACL 2018 Best Paper
Know What You Don't Know: Unanswerable Questions for SQuAD (2018, 6)
SQuAD 2.0
,Negative Example
,ACL 2018 Best Paper
The Natural Language Decathlon: Multitask Learning as Question Answering (2018, 6)
decaNLP
,Multitask Question Answering Network (MQAN)
,Transfer Learning
GLoMo: Unsupervisedly Learned Relational Graphs as Transferable Representations (2018, 6)
Transfer Learning Framework
,Structured Graphical Representations
Improving Language Understanding by Generative Pre-Training (2018, 6)
Transformer
,Generative Pre-Training
,Discriminative Fine-Tuning
Finding Syntax in Human Encephalography with Beam Search (2018, 6)
RNNG+beam search
,ACL 2018 Best Paper
Let's do it "again": A First Computational Approach to Detecting Adverbial Presupposition Triggers (2018, 6)
Task
,Dataset
,Weighted-Pooling (WP)
ACL 2018 Best Paper
QuAC : Question Answering in Context (2018. 8)
Information-Seeking dialog
,Challenge
,Without Evidence
CoQA: A Conversational Question Answering Challenge (2018. 8)
Abstractive with Extractive Rationale
,Challenge
,Coreference and Pragmatic Reasoning
Contextual Parameter Generation for Universal Neural Machine Translation (2018. 8)
Parameter Generation
,Language Embedding
,EMNLP 2018
Evaluating Theory of Mind in Question Answering (2018. 8)
Dataset
,Higher-order Beliefs
,EMNLP 2018
Open Domain Question Answering Using Early Fusion of Knowledge Bases and Text (2018. 9)
GRAFT-Net
,KB+Text Fusion
,EMNLP 2018
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering (2018. 9)
Dataset
,Multi-hop
,Sentence-level Supporting Fact
,EMNLP 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding (2018. 10)
BERT
,Discriminative
,Large Pretrained
,Transfer Learning
Last updated