NLP | 🍎 Paper Today I Read 🦔

[22] Transformers without Tears: Improving the Normalization of Self-Attention

NLP 2019 fundamental norm

[21] cosFormer: Rethinking Softmax in Attention

NLP attention 2022Q1 ICLR long

[20] Memorizing Transformer

NLP 2022Q1 google ICLR long

[16] Counterfactual Memorization in Neural Language Models

NLP 2021Q4 privacy LM

[15] Quantifying Memorization Across Neural Language Models

NLP 2022Q1 privacy LM

[14] Longformer: The Long-Document Transformer

NLP AllenAI 2020Q1 long

[13] GPT-3 : Language Models are Few-Shot Learners

NLP few-shot zero-shot openAI 2020Q2

[12] BBPE: Neural Machine Translation with Byte-Level Subwords

NLP 2019 tokenizing facebook AAAI