TransferTransfo: A Transfer Learning Approach for Neural Network Based Conversational Agents Paper • 1901.08149 • Published Jan 23, 2019 • 3
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts Paper • 2202.01279 • Published Feb 2, 2022
Learning from others' mistakes: Avoiding dataset biases without modeling them Paper • 2012.01300 • Published Dec 2, 2020
A Hierarchical Multi-task Approach for Learning Embeddings from Semantic Tasks Paper • 1811.06031 • Published Nov 14, 2018
Unlocking the conversion of Web Screenshots into HTML Code with the WebSight Dataset Paper • 2403.09029 • Published Mar 14, 2024 • 56
OBELICS: An Open Web-Scale Filtered Dataset of Interleaved Image-Text Documents Paper • 2306.16527 • Published Jun 21, 2023 • 46
What Language Model to Train if You Have One Million GPU Hours? Paper • 2210.15424 • Published Oct 27, 2022 • 2
Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning Paper • 2109.04144 • Published Sep 9, 2021
Datasets: A Community Library for Natural Language Processing Paper • 2109.02846 • Published Sep 7, 2021 • 14
BLOOM: A 176B-Parameter Open-Access Multilingual Language Model Paper • 2211.05100 • Published Nov 9, 2022 • 32
Multitask Prompted Training Enables Zero-Shot Task Generalization Paper • 2110.08207 • Published Oct 15, 2021 • 2
HuggingFace's Transformers: State-of-the-art Natural Language Processing Paper • 1910.03771 • Published Oct 9, 2019 • 19
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter Paper • 1910.01108 • Published Oct 2, 2019 • 17