Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Jesse Dodge's picture
2 2

Jesse Dodge

JesseDodge
shuyuej's profile picture 0xLaszlo's profile picture 21world's profile picture
·
https://jessedodge.github.io/
  • JesseDodge

AI & ML interests

Reproducibility and Efficiency in NLP and ML.

Organizations

Ai2's profile picture

authored a paper 4 months ago

OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens

Paper • 2504.07096 • Published Apr 9 • 76
authored 5 papers over 1 year ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 64

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1, 2024 • 84

AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters

Paper • 2401.06408 • Published Jan 12, 2024 • 1

Catwalk: A Unified Language Model Evaluation Framework for Many Datasets

Paper • 2312.10253 • Published Dec 15, 2023 • 8

Paloma: A Benchmark for Evaluating Language Model Fit

Paper • 2312.10523 • Published Dec 16, 2023 • 13
authored a paper about 2 years ago

Evaluating the Social Impact of Generative AI Systems in Systems and Society

Paper • 2306.05949 • Published Jun 9, 2023 • 9
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs