AI & ML interests

None defined yet.

Recent Activity

sumuksย  updated a dataset about 3 hours ago
yourbench/llm-pdf-ingestion-demo
sumuksย  published a dataset about 4 hours ago
yourbench/llm-pdf-ingestion-demo
View all activity

YourBench is an open-source framework for generating zero-shot benchmarks from your own documents. It helps you test language models on custom domains using automated pipelines for ingestion, summarization, and question generation.

  • ๐Ÿ“š Build benchmarks from PDFs, HTML, or text files
  • ๐Ÿง  Generate both single-hop and multi-hop questions
  • ๐Ÿ” Evaluate top models and deploy leaderboards instantly
  • ๐Ÿ› ๏ธ Fully configurable via a single YAML file

Built with ๐Ÿค— by the OpenEvals team โ€” GitHub

models 0

None public yet