gpt-oss-safeguard Collection gpt-oss-safeguard-120b and gpt-oss-safeguard-20b are safety reasoning models built-upon gpt-oss β’ 2 items β’ Updated 4 days ago β’ 51
Glyph: Scaling Context Windows via Visual-Text Compression Paper β’ 2510.17800 β’ Published 13 days ago β’ 64
view article Article Building the Open Agent Ecosystem Together: Introducing OpenEnv 10 days ago β’ 106
βοΈ Liquid Nanos Collection Library of task-specific models: https://www.liquid.ai/blog/introducing-liquid-nanos-frontier-grade-performance-on-everyday-devices β’ 21 items β’ Updated 3 days ago β’ 88
π LLM pretraining datasets Collection A collection of datasets for LLM pretraining β’ 9 items β’ Updated May 5 β’ 13
view article Article Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models Mar 20, 2024 β’ 104
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. β’ 43 items β’ Updated Apr 12, 2024 β’ 143
view article Article The Missing Semester of AI for Organizations #1: LLM Security By huseyingulsin β’ Aug 6 β’ 10