DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 165
Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series By cgeorgiaw and 1 other • 1 day ago • 11
Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • 5 days ago • 11
The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • about 22 hours ago • 9
Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 13 days ago • 18
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 165
Accelerating AI for Drug Discovery: Ginkgo’s GDPx Functional Genomics and GDPa Antibody Developability Dataset Series By cgeorgiaw and 1 other • 1 day ago • 11
Adaptive Classifier: Dynamic Text Classification with Continuous Learning By codelion • 5 days ago • 11
The Anthropic Ruling: Why AI Training Just Got Legal (But Piracy Didn't) By fdaudens • about 22 hours ago • 9
Saying Thank You to a LLM Isn't Free — Measuring the Energy Cost of Politeness By jdelavande and 2 others • 13 days ago • 18