imflash217/proximal_policy_optimization_lunar_lander_v2 Reinforcement Learning • Updated Jan 13, 2023 • 11 • 1
mradermacher/CscSQL-Merge-Qwen2.5-Coder-1.5B-Instruct-GGUF Reinforcement Learning • 2B • Updated Jul 31 • 101 • 1
emiliodavola/french-solitaire-dqn-single-solution Reinforcement Learning • Updated about 22 hours ago • 56 • 1