Feynman Innovations's picture

Feynman Innovations

ajibawa-2023

·

AjinkyaBawase

AI & ML interests

LLM, RL, DL, ML, AGI. Developing LLMs (preferably fully fine tuned ) for various use cases.

Recent Activity

upvoted a paper 3 days ago

Diffusion Language Models are Super Data Learners

upvoted a paper 3 days ago

General Agentic Memory Via Deep Research

liked a model 4 days ago

deepseek-ai/DeepSeek-Math-V2

View all activity

Organizations

upvoted 2 papers 3 days ago

Diffusion Language Models are Super Data Learners

Paper • 2511.03276 • Published 29 days ago • 123

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 11 days ago • 151

upvoted 5 papers 6 days ago

MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling

Paper • 2511.11793 • Published 20 days ago • 156

ROOT: Robust Orthogonalized Optimizer for Neural Network Training

Paper • 2511.20626 • Published 9 days ago • 167

Lumine: An Open Recipe for Building Generalist Agents in 3D Open Worlds

Paper • 2511.08892 • Published 22 days ago • 193

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published 28 days ago • 208

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Paper • 2511.14993 • Published 15 days ago • 221

upvoted 13 papers 29 days ago

ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use

Paper • 2510.27363 • Published Oct 31 • 22

MR-Align: Meta-Reasoning Informed Factuality Alignment for Large Reasoning Models

Paper • 2510.24794 • Published Oct 27 • 31

Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning

Paper • 2510.27606 • Published Oct 31 • 27

UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible Feedback

Paper • 2511.01678 • Published about 1 month ago • 34

The Underappreciated Power of Vision Models for Graph Structural Understanding

Paper • 2510.24788 • Published Oct 27 • 35

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published Oct 29 • 41

Continuous Autoregressive Language Models

Paper • 2510.27688 • Published Oct 31 • 68

π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models

Paper • 2510.25889 • Published Oct 29 • 63

INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats

Paper • 2510.25602 • Published Oct 29 • 75

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

Paper • 2510.22115 • Published Oct 25 • 82

ThinkMorph: Emergent Properties in Multimodal Interleaved Chain-of-Thought Reasoning

Paper • 2510.27492 • Published Oct 30 • 80

OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows

Paper • 2510.24411 • Published Oct 28 • 70

VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation

Paper • 2511.02778 • Published 30 days ago • 101