ReasoningTrap

university

https://github.com/ReasoningTrap

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yjyjyj98 authored a paper 8 days ago

Music2Video: Automatic Generation of Music Video with fusion of audio and text

yjyjyj98 authored a paper 8 days ago

ReviewScore: Misinformed Peer Review Detection with Large Language Models

yjyjyj98 authored a paper 8 days ago

Meta-Awareness Enhances Reasoning Models: Self-Alignment Reinforcement Learning

View all activity

Organization Card

Community About org cards

Fine-grain evaluation & Large Reasoning Models that fails in reasoning due to reasoning rigidity.
ConditionedMath (AIME & MATH500) · PuzzleTrivial · Zero-shot pipelines

📜 Why ReasoningTrap?

Current RL-tuned Reasoning LLMs excel at producing answers but often ignore explicit user constraints.
ReasoningTrap surfaces these failure modes with carefully crafted, conditioned problems.

Modified from Famous MATH Reasoning Benchmark – AIME & MATH500 problems altered with minimal constraints to divert reasoning paths.
Puzzles Trivialized by Subtle Modifications - Well-known puzzles where a small change transforms a challenging problem into a trivial one.
Plug-and-play – evaluate any 🤗 Transformers model with vLLM in simple instructions.

models 0

None public yet

datasets 3

AI & ML interests

Recent Activity

Team members 2

📜 Why ReasoningTrap?

models 0

datasets 3 Sort: Recently updated

datasets 3