Gabriel C's picture

Gabriel C PRO

gabrielchua

·

https://gabrielchua.me

AI & ML interests

Large Language Models, AI Safety, Causal Inference

Recent Activity

updated a Space about 10 hours ago

govtech/rai-bench

liked a model about 13 hours ago

openai/gpt-oss-20b

liked a model about 13 hours ago

openai/gpt-oss-120b

View all activity

Organizations

authored a paper 9 days ago

Running in CIRCLE? A Simple Benchmark for LLM Code Interpreter Security

Paper • 2507.19399 • Published 12 days ago • 1

authored a paper 14 days ago

LionGuard 2: Building Lightweight, Data-Efficient & Localised Multilingual Content Moderators

Paper • 2507.15339 • Published 16 days ago

authored a paper 16 days ago

Toxicity-Aware Few-Shot Prompting for Low-Resource Singlish Translation

Paper • 2507.11966 • Published 21 days ago

authored a paper 21 days ago

Measuring What Matters: A Framework for Evaluating Safety Risks in Real-World LLM Applications

Paper • 2507.09820 • Published 23 days ago

authored a paper 28 days ago

RabakBench: Scaling Human Annotations to Construct Localized Multilingual Safety Benchmarks for Low-Resource Languages

Paper • 2507.05980 • Published 29 days ago • 1

authored a paper 5 months ago

MinorBench: A hand-built benchmark for content-based risks for children

Paper • 2503.10242 • Published Mar 13 • 5

authored a paper 9 months ago

A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection

Paper • 2411.12946 • Published Nov 20, 2024 • 23