57 18 44

Felix Friedrich

felfri

https://ml-research.github.io/people/ffriedrich/

AI & ML interests

Multimodal GenAI; AI Alignment; AI steerability

Recent Activity

liked a dataset 1 day ago

AIML-TUDA/t2i-diversity-gender-neutral-captions

updated a dataset 1 day ago

AIML-TUDA/t2i-diversity-gender-neutral-captions

new activity 2 days ago

AIML-TUDA/t2i-diversity-gender-neutral-captions:[bot] Conversion to Parquet

View all activity

Organizations

authored 17 papers 6 days ago

Class Attribute Inference Attacks: Inferring Sensitive Class Information by Diffusion-Based Attribute Manipulations

Paper • 2303.09289 • Published Mar 16, 2023 • 1

MultiFusion: Fusing Pre-Trained Models for Multi-Lingual, Multi-Modal Image Generation

Paper • 2305.15296 • Published May 24, 2023

Mitigating Inappropriateness in Image Generation: Can there be Value in Reflecting the World's Ugliness?

Paper • 2305.18398 • Published May 28, 2023 • 1

Interactively Providing Explanations for Transformer Language Models

Paper • 2110.02058 • Published Sep 2, 2021

Multilingual Text-to-Image Generation Magnifies Gender Stereotypes and Prompt Engineering May Not Help You

Paper • 2401.16092 • Published Jan 29, 2024

A Typology for Exploring the Mitigation of Shortcut Behavior

Paper • 2203.03668 • Published Mar 4, 2022

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 43

ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming

Paper • 2404.08676 • Published Apr 6, 2024 • 3

LLavaGuard: VLM-based Safeguards for Vision Dataset Curation and Safety Assessment

Paper • 2406.05113 • Published Jun 7, 2024 • 2

SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Paper • 2411.07122 • Published Nov 11, 2024 • 1

AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons

Paper • 2503.05731 • Published Feb 19 • 1

EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition

Paper • 2505.20033 • Published about 1 month ago • 3

EmoNet-Voice: A Fine-Grained, Expert-Verified Benchmark for Speech Emotion Detection

Paper • 2506.09827 • Published 15 days ago • 17

authored a paper 5 months ago

MSTS: A Multimodal Safety Test Suite for Vision-Language Models

Paper • 2501.10057 • Published Jan 17 • 9

authored a paper 6 months ago

LLMs Lost in Translation: M-ALERT uncovers Cross-Linguistic Safety Gaps

Paper • 2412.15035 • Published Dec 19, 2024 • 4

authored a paper over 1 year ago

LEDITS++: Limitless Image Editing using Text-to-Image Models

Paper • 2311.16711 • Published Nov 28, 2023 • 24

Felix Friedrich

AI & ML interests

Recent Activity

Organizations

felfri's activity