Generate web application code from descriptions
Challenge LLMs with riddles!
Search and submit code models for evaluation