--- title: Merlin AI Coach emoji: 🧙 colorFrom: purple colorTo: red sdk: gradio sdk_version: 5.33.1 app_file: app_merlin_ai_coach.py pinned: false license: mit short_description: Merlin is an AI Coach for goal planning tags: - agent-demo-track - Agents-MCP-Hackathon --- # 🧙‍♂️ Merlin AI Coach [Youtube Preview](https://www.youtube.com/watch?v=2tPf6CM68yk) Merlin AI Coach is your intelligent, interactive planning assistant designed to help you achieve your goals with clarity, structure, and deep context. Whether you're working on personal growth, research, fitness, or any complex project, Merlin guides you every step of the way. --- ![Merlin AI Coach Main UI](screenshots/main_ui.png) ## 🚀 Key Features ### 1. Deep Planning Capabilities Merlin enables you to break down ambitious goals into actionable steps, supporting detailed, multi-stage planning for any objective. ### 2. Interactive Clarification Merlin doesn't just take instructions—it asks clarifying questions, collaborates with you, and builds a tailored plan together, ensuring your needs are fully understood. ![Clarification Chat](screenshots/clarification.png) ### 3. Timestamped Notes & Conclusions All key notes and conclusions are timestamped, providing a clear timeline of your journey and supporting long-context workflows. ![Session Notes](screenshots/note_taking_for_user_and_long_context_support.png) ### 4. Progress Checklist Track your advancement through a LangChain-infused progress sheet: - **Goal Setting** - **Research** - **Planning** - **Execution** - **Review** ![Checklist](screenshots/langchain_structured_conversation_stage_flow.png) ### 5. Dynamic Task Management Tasks are automatically generated based on your plan. Use intuitive controls to mark them as **Done** or **To Do**—these trigger further plan development and status tracking. ![Task Management](screenshots/task.png) ### 6. Powerful Abilities Merlin can: - Search the web for up-to-date information - Read and analyze Google Sheets - Summarize research papers - Perform mathematical calculations - Create and manage user tasks - Maintain and update state - Query Wikipedia and much more ![Google Sheets Integration](screenshots/research_through_web.png) ![Google Sheets Integration](screenshots/sheet_example.png) ![Google Sheets Integration](screenshots/read_from_external_google_sheet.png) ### 7. Advanced Local Tool Calls Merlin leverages self-built local tool calls for enhanced flexibility and performance. ### 8. Flexible backend Powered by **LangChain**, **Nebiuss**, and **Modal**, Merlin delivers reliable, scalable, and context-aware AI planning. --- ## 🧑‍🎤 Choose Your Avatar Personalize your coaching experience by selecting from unique avatars, each with their own style: - **Grandma:** Your sweet, encouraging coach who supports you with warmth and wisdom. - **Default:** The classic Merlin—professional, balanced, and always helpful. - **Drill Instructor:** A strict coach who pushes you to excel, even scolding you when needed for extra motivation! ![Avatar Selection](screenshots/planning_with_coach_avatar_optional_tts.png) --- ## 🗣️ Natural Voice Interaction (Optional) Experience hands-free planning with Merlin's natural voice features: - **Whisper-powered Audio Input:** Speak your instructions or ideas—Merlin listens and understands. - **TTS Output:** Merlin can respond with natural-sounding voice, making your planning sessions more interactive and accessible. --- ## 🌟 Why Choose Merlin AI Coach? - **Personalized Guidance:** Merlin adapts to your workflow and goals. - **Context Awareness:** Never lose track—Merlin remembers and builds on your progress. - **Seamless Integration:** Works with your favorite tools and data sources. - **Continuous Improvement:** Each interaction refines your plan and execution. ## POST DEADLINE COMMIT CHANGES: - gTTS replaces tts: In the last commit before deadline text to speech optional feature was disabled manually to address a dependency issue TTS caused in HF space. The commit *"fix tts with gts"* replaced the TTS package with gTTS. The place of change is **synthesize_speech(text)** method which swaps TTS for gTTS. Temporary disabling of text to speech was removed. Text to speech is now available again. ---