Update README.md
Browse files
README.md
CHANGED
@@ -15,4 +15,46 @@ tags:
|
|
15 |
short_description: 'Turn any document into Interactive presentation '
|
16 |
---
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|
|
|
15 |
short_description: 'Turn any document into Interactive presentation '
|
16 |
---
|
17 |
|
18 |
+
# 🤖 SlideDeck AI
|
19 |
+
|
20 |
+
### An autonomous AI agent that turns your raw documents into stunning presentations.
|
21 |
+
|
22 |
+
---
|
23 |
+
|
24 |
+
### 🚀 Watch the Demo!
|
25 |
+
|
26 |
+
* **🎥 Watch the Video Walkthrough on YouTube:** [[Link to your YouTube Video](https://www.youtube.com/watch?v=qBfyChj9_Q0&ab_channel=Revanth)]
|
27 |
+
|
28 |
+
|
29 |
+
---
|
30 |
+
|
31 |
+
This project is a submission for the **Hugging Face & Gradio Agents & MCP Hackathon**. It demonstrates a powerful, multi-tool agentic pipeline that handles everything from creative direction to asset generation.
|
32 |
+
|
33 |
+
---
|
34 |
+
|
35 |
+
## ✨ Key Features
|
36 |
+
|
37 |
+
SlideDeck AI acts as your personal creative director, designer, and narrator. It's not just a summarizer; it's a content creator.
|
38 |
+
|
39 |
+
* **🧠 Intelligent Document Analysis:** Upload multiple documents (`PDF`, `DOCX`, `TXT`), and the agent uses **LlamaParse** to understand and synthesize the core information.
|
40 |
+
* **🎨 AI Creative Direction:** The agent analyzes your topic and generates a complete visual theme, including a color palette, font pairings, and a compelling title for your presentation.
|
41 |
+
* **🖼️ Contextual AI Image Generation:** For high-impact slides, the agent writes its own detailed prompts and uses them to generate beautiful, custom images that match the slide's content.
|
42 |
+
* **🎤 AI-Narrated Speaker Notes:** Every slide comes with a full set of speaker notes, which are then converted into high-quality audio narration for you to listen to.
|
43 |
+
* **📊 Dynamic CSS Visualizations:** When an image isn't needed, the agent builds data-driven visualizations (like charts and diagrams) using pure HTML and CSS.
|
44 |
+
* **🚀 One-Click Export:** Download your final, beautifully designed presentation as a self-contained HTML file or a shareable PDF.
|
45 |
+
|
46 |
+
---
|
47 |
+
|
48 |
+
## ⚙️ How It Works: The Agentic Pipeline
|
49 |
+
|
50 |
+
SlideDeck AI uses a chain of specialized tools and AI models to achieve its results. Each step passes its output to the next in a seamless agentic workflow:
|
51 |
+
|
52 |
+
1. **Parse Tool (`LlamaParse`):** The agent takes your uploaded files and uses LlamaParse to convert them into clean, structured Markdown text.
|
53 |
+
2. **Creative Plan Tool (`Sambanova`):** This Markdown text is fed to a powerful LLM on Sambanova. The LLM acts as a Creative Director, outputting a detailed JSON "master plan" that defines the entire presentation—slide titles, key points, speaker notes, and even prompts for image generation.
|
54 |
+
3. **Image Asset Tool (`Nebius`):** The agent extracts the image prompts from the JSON plan and sends them to the Flux-Schnell model via Nebius to generate visual assets.
|
55 |
+
4. **Audio Asset Tool (`Modal`):** The speaker notes from the JSON plan are sent to a custom Text-to-Speech model (https://huggingface.co/hexgrad/Kokoro-82M) deployed on Modal, which returns audio files for each slide.
|
56 |
+
5. **HTML Builder Tool (`Sambanova`):** Finally, the agent combines the JSON plan (now updated with image and audio URLs) and feeds it to another powerful LLM on Nebius. This model writs the complete, final HTML and CSS for the presentation.
|
57 |
+
|
58 |
+
---
|
59 |
+
|
60 |
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
|