MultiAgent_System_for_Screenplay_Creation

Running

App Files Files Community

luke9705 commited on Jun 7

Commit

c4a371d

1 Parent(s): 2ddbac2

Enhance Gradio interface: Update title to 'Scriptura: A MultiAgent System for Screenplay Creation and Editing' and add detailed description for improved user understanding.

Browse files

Files changed (1) hide show

app.py +21 -2

app.py CHANGED Viewed

@@ -13,6 +13,7 @@ import openai
 from openai import OpenAI
 import pdfplumber
 import numpy as np
 ## utilties and class definition
@@ -300,19 +301,37 @@ def initialize_agent():
     return agent
 ## gradio interface
 global agent
 agent = initialize_agent()
 demo = gr.ChatInterface(
                     fn=respond,
                     type='messages',
                     multimodal=True,
-                    title='MultiAgent System for Screenplay Creation and Editing',
                     show_progress='full',
                     fill_height=True,
                     fill_width=True,
                     save_history=True,
                     autoscroll=True,
-                    #css = css_snippet,
                     additional_inputs=[
                         gr.Checkbox(value=False, label="Web Search",
                                 info="Enable web search to find information online. If disabled, the agent will only use the provided files and images.",

 from openai import OpenAI
 import pdfplumber
 import numpy as np
+import textwrap
 ## utilties and class definition
     return agent
 ## gradio interface
+description = textwrap.dedent("""**Scriptura** is a multi-agent AI framework based on HF-SmolAgents that streamlines the creation of screenplays, storyboards,
+and soundtracks by automating the stages of analysis, summarization, and multimodal enrichment, freeing authors to focus on pure creativity.
+At its heart:
+- **Qwen3-32B** serves as the primary orchestrating agent, coordinating workflows and managing high-level reasoning across the system.
+- **Gemma-3-27B-IT** acts as a specialized assistant for multimodal tasks, supporting both text and audio inputs to refine narrative elements and prepare them for downstream generation.
+For media generation, Scriptura integrates:
+- **MusicGen** models (per the AudioCraft MusicGen specification), deployed via Hugging Face Spaces,
+enabling the agent to produce original soundtracks and sound effects from text prompts or combined text + audio samples.
+- **FLUX (black-forest-labs/FLUX.1-dev)** for on-the-fly image creation—ideal for storyboards, concept art, and
+visual references that seamlessly tie into the narrative flow.
+Optionally, Scriptura can query external sources (e.g., via a DuckDuckGo API integration) to pull in reference scripts, sound samples, or research materials,
+ensuring that every draft is not only creatively rich but also contextually informed.
+For more information: [README.md](https://huggingface.co/spaces/Lab9705/MultiAgent_System_for_Screenplay_Creation/blob/main/README.md)
+""")
 global agent
 agent = initialize_agent()
 demo = gr.ChatInterface(
                     fn=respond,
                     type='messages',
                     multimodal=True,
+                    title='Scriptura: A MultiAgent System for Screenplay Creation and Editing',
+                    description=description,
                     show_progress='full',
                     fill_height=True,
                     fill_width=True,
                     save_history=True,
                     autoscroll=True,
                     additional_inputs=[
                         gr.Checkbox(value=False, label="Web Search",
                                 info="Enable web search to find information online. If disabled, the agent will only use the provided files and images.",