Spaces:

BladeSzaSza
/

digiPal

Paused

App Files Files Community

BladeSzaSza commited on Jun 25

Commit

19b4b73

1 Parent(s): b5223b4

moved to stremlit

Browse files

Files changed (20) hide show

.claude/settings.local.json +2 -1
CLAUDE.md +26 -16
CLAUDE_SVELTE_FRONTEND_GUIDE.md +0 -186
DEPLOYMENT.md +213 -0
DIGIPAL_V2_GUIDE.md +0 -355
Dockerfile +4 -4
QUICK_START.md +0 -169
QUICK_UI_TEST.md +63 -0
README.md +84 -123
app.py +26 -54
requirements.txt +6 -5
run_digipal.py +80 -0
src/ai/speech_engine.py +186 -91
src/core/monster_3d_hunyuan_integration.py +0 -326
src/pipelines/hunyuan3d_pipeline.py +0 -958
src/pipelines/opensource_3d_pipeline_v2.py +351 -308
src/ui/gradio_interface.py +0 -1263
src/ui/streamlit_interface.py +565 -0
streamlit_app.py +55 -0
test_ui.py +50 -0

.claude/settings.local.json CHANGED Viewed

@@ -22,7 +22,8 @@
       "Bash(git reset:*)",
       "WebFetch(domain:github.com)",
       "Bash(timeout:*)",
-      "Bash(git rm:*)"
     ],
     "deny": []
   }

       "Bash(git reset:*)",
       "WebFetch(domain:github.com)",
       "Bash(timeout:*)",
+      "Bash(git rm:*)",
+      "Bash(chmod:*)"
     ],
     "deny": []
   }

CLAUDE.md CHANGED Viewed

@@ -4,34 +4,33 @@ This file provides guidance to Claude Code (claude.ai/code) when working with co
 ## Project Overview
-DigiPal is an advanced AI-powered virtual monster companion application built with Gradio 5.34.2, featuring deep AI conversations using Qwen 2.5 models, comprehensive monster care systems, sophisticated evolution mechanics, and 3D model generation capabilities. This is a complex multi-component system designed for deployment on Hugging Face Spaces with Zero GPU optimization.
 ## Architecture
 ### Core Technologies
-- **Frontend**: Gradio 5.34.2 with custom CSS and streaming support
-- **AI Models**: Qwen 2.5-1.5B-Instruct for conversations, Faster Whisper for speech
-- **3D Pipeline**: Hunyuan3D and open-source alternatives for 3D model generation
 - **Framework**: Python 3.11+ with asyncio for concurrent operations
 - **Database**: SQLite for monster persistence with async operations
-- **Deployment**: Hugging Face Spaces with Zero GPU optimization, Docker support
 ### Component Structure
 ```
 src/
 ├── ai/                         # AI processing components
 │   ├── qwen_processor.py       # Qwen 2.5 conversation engine
-│   └── speech_engine.py        # Whisper speech recognition
 ├── core/                       # Core game logic
 │   ├── monster_engine.py       # Monster stats, evolution, persistence
 │   ├── monster_engine_dw1.py   # DW1-aligned monster mechanics (reference)
-│   ├── evolution_system.py     # Evolution mechanics
-│   └── monster_3d_hunyuan_integration.py  # Hunyuan3D specific integration
 ├── pipelines/                  # 3D generation pipelines
-│   ├── hunyuan3d_pipeline.py   # Hunyuan3D integration
-│   └── opensource_3d_pipeline_v2.py  # Enhanced 3D pipeline with MCP
 ├── ui/                         # User interface
-│   ├── gradio_interface.py     # Main Gradio interface
 │   └── state_manager.py        # Browser state management
 ├── deployment/                 # Deployment optimization
 │   └── zero_gpu_optimizer.py   # Zero GPU resource management
@@ -43,14 +42,22 @@ src/
 ### Running the Application
 ```bash
-# Run the unified backend with API server
 python app.py
 # Run with debug logging
 LOG_LEVEL=DEBUG python app.py
 # Run with specific configuration
-SERVER_PORT=8080 API_PORT=8081 SHARE=true python app.py
 # Run with MCP enabled
 MCP_ENDPOINT=https://your-mcp-server MCP_API_KEY=your-key python app.py
@@ -110,17 +117,20 @@ pytest
 ### AI Conversation System
 - **Qwen 2.5 integration** with quantization support (8-bit) for GPU efficiency
 - **Context-aware conversations** with personality-based system prompts
 - **Mood-responsive dialogue** based on current monster stats
 - **Conversation history management** with automatic truncation
 - **Flash Attention 2** optimization when available
 ### 3D Generation Pipeline
-- **Multiple model providers**: Hunyuan3D, open-source models via Hugging Face, MCP protocol
-- **Text-to-3D conversion**: Generate 3D models from text descriptions
 - **Model caching**: Efficient reuse of generated 3D assets
 - **Async generation**: Non-blocking 3D model creation
-- **MCP integration**: Access to external model services via Model Context Protocol
 ### State Management
 - **Async SQLite operations** for monster persistence

 ## Project Overview
+DigiPal is an advanced AI-powered virtual monster companion application built with Streamlit, featuring deep AI conversations using Qwen 2.5 models, Kyutai STT speech recognition, comprehensive monster care systems, sophisticated evolution mechanics, and cutting-edge 3D model generation via OmniGen2 → Hunyuan3D-2.1 → UniRig pipeline. This is a streamlined multi-component system designed for modern deployment with HuggingFace integration.
 ## Architecture
 ### Core Technologies
+- **Frontend**: Streamlit with modern cyberpunk UI design
+- **Backend**: FastAPI with WebSocket support for real-time updates
+- **AI Models**: Qwen 2.5-1.5B-Instruct for conversations, Kyutai STT-2.6b-en for speech
+- **3D Pipeline**: OmniGen2 → Hunyuan3D-2.1 → UniRig (text-to-image-to-3D-to-rigged)
 - **Framework**: Python 3.11+ with asyncio for concurrent operations
 - **Database**: SQLite for monster persistence with async operations
+- **Deployment**: Modern architecture with HuggingFace integration, Docker support
 ### Component Structure
 ```
 src/
 ├── ai/                         # AI processing components
 │   ├── qwen_processor.py       # Qwen 2.5 conversation engine
+│   └── speech_engine.py        # Kyutai STT speech recognition
 ├── core/                       # Core game logic
 │   ├── monster_engine.py       # Monster stats, evolution, persistence
 │   ├── monster_engine_dw1.py   # DW1-aligned monster mechanics (reference)
+│   └── evolution_system.py     # Evolution mechanics
 ├── pipelines/                  # 3D generation pipelines
+│   └── opensource_3d_pipeline_v2.py  # Production 3D pipeline: OmniGen2→Hunyuan3D→UniRig
 ├── ui/                         # User interface
+│   ├── streamlit_interface.py  # Modern Streamlit interface
 │   └── state_manager.py        # Browser state management
 ├── deployment/                 # Deployment optimization
 │   └── zero_gpu_optimizer.py   # Zero GPU resource management
 ### Running the Application
 ```bash
+# Run complete application (FastAPI + Streamlit)
+python run_digipal.py
+# Or run components separately:
+# Run FastAPI backend server
 python app.py
+# Run Streamlit frontend (in another terminal)
+streamlit run src/ui/streamlit_interface.py
 # Run with debug logging
 LOG_LEVEL=DEBUG python app.py
 # Run with specific configuration
+API_PORT=8081 python app.py
 # Run with MCP enabled
 MCP_ENDPOINT=https://your-mcp-server MCP_API_KEY=your-key python app.py
 ### AI Conversation System
 - **Qwen 2.5 integration** with quantization support (8-bit) for GPU efficiency
+- **Kyutai STT-2.6b-en** for high-quality speech-to-text conversion
 - **Context-aware conversations** with personality-based system prompts
 - **Mood-responsive dialogue** based on current monster stats
 - **Conversation history management** with automatic truncation
 - **Flash Attention 2** optimization when available
 ### 3D Generation Pipeline
+- **OmniGen2**: Advanced text-to-image generation with multi-view consistency
+- **Hunyuan3D-2.1**: State-of-the-art image-to-3D conversion via official HuggingFace Space API
+- **UniRig**: Automatic 3D model rigging via HuggingFace integration
+- **Complete Pipeline**: text → multi-view images → 3D mesh → rigged model
+- **Fallback Systems**: Graceful degradation when APIs are unavailable
 - **Model caching**: Efficient reuse of generated 3D assets
 - **Async generation**: Non-blocking 3D model creation
 ### State Management
 - **Async SQLite operations** for monster persistence

CLAUDE_SVELTE_FRONTEND_GUIDE.md DELETED Viewed

@@ -1,186 +0,0 @@
-# **Claude Development Guide: DigiPal Svelte Frontend**
-This document outlines the plan for the complete UI overhaul of DigiPal, moving from Gradio to a custom SvelteKit application with voice-first interaction.
-## **Status: Implementation Complete ✅**
-All major components have been implemented:
-- ✅ Unified backend with FastAPI + WebSocket support
-- ✅ SvelteKit frontend structure
-- ✅ Voice-first interaction system
-- ✅ DigiVice-style UI components
-- ✅ 3D rendering with Threlte
-- ✅ MCP integration
-- ✅ Real-time WebSocket updates
-- ✅ Cyberpunk-retro styling
-## **1. Project Architecture**
-### **Backend (Python)**
-- **Main Application**: `app.py` - Unified application with all features enabled
-- **API Server**: FastAPI on port 7861
-- **Gradio Admin**: Running on port 7860 as fallback/admin interface
-- **WebSocket**: Real-time stat updates and model changes
-- **MCP Support**: Configurable via environment variables
-### **Frontend (SvelteKit)**
-Located in `/frontend` directory:
-- **Framework**: SvelteKit with TypeScript
-- **3D Rendering**: Threlte (Three.js wrapper for Svelte)
-- **Styling**: Tailwind CSS with custom cyberpunk-retro theme
-- **Voice**: Web Speech API with intent parsing
-- **State**: Svelte stores for reactive state management
-## **2. Running the Application**
-### **Backend Setup**
-```bash
-# Install Python dependencies
-pip install -r requirements.txt
-# Run the unified backend
-python app.py
-# Or with MCP enabled
-MCP_ENDPOINT=https://your-mcp-endpoint MCP_API_KEY=your-key python app.py
-```
-### **Frontend Setup**
-```bash
-# Navigate to frontend directory
-cd frontend
-# Install dependencies
-npm install
-# Run development server
-npm run dev
-# Build for production
-npm run build
-```
-## **3. API Endpoints**
-The backend exposes these REST endpoints on port 7861:
-- `GET /api/monsters` - List all monsters
-- `POST /api/monsters` - Create new monster
-- `GET /api/monsters/{id}` - Get monster details
-- `POST /api/monsters/{id}/action` - Perform care action
-- `POST /api/monsters/{id}/talk` - Send message
-- `POST /api/monsters/{id}/generate-3d` - Generate 3D model
-- `WS /api/monsters/{id}/ws` - WebSocket for real-time updates
-## **4. Voice Commands**
-The system recognizes these voice intents:
-### **Care Actions**
-- "Feed [food type]" → `feed` action
-- "Train [skill]" → `train` action
-- "Play with monster" → `play` action
-- "Clean monster" → `clean` action
-- "Heal monster" → `heal` action
-- "Let monster rest" → `rest` action
-- "Discipline monster" → `discipline` action
-### **3D Generation**
-- "Generate 3D model" → triggers 3D generation
-- "Create a [description]" → generates with description
-### **Conversation**
-- Any other speech → sent as conversation to monster
-## **5. Component Structure**
-### **Core Components**
-- `Device.svelte` - Main device container
-- `Screen.svelte` - 3D display with CRT effect
-- `MonsterScene.svelte` - Three.js scene for monster
-- `Dpad.svelte` - Directional pad control
-- `ActionButton.svelte` - A/B buttons
-- `VoiceButton.svelte` - Voice activation
-- `HolographicStats.svelte` - Stats overlay
-### **Services**
-- `api.ts` - Backend communication
-- `voice.ts` - Speech recognition & intent parsing
-- `mcp.ts` - Model Context Protocol integration
-### **Stores**
-- `monsterStore.ts` - Monster state management
-- `voiceStore.ts` - Voice input state
-## **6. Styling Guide**
-The UI uses a cyberpunk-retro aesthetic:
-### **Color Palette**
-- DigiPal Orange: `#FF6B00`
-- DigiPal Teal: `#00CED1`
-- DigiPal Gray: `#2D2D2D`
-- Neon Magenta: `#FF00FF`
-- Neon Cyan: `#00FFFF`
-### **Effects**
-- CRT scanlines animation
-- Holographic glitch effect
-- Neon glow shadows
-- Pixel fonts for UI text
-## **7. MCP Integration**
-When MCP is configured, the system will:
-1. Use MCP for AI conversations instead of local Qwen
-2. Use MCP for 3D generation instead of local pipeline
-3. Optionally use MCP for speech-to-text
-Configure via environment variables:
-```bash
-MCP_ENDPOINT=https://your-mcp-server.com
-MCP_API_KEY=your-api-key
-```
-## **8. Development Tips**
-### **Adding New Voice Commands**
-1. Update intent parsing in `voice.ts`
-2. Add handler in `voiceStore.ts`
-3. Implement action in `monsterStore.ts`
-### **Adding New UI Components**
-1. Create component in `src/lib/components/`
-2. Apply cyberpunk-retro styling
-3. Connect to appropriate stores
-### **Extending 3D Features**
-1. Update `MonsterScene.svelte` for new animations
-2. Add model loading logic
-3. Implement interactive features
-## **9. Production Deployment**
-### **Build Process**
-```bash
-# Backend
-docker build -t digipal .
-# Frontend
-cd frontend && npm run build
-```
-### **Environment Variables**
-- `API_PORT`: Backend API port (default: 7861)
-- `SERVER_PORT`: Gradio port (default: 7860)
-- `MCP_ENDPOINT`: MCP server URL
-- `MCP_API_KEY`: MCP authentication
-## **10. Future Enhancements**
-- [ ] PWA support for mobile devices
-- [ ] Offline voice recognition
-- [ ] Multiplayer monster interactions
-- [ ] AR mode using device camera
-- [ ] Custom shader effects for monsters
-- [ ] Voice synthesis for monster responses

DEPLOYMENT.md ADDED Viewed

	@@ -0,0 +1,213 @@

+# DigiPal Deployment Guide
+## Quick Start
+### Prerequisites
+- Python 3.11+
+- Node.js 18+ (for Svelte frontend, if using)
+- Git
+### Installation
+1. **Clone the repository:**
+```bash
+git clone <repository-url>
+cd digiPal
+```
+2. **Install Python dependencies:**
+```bash
+pip install -r requirements.txt
+```
+3. **Set up environment variables (optional):**
+```bash
+export HF_TOKEN="your_huggingface_token"  # For private models/spaces
+export MCP_ENDPOINT="your_mcp_endpoint"   # For MCP integration
+export MCP_API_KEY="your_mcp_key"
+```
+### Running DigiPal
+#### Option 1: Complete Application (Recommended)
+```bash
+python run_digipal.py
+```
+This starts both the FastAPI backend and Streamlit frontend.
+**Access:**
+- **Streamlit UI**: http://localhost:8501
+- **API Backend**: http://localhost:7861
+#### Option 2: Manual Startup
+Terminal 1 (Backend):
+```bash
+python app.py
+```
+Terminal 2 (Frontend):
+```bash
+streamlit run src/ui/streamlit_interface.py
+```
+#### Option 3: Svelte Frontend (Advanced)
+```bash
+# Terminal 1: Start backend
+python app.py
+# Terminal 2: Start Svelte frontend
+cd frontend
+npm install
+npm run dev
+```
+## Architecture Overview
+### Technology Stack
+- **Frontend**: Streamlit (modern cyberpunk UI)
+- **Backend**: FastAPI with WebSocket support
+- **AI Models**:
+  - Qwen 2.5-1.5B-Instruct (conversations)
+  - Kyutai STT-2.6b-en (speech recognition)
+- **3D Pipeline**: OmniGen2 → Hunyuan3D-2.1 → UniRig
+- **Database**: SQLite with async operations
+### API Endpoints
+**Monster Management:**
+- `GET /api/monsters` - List all monsters
+- `POST /api/monsters` - Create new monster
+- `GET /api/monsters/{id}` - Get monster details
+- `POST /api/monsters/{id}/action` - Perform care action
+- `POST /api/monsters/{id}/talk` - Send message to monster
+- `POST /api/monsters/{id}/generate-3d` - Generate 3D model
+**WebSocket:**
+- `WS /api/monsters/{id}/ws` - Real-time updates
+## Configuration
+### Environment Variables
+| Variable | Description | Default |
+|----------|-------------|---------|
+| `LOG_LEVEL` | Logging level | `INFO` |
+| `API_PORT` | FastAPI backend port | `7861` |
+| `HF_TOKEN` | HuggingFace API token | None |
+| `MCP_ENDPOINT` | MCP service endpoint | None |
+| `MCP_API_KEY` | MCP API key | None |
+### Hardware Requirements
+**Minimum:**
+- 8GB RAM
+- 4GB free disk space
+- Internet connection (for HuggingFace APIs)
+**Recommended:**
+- 16GB RAM
+- NVIDIA GPU with 8GB+ VRAM
+- SSD storage
+- High-speed internet
+## 3D Generation Pipeline
+The application uses a modern 3D generation pipeline:
+1. **Text Input** → User describes their monster
+2. **OmniGen2** → Generates multi-view images
+3. **Hunyuan3D-2.1** → Converts images to 3D mesh
+4. **UniRig** → Automatically rigs the 3D model
+5. **Output** → Fully rigged 3D model ready for animation
+### API Integration
+- **OmniGen2**: Via transformers/diffusers pipeline
+- **Hunyuan3D-2.1**: Via official HuggingFace Space API
+- **UniRig**: Via HuggingFace model repository
+## Deployment Options
+### Local Development
+Use the quick start guide above.
+### Docker (Future)
+```bash
+docker build -t digipal .
+docker run -p 7861:7861 -p 8501:8501 digipal
+```
+### HuggingFace Spaces
+1. Fork/upload repository to HuggingFace Spaces
+2. Set Space type to "Streamlit"
+3. Configure secrets for HF_TOKEN if needed
+4. Space will auto-deploy
+## Troubleshooting
+### Common Issues
+**Port Already in Use:**
+```bash
+# Change ports
+API_PORT=8081 python app.py
+streamlit run src/ui/streamlit_interface.py --server.port 8502
+```
+**Missing Dependencies:**
+```bash
+pip install -r requirements.txt --upgrade
+```
+**3D Generation Fails:**
+- Check internet connection
+- Verify HF_TOKEN if using private models
+- Pipeline includes fallback mechanisms
+**Streamlit Not Starting:**
+```bash
+pip install streamlit --upgrade
+streamlit --version
+```
+### Performance Optimization
+**For GPU Systems:**
+- Ensure CUDA is properly installed
+- Models will automatically use GPU when available
+**For CPU-Only Systems:**
+- Increase timeout values for 3D generation
+- Consider using smaller model variants
+## Monitoring
+### Logs
+- Application logs: `logs/digipal.log`
+- Streamlit logs: Console output
+- FastAPI logs: Console output with timestamps
+### Health Check
+```bash
+curl http://localhost:7861/health
+```
+## Support
+For issues and questions:
+1. Check this deployment guide
+2. Review `CLAUDE.md` for development details
+3. Check console logs for error messages
+## New Tech Stack Summary
+**Replaced:**
+- Gradio → Streamlit (modern UI)
+- Faster Whisper → Kyutai STT-2.6b-en (better accuracy)
+- Complex 3D pipeline → Streamlined OmniGen2→Hunyuan3D→UniRig
+**Benefits:**
+- Modern, responsive UI with cyberpunk theme
+- Better speech recognition quality
+- State-of-the-art 3D generation pipeline
+- Simplified deployment and maintenance
+- Better separation of frontend/backend

DIGIPAL_V2_GUIDE.md DELETED Viewed

@@ -1,355 +0,0 @@
-# DigiPal V2 - Complete Development Guide
-## Overview
-DigiPal V2 is a revolutionary digital monster companion application that combines:
-- **Authentic Digimon World 1 mechanics** based on comprehensive reverse engineering
-- **Modern AI conversations** using Qwen 2.5 models
-- **Optional 3D generation** using cutting-edge open-source pipeline (Flux → Sparc3D → UniRig)
-- **Real-time care simulation** with mortality and complex evolution
-## Architecture
-### Core Components
-```
-digiPal/
-├── src/
-│   ├── core/
-│   │   ├── monster_engine_dw1.py      # DW1-accurate monster mechanics
-│   │   └── monster_3d_integration.py  # 3D model management
-│   ├── pipelines/
-│   │   ├── text_to_3d_pipeline.py     # Commercial 3D pipeline (Meshy AI)
-│   │   └── opensource_3d_pipeline_v2.py # Open-source pipeline (Flux/Sparc3D/UniRig)
-│   ├── ui/
-│   │   └── gradio_interface_v2.py     # Enhanced Gradio interface
-│   └── ai/
-│       └── qwen_processor.py          # AI conversation system
-├── app_v2.py                          # Main application entry
-└── README.md                          # HuggingFace Spaces config
-```
-## Key Features Implementation
-### 1. DW1-Accurate Monster System
-The `DW1Monster` class implements authentic mechanics:
-```python
-# Lifespan system with mortality
-LIFESPAN_DAYS = {
-    "BABY": 1, "CHILD": 3, "ADULT": 5,
-    "PERFECT": 6, "ULTIMATE": 7, "MEGA": 8
-}
-# Complex stat system
-- HP/MP (max 9999)
-- Offense/Defense/Speed/Brains (max 999)
-- Weight (5-99, affects evolution)
-- Nature (0-100, affects techniques)
-# Care mechanics
-- Hunger (depletes faster during activity)
-- Energy (restored by sleep)
-- Toilet needs (every 3 hours)
-- Happiness (affects training)
-- Discipline (affects obedience)
-- Sickness system (Cold, Injury, Fatigue, Stomach)
-# Evolution system
-- Time-based progression
-- Stat requirements
-- Care quality thresholds
-- Weight requirements
-- Special conditions (perfect week, tournament wins)
-```
-### 2. Training System
-Based on DW1's gym mechanics:
-```python
-# Training types match DW1 locations
-- HP Training (Green Gym)
-- MP Training (Beetle Land)
-- Offense Training (Dojo)
-- Defense Training (Ice Gym)
-- Speed Training (Speed Gym)
-- Brains Training (Library)
-# Performance-based gains
-- PERFECT: 150% gain
-- GREAT: 120% gain
-- GOOD: 100% gain
-- MISS: 50% gain + care mistake
-```
-### 3. Evolution Mechanics
-Implements DW1's complex branching:
-```python
-# Baby → Child (6 hours)
-- Personality-based (Brave→Agumon, Calm→Gabumon)
-- Care quality affects outcome
-# Child → Adult (24 hours)
-- Stat requirements (total > 200)
-- Weight requirements (15-35)
-- Care mistakes < 5
-- Highest stat determines form
-# Adult → Perfect (72 hours)
-- High stats (total > 500)
-- Excellent care (mistakes < 3)
-- Battle experience (15+ wins)
-# Perfect → Ultimate (96 hours)
-- Perfect care week required
-- Very high stats (total > 800)
-- Extensive battles (30+ wins)
-```
-### 4. 3D Generation Pipeline
-Two implementation options:
-#### Commercial Pipeline (Meshy AI)
-```python
-# Fast, high-quality, API-based
-- Text → Meshy AI → 3D Model
-- 2-12 credits per generation
-- Built-in rigging support
-- PBR textures included
-```
-#### Open-Source Pipeline (Production Ready)
-```python
-# Free, cutting-edge, locally runnable
-1. Text → Flux (HF Spaces API)
-   - Multi-view generation (6 views)
-   - Consistent character design
-2. Images → Sparc3D (1024³ resolution)
-   - Ultra-high quality mesh
-   - Texture projection
-3. Mesh → UniRig (Auto-rigging)
-   - Procedural skeleton generation
-   - Weight painting
-   - Animation-ready output
-```
-## Setup Instructions
-### Basic Setup (No 3D)
-```bash
-# Clone repository
-git clone https://github.com/yourusername/digiPal
-cd digiPal
-# Install dependencies
-pip install -r requirements.txt
-# Run application
-python app_v2.py
-```
-### Full Setup (With 3D)
-```bash
-# Install additional dependencies
-pip install gradio_client trimesh pillow
-# Clone required repositories
-git clone https://github.com/lizhihao6/Sparc3D
-git clone https://github.com/VAST-AI-Research/UniRig
-# Configure API keys (optional)
-export HF_TOKEN="your_huggingface_token"
-# Run with 3D enabled
-ENABLE_3D=true python app_v2.py
-```
-### HuggingFace Spaces Deployment
-The application is optimized for Spaces:
-1. **YAML Configuration** (in README.md):
-```yaml
-suggested_hardware: zero-a10g
-suggested_storage: medium
-```
-2. **Environment Variables**:
-```
-ENABLE_3D=true
-ENABLE_AI=true
-LOG_LEVEL=INFO
-```
-## Usage Guide
-### Creating Your First Monster
-1. **Choose Species Type**: Affects evolution paths
-   - Data: Mechanical/Digital evolutions
-   - Vaccine: Holy/Light evolutions
-   - Virus: Dark/Chaos evolutions
-   - Free: Balanced evolutions
-2. **Select Personality**: Affects behavior and growth
-   - Brave: +Offense, -Defense
-   - Calm: +Defense, -Speed
-   - Energetic: +Speed, -MP
-   - Clever: +Brains/MP, -HP
-   - Friendly: Balanced, easier care
-### Care Guidelines
-1. **Feeding**:
-   - Feed when hunger > 50
-   - Don't overfeed (causes care mistakes)
-   - Different foods have different effects
-2. **Training**:
-   - Train when energy > 50
-   - Match training to desired evolution
-   - Perfect performance gives bonus gains
-3. **Health**:
-   - Check toilet needs every 3 hours
-   - Heal sickness immediately
-   - Let monster sleep at night (8PM-8AM)
-### Evolution Tips
-1. **For Strong Evolutions**:
-   - Minimize care mistakes (< 3)
-   - Balanced stat growth
-   - Regular training
-   - Win battles
-2. **Special Evolutions**:
-   - Perfect care for 7 days
-   - Win tournaments
-   - Specific stat distributions
-   - Weight requirements
-## Technical Details
-### Performance Optimization
-1. **ZeroGPU Support**:
-```python
-@spaces.GPU(duration=60)
-def generate_3d_model():
-    # GPU-intensive operations
-```
-2. **Async Operations**:
-- Database operations
-- 3D generation pipeline
-- Multi-view image generation
-3. **Memory Management**:
-- Model quantization (8-bit)
-- Texture compression
-- Mesh decimation
-### Customization Options
-1. **Monster Definitions**:
-Edit `monster_engine_dw1.py` to add:
-- New evolution paths
-- Custom techniques
-- Special forms
-2. **3D Styles**:
-Modify prompts in `_generate_3d_prompt()`:
-- Art style preferences
-- Detail levels
-- Color schemes
-3. **UI Themes**:
-Edit `CUSTOM_CSS` in interface:
-- Color schemes
-- Fonts
-- Animations
-## Troubleshooting
-### Common Issues
-1. **"No GPU detected"**
-   - Application works on CPU but slower
-   - 3D generation may timeout
-2. **"3D pipeline not available"**
-   - Check Sparc3D/UniRig installation
-   - Verify CUDA installation
-   - Try commercial pipeline instead
-3. **"Monster died unexpectedly"**
-   - Check care history
-   - Monitor lifespan remaining
-   - Reduce care mistakes
-### Performance Tips
-1. **For Faster 3D Generation**:
-   - Use "draft" quality
-   - Reduce texture resolution
-   - Enable model caching
-2. **For Better Evolution Rates**:
-   - Train regularly but don't overtrain
-   - Keep happiness > 70
-   - Minimize toilet accidents
-## Future Enhancements
-### Planned Features
-1. **Battle System**:
-   - Turn-based combat
-   - Technique learning
-   - PvP battles
-2. **World Exploration**:
-   - File City building
-   - NPC recruitment
-   - Item collection
-3. **Breeding System**:
-   - Genetic inheritance
-   - Hybrid forms
-   - Egg management
-4. **Mini-Games**:
-   - Training challenges
-   - Fishing
-   - Treasure hunting
-## Credits
-- **DW1 Reverse Engineering**: SydMontague, Vicen04, Romsstar
-- **3D Pipeline**: Flux (Black Forest Labs), Sparc3D (lizhihao6), UniRig (VAST-AI)
-- **AI Models**: Qwen 2.5 (Alibaba)
-- **Framework**: Gradio (HuggingFace)
-## Philosophy
-Built following Rick Rubin's creative philosophy:
-- **Strip to essentials**: Focus on core monster-raising mechanics
-- **Amplify emotion**: Every interaction should create connection
-- **Respect the source**: Authentic to Digimon World 1's spirit
-- **Push boundaries**: Modern tech serving classic gameplay
-## License
-MIT License - See LICENSE file for details
----
-*"The goal is not to recreate the past, but to capture what made it special and express it through modern possibilities."*

Dockerfile CHANGED Viewed

@@ -27,12 +27,12 @@ COPY . .
 # Create necessary directories
 RUN mkdir -p data/saves data/models data/cache logs config
-# Expose ports for both API and Gradio
-EXPOSE 7860 7861
 # Health check - check API server on port 7861
 HEALTHCHECK --interval=30s --timeout=30s --start-period=60s --retries=3 \
     CMD curl -f http://localhost:7861/health || exit 1
-# Run the application
-CMD ["python", "app.py"]

 # Create necessary directories
 RUN mkdir -p data/saves data/models data/cache logs config
+# Expose ports for FastAPI backend and Streamlit frontend
+EXPOSE 7861 8501
 # Health check - check API server on port 7861
 HEALTHCHECK --interval=30s --timeout=30s --start-period=60s --retries=3 \
     CMD curl -f http://localhost:7861/health || exit 1
+# Run the complete application (FastAPI + Streamlit)
+CMD ["python", "run_digipal.py"]

QUICK_START.md DELETED Viewed

@@ -1,169 +0,0 @@
-# DigiPal Quick Start Guide
-## 🚀 Current Status: FULLY FUNCTIONAL
-**✅ All major issues resolved!** The application is now ready for deployment.
-## 📋 What's Working Right Now
-### **✅ Core Systems**
-- Monster creation and management
-- Evolution system with requirements
-- State persistence (SQLite database)
-- Performance tracking
-- Zero GPU optimization
-### **✅ AI Systems**
-- Qwen 2.5 text generation (with fallbacks)
-- Speech processing (voice-to-text)
-- Multiple 3D generation pipelines
-- Emotional impact calculation
-### **✅ Deployment Ready**
-- Hugging Face Spaces compatible
-- Zero GPU support
-- Automatic resource detection
-- Graceful fallbacks
-### **✅ Interfaces**
-- FastAPI backend (Port 7861)
-- Gradio admin panel (Port 7860)
-- WebSocket real-time updates
-- RESTful API endpoints
-## 🎯 How to Use Right Now
-### **1. Start the Application**
-```bash
-python app.py
-```
-### **2. Access the Interfaces**
-- **Gradio Admin**: http://localhost:7860
-- **API Documentation**: http://localhost:7861/docs
-- **Health Check**: http://localhost:7861/health
-### **3. Create Your First Monster**
-1. Go to http://localhost:7860
-2. Click "Create New Monster"
-3. Enter name and personality
-4. Start interacting!
-## 🔧 What Each Port Does
-| Port | Service | Purpose |
-|------|---------|---------|
-| 7860 | Gradio | Admin panel, monster creation, debugging |
-| 7861 | FastAPI | REST API, WebSocket, 3D generation |
-| 5173 | Svelte | Modern web UI (when implemented) |
-## 🎮 Current Features
-### **Monster Management**
-- ✅ Create monsters with personalities
-- ✅ Feed, train, and care for monsters
-- ✅ Real-time stat updates
-- ✅ Evolution tracking
-- ✅ Conversation history
-### **AI Interactions**
-- ✅ Text conversations with AI responses
-- ✅ Personality-aware responses
-- ✅ Emotional impact on monster stats
-- ✅ Fallback responses when AI unavailable
-### **3D Generation**
-- ✅ Multiple pipeline support
-- ✅ Hugging Face Spaces integration
-- ✅ Automatic model optimization
-- ✅ Texture generation
-### **System Features**
-- ✅ Automatic saving
-- ✅ Performance monitoring
-- ✅ Error handling
-- ✅ Resource optimization
-## 🚀 Next Steps
-### **Immediate (Ready Now)**
-1. **Deploy to Hugging Face Spaces**
-   ```bash
-   git add .
-   git commit -m "Ready for deployment"
-   git push origin main
-   ```
-2. **Test on Spaces**
-   - Monitor logs for any issues
-   - Verify GPU allocation
-   - Test monster creation
-### **Future Enhancements**
-1. **Svelte Frontend** (Port 5173)
-   - Modern web interface
-   - 3D model viewer
-   - Voice chat interface
-2. **Advanced Features**
-   - Mini-games implementation
-   - Breeding system
-   - Advanced evolution paths
-## 🔍 Troubleshooting
-### **Common Issues**
-1. **Import Errors**: All dependencies installed ✅
-2. **Dataclass Errors**: Fixed with `field(default_factory=...)` ✅
-3. **GPU Issues**: Zero GPU optimization implemented ✅
-4. **Port Conflicts**: All ports properly configured ✅
-### **If Something Goes Wrong**
-1. Check logs in terminal output
-2. Verify all dependencies: `pip install -r requirements.txt`
-3. Clear cache: `rm -rf data/cache/*`
-4. Restart: `python app.py`
-## 📊 Performance Notes
-### **Local Development**
-- Fast startup (< 30 seconds)
-- Low memory usage
-- CPU-optimized AI models
-- Real-time responses
-### **Hugging Face Spaces**
-- Automatic GPU detection
-- Memory optimization
-- Graceful CPU fallback
-- Spaces.GPU decorators applied
-## 🎯 Key Achievements
-### **✅ Unified Architecture**
-- Single entry point (`app.py`)
-- Shared state across components
-- Consistent data flow
-- Modular design
-### **✅ Production Ready**
-- Error handling
-- Logging
-- Performance monitoring
-- Graceful fallbacks
-### **✅ Zero GPU Compatible**
-- Dynamic resource detection
-- CPU optimization
-- Memory management
-- Spaces integration
-## 🚀 Ready for Deployment!
-The application is now **fully functional** and ready for:
-- ✅ Local development
-- ✅ Hugging Face Spaces deployment
-- ✅ Production use
-- ✅ Further development
-**All the effort has matured the codebase into a robust, scalable, and maintainable system!** 🎉

QUICK_UI_TEST.md ADDED Viewed

	@@ -0,0 +1,63 @@

+# 🎨 Quick UI Test Guide
+## See the New DigiPal UI Now!
+### Option 1: UI Only Preview (Fastest)
+```bash
+python test_ui.py
+```
+This shows you the new cyberpunk Streamlit interface without needing the backend.
+### Option 2: Full Application
+```bash
+python run_digipal.py
+```
+This runs both backend and frontend for full functionality.
+## What You'll See
+### 🎨 **Modern Cyberpunk Theme:**
+- Dark gradient backgrounds with neon accents
+- Glowing cyan and magenta color scheme
+- Orbitron and Rajdhani fonts for sci-fi feel
+- Animated neon effects on titles and buttons
+### 🖥️ **Interface Features:**
+- **Welcome Screen**: Feature overview with holographic styling
+- **Sidebar**: Monster management with neon buttons
+- **Monster Stats**: Holographic containers with progress bars
+- **Chat Interface**: Cyberpunk-styled conversation area
+- **3D Generation**: Modern controls for model creation
+### 🚀 **Interactive Elements:**
+- Hover effects on buttons with glow animations
+- Gradient backgrounds that shift and pulse
+- Neon text effects with shadows
+- Holographic containers with backdrop blur
+## Access URLs
+After starting:
+- **Streamlit UI**: http://localhost:8501
+- **API Backend**: http://localhost:7861 (if running full app)
+## Notes
+- The UI test mode shows the interface but backend features won't work
+- Create a monster in the sidebar to see the full interface
+- All the cyberpunk styling and animations will be visible
+- The design is optimized for both desktop and tablet viewing
+## Troubleshooting
+**If Streamlit won't start:**
+```bash
+pip install streamlit --upgrade
+```
+**If you see port conflicts:**
+```bash
+STREAMLIT_PORT=8502 python test_ui.py
+```
+Enjoy the new futuristic DigiPal experience! 🐉✨

README.md CHANGED Viewed

@@ -1,16 +1,19 @@
 ---
 title: DigiPal Advanced Monster Companion
-emoji: 🐾
 colorFrom: purple
 colorTo: blue
-sdk: gradio
-sdk_version: 5.34.2
-app_file: app.py
 pinned: false
 license: mit
 models:
   - Qwen/Qwen2.5-1.5B-Instruct
-  - openai/whisper-base
 datasets: []
 tags:
   - gaming
@@ -20,147 +23,105 @@ tags:
   - speech-recognition
   - 3d-generation
   - text-to-3d
 suggested_hardware: zero-a10g
 suggested_storage: medium
 ---
-# 🐾 DigiPal - Advanced AI Monster Companion
-The next generation of virtual monster companions powered by **Qwen 2.5**, **Whisper**, advanced AI technologies, and **3D model generation**. Experience deep emotional connections with your digital pet through natural conversation, comprehensive care systems, sophisticated evolution mechanics, and bring your monsters to life in 3D!
-## ✨ Features
-### 🧠 Advanced AI Personality System
-- **Qwen 2.5-powered conversations** with contextual memory
-- **Dynamic personality traits** that evolve with care
-- **Emotional state recognition** and appropriate responses
-- **Voice chat support** with Whisper speech recognition
-### 🎮 Comprehensive Monster Care
-- **Six-dimensional care system** (health, happiness, hunger, energy, discipline, cleanliness)
-- **Real-time stat degradation** that continues even when offline
-- **Complex evolution requirements** inspired by classic monster-raising games
-- **Training mini-games** that affect monster development
-- **DW1-aligned mechanics** option for authentic Digimon World 1 experience
-### 🎨 3D Model Generation (V2)
-- **Text-to-3D conversion** using Hunyuan3D and open-source models
-- **Multiple model providers** including HuggingFace, local models, and MCP protocol
-- **Real-time 3D visualization** of your monster companions
-- **Async generation pipeline** for smooth user experience
-- **Model caching** for efficient reuse of generated assets
-### 🌟 Next-Generation Features
-- **Cross-session persistence** with browser state management
-- **Real-time streaming updates** using Gradio 5.34.2
-- **Zero GPU optimization** for efficient resource usage
-- **Advanced breeding system** with genetic inheritance
-- **MCP (Model Context Protocol)** support for flexible model deployment
-## 🚀 Technology Stack
-- **HuggingFace Transformers v4.52.4** with Flash Attention 2
-- **Gradio 5.34.2** with modern state management
-- **Qwen 2.5 models** optimized for conversation
-- **Faster Whisper** for efficient speech processing
-- **Hunyuan3D** for high-quality 3D model generation
-- **Zero GPU deployment** for scalable AI inference
-- **Docker support** for containerized deployment
-## 🎯 Getting Started
-### Option 1: Basic Version (V1)
-1. **Create Your Monster**: Choose a name and personality type
-2. **Start Caring**: Feed, train, and interact with your companion
-3. **Build Relationships**: Use voice or text chat to bond
-4. **Watch Evolution**: Meet requirements to unlock new forms
-### Option 2: Enhanced Version with 3D (V2)
-1. **All V1 features** plus:
-2. **Generate 3D Models**: Create visual representations of your monsters
-3. **Customize Appearance**: Use text descriptions to shape your monster's look
-4. **View in 3D**: Interact with generated 3D models in real-time
-## 🛠️ Running Locally
-### Requirements
-- Python 3.11+
-- CUDA-capable GPU (recommended) or CPU
-- 8GB+ RAM (16GB+ recommended for 3D features)
-### Installation
-```bash
-# Clone the repository
-git clone https://github.com/yourusername/digipal.git
-cd digipal
-# Install dependencies
-pip install -r requirements.txt
-# Run V1 (basic features)
-python app.py
-# Run V2 (with 3D generation)
-python app_v2.py
-```
-### Docker Deployment
-```bash
-# Build the Docker image
-docker build -t digipal .
-# Run the container
-docker run -p 7860:7860 -v $(pwd)/data:/app/data digipal
-```
-### Configuration Options
-```bash
-# Enable debug mode
-LOG_LEVEL=DEBUG python app.py
-# Disable specific features (V2 only)
-ENABLE_3D=false python app_v2.py
-# Custom port and sharing
-SERVER_PORT=8080 SHARE=true python app.py
-```
-## 💡 Tips for Best Experience
-- **Regular interaction** builds stronger relationships
-- **Balanced care** prevents evolution mistakes
-- **Voice chat** creates deeper emotional connections
-- **Training variety** unlocks special evolution paths
-- **3D generation** works best with detailed descriptions
-- **Save frequently** to preserve your monster's progress
-## 🔧 Advanced Features
-### MCP Integration
-DigiPal supports Model Context Protocol for flexible AI model deployment:
-- Configure external model services via `MCP_ENDPOINT` and `MCP_API_KEY`
-- Access various AI models through a standardized protocol
-- Enable MCP server mode for integration with other tools
-### Development Mode
-- **Code formatting**: `black src/`
-- **Linting**: `ruff src/`
-- **Testing**: `pytest` (test suite in development)
-## 📚 Documentation
-- [CLAUDE.md](CLAUDE.md) - Development guide for Claude AI assistance
-- [DIGIPAL_V2_GUIDE.md](DIGIPAL_V2_GUIDE.md) - Detailed V2 features guide
-- [docs/HUNYUAN3D_INTEGRATION.md](docs/HUNYUAN3D_INTEGRATION.md) - 3D pipeline documentation
-## 🤝 Contributing
-Contributions are welcome! Please read our contributing guidelines and submit pull requests to our repository.
-## 📄 License
-This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
 ---
-*Experience the future of AI companionship with DigiPal - where virtual monsters come to life in conversation and 3D!*

 ---
 title: DigiPal Advanced Monster Companion
+emoji: 🐉
 colorFrom: purple
 colorTo: blue
+sdk: streamlit
+sdk_version: 1.25.0
+app_file: streamlit_app.py
 pinned: false
 license: mit
 models:
   - Qwen/Qwen2.5-1.5B-Instruct
+  - kyutai/stt-2.6b-en
+  - shitao/OmniGen-v1
+  - tencent/Hunyuan3D-2.1
+  - VAST-AI/UniRig
 datasets: []
 tags:
   - gaming
   - speech-recognition
   - 3d-generation
   - text-to-3d
+  - cyberpunk
+  - streamlit
 suggested_hardware: zero-a10g
 suggested_storage: medium
 ---
+# 🐉 DigiPal - Advanced AI Monster Companion
+**The most advanced AI-powered virtual monster companion with cutting-edge 3D generation!**
+## 🚀 Revolutionary Features
+- 🤖 **Advanced AI Conversations** with Qwen 2.5-1.5B-Instruct
+- 🎤 **High-Quality Speech Recognition** with Kyutai STT-2.6b-en
+- 🎨 **State-of-the-Art 3D Generation** via OmniGen2 → Hunyuan3D-2.1 → UniRig
+- 📊 **Complex Care System** inspired by Digimon World mechanics
+- 🧬 **Dynamic Evolution** based on care quality and interaction
+- 💬 **Personality-Driven Responses** with emotional intelligence
+- 🎮 **Cyberpunk UI** with neon effects and holographic styling
+## 🛠️ Technology Stack
+### AI Models
+- **Conversations**: Qwen 2.5-1.5B-Instruct (quantized for efficiency)
+- **Speech-to-Text**: Kyutai STT-2.6b-en (latest multilingual model)
+- **Text-to-Image**: OmniGen2 (multi-view generation)
+- **Image-to-3D**: Hunyuan3D-2.1 (official Tencent model)
+- **3D Rigging**: UniRig (automatic model rigging)
+### Architecture
+- **Frontend**: Streamlit with cyberpunk theme
+- **Backend**: Integrated FastAPI services
+- **Database**: SQLite with async operations
+- **3D Pipeline**: Complete text → image → 3D → rigged workflow
+## 🎯 3D Generation Pipeline
+The crown jewel of DigiPal is its revolutionary 3D generation system:
+1. **Text Description** → User describes their monster
+2. **OmniGen2** → Generates consistent multi-view images
+3. **Hunyuan3D-2.1** → Converts images to high-quality 3D mesh
+4. **UniRig** → Automatically rigs the model for animation
+5. **Result** → Fully rigged 3D model ready for games/animation
+## 🎮 How to Use
+1. **Create Your Monster**: Choose name and personality type
+2. **Care & Interact**: Feed, train, play, and talk with your companion
+3. **Watch Evolution**: Your monster grows based on care quality
+4. **Generate 3D Model**: Create a unique 3D representation
+5. **Download & Use**: Get your rigged model for other applications
+## 🎨 Monster Care System
+- **Six Core Stats**: Health, Happiness, Hunger, Energy, Discipline, Cleanliness
+- **Real-Time Degradation**: Stats change even when you're away
+- **Evolution Stages**: Egg → Baby → Child → Adult → Champion → Ultimate
+- **Personality Types**: Friendly, Energetic, Calm, Curious, Brave
+- **Complex Requirements**: Age, level, care quality all matter
+## 💫 Technical Highlights
+- **Zero GPU Optimization**: Efficient model loading and inference
+- **Graceful Fallbacks**: Pipeline continues even if some APIs fail
+- **Real-Time Updates**: WebSocket integration for live stat changes
+- **Model Caching**: Intelligent reuse of generated assets
+- **Cross-Platform**: Works on desktop, tablet, and mobile
+## 🔧 Development
+### Local Setup
+```bash
+git clone <repository>
+cd digiPal
+pip install -r requirements.txt
+# Run complete application
+python run_digipal.py
+# Or run Streamlit only
+streamlit run streamlit_app.py
+```
+### Environment Variables
+```bash
+HF_TOKEN=your_token          # For private models
+MCP_ENDPOINT=your_endpoint   # For MCP integration
+LOG_LEVEL=INFO              # Logging level
+```
+## 📝 License
+MIT License - Feel free to use, modify, and distribute!
+## 🤝 Contributing
+Contributions welcome! This project pushes the boundaries of AI companions and 3D generation.
 ---
+*Experience the future of AI companions with DigiPal! 🐉✨*

app.py CHANGED Viewed

@@ -16,9 +16,7 @@ from fastapi import FastAPI, WebSocket, WebSocketDisconnect, HTTPException
 from fastapi.middleware.cors import CORSMiddleware
 from fastapi.responses import JSONResponse
 from pydantic import BaseModel
-import gradio as gr
 import torch
-import spaces
 from contextlib import asynccontextmanager
 # Add src to path
@@ -35,7 +33,7 @@ logger = logging.getLogger(__name__)
 ENV_CONFIG = {
     "LOG_LEVEL": os.getenv("LOG_LEVEL", "INFO"),
     "SERVER_NAME": os.getenv("SERVER_NAME", "0.0.0.0"),
-    "SERVER_PORT": int(os.getenv("SERVER_PORT", "7860")),
     "API_PORT": int(os.getenv("API_PORT", "7861")),
     "SHARE": os.getenv("SHARE", "false").lower() == "true",
     "DEBUG": os.getenv("DEBUG", "false").lower() == "true",
@@ -71,14 +69,13 @@ try:
     from src.ai.speech_engine import AdvancedSpeechEngine as SpeechEngine, SpeechConfig
     from src.ui.state_manager import AdvancedStateManager as StateManager
     from src.deployment.zero_gpu_optimizer import get_optimal_device
-    from src.pipelines.hunyuan3d_pipeline import Hunyuan3DClient
     from src.pipelines.opensource_3d_pipeline_v2 import (
         ProductionPipeline,
         ProductionConfig
     )
-    # UI imports
-    from src.ui.gradio_interface import create_interface
 except ImportError as e:
     logger.error(f"Failed to import required modules: {e}")
     sys.exit(1)
@@ -127,13 +124,14 @@ class AppState:
             self.qwen_processor = QwenProcessor(qwen_config)
-            # Create speech engine config
             speech_config = SpeechConfig(
-                model_size="base",  # Conservative model size for Spaces
                 device="auto",  # Auto-detect device
-                compute_type="int8",  # Use int8 for better compatibility
                 use_vad=True,
-                vad_aggressiveness=2
             )
             self.speech_engine = SpeechEngine(speech_config)
@@ -432,11 +430,8 @@ async def websocket_endpoint(websocket: WebSocket, monster_id: str):
     except WebSocketDisconnect:
         manager.disconnect(monster_id)
-# Gradio interface for fallback/admin
-def create_gradio_interface():
-    """Create Gradio interface as admin panel"""
-    interface = create_interface()
-    return interface
 # Main entry point
 if __name__ == "__main__":
@@ -451,46 +446,23 @@ if __name__ == "__main__":
     logger.info("DigiPal - Advanced AI Monster Companion")
     logger.info("=" * 60)
     logger.info(f"Environment: {'HuggingFace Spaces' if IS_SPACES else 'Local'}")
-    logger.info(f"API Port: {ENV_CONFIG['API_PORT']}")
-    logger.info(f"Gradio Port: {ENV_CONFIG['SERVER_PORT']}")
     logger.info(f"MCP Enabled: {bool(ENV_CONFIG['MCP_ENDPOINT'])}")
     logger.info("=" * 60)
-    # Run both FastAPI and Gradio
-    async def run_servers():
-        # Create Gradio interface
-        gr_interface = create_gradio_interface()
-        # Launch Gradio in a separate thread (non-blocking)
-        import threading
-        gradio_thread = threading.Thread(
-            target=gr_interface.launch,
-            kwargs={
-                "server_name": ENV_CONFIG["SERVER_NAME"],
-                "server_port": ENV_CONFIG["SERVER_PORT"],
-                "share": ENV_CONFIG["SHARE"],
-                "max_threads": ENV_CONFIG["MAX_THREADS"],
-                "show_error": True,
-                "prevent_thread_lock": True  # Important for running alongside FastAPI
-            }
-        )
-        gradio_thread.daemon = True
-        gradio_thread.start()
-        # Give Gradio a moment to start
-        await asyncio.sleep(2)
-        # Start FastAPI server
-        config = uvicorn.Config(
-            app,
-            host=ENV_CONFIG["SERVER_NAME"],
-            port=ENV_CONFIG["API_PORT"],
-            log_level=ENV_CONFIG["LOG_LEVEL"].lower()
-        )
-        server = uvicorn.Server(config)
-        # Run FastAPI server (this will block)
-        await server.serve()
-    # Run the servers
-    asyncio.run(run_servers())

 from fastapi.middleware.cors import CORSMiddleware
 from fastapi.responses import JSONResponse
 from pydantic import BaseModel
 import torch
 from contextlib import asynccontextmanager
 # Add src to path
 ENV_CONFIG = {
     "LOG_LEVEL": os.getenv("LOG_LEVEL", "INFO"),
     "SERVER_NAME": os.getenv("SERVER_NAME", "0.0.0.0"),
+    "STREAMLIT_PORT": int(os.getenv("STREAMLIT_PORT", "8501")),
     "API_PORT": int(os.getenv("API_PORT", "7861")),
     "SHARE": os.getenv("SHARE", "false").lower() == "true",
     "DEBUG": os.getenv("DEBUG", "false").lower() == "true",
     from src.ai.speech_engine import AdvancedSpeechEngine as SpeechEngine, SpeechConfig
     from src.ui.state_manager import AdvancedStateManager as StateManager
     from src.deployment.zero_gpu_optimizer import get_optimal_device
     from src.pipelines.opensource_3d_pipeline_v2 import (
         ProductionPipeline,
         ProductionConfig
     )
+    # UI imports - now using Streamlit (separate process)
+    # from src.ui.streamlit_interface import main as streamlit_main
 except ImportError as e:
     logger.error(f"Failed to import required modules: {e}")
     sys.exit(1)
             self.qwen_processor = QwenProcessor(qwen_config)
+            # Create speech engine config for Kyutai STT
             speech_config = SpeechConfig(
+                model_name="kyutai/stt-2.6b-en",  # Kyutai STT model
                 device="auto",  # Auto-detect device
+                torch_dtype="float32",  # Use float32 for better compatibility
                 use_vad=True,
+                vad_aggressiveness=2,
+                use_pipeline=True  # Use pipeline for easier integration
             )
             self.speech_engine = SpeechEngine(speech_config)
     except WebSocketDisconnect:
         manager.disconnect(monster_id)
+# Streamlit interface runs separately
+# Use: streamlit run src/ui/streamlit_interface.py
 # Main entry point
 if __name__ == "__main__":
     logger.info("DigiPal - Advanced AI Monster Companion")
     logger.info("=" * 60)
     logger.info(f"Environment: {'HuggingFace Spaces' if IS_SPACES else 'Local'}")
+    logger.info(f"FastAPI Backend Port: {ENV_CONFIG['API_PORT']}")
+    logger.info(f"Streamlit UI: Run separately on port {ENV_CONFIG['STREAMLIT_PORT']}")
     logger.info(f"MCP Enabled: {bool(ENV_CONFIG['MCP_ENDPOINT'])}")
     logger.info("=" * 60)
+    # Start FastAPI server only
+    # Streamlit interface runs separately via: streamlit run src/ui/streamlit_interface.py
+    logger.info("Starting FastAPI backend server...")
+    logger.info(f"Streamlit UI: Run 'streamlit run src/ui/streamlit_interface.py' in another terminal")
+    config = uvicorn.Config(
+        app,
+        host=ENV_CONFIG["SERVER_NAME"],
+        port=ENV_CONFIG["API_PORT"],
+        log_level=ENV_CONFIG["LOG_LEVEL"].lower()
+    )
+    server = uvicorn.Server(config)
+    # Run FastAPI server
+    asyncio.run(server.serve())

requirements.txt CHANGED Viewed

@@ -2,7 +2,8 @@
 transformers>=4.52.4  # Latest stable, supports Qwen 2.5
 torch>=2.2.0  # PyTorch 2.0+ for torch.compile
 torchaudio>=2.2.0
-gradio>=5.34.2  # Latest Gradio 5.x series
 # Qwen 2.5 Optimization Stack
 # auto-gptq>=0.7.1  # Removed - not needed, using BitsAndBytesConfig instead
@@ -11,23 +12,23 @@ accelerate>=0.26.1
 bitsandbytes>=0.42.0
 # FlashAttention2 will be installed at runtime if GPU is available
-# Enhanced Audio Processing
-faster-whisper>=1.0.0
-librosa>=0.10.1
 soundfile>=0.12.1
 webrtcvad>=2.0.10
 # Production Backend
 fastapi>=0.108.0
 uvicorn[standard]>=0.25.0
 pydantic>=2.5.0
 websockets>=12.0
 # Advanced State Management
 apscheduler>=3.10.4
 aiosqlite>=0.19.0
-# Zero GPU Optimization
 spaces>=0.28.0
 # 3D Generation Pipeline Dependencies

 transformers>=4.52.4  # Latest stable, supports Qwen 2.5
 torch>=2.2.0  # PyTorch 2.0+ for torch.compile
 torchaudio>=2.2.0
+diffusers>=0.30.0  # For OmniGen and other diffusion models
+# gradio>=5.34.2  # Replaced with Streamlit
 # Qwen 2.5 Optimization Stack
 # auto-gptq>=0.7.1  # Removed - not needed, using BitsAndBytesConfig instead
 bitsandbytes>=0.42.0
 # FlashAttention2 will be installed at runtime if GPU is available
+# Enhanced Audio Processing - Kyutai STT
 soundfile>=0.12.1
 webrtcvad>=2.0.10
+# Note: transformers and torch/torchaudio above provide Kyutai STT support
 # Production Backend
 fastapi>=0.108.0
 uvicorn[standard]>=0.25.0
 pydantic>=2.5.0
 websockets>=12.0
+streamlit>=1.28.0  # Modern UI framework replacing Gradio
 # Advanced State Management
 apscheduler>=3.10.4
 aiosqlite>=0.19.0
+# Zero GPU Optimization (kept for speech engine compatibility)
 spaces>=0.28.0
 # 3D Generation Pipeline Dependencies

run_digipal.py ADDED Viewed

	@@ -0,0 +1,80 @@

+#!/usr/bin/env python3
+"""
+DigiPal Launcher Script
+Starts both FastAPI backend and Streamlit frontend
+"""
+import subprocess
+import time
+import sys
+import os
+import threading
+import logging
+# Configure logging
+logging.basicConfig(
+    level=logging.INFO,
+    format='%(asctime)s - %(name)s - %(levelname)s - %(message)s'
+)
+logger = logging.getLogger(__name__)
+def start_fastapi():
+    """Start FastAPI backend server"""
+    logger.info("Starting FastAPI backend server...")
+    try:
+        subprocess.run([sys.executable, "app.py"], check=True)
+    except subprocess.CalledProcessError as e:
+        logger.error(f"FastAPI server failed: {e}")
+    except KeyboardInterrupt:
+        logger.info("FastAPI server stopped")
+def start_streamlit():
+    """Start Streamlit frontend"""
+    logger.info("Starting Streamlit frontend...")
+    try:
+        port = os.getenv("STREAMLIT_PORT", "8501")
+        subprocess.run([
+            sys.executable, "-m", "streamlit", "run",
+            "src/ui/streamlit_interface.py",
+            "--server.port", port,
+            "--server.address", "0.0.0.0"
+        ], check=True)
+    except subprocess.CalledProcessError as e:
+        logger.error(f"Streamlit frontend failed: {e}")
+    except KeyboardInterrupt:
+        logger.info("Streamlit frontend stopped")
+def main():
+    """Main launcher function"""
+    logger.info("🐉 DigiPal - Advanced AI Monster Companion")
+    logger.info("=" * 60)
+    logger.info("Starting both FastAPI backend and Streamlit frontend...")
+    api_port = os.getenv("API_PORT", "7861")
+    streamlit_port = os.getenv("STREAMLIT_PORT", "8501")
+    logger.info(f"FastAPI Backend: http://localhost:{api_port}")
+    logger.info(f"Streamlit Frontend: http://localhost:{streamlit_port}")
+    logger.info("=" * 60)
+    # Create necessary directories
+    os.makedirs("data/saves", exist_ok=True)
+    os.makedirs("data/models", exist_ok=True)
+    os.makedirs("data/cache", exist_ok=True)
+    os.makedirs("logs", exist_ok=True)
+    try:
+        # Start FastAPI in a separate thread
+        fastapi_thread = threading.Thread(target=start_fastapi, daemon=True)
+        fastapi_thread.start()
+        # Give FastAPI time to start
+        time.sleep(3)
+        # Start Streamlit (this will block)
+        start_streamlit()
+    except KeyboardInterrupt:
+        logger.info("Shutting down DigiPal...")
+        sys.exit(0)
+if __name__ == "__main__":
+    main()

src/ai/speech_engine.py CHANGED Viewed

@@ -1,7 +1,8 @@
 import asyncio
 import numpy as np
-from faster_whisper import WhisperModel
 import torch
 import webrtcvad
 import logging
 from typing import Dict, List, Optional, Tuple, Any
@@ -13,29 +14,32 @@ import spaces
 @dataclass
 class SpeechConfig:
-    model_size: str = "base"  # tiny, base, small, medium, large-v3
     device: str = "auto"
-    compute_type: str = "float16"
     use_vad: bool = True
     vad_aggressiveness: int = 2  # 0-3, higher = more aggressive
     chunk_duration_ms: int = 30  # VAD chunk size
     sample_rate: int = 16000
 class AdvancedSpeechEngine:
     def __init__(self, config: SpeechConfig):
         self.config = config
         self.logger = logging.getLogger(__name__)
-        # Model configurations optimized for gaming
-        self.model_configs = {
-            "tiny": {"memory_gb": 1, "speed": "fastest", "accuracy": "basic"},
-            "base": {"memory_gb": 2, "speed": "fast", "accuracy": "good"},
-            "small": {"memory_gb": 3, "speed": "medium", "accuracy": "better"},
-            "medium": {"memory_gb": 6, "speed": "slower", "accuracy": "high"},
-            "large-v3": {"memory_gb": 12, "speed": "slowest", "accuracy": "best"}
         }
-        self.whisper_model = None
         self.vad_model = None
         # Performance tracking
@@ -47,11 +51,11 @@ class AdvancedSpeechEngine:
         self.is_processing = False
     async def initialize(self):
-        """Initialize the speech recognition system"""
         try:
             # Enhanced device detection for local vs Spaces environments
             device = self.config.device
-            compute_type = self.config.compute_type
             if device == "auto":
                 # For Zero GPU environments, try GPU first, fallback to CPU
@@ -65,48 +69,93 @@ class AdvancedSpeechEngine:
                     except Exception as cuda_error:
                         # CUDA not properly accessible, use CPU
                         device = "cpu"
-                        if compute_type == "float16":
-                            compute_type = "int8"
-                        self.logger.info(f"CUDA not accessible ({cuda_error}), using CPU with int8")
                 else:
                     device = "cpu"
-                    if compute_type == "float16":
-                        compute_type = "int8"
-                    self.logger.info("CUDA not available, using CPU with int8")
-            # Adjust compute_type for CPU
-            if device == "cpu" and compute_type == "float16":
-                compute_type = "int8"  # Use int8 for CPU instead of float16
-                self.logger.info("CPU detected, switching from float16 to int8 compute type")
-            # Initialize Faster Whisper with proper error handling
             try:
-                self.whisper_model = WhisperModel(
-                    self.config.model_size,
-                    device=device,
-                    compute_type=compute_type,
-                    download_root="data/models/"
-                )
-                self.logger.info(f"Whisper model loaded on {device} with {compute_type}")
             except Exception as model_error:
                 # Final fallback to CPU with basic settings
                 self.logger.warning(f"Failed to load on {device}, falling back to CPU: {model_error}")
-                self.whisper_model = WhisperModel(
-                    self.config.model_size,
-                    device="cpu",
-                    compute_type="int8",
-                    download_root="data/models/"
-                )
-                self.logger.info("Whisper model loaded on CPU (fallback)")
             # Initialize VAD if enabled
             if self.config.use_vad:
                 self.vad_model = webrtcvad.Vad(self.config.vad_aggressiveness)
-            self.logger.info(f"Speech engine initialized: {self.config.model_size} on {device}")
         except Exception as e:
-            self.logger.error(f"Failed to initialize speech engine: {e}")
             raise
     async def process_audio_stream(self, audio_data: np.ndarray) -> Dict[str, Any]:
@@ -135,35 +184,55 @@ class AdvancedSpeechEngine:
                         "has_speech": False
                     }
-            # Transcribe with Faster Whisper
-            segments, info = self.whisper_model.transcribe(
-                audio_data,
-                language="en",
-                beam_size=1,  # Faster inference
-                temperature=0.0,
-                condition_on_previous_text=True,
-                compression_ratio_threshold=2.4,
-                log_prob_threshold=-1.0,
-                no_speech_threshold=0.6
-            )
-            # Combine segments
-            transcription = ""
-            avg_confidence = 0.0
-            segment_count = 0
-            for segment in segments:
-                transcription += segment.text + " "
-                avg_confidence += segment.avg_logprob
-                segment_count += 1
-            transcription = transcription.strip()
-            if segment_count > 0:
-                avg_confidence = avg_confidence / segment_count
-                confidence = self._logprob_to_confidence(avg_confidence)
             else:
-                confidence = 0.0
             processing_time = time.time() - start_time
             self.transcription_times.append(processing_time)
@@ -178,8 +247,9 @@ class AdvancedSpeechEngine:
                 "processing_time": processing_time,
                 "has_speech": True,
                 "speech_analysis": speech_analysis,
-                "detected_language": info.language if hasattr(info, 'language') else "en",
-                "language_probability": info.language_probability if hasattr(info, 'language_probability') else 1.0
             }
         except Exception as e:
@@ -292,18 +362,31 @@ class AdvancedSpeechEngine:
             }
     async def batch_transcribe(self, audio_files: List[str]) -> List[Dict[str, Any]]:
-        """Batch transcribe multiple audio files"""
         results = []
         for audio_file in audio_files:
             try:
-                # Load audio file
-                import librosa
-                audio_data, _ = librosa.load(audio_file, sr=self.config.sample_rate)
                 # Process
                 result = await self.process_audio_stream(audio_data)
                 result["file_path"] = audio_file
                 results.append(result)
@@ -334,42 +417,54 @@ class AdvancedSpeechEngine:
         }
     def optimize_for_hardware(self, available_vram_gb: float) -> SpeechConfig:
-        """Optimize speech config based on available hardware"""
-        if available_vram_gb >= 12:
             return SpeechConfig(
-                model_size="large-v3",
                 device="cuda",
-                compute_type="float16",
-                use_vad=True
             )
         elif available_vram_gb >= 6:
             return SpeechConfig(
-                model_size="medium",
                 device="cuda",
-                compute_type="float16",
-                use_vad=True
             )
-        elif available_vram_gb >= 3:
             return SpeechConfig(
-                model_size="small",
                 device="cuda",
-                compute_type="int8",
-                use_vad=True
             )
         else:
             return SpeechConfig(
-                model_size="base",
                 device="cpu",
-                compute_type="int8",
-                use_vad=True
             )
 # Apply GPU decorator to methods after class definition for ZeroGPU compatibility
 try:
     import os
     if os.getenv("SPACE_ID") is not None:
-        # We're in Spaces environment, apply GPU decorator
-        AdvancedSpeechEngine.process_audio_stream = spaces.GPU(AdvancedSpeechEngine.process_audio_stream)
 except (ImportError, NotImplementedError, AttributeError) as e:
     # GPU decorator not available or failed, continue without it
     pass

 import asyncio
 import numpy as np
 import torch
+import torchaudio
+from transformers import pipeline, AutoModelForSpeechSeq2Seq, AutoProcessor
 import webrtcvad
 import logging
 from typing import Dict, List, Optional, Tuple, Any
 @dataclass
 class SpeechConfig:
+    model_name: str = "kyutai/stt-2.6b-en"  # Kyutai STT model
     device: str = "auto"
+    torch_dtype: str = "float16"
     use_vad: bool = True
     vad_aggressiveness: int = 2  # 0-3, higher = more aggressive
     chunk_duration_ms: int = 30  # VAD chunk size
     sample_rate: int = 16000
+    use_pipeline: bool = True  # Use transformers pipeline for easier integration
 class AdvancedSpeechEngine:
     def __init__(self, config: SpeechConfig):
         self.config = config
         self.logger = logging.getLogger(__name__)
+        # Kyutai STT model configurations
+        self.model_info = {
+            "name": "Kyutai STT-2.6B-EN",
+            "description": "Multilingual speech-to-text model optimized for English",
+            "memory_gb": 6,  # Approximate memory requirement for 2.6B model
+            "speed": "fast",
+            "accuracy": "high"
         }
+        self.speech_pipeline = None
+        self.model = None
+        self.processor = None
         self.vad_model = None
         # Performance tracking
         self.is_processing = False
     async def initialize(self):
+        """Initialize the Kyutai STT speech recognition system"""
         try:
             # Enhanced device detection for local vs Spaces environments
             device = self.config.device
+            torch_dtype = self.config.torch_dtype
             if device == "auto":
                 # For Zero GPU environments, try GPU first, fallback to CPU
                     except Exception as cuda_error:
                         # CUDA not properly accessible, use CPU
                         device = "cpu"
+                        if torch_dtype == "float16":
+                            torch_dtype = "float32"
+                        self.logger.info(f"CUDA not accessible ({cuda_error}), using CPU with float32")
                 else:
                     device = "cpu"
+                    if torch_dtype == "float16":
+                        torch_dtype = "float32"
+                    self.logger.info("CUDA not available, using CPU with float32")
+            # Adjust torch_dtype for CPU
+            if device == "cpu" and torch_dtype == "float16":
+                torch_dtype = "float32"  # Use float32 for CPU instead of float16
+                self.logger.info("CPU detected, switching from float16 to float32 dtype")
+            # Convert string dtype to torch dtype
+            dtype_map = {
+                "float16": torch.float16,
+                "float32": torch.float32,
+                "bfloat16": torch.bfloat16
+            }
+            torch_dtype_obj = dtype_map.get(torch_dtype, torch.float32)
+            # Initialize Kyutai STT with proper error handling
             try:
+                if self.config.use_pipeline:
+                    # Use transformers pipeline for easier integration
+                    self.speech_pipeline = pipeline(
+                        "automatic-speech-recognition",
+                        model=self.config.model_name,
+                        torch_dtype=torch_dtype_obj,
+                        device=device,
+                        cache_dir="data/models/"
+                    )
+                    self.logger.info(f"Kyutai STT pipeline loaded on {device} with {torch_dtype}")
+                else:
+                    # Load model and processor separately for more control
+                    self.model = AutoModelForSpeechSeq2Seq.from_pretrained(
+                        self.config.model_name,
+                        torch_dtype=torch_dtype_obj,
+                        device_map="auto" if device == "cuda" else None,
+                        cache_dir="data/models/"
+                    )
+                    self.processor = AutoProcessor.from_pretrained(
+                        self.config.model_name,
+                        cache_dir="data/models/"
+                    )
+                    if device == "cuda" and not hasattr(self.model, 'device_map'):
+                        self.model = self.model.to(device)
+                    self.logger.info(f"Kyutai STT model and processor loaded on {device} with {torch_dtype}")
             except Exception as model_error:
                 # Final fallback to CPU with basic settings
                 self.logger.warning(f"Failed to load on {device}, falling back to CPU: {model_error}")
+                if self.config.use_pipeline:
+                    self.speech_pipeline = pipeline(
+                        "automatic-speech-recognition",
+                        model=self.config.model_name,
+                        torch_dtype=torch.float32,
+                        device="cpu",
+                        cache_dir="data/models/"
+                    )
+                else:
+                    self.model = AutoModelForSpeechSeq2Seq.from_pretrained(
+                        self.config.model_name,
+                        torch_dtype=torch.float32,
+                        device_map=None,
+                        cache_dir="data/models/"
+                    )
+                    self.processor = AutoProcessor.from_pretrained(
+                        self.config.model_name,
+                        cache_dir="data/models/"
+                    )
+                    self.model = self.model.to("cpu")
+                self.logger.info("Kyutai STT model loaded on CPU (fallback)")
             # Initialize VAD if enabled
             if self.config.use_vad:
                 self.vad_model = webrtcvad.Vad(self.config.vad_aggressiveness)
+            self.logger.info(f"Kyutai STT speech engine initialized: {self.config.model_name} on {device}")
         except Exception as e:
+            self.logger.error(f"Failed to initialize Kyutai STT speech engine: {e}")
             raise
     async def process_audio_stream(self, audio_data: np.ndarray) -> Dict[str, Any]:
                         "has_speech": False
                     }
+            # Transcribe with Kyutai STT
+            if self.config.use_pipeline and self.speech_pipeline:
+                # Use pipeline for simpler transcription
+                result = self.speech_pipeline(
+                    audio_data,
+                    generate_kwargs={
+                        "language": "en",
+                        "task": "transcribe",
+                        "max_new_tokens": 256
+                    }
+                )
+                transcription = result["text"].strip()
+                # Pipeline doesn't provide confidence scores directly
+                confidence = 0.8  # Default confidence for pipeline
             else:
+                # Use model and processor for more control
+                # Prepare inputs
+                inputs = self.processor(
+                    audio_data,
+                    sampling_rate=self.config.sample_rate,
+                    return_tensors="pt"
+                )
+                # Move inputs to device
+                device = next(self.model.parameters()).device
+                inputs = {k: v.to(device) for k, v in inputs.items()}
+                # Generate transcription
+                with torch.no_grad():
+                    generated_tokens = self.model.generate(
+                        **inputs,
+                        language="en",
+                        task="transcribe",
+                        max_new_tokens=256,
+                        num_beams=1,  # Faster inference
+                        do_sample=False,
+                        temperature=1.0
+                    )
+                # Decode transcription
+                transcription = self.processor.batch_decode(
+                    generated_tokens,
+                    skip_special_tokens=True
+                )[0].strip()
+                # Calculate confidence (simplified)
+                confidence = 0.8  # Default confidence
             processing_time = time.time() - start_time
             self.transcription_times.append(processing_time)
                 "processing_time": processing_time,
                 "has_speech": True,
                 "speech_analysis": speech_analysis,
+                "detected_language": "en",  # Kyutai model is optimized for English
+                "language_probability": 1.0,
+                "model": "kyutai-stt-2.6b-en"
             }
         except Exception as e:
             }
     async def batch_transcribe(self, audio_files: List[str]) -> List[Dict[str, Any]]:
+        """Batch transcribe multiple audio files using Kyutai STT"""
         results = []
         for audio_file in audio_files:
             try:
+                # Load audio file - use torchaudio for better PyTorch integration
+                audio_data, sample_rate = torchaudio.load(audio_file)
+                # Convert to numpy and ensure mono
+                audio_data = audio_data.numpy()
+                if len(audio_data.shape) > 1:
+                    audio_data = audio_data.mean(axis=0)  # Convert to mono
+                # Resample if necessary
+                if sample_rate != self.config.sample_rate:
+                    # Use torchaudio for resampling
+                    audio_tensor = torch.from_numpy(audio_data).unsqueeze(0)
+                    resampler = torchaudio.transforms.Resample(sample_rate, self.config.sample_rate)
+                    audio_tensor = resampler(audio_tensor)
+                    audio_data = audio_tensor.squeeze(0).numpy()
                 # Process
                 result = await self.process_audio_stream(audio_data)
                 result["file_path"] = audio_file
+                result["original_sample_rate"] = sample_rate
                 results.append(result)
         }
     def optimize_for_hardware(self, available_vram_gb: float) -> SpeechConfig:
+        """Optimize Kyutai STT config based on available hardware"""
+        # Kyutai STT-2.6B requires about 6GB VRAM for optimal performance
+        if available_vram_gb >= 8:
             return SpeechConfig(
+                model_name="kyutai/stt-2.6b-en",
                 device="cuda",
+                torch_dtype="float16",
+                use_vad=True,
+                use_pipeline=True
             )
         elif available_vram_gb >= 6:
             return SpeechConfig(
+                model_name="kyutai/stt-2.6b-en",
                 device="cuda",
+                torch_dtype="float32",
+                use_vad=True,
+                use_pipeline=True
             )
+        elif available_vram_gb >= 4:
             return SpeechConfig(
+                model_name="kyutai/stt-2.6b-en",
                 device="cuda",
+                torch_dtype="float32",
+                use_vad=True,
+                use_pipeline=False  # More memory efficient without pipeline
             )
         else:
             return SpeechConfig(
+                model_name="kyutai/stt-2.6b-en",
                 device="cpu",
+                torch_dtype="float32",
+                use_vad=True,
+                use_pipeline=True
             )
 # Apply GPU decorator to methods after class definition for ZeroGPU compatibility
 try:
     import os
     if os.getenv("SPACE_ID") is not None:
+        # We're in Spaces environment, apply GPU decorator for Kyutai STT
+        AdvancedSpeechEngine.process_audio_stream = spaces.GPU(
+            AdvancedSpeechEngine.process_audio_stream,
+            duration=120  # Kyutai STT may take longer than Whisper
+        )
+        AdvancedSpeechEngine.batch_transcribe = spaces.GPU(
+            AdvancedSpeechEngine.batch_transcribe,
+            duration=300  # Batch processing may take longer
+        )
 except (ImportError, NotImplementedError, AttributeError) as e:
     # GPU decorator not available or failed, continue without it
     pass

src/core/monster_3d_hunyuan_integration.py DELETED Viewed

@@ -1,326 +0,0 @@
-"""
-Enhanced Monster 3D Integration using Hunyuan3D Pipeline
-Ensures consistent visual style across all evolution stages
-"""
-import asyncio
-import json
-import logging
-from pathlib import Path
-from typing import Dict, List, Optional, Any
-from dataclasses import dataclass
-from ..pipelines.hunyuan3d_pipeline import (
-    DigiPalHunyuan3DIntegration,
-    Hunyuan3DConfig,
-    GenerationMode
-)
-logger = logging.getLogger(__name__)
-@dataclass
-class Monster3DProfile:
-    """Visual profile for consistent 3D generation"""
-    # Core design elements that remain consistent
-    base_design: str  # e.g., "dragon-like", "wolf-like", "humanoid"
-    primary_features: List[str]  # e.g., ["wings", "horn", "tail"]
-    color_scheme: str  # e.g., "blue crystalline", "dark purple", "golden"
-    texture_style: str  # e.g., "metallic", "organic", "ethereal"
-    # Elements that change with evolution
-    size_modifier: Dict[str, str]  # stage -> size description
-    detail_level: Dict[str, str]  # stage -> detail description
-    aura_intensity: Dict[str, str]  # stage -> aura/effects
-class ConsistentMonster3DManager:
-    """Manages 3D generation with consistent style across evolutions"""
-    def __init__(self, config: Optional[Hunyuan3DConfig] = None):
-        self.integration = DigiPalHunyuan3DIntegration(config)
-        self.profiles = {}  # Cache monster visual profiles
-        self.style_guide = self._load_style_guide()
-    def _load_style_guide(self) -> Dict[str, Any]:
-        """Load visual style guidelines"""
-        return {
-            "render_style": "AAA 3D render, well lit, white void background",
-            "model_requirements": "T-pose, game character model, consistent design language",
-            "quality_specs": "professional 3D game asset, pbr materials, clean topology",
-            # Species-specific base designs
-            "species_designs": {
-                "data": {
-                    "base": "biomechanical creature",
-                    "features": ["crystalline segments", "circuit patterns", "data streams"],
-                    "materials": "holographic metal with glass accents"
-                },
-                "vaccine": {
-                    "base": "angelic creature",
-                    "features": ["feathered wings", "armor plating", "holy symbols"],
-                    "materials": "white metal with gold trim"
-                },
-                "virus": {
-                    "base": "corrupted beast",
-                    "features": ["shadow tendrils", "void armor", "energy cores"],
-                    "materials": "dark metal with purple energy"
-                },
-                "free": {
-                    "base": "elemental creature",
-                    "features": ["natural armor", "elemental crystals", "organic patterns"],
-                    "materials": "stone and wood with living elements"
-                }
-            },
-            # Personality influences on pose/expression
-            "personality_modifiers": {
-                "brave": "heroic stance, fierce expression, forward-leaning pose",
-                "calm": "balanced stance, serene expression, centered pose",
-                "energetic": "dynamic stance, excited expression, action-ready pose",
-                "clever": "tactical stance, focused expression, observant pose",
-                "friendly": "open stance, warm expression, welcoming pose"
-            }
-        }
-    async def create_monster_profile(self, monster: Any) -> Monster3DProfile:
-        """Create consistent visual profile for a monster"""
-        # Determine base design from species and initial traits
-        species_design = self.style_guide["species_designs"].get(
-            monster.species_type.value,
-            self.style_guide["species_designs"]["data"]
-        )
-        # Create base design combining species and personality
-        if monster.personality.value == "brave":
-            base = f"heroic {species_design['base']}"
-        elif monster.personality.value == "calm":
-            base = f"wise {species_design['base']}"
-        elif monster.personality.value == "energetic":
-            base = f"agile {species_design['base']}"
-        elif monster.personality.value == "clever":
-            base = f"cunning {species_design['base']}"
-        else:
-            base = f"gentle {species_design['base']}"
-        # Define consistent features based on stats
-        features = species_design["features"].copy()
-        if monster.battle_stats.offense > 70:
-            features.append("prominent claws or weapons")
-        if monster.battle_stats.defense > 70:
-            features.append("reinforced armor sections")
-        if monster.battle_stats.speed > 70:
-            features.append("aerodynamic body shape")
-        # Create profile with evolution variations
-        profile = Monster3DProfile(
-            base_design=base,
-            primary_features=features,
-            color_scheme=self._determine_color_scheme(monster),
-            texture_style=species_design["materials"],
-            # Size progression - maintains adult-like proportions
-            size_modifier={
-                "egg": "dormant embryonic",
-                "baby": "compact juvenile",
-                "child": "developing adolescent",
-                "adult": "fully grown",
-                "perfect": "enhanced mature",
-                "ultimate": "transcendent apex",
-                "mega": "legendary titan"
-            },
-            # Detail progression
-            detail_level={
-                "egg": "simplified form within crystalline shell",
-                "baby": "basic features emerging",
-                "child": "defined features developing",
-                "adult": "fully detailed form",
-                "perfect": "intricate details and enhancements",
-                "ultimate": "complex ornate details",
-                "mega": "impossibly detailed divine form"
-            },
-            # Aura/effects progression
-            aura_intensity={
-                "egg": "faint inner glow",
-                "baby": "soft ambient glow",
-                "child": "visible energy aura",
-                "adult": "strong power aura",
-                "perfect": "intense energy field",
-                "ultimate": "overwhelming power presence",
-                "mega": "reality-warping energy storm"
-            }
-        )
-        # Cache the profile
-        self.profiles[monster.id] = profile
-        return profile
-    def _determine_color_scheme(self, monster: Any) -> str:
-        """Determine consistent color scheme based on monster attributes"""
-        # Base colors by species
-        species_colors = {
-            "data": ["blue", "cyan", "white"],
-            "vaccine": ["gold", "white", "silver"],
-            "virus": ["purple", "black", "red"],
-            "free": ["green", "brown", "orange"]
-        }
-        # Get base colors
-        colors = species_colors.get(monster.species_type.value, ["gray"])
-        # Modify based on personality
-        if monster.personality.value == "brave":
-            colors.append("crimson accents")
-        elif monster.personality.value == "calm":
-            colors.append("pearl highlights")
-        elif monster.personality.value == "energetic":
-            colors.append("electric highlights")
-        return f"{colors[0]} and {colors[1]} with {colors[2]}"
-    async def generate_monster_model(self,
-                                   monster: Any,
-                                   stage: Optional[str] = None,
-                                   mode: Optional[GenerationMode] = None) -> Dict[str, Any]:
-        """Generate 3D model with consistent style"""
-        # Get or create visual profile
-        if monster.id not in self.profiles:
-            profile = await self.create_monster_profile(monster)
-        else:
-            profile = self.profiles[monster.id]
-        # Use current stage if not specified
-        if stage is None:
-            stage = monster.evolution.current_stage.value
-        # Build consistent prompt
-        prompt = self._build_consistent_prompt(monster, profile, stage)
-        # Create monster data with enhanced prompt
-        monster_data = {
-            "name": monster.name,
-            "stage": stage,
-            "personality": monster.personality.value,
-            "species_type": monster.species_type.value,
-            "stats": {
-                "offense": monster.battle_stats.offense,
-                "defense": monster.battle_stats.defense,
-                "speed": monster.battle_stats.speed,
-                "brains": monster.battle_stats.brains
-            },
-            "_custom_prompt": prompt  # Override default prompt generation
-        }
-        # Generate model
-        result = await self.integration.pipeline.generate_monster_for_digipal(
-            monster_data, mode
-        )
-        return result
-    def _build_consistent_prompt(self,
-                               monster: Any,
-                               profile: Monster3DProfile,
-                               stage: str) -> str:
-        """Build prompt ensuring visual consistency"""
-        # Get stage-specific modifiers
-        size = profile.size_modifier.get(stage, "medium sized")
-        detail = profile.detail_level.get(stage, "detailed")
-        aura = profile.aura_intensity.get(stage, "glowing")
-        # Get personality modifier
-        personality_mod = self.style_guide["personality_modifiers"].get(
-            monster.personality.value,
-            "neutral stance"
-        )
-        # Build the prompt
-        prompt = (
-            f"{self.style_guide['render_style']}, "
-            f"{size} {profile.base_design}, "
-            f"{', '.join(profile.primary_features)}, "
-            f"{profile.color_scheme} coloration, "
-            f"{profile.texture_style}, "
-            f"{detail}, "
-            f"{aura}, "
-            f"{personality_mod}, "
-            f"{self.style_guide['model_requirements']}, "
-            f"{self.style_guide['quality_specs']}"
-        )
-        # Add stage-specific quality hints
-        if stage in ["ultimate", "mega", "perfect"]:
-            prompt += ", hero character quality, highly detailed, epic presence"
-        elif stage in ["baby", "egg"]:
-            prompt += ", cute proportions, simplified details, approachable design"
-        return prompt
-    async def generate_evolution_sequence(self,
-                                        monster: Any,
-                                        stages: Optional[List[str]] = None) -> List[Dict[str, Any]]:
-        """Generate complete evolution sequence with consistent design"""
-        # Default to all possible stages
-        if stages is None:
-            stages = ["baby", "child", "adult", "perfect", "ultimate"]
-        # Ensure we have a profile
-        if monster.id not in self.profiles:
-            await self.create_monster_profile(monster)
-        # Generate all stages
-        results = []
-        for stage in stages:
-            # Use faster generation for early stages
-            mode = GenerationMode.TURBO if stage in ["baby", "child"] else GenerationMode.FAST
-            result = await self.generate_monster_model(monster, stage, mode)
-            results.append(result)
-            # Add small delay between generations to avoid rate limits
-            await asyncio.sleep(1)
-        return results
-    async def generate_showcase_views(self,
-                                    monster: Any,
-                                    include_action_poses: bool = True) -> Dict[str, Any]:
-        """Generate multiple views/poses for showcase"""
-        base_result = await self.generate_monster_model(monster)
-        if not base_result["success"] or not include_action_poses:
-            return {"base": base_result}
-        # Generate additional views with modified prompts
-        views = {"base": base_result}
-        # Action pose
-        action_prompt = base_result.get("prompt", "").replace("T-pose", "dynamic action pose")
-        # Battle pose
-        battle_prompt = base_result.get("prompt", "").replace("T-pose", "battle-ready stance")
-        # TODO: Implement multi-pose generation when API supports it
-        return views
-# Convenience function for integration
-async def setup_hunyuan3d_for_monster(monster: Any) -> ConsistentMonster3DManager:
-    """Setup Hunyuan3D manager for a specific monster"""
-    config = Hunyuan3DConfig(
-        output_dir=Path(f"./3d_models/{monster.name}"),
-        cache_dir=Path("./model_cache")
-    )
-    manager = ConsistentMonster3DManager(config)
-    await manager.create_monster_profile(monster)
-    return manager

src/pipelines/hunyuan3d_pipeline.py DELETED Viewed

@@ -1,958 +0,0 @@
-"""
-Hunyuan3D-2.1 Integration Pipeline for DigiPal
-Implements Tencent's state-of-the-art 3D generation model
-Built with Rick Rubin philosophy: Focus on the core, eliminate friction
-"""
-import asyncio
-import base64
-import json
-import logging
-import os
-import time
-from dataclasses import dataclass, field
-from enum import Enum
-from pathlib import Path
-from typing import Dict, List, Optional, Tuple, Any, Union
-import aiohttp
-import numpy as np
-from PIL import Image
-import torch
-import trimesh
-from gradio_client import Client, handle_file
-import tempfile
-import shutil
-# Configure logging
-logging.basicConfig(level=logging.INFO)
-logger = logging.getLogger(__name__)
-class GenerationMode(Enum):
-    """Hunyuan3D generation speed/quality modes"""
-    TURBO = "Turbo (Fast)"  # Fastest, lower quality
-    FAST = "Fast"           # Balanced speed/quality
-    STANDARD = "Standard"   # Best quality, slower
-class TextureMethod(Enum):
-    """Texture generation methods"""
-    RGB = "RGB"   # Standard color texture
-    PBR = "PBR"   # Physically-based rendering (metallic, roughness, normal)
-class ExportFormat(Enum):
-    """3D model export formats"""
-    GLB = "glb"   # GLTF binary format (recommended)
-    OBJ = "obj"   # Wavefront OBJ format
-    PLY = "ply"   # Polygon File Format
-    STL = "stl"   # Stereolithography format
-@dataclass
-class Hunyuan3DConfig:
-    """Configuration for Hunyuan3D pipeline"""
-    # API Configuration
-    space_id: str = "Tencent/Hunyuan3D-2"  # Official Hugging Face Space
-    use_auth_token: Optional[str] = None    # HF auth token if needed
-    max_retries: int = 3
-    timeout: int = 600  # 10 minutes max per generation
-    # Generation Settings
-    default_mode: GenerationMode = GenerationMode.FAST
-    texture_method: TextureMethod = TextureMethod.RGB
-    export_format: ExportFormat = ExportFormat.GLB
-    # Quality Settings
-    shape_resolution: int = 256  # Internal shape generation resolution
-    texture_resolution: int = 1024  # Output texture resolution
-    remove_bg: bool = True  # Remove background from input images
-    foreground_ratio: float = 0.85  # Object size in frame
-    # Optimization Settings
-    enable_optimization: bool = True
-    target_polycount: int = 30000  # Higher quality than other pipelines
-    simplify_ratio: float = 0.5  # Mesh simplification ratio
-    # File Management
-    output_dir: Path = Path("./hunyuan3d_output")
-    cache_dir: Path = Path("./hunyuan3d_cache")
-    temp_dir: Path = Path("./hunyuan3d_temp")
-    keep_intermediates: bool = False  # Keep intermediate files for debugging
-    # Multi-view Settings
-    enable_multi_view: bool = True
-    views: List[str] = field(default_factory=lambda: ["front", "back", "left", "right"])
-class Hunyuan3DClient:
-    """Client for interacting with Hunyuan3D Space API"""
-    def __init__(self, config: Hunyuan3DConfig):
-        self.config = config
-        self.client = None
-        self._initialize_client()
-    def _initialize_client(self):
-        """Initialize Gradio client for Hunyuan3D Space"""
-        try:
-            logger.info(f"Connecting to Hunyuan3D Space: {self.config.space_id}")
-            self.client = Client(
-                self.config.space_id,
-                hf_token=self.config.use_auth_token
-            )
-            logger.info("Successfully connected to Hunyuan3D Space")
-        except Exception as e:
-            logger.error(f"Failed to connect to Hunyuan3D Space: {e}")
-            raise
-    async def generate_from_image(self,
-                                 image_path: Union[str, Path],
-                                 mode: GenerationMode,
-                                 remove_bg: bool = True,
-                                 foreground_ratio: float = 0.85) -> Dict[str, Any]:
-        """Generate 3D model from single image"""
-        logger.info(f"Generating 3D model from image: {image_path}")
-        try:
-            # Prepare parameters
-            params = {
-                "image": handle_file(str(image_path)),
-                "mode": mode.value,
-                "remove_bg": remove_bg,
-                "foreground_ratio": foreground_ratio,
-                "texture_method": self.config.texture_method.value
-            }
-            # Stage 1: Shape generation
-            logger.info("Stage 1: Generating 3D shape...")
-            shape_result = await self._run_generation(
-                self.client.predict,
-                "/generate_shape",
-                **params
-            )
-            # Stage 2: Texture generation
-            logger.info("Stage 2: Generating textures...")
-            texture_result = await self._run_generation(
-                self.client.predict,
-                "/generate_texture",
-                shape_data=shape_result,
-                **params
-            )
-            # Process results
-            return self._process_results(shape_result, texture_result)
-        except Exception as e:
-            logger.error(f"Generation failed: {e}")
-            raise
-    async def generate_from_multi_view(self,
-                                     image_paths: Dict[str, Union[str, Path]],
-                                     mode: GenerationMode) -> Dict[str, Any]:
-        """Generate 3D model from multiple view images"""
-        logger.info(f"Generating 3D model from {len(image_paths)} views")
-        try:
-            # Validate views
-            required_views = ["front", "back", "left", "right"]
-            for view in required_views:
-                if view not in image_paths:
-                    raise ValueError(f"Missing required view: {view}")
-            # Prepare multi-view parameters
-            params = {
-                "front_image": handle_file(str(image_paths["front"])),
-                "back_image": handle_file(str(image_paths["back"])),
-                "left_image": handle_file(str(image_paths["left"])),
-                "right_image": handle_file(str(image_paths["right"])),
-                "mode": mode.value,
-                "texture_method": self.config.texture_method.value,
-                "use_multi_view": True
-            }
-            # Multi-view generation
-            logger.info("Generating from multi-view inputs...")
-            result = await self._run_generation(
-                self.client.predict,
-                "/generate_multi_view",
-                **params
-            )
-            return self._process_multi_view_results(result)
-        except Exception as e:
-            logger.error(f"Multi-view generation failed: {e}")
-            raise
-    async def _run_generation(self, func, endpoint: str, **kwargs) -> Any:
-        """Run generation with retries and error handling"""
-        for attempt in range(self.config.max_retries):
-            try:
-                # Run prediction
-                result = await asyncio.get_event_loop().run_in_executor(
-                    None,
-                    lambda: func(endpoint, **kwargs)
-                )
-                return result
-            except Exception as e:
-                if attempt < self.config.max_retries - 1:
-                    wait_time = 2 ** attempt  # Exponential backoff
-                    logger.warning(f"Attempt {attempt + 1} failed, retrying in {wait_time}s...")
-                    await asyncio.sleep(wait_time)
-                else:
-                    raise e
-    def _process_results(self, shape_result: Any, texture_result: Any) -> Dict[str, Any]:
-        """Process generation results"""
-        # Extract file paths from results
-        model_files = {}
-        # Handle different result formats
-        if isinstance(texture_result, tuple):
-            # Multiple outputs (model, textures, etc.)
-            model_files["model"] = texture_result[0]
-            if len(texture_result) > 1:
-                model_files["texture"] = texture_result[1]
-            if len(texture_result) > 2:
-                model_files["preview"] = texture_result[2]
-        else:
-            model_files["model"] = texture_result
-        # Extract metadata
-        metadata = {
-            "generation_mode": self.config.default_mode.value,
-            "texture_method": self.config.texture_method.value,
-            "timestamp": time.time()
-        }
-        return {
-            "success": True,
-            "files": model_files,
-            "metadata": metadata
-        }
-    def _process_multi_view_results(self, result: Any) -> Dict[str, Any]:
-        """Process multi-view generation results"""
-        model_files = {}
-        if isinstance(result, dict):
-            model_files = result
-        elif isinstance(result, tuple):
-            model_files["model"] = result[0]
-            if len(result) > 1:
-                model_files["texture"] = result[1]
-        else:
-            model_files["model"] = result
-        return {
-            "success": True,
-            "files": model_files,
-            "metadata": {
-                "generation_type": "multi_view",
-                "views_used": self.config.views
-            }
-        }
-class Hunyuan3DProcessor:
-    """Post-processing for Hunyuan3D outputs"""
-    def __init__(self, config: Hunyuan3DConfig):
-        self.config = config
-    async def process_model(self,
-                          raw_model_path: Path,
-                          creature_name: str) -> Dict[str, Any]:
-        """Process and optimize raw 3D model"""
-        logger.info(f"Processing model: {raw_model_path}")
-        # Load model
-        mesh = trimesh.load(raw_model_path)
-        if isinstance(mesh, trimesh.Scene):
-            mesh = mesh.dump(concatenate=True)
-        original_stats = {
-            "vertices": len(mesh.vertices),
-            "faces": len(mesh.faces),
-            "bounds": mesh.bounds.tolist()
-        }
-        # Optimize if enabled
-        if self.config.enable_optimization:
-            mesh = self._optimize_mesh(mesh)
-        # Export in requested format
-        output_path = self.config.output_dir / f"{creature_name}_processed.{self.config.export_format.value}"
-        mesh.export(output_path)
-        # Generate preview
-        preview_path = await self._generate_preview(mesh, creature_name)
-        return {
-            "processed_model": output_path,
-            "preview": preview_path,
-            "stats": {
-                "original": original_stats,
-                "optimized": {
-                    "vertices": len(mesh.vertices),
-                    "faces": len(mesh.faces)
-                }
-            }
-        }
-    def _optimize_mesh(self, mesh: trimesh.Trimesh) -> trimesh.Trimesh:
-        """Optimize mesh for game use"""
-        logger.info(f"Optimizing mesh: {len(mesh.faces)} faces -> {self.config.target_polycount}")
-        # Clean up mesh
-        mesh.remove_degenerate_faces()
-        mesh.remove_duplicate_faces()
-        mesh.remove_unreferenced_vertices()
-        # Simplify if needed
-        if len(mesh.faces) > self.config.target_polycount:
-            mesh = mesh.simplify_quadric_decimation(self.config.target_polycount)
-        # Ensure watertight
-        mesh.fill_holes()
-        # Smooth normals
-        mesh.vertex_normals
-        return mesh
-    async def _generate_preview(self, mesh: trimesh.Trimesh, name: str) -> Path:
-        """Generate preview image of mesh"""
-        # Create scene
-        scene = mesh.scene()
-        # Set camera angle
-        angles = [(np.pi / 6, np.pi / 4)]  # 30 degrees elevation, 45 degrees azimuth
-        for elevation, azimuth in angles:
-            camera_transform = scene.camera.look_at(
-                mesh.bounds.mean(axis=0),
-                distance=mesh.scale * 2.5,
-                elevation=elevation,
-                azimuth=azimuth
-            )
-            scene.camera_transform = camera_transform
-            # Render
-            try:
-                data = scene.save_image(resolution=[1024, 1024], visible=False)
-                preview_path = self.config.output_dir / f"{name}_preview.png"
-                if data:
-                    with open(preview_path, 'wb') as f:
-                        f.write(data.getvalue())
-                    return preview_path
-            except:
-                pass
-        # Fallback preview
-        preview_path = self.config.output_dir / f"{name}_preview.png"
-        img = Image.new('RGBA', (1024, 1024), (200, 200, 200, 255))
-        img.save(preview_path)
-        return preview_path
-class Hunyuan3DPipeline:
-    """
-    Complete Hunyuan3D integration pipeline for DigiPal
-    Philosophy: Leverage cutting-edge AI while maintaining simplicity
-    """
-    def __init__(self, config: Optional[Hunyuan3DConfig] = None):
-        self.config = config or Hunyuan3DConfig()
-        # Initialize components
-        self.client = Hunyuan3DClient(self.config)
-        self.processor = Hunyuan3DProcessor(self.config)
-        # Create directories
-        self.config.output_dir.mkdir(parents=True, exist_ok=True)
-        self.config.cache_dir.mkdir(parents=True, exist_ok=True)
-        self.config.temp_dir.mkdir(parents=True, exist_ok=True)
-        # Cache for generated models
-        self.cache = self._load_cache()
-        logger.info("Hunyuan3D pipeline initialized")
-    def _load_cache(self) -> Dict[str, Any]:
-        """Load generation cache"""
-        cache_file = self.config.cache_dir / "generation_cache.json"
-        if cache_file.exists():
-            with open(cache_file, 'r') as f:
-                return json.load(f)
-        return {}
-    def _save_cache(self):
-        """Save generation cache"""
-        cache_file = self.config.cache_dir / "generation_cache.json"
-        with open(cache_file, 'w') as f:
-            json.dump(self.cache, f, indent=2)
-    async def generate_from_text(self,
-                               prompt: str,
-                               name: str,
-                               mode: Optional[GenerationMode] = None,
-                               style: Optional[str] = None) -> Dict[str, Any]:
-        """Generate 3D model from text prompt"""
-        start_time = time.time()
-        mode = mode or self.config.default_mode
-        # Check cache
-        cache_key = f"{prompt}_{mode.value}_{style or 'default'}"
-        if cache_key in self.cache:
-            logger.info(f"Using cached model for: {prompt}")
-            return self.cache[cache_key]
-        try:
-            # Step 1: Generate concept image from text
-            logger.info(f"Generating concept image for: {prompt}")
-            concept_image = await self._generate_concept_image(prompt, style)
-            # Step 2: Generate 3D from image
-            logger.info("Converting to 3D with Hunyuan3D...")
-            generation_result = await self.client.generate_from_image(
-                concept_image,
-                mode,
-                self.config.remove_bg,
-                self.config.foreground_ratio
-            )
-            if not generation_result["success"]:
-                raise Exception("3D generation failed")
-            # Step 3: Process and optimize
-            logger.info("Processing generated model...")
-            process_result = await self.processor.process_model(
-                Path(generation_result["files"]["model"]),
-                name
-            )
-            # Step 4: Handle textures
-            texture_paths = {}
-            if "texture" in generation_result["files"]:
-                texture_paths["diffuse"] = generation_result["files"]["texture"]
-            # Compile results
-            result = {
-                "success": True,
-                "name": name,
-                "prompt": prompt,
-                "mode": mode.value,
-                "style": style,
-                "pipeline": "hunyuan3d",
-                "paths": {
-                    "concept_image": str(concept_image),
-                    "raw_model": generation_result["files"]["model"],
-                    "processed_model": str(process_result["processed_model"]),
-                    "preview": str(process_result["preview"])
-                },
-                "textures": texture_paths,
-                "stats": {
-                    "generation_time": time.time() - start_time,
-                    **process_result["stats"]
-                },
-                "metadata": generation_result["metadata"]
-            }
-            # Cache result
-            self.cache[cache_key] = result
-            self._save_cache()
-            # Save metadata
-            metadata_path = self.config.output_dir / f"{name}_metadata.json"
-            with open(metadata_path, 'w') as f:
-                json.dump(result, f, indent=2)
-            logger.info(f"Generation completed in {result['stats']['generation_time']:.2f}s")
-            return result
-        except Exception as e:
-            logger.error(f"Pipeline failed: {e}")
-            return {
-                "success": False,
-                "error": str(e),
-                "name": name,
-                "prompt": prompt
-            }
-    async def generate_from_image(self,
-                                image_path: Union[str, Path],
-                                name: str,
-                                mode: Optional[GenerationMode] = None) -> Dict[str, Any]:
-        """Generate 3D model from single image"""
-        start_time = time.time()
-        mode = mode or self.config.default_mode
-        try:
-            # Generate 3D from image
-            logger.info(f"Generating 3D from image: {image_path}")
-            generation_result = await self.client.generate_from_image(
-                image_path,
-                mode,
-                self.config.remove_bg,
-                self.config.foreground_ratio
-            )
-            if not generation_result["success"]:
-                raise Exception("3D generation failed")
-            # Process model
-            process_result = await self.processor.process_model(
-                Path(generation_result["files"]["model"]),
-                name
-            )
-            # Compile results
-            result = {
-                "success": True,
-                "name": name,
-                "input_image": str(image_path),
-                "mode": mode.value,
-                "pipeline": "hunyuan3d",
-                "paths": {
-                    "input_image": str(image_path),
-                    "raw_model": generation_result["files"]["model"],
-                    "processed_model": str(process_result["processed_model"]),
-                    "preview": str(process_result["preview"])
-                },
-                "stats": {
-                    "generation_time": time.time() - start_time,
-                    **process_result["stats"]
-                },
-                "metadata": generation_result["metadata"]
-            }
-            return result
-        except Exception as e:
-            logger.error(f"Pipeline failed: {e}")
-            return {
-                "success": False,
-                "error": str(e),
-                "name": name,
-                "input_image": str(image_path)
-            }
-    async def generate_from_multi_view(self,
-                                     image_paths: Dict[str, Union[str, Path]],
-                                     name: str,
-                                     mode: Optional[GenerationMode] = None) -> Dict[str, Any]:
-        """Generate 3D model from multiple view images"""
-        start_time = time.time()
-        mode = mode or self.config.default_mode
-        try:
-            # Generate 3D from multi-view
-            logger.info(f"Generating 3D from {len(image_paths)} views")
-            generation_result = await self.client.generate_from_multi_view(
-                image_paths,
-                mode
-            )
-            if not generation_result["success"]:
-                raise Exception("Multi-view generation failed")
-            # Process model
-            process_result = await self.processor.process_model(
-                Path(generation_result["files"]["model"]),
-                name
-            )
-            # Compile results
-            result = {
-                "success": True,
-                "name": name,
-                "input_views": {k: str(v) for k, v in image_paths.items()},
-                "mode": mode.value,
-                "pipeline": "hunyuan3d_multiview",
-                "paths": {
-                    "raw_model": generation_result["files"]["model"],
-                    "processed_model": str(process_result["processed_model"]),
-                    "preview": str(process_result["preview"])
-                },
-                "stats": {
-                    "generation_time": time.time() - start_time,
-                    **process_result["stats"]
-                },
-                "metadata": generation_result["metadata"]
-            }
-            return result
-        except Exception as e:
-            logger.error(f"Multi-view pipeline failed: {e}")
-            return {
-                "success": False,
-                "error": str(e),
-                "name": name,
-                "input_views": {k: str(v) for k, v in image_paths.items()}
-            }
-    async def _generate_concept_image(self, prompt: str, style: Optional[str] = None) -> Path:
-        """Generate concept image from text (placeholder - integrate with your image gen)"""
-        # This is a placeholder - integrate with your preferred text-to-image model
-        # For example, could use SDXL, Flux, or other models
-        logger.info("Generating concept image...")
-        # For now, create a placeholder image
-        img = Image.new('RGB', (1024, 1024), (100, 100, 100))
-        # Add some text to indicate it's a placeholder
-        from PIL import ImageDraw, ImageFont
-        draw = ImageDraw.Draw(img)
-        text = f"Concept: {prompt[:50]}..."
-        try:
-            # Try to use a font
-            font = ImageFont.truetype("/usr/share/fonts/truetype/dejavu/DejaVuSans.ttf", 40)
-        except:
-            font = ImageFont.load_default()
-        # Draw text
-        bbox = draw.textbbox((0, 0), text, font=font)
-        text_width = bbox[2] - bbox[0]
-        text_height = bbox[3] - bbox[1]
-        position = ((1024 - text_width) // 2, (1024 - text_height) // 2)
-        draw.text(position, text, fill='white', font=font)
-        # Save image
-        concept_path = self.config.temp_dir / f"concept_{hash(prompt)}.png"
-        img.save(concept_path)
-        return concept_path
-    async def generate_monster_for_digipal(self,
-                                         monster_data: Dict[str, Any],
-                                         mode: Optional[GenerationMode] = None) -> Dict[str, Any]:
-        """Generate 3D model specifically for DigiPal monster"""
-        # Extract monster characteristics
-        name = monster_data.get("name", "Monster")
-        stage = monster_data.get("stage", "child")
-        personality = monster_data.get("personality", "friendly")
-        species_type = monster_data.get("species_type", "data")
-        # Create detailed prompt
-        prompt = self._create_monster_prompt(monster_data)
-        # Determine generation mode based on stage
-        if mode is None:
-            stage_modes = {
-                "baby": GenerationMode.TURBO,
-                "child": GenerationMode.FAST,
-                "adult": GenerationMode.STANDARD,
-                "perfect": GenerationMode.STANDARD,
-                "ultimate": GenerationMode.STANDARD
-            }
-            mode = stage_modes.get(stage, GenerationMode.FAST)
-        # Generate model
-        result = await self.generate_from_text(
-            prompt=prompt,
-            name=f"{name}_{stage}",
-            mode=mode,
-            style=personality
-        )
-        # Add monster-specific metadata
-        if result["success"]:
-            result["monster_metadata"] = monster_data
-        return result
-    def _create_monster_prompt(self, monster_data: Dict[str, Any]) -> str:
-        """Create detailed prompt from monster data with consistent style"""
-        # Core visual style - consistent across all stages
-        base_style = "AAA 3D render, well lit, white void background"
-        # Size descriptors that maintain similar adult proportions
-        stage_size = {
-            "egg": "small egg-shaped",
-            "baby": "compact young",
-            "child": "medium-sized adolescent",
-            "adult": "full-sized mature",
-            "perfect": "large powerful",
-            "ultimate": "imposing legendary",
-            "mega": "colossal mythical"
-        }
-        # Stage modifiers - subtle changes while maintaining core design
-        stage_modifier = {
-            "egg": "dormant crystalline",
-            "baby": "newly formed",
-            "child": "developing",
-            "adult": "fully formed",
-            "perfect": "enhanced",
-            "ultimate": "transcendent",
-            "mega": "apex"
-        }
-        # Personality traits - affects posture and expression
-        personality_traits = {
-            "brave": "fierce stance, determined expression",
-            "calm": "serene posture, wise countenance",
-            "energetic": "dynamic pose, alert expression",
-            "clever": "calculating gaze, strategic stance",
-            "friendly": "approachable posture, gentle expression"
-        }
-        # Species visual themes - consistent design language
-        species_theme = {
-            "data": "digital creature with crystalline segments, circuit patterns, holographic accents",
-            "vaccine": "angelic creature with white armor plating, golden trim, light aura",
-            "virus": "dark creature with shadowy armor, purple energy, corrupted data streams",
-            "free": "natural creature with organic armor, earth tones, elemental energy"
-        }
-        # Extract attributes
-        stage = monster_data.get("stage", "child")
-        personality = monster_data.get("personality", "friendly")
-        species = monster_data.get("species_type", "data")
-        name = monster_data.get("name", "DigiPal")
-        # Build consistent prompt
-        size = stage_size.get(stage, "medium-sized")
-        modifier = stage_modifier.get(stage, "")
-        traits = personality_traits.get(personality, "neutral stance")
-        theme = species_theme.get(species, "digital creature")
-        # Stats-based features - subtle additions
-        features = []
-        stats = monster_data.get("stats", {})
-        if stats.get("offense", 0) > 70:
-            features.append("reinforced claws")
-        if stats.get("defense", 0) > 70:
-            features.append("enhanced armor")
-        if stats.get("speed", 0) > 70:
-            features.append("streamlined form")
-        if stats.get("brains", 0) > 70:
-            features.append("glowing intelligence core")
-        feature_str = f", {', '.join(features)}" if features else ""
-        # Construct final prompt with consistent style
-        prompt = (
-            f"{base_style}, {size} {modifier} {theme}, "
-            f"{traits}{feature_str}, "
-            f"T-pose, game character model, consistent design language, "
-            f"professional 3D game asset, pbr materials, clean topology"
-        )
-        # Add stage-specific quality hints
-        if stage in ["ultimate", "mega", "perfect"]:
-            prompt += ", hero character quality, highly detailed"
-        elif stage in ["baby", "egg"]:
-            prompt += ", simplified design, cute proportions"
-        return prompt
-    async def batch_generate(self,
-                           generation_tasks: List[Dict[str, Any]],
-                           max_concurrent: int = 2) -> List[Dict[str, Any]]:
-        """Batch generate multiple models"""
-        semaphore = asyncio.Semaphore(max_concurrent)
-        async def generate_with_limit(task: Dict[str, Any]):
-            async with semaphore:
-                if "prompt" in task:
-                    return await self.generate_from_text(
-                        prompt=task["prompt"],
-                        name=task["name"],
-                        mode=task.get("mode"),
-                        style=task.get("style")
-                    )
-                elif "image" in task:
-                    return await self.generate_from_image(
-                        image_path=task["image"],
-                        name=task["name"],
-                        mode=task.get("mode")
-                    )
-                elif "images" in task:
-                    return await self.generate_from_multi_view(
-                        image_paths=task["images"],
-                        name=task["name"],
-                        mode=task.get("mode")
-                    )
-                else:
-                    return {
-                        "success": False,
-                        "error": "Invalid task format",
-                        "task": task
-                    }
-        tasks = [generate_with_limit(task) for task in generation_tasks]
-        results = await asyncio.gather(*tasks)
-        return results
-# Integration with DigiPal's monster system
-class DigiPalHunyuan3DIntegration:
-    """Integration layer for DigiPal monster system"""
-    def __init__(self, config: Optional[Hunyuan3DConfig] = None):
-        self.pipeline = Hunyuan3DPipeline(config)
-    async def generate_monster_model(self,
-                                   monster: Any,  # DW1Monster type
-                                   force_regenerate: bool = False) -> Dict[str, Any]:
-        """Generate 3D model for DigiPal monster"""
-        # Convert monster to data dict
-        monster_data = {
-            "name": monster.name,
-            "stage": monster.evolution.current_stage.value,
-            "personality": monster.personality.value,
-            "species_type": monster.species_type.value,
-            "stats": {
-                "offense": monster.battle_stats.offense,
-                "defense": monster.battle_stats.defense,
-                "speed": monster.battle_stats.speed,
-                "brains": monster.battle_stats.brains
-            }
-        }
-        # Check cache unless forced
-        cache_key = f"{monster.id}_{monster.evolution.current_stage.value}"
-        if not force_regenerate and cache_key in self.pipeline.cache:
-            logger.info(f"Using cached model for {monster.name}")
-            return self.pipeline.cache[cache_key]
-        # Generate model
-        result = await self.pipeline.generate_monster_for_digipal(monster_data)
-        # Cache if successful
-        if result["success"]:
-            self.pipeline.cache[cache_key] = result
-            self.pipeline._save_cache()
-        return result
-    async def generate_evolution_sequence(self,
-                                        monster: Any,
-                                        stages: List[str]) -> List[Dict[str, Any]]:
-        """Generate models for evolution sequence"""
-        tasks = []
-        for stage in stages:
-            monster_data = {
-                "name": monster.name,
-                "stage": stage,
-                "personality": monster.personality.value,
-                "species_type": monster.species_type.value,
-                "stats": {
-                    "offense": monster.battle_stats.offense,
-                    "defense": monster.battle_stats.defense,
-                    "speed": monster.battle_stats.speed,
-                    "brains": monster.battle_stats.brains
-                }
-            }
-            tasks.append({
-                "monster_data": monster_data,
-                "mode": GenerationMode.FAST if stage in ["baby", "child"] else GenerationMode.STANDARD
-            })
-        # Generate all stages
-        results = []
-        for task in tasks:
-            result = await self.pipeline.generate_monster_for_digipal(
-                task["monster_data"],
-                task["mode"]
-            )
-            results.append(result)
-        return results
-# Convenience functions
-def create_hunyuan3d_config(output_path: str = "hunyuan3d_config.json"):
-    """Create configuration template"""
-    config = {
-        "space_id": "Tencent/Hunyuan3D-2",
-        "use_auth_token": None,
-        "max_retries": 3,
-        "timeout": 600,
-        "default_mode": "Fast",
-        "texture_method": "RGB",
-        "export_format": "glb",
-        "shape_resolution": 256,
-        "texture_resolution": 1024,
-        "remove_bg": True,
-        "foreground_ratio": 0.85,
-        "enable_optimization": True,
-        "target_polycount": 30000,
-        "simplify_ratio": 0.5,
-        "output_dir": "./hunyuan3d_output",
-        "cache_dir": "./hunyuan3d_cache",
-        "temp_dir": "./hunyuan3d_temp",
-        "keep_intermediates": False,
-        "enable_multi_view": True,
-        "views": ["front", "back", "left", "right"]
-    }
-    with open(output_path, 'w') as f:
-        json.dump(config, f, indent=2)
-    logger.info(f"Config template created at: {output_path}")
-# Example usage
-if __name__ == "__main__":
-    import asyncio
-    async def example():
-        # Initialize pipeline
-        config = Hunyuan3DConfig()
-        pipeline = Hunyuan3DPipeline(config)
-        # Example 1: Generate from text
-        result = await pipeline.generate_from_text(
-            prompt="cute blue dragon with big eyes, small wings, friendly expression",
-            name="BlueDragon",
-            mode=GenerationMode.FAST
-        )
-        if result["success"]:
-            print(f"Generated model: {result['paths']['processed_model']}")
-            print(f"Generation time: {result['stats']['generation_time']:.2f}s")
-        else:
-            print(f"Generation failed: {result['error']}")
-        # Example 2: Generate from image
-        # result = await pipeline.generate_from_image(
-        #     image_path="dragon_concept.png",
-        #     name="DragonFromImage",
-        #     mode=GenerationMode.STANDARD
-        # )
-        # Example 3: Multi-view generation
-        # views = {
-        #     "front": "dragon_front.png",
-        #     "back": "dragon_back.png",
-        #     "left": "dragon_left.png",
-        #     "right": "dragon_right.png"
-        # }
-        # result = await pipeline.generate_from_multi_view(
-        #     image_paths=views,
-        #     name="DragonMultiView",
-        #     mode=GenerationMode.STANDARD
-        # )
-    # Run example
-    # asyncio.run(example())

src/pipelines/opensource_3d_pipeline_v2.py CHANGED Viewed

@@ -1,6 +1,6 @@
 """
 Production-Ready Open-Source Text-to-Rigged-3D Pipeline
-Uses HuggingFace Spaces API for Flux, Sparc3D implementation, and UniRig models
 Rick Rubin philosophy: Strip complexity, amplify creativity
 """
@@ -33,14 +33,14 @@ logger = logging.getLogger(__name__)
 class ProductionConfig:
     """Production configuration for open-source pipeline"""
-    # Text-to-image model options
-    text_to_image_model: str = "omnigen2"  # omnigen2 or flux
-    omnigen2_repo: str = "OmniGen2/OmniGen2"
     flux_space: str = "black-forest-labs/FLUX.1-dev"  # Kept for fallback
-    sparc3d_space: Optional[str] = "lizhihao6/Sparc3D"  # If available as Space
-    # Local model paths
-    sparc3d_repo: str = "https://github.com/lizhihao6/Sparc3D"
     unirig_repo: str = "https://github.com/VAST-AI-Research/UniRig"
     unirig_hf_model: str = "VAST-AI/UniRig"
@@ -51,7 +51,7 @@ class ProductionConfig:
     inference_steps: int = 28
     # 3D settings
-    sparc3d_resolution: int = 1024  # Up to 1024³ as per paper
     target_polycount: int = 30000
     texture_resolution: int = 2048
@@ -71,35 +71,56 @@ class ProductionConfig:
     enable_cpu_offload: bool = True  # For VRAM optimization
 class OmniGen2MultiViewGenerator:
-    """Generate multi-view images using OmniGen2"""
     def __init__(self, config: ProductionConfig):
         self.config = config
         self.model = None
         self.tokenizer = None
-        logger.info(f"Initializing OmniGen2 from: {config.omnigen2_repo}")
     def _load_model(self):
-        """Lazy load the model to save memory"""
         if self.model is None:
             try:
-                # Load OmniGen2 model from HuggingFace
-                self.tokenizer = AutoTokenizer.from_pretrained(
-                    self.config.omnigen2_repo,
-                    trust_remote_code=True
-                )
-                self.model = AutoModel.from_pretrained(
                     self.config.omnigen2_repo,
                     trust_remote_code=True,
                     torch_dtype=torch.float16 if self.config.device == "cuda" else torch.float32,
                     device_map="auto" if self.config.enable_cpu_offload else None
                 )
                 if not self.config.enable_cpu_offload:
                     self.model = self.model.to(self.config.device)
-                logger.info("OmniGen2 model loaded successfully")
             except Exception as e:
-                logger.error(f"Failed to load OmniGen2 model: {str(e)}")
-                raise
     async def generate_creature_views(self, base_prompt: str,
                                     creature_name: str) -> Dict[str, Path]:
@@ -153,34 +174,59 @@ class OmniGen2MultiViewGenerator:
                     f"consistent lighting, monster character design"
                 )
-                # Generate image using OmniGen2
-                inputs = self.tokenizer(full_prompt, return_tensors="pt", truncation=True)
                 with torch.no_grad():
-                    # Use OmniGen2's generation method
-                    outputs = self.model.generate(
-                        **inputs,
-                        max_pixels=self.config.max_pixels,
-                        guidance_scale=self.config.text_guidance_scale,
-                        num_inference_steps=self.config.inference_steps,
-                        width=self.config.image_resolution,
-                        height=self.config.image_resolution
-                    )
                 # Save generated image
                 output_path = self.config.output_dir / f"{creature_name}_{view_name}_view.png"
                 output_path.parent.mkdir(parents=True, exist_ok=True)
-                # Convert tensor to PIL Image and save
-                if isinstance(outputs, torch.Tensor):
-                    # Convert tensor to PIL Image
-                    image_array = outputs.cpu().numpy().squeeze()
-                    if image_array.ndim == 3 and image_array.shape[0] == 3:
-                        image_array = np.transpose(image_array, (1, 2, 0))
-                    image_array = (image_array * 255).astype(np.uint8)
-                    image = Image.fromarray(image_array)
-                else:
-                    image = outputs  # Assume it's already a PIL Image
                 image.save(output_path)
                 output_paths[view_name] = output_path
@@ -400,316 +446,312 @@ class FluxMultiViewGenerator:
         return output_path
-class Sparc3DProcessor:
-    """Sparc3D implementation for ultra-high resolution 3D generation"""
     def __init__(self, config: ProductionConfig):
         self.config = config
-        self.model = None
-        self._setup_sparc3d()
-    def _setup_sparc3d(self):
-        """Setup Sparc3D from GitHub repository"""
-        sparc3d_path = Path("./Sparc3D")
-        if not sparc3d_path.exists():
-            logger.info("Cloning Sparc3D repository...")
-            subprocess.run([
-                "git", "clone", self.config.sparc3d_repo, str(sparc3d_path)
-            ], check=True)
-            # Install requirements
-            requirements_file = sparc3d_path / "requirements.txt"
-            if requirements_file.exists():
-                subprocess.run([
-                    "pip", "install", "-r", str(requirements_file)
-                ], check=True)
-        # Add to Python path
-        import sys
-        sys.path.insert(0, str(sparc3d_path))
-        try:
-            # Import Sparc3D modules based on repository structure
-            from sparc3d import Sparc3DPipeline
-            # Initialize model
-            self.model = Sparc3DPipeline(
-                device=self.config.device,
-                resolution=self.config.sparc3d_resolution
-            )
-            logger.info(f"Sparc3D initialized at {self.config.sparc3d_resolution}³ resolution")
-        except ImportError as e:
-            logger.warning(f"Could not import Sparc3D: {e}")
-            logger.info("Using simplified implementation")
-            self.model = SimplifiedSparc3D(self.config)
     async def generate_3d_from_views(self, view_paths: Dict[str, Path],
                                    creature_name: str) -> Dict[str, Any]:
-        """Generate 3D model from multi-view images"""
-        logger.info(f"Generating 3D model from {len(view_paths)} views")
-        # Load and preprocess images
-        processed_views = self._preprocess_views(view_paths)
-        # Generate 3D using Sparc3D
         start_time = time.time()
         try:
-            # Run Sparc3D pipeline
-            mesh_data = await self._run_sparc3d_pipeline(processed_views)
-            # Convert to trimesh
-            mesh = self._create_trimesh(mesh_data)
-            # Optimize mesh
-            mesh = self._optimize_mesh(mesh)
-            # Generate textures
-            texture_data = await self._generate_textures(mesh, processed_views, creature_name)
-            # Save outputs
-            mesh_path = self.config.output_dir / f"{creature_name}_mesh.glb"
-            mesh.export(mesh_path)
-            generation_time = time.time() - start_time
-            return {
-                "success": True,
-                "mesh_path": mesh_path,
-                "texture_path": texture_data["path"],
-                "statistics": {
-                    "vertices": len(mesh.vertices),
-                    "faces": len(mesh.faces),
-                    "generation_time": generation_time,
-                    "resolution": self.config.sparc3d_resolution,
-                    "texture_size": texture_data["size"]
-                }
-            }
         except Exception as e:
-            logger.error(f"3D generation failed: {e}")
             return {
                 "success": False,
                 "error": str(e)
             }
-    def _preprocess_views(self, view_paths: Dict[str, Path]) -> Dict[str, np.ndarray]:
-        """Preprocess views for Sparc3D input"""
-        processed = {}
-        for view_name, path in view_paths.items():
-            if view_name == "concept_sheet":
-                continue  # Skip concept sheet
-            # Load image
-            img = Image.open(path)
-            # Ensure square aspect ratio
-            if img.width != img.height:
-                size = min(img.width, img.height)
-                img = img.crop((
-                    (img.width - size) // 2,
-                    (img.height - size) // 2,
-                    (img.width + size) // 2,
-                    (img.height + size) // 2
-                ))
-            # Resize to expected resolution
-            img = img.resize((512, 512), Image.Resampling.LANCZOS)
-            # Convert to array and normalize
-            img_array = np.array(img).astype(np.float32) / 255.0
-            processed[view_name] = img_array
-        return processed
-    async def _run_sparc3d_pipeline(self, views: Dict[str, np.ndarray]) -> Dict[str, Any]:
-        """Run Sparc3D pipeline on preprocessed views"""
-        if hasattr(self.model, 'generate_3d'):
-            # Use actual Sparc3D implementation
-            return await self.model.generate_3d(views)
-        else:
-            # Use simplified implementation
-            return self.model.generate_mesh(views)
-    def _create_trimesh(self, mesh_data: Dict[str, Any]) -> trimesh.Trimesh:
-        """Convert Sparc3D output to trimesh object"""
-        vertices = np.array(mesh_data["vertices"])
-        faces = np.array(mesh_data["faces"])
-        # Create mesh
-        mesh = trimesh.Trimesh(vertices=vertices, faces=faces)
-        # Center and normalize scale
-        mesh.vertices -= mesh.centroid
-        max_extent = np.max(mesh.extents)
-        mesh.vertices *= 2.0 / max_extent
-        return mesh
-    def _optimize_mesh(self, mesh: trimesh.Trimesh) -> trimesh.Trimesh:
-        """Optimize mesh for game use"""
-        logger.info(f"Optimizing mesh: {len(mesh.faces)} faces")
-        # Clean up
-        mesh.remove_duplicate_faces()
-        mesh.remove_unreferenced_vertices()
-        mesh.fill_holes()
-        # Simplify if needed
-        if len(mesh.faces) > self.config.target_polycount:
-            target = self.config.target_polycount
-            logger.info(f"Simplifying to {target} faces...")
-            mesh = mesh.simplify_quadric_decimation(target)
-        # Ensure manifold
-        if not mesh.is_watertight:
-            mesh.fill_holes()
-        # Smooth normals
-        mesh.vertex_normals
-        return mesh
-    async def _generate_textures(self, mesh: trimesh.Trimesh,
-                               views: Dict[str, np.ndarray],
-                               creature_name: str) -> Dict[str, Any]:
-        """Generate PBR textures from input views"""
-        # Generate UV mapping
-        if not hasattr(mesh.visual, 'uv') or mesh.visual.uv is None:
-            logger.info("Generating UV mapping...")
-            # Use xatlas or similar for UV unwrapping
-            uv_coords = self._generate_smart_uvs(mesh)
-            mesh.visual = trimesh.visual.TextureVisuals(uv=uv_coords)
-        # Create texture from views
-        texture_size = self.config.texture_resolution
-        # Initialize texture maps
-        albedo = np.ones((texture_size, texture_size, 3), dtype=np.uint8) * 200
-        normal = np.ones((texture_size, texture_size, 3), dtype=np.uint8) * 128
-        normal[:, :, 2] = 255  # Default normal pointing up
-        # Project views onto texture
-        # In production, use proper texture projection algorithms
-        if "front" in views:
-            front_img = (views["front"] * 255).astype(np.uint8)
-            front_img = Image.fromarray(front_img)
-            front_img = front_img.resize((texture_size, texture_size), Image.Resampling.LANCZOS)
-            albedo = np.array(front_img)
-        # Save textures
-        texture_path = self.config.output_dir / f"{creature_name}_albedo.png"
-        Image.fromarray(albedo).save(texture_path)
-        normal_path = self.config.output_dir / f"{creature_name}_normal.png"
-        Image.fromarray(normal).save(normal_path)
-        # Apply to mesh
-        mesh.visual.material.image = Image.fromarray(albedo)
         return {
-            "path": texture_path,
-            "size": texture_size,
-            "maps": {
-                "albedo": str(texture_path),
-                "normal": str(normal_path)
             }
         }
-    def _generate_smart_uvs(self, mesh: trimesh.Trimesh) -> np.ndarray:
-        """Generate smart UV coordinates"""
-        # Try using xatlas if available
-        try:
-            import xatlas
-            vmapping, indices, uvs = xatlas.parametrize(mesh.vertices, mesh.faces)
-            return uvs
-        except ImportError:
-            # Fallback to simple cylindrical mapping
-            vertices = mesh.vertices
-            theta = np.arctan2(vertices[:, 0], vertices[:, 2])
-            height = vertices[:, 1]
-            u = (theta + np.pi) / (2 * np.pi)
-            v = (height - height.min()) / (height.max() - height.min())
-            return np.column_stack([u, v])
 class UniRigProcessor:
-    """UniRig integration using HuggingFace models"""
     def __init__(self, config: ProductionConfig):
         self.config = config
         self.model_path = None
-        self._setup_unirig()
-    def _setup_unirig(self):
-        """Setup UniRig from HuggingFace and GitHub"""
-        # Download UniRig models from HuggingFace
-        logger.info("Downloading UniRig models from HuggingFace...")
         try:
-            self.model_path = snapshot_download(
-                repo_id=self.config.unirig_hf_model,
-                cache_dir=self.config.cache_dir,
-                token=self.config.hf_token
-            )
-            logger.info(f"UniRig models downloaded to: {self.model_path}")
-        except Exception as e:
-            logger.warning(f"Could not download UniRig from HF: {e}")
-            # Fallback to GitHub
-            unirig_path = Path("./UniRig")
-            if not unirig_path.exists():
-                logger.info("Cloning UniRig from GitHub...")
-                subprocess.run([
-                    "git", "clone", self.config.unirig_repo, str(unirig_path)
-                ], check=True)
-            self.model_path = unirig_path
     async def auto_rig_creature(self, mesh_path: Path, creature_name: str,
                               creature_type: str = "biped") -> Dict[str, Any]:
-        """Apply automatic rigging using UniRig"""
-        logger.info(f"Auto-rigging {creature_name} as {creature_type}")
         try:
-            # Determine rigging approach based on creature type
-            if creature_type == "biped":
-                skeleton_config = "configs/skeleton_biped.json"
-            elif creature_type == "quadruped":
-                skeleton_config = "configs/skeleton_quadruped.json"
-            else:
-                skeleton_config = "configs/skeleton_generic.json"
-            # Run UniRig pipeline
-            rigged_data = await self._run_unirig_pipeline(
-                mesh_path,
-                skeleton_config,
-                creature_name
-            )
-            return rigged_data
         except Exception as e:
             logger.error(f"UniRig failed: {e}")
             # Fallback to procedural rigging
             return await self._procedural_rigging_fallback(mesh_path, creature_name, creature_type)
     async def _run_unirig_pipeline(self, mesh_path: Path,
                                  skeleton_config: str,
                                  creature_name: str) -> Dict[str, Any]:
@@ -999,8 +1041,8 @@ class UniRigProcessor:
             "method": "procedural_fallback"
         }
-class SimplifiedSparc3D:
-    """Simplified 3D generation when Sparc3D is not available"""
     def __init__(self, config: ProductionConfig):
         self.config = config
@@ -1076,7 +1118,7 @@ class SimplifiedSparc3D:
 class ProductionPipeline:
     """
     Complete production-ready open-source pipeline
-    Flux -> Sparc3D -> UniRig
     """
     def __init__(self, config: Optional[ProductionConfig] = None):
@@ -1101,7 +1143,7 @@ class ProductionPipeline:
         else:
             self.image_generator = FluxMultiViewGenerator(self.config)
-        self.sparc3d = Sparc3DProcessor(self.config)
         self.unirig = UniRigProcessor(self.config)
         logger.info("Production pipeline ready!")
@@ -1141,14 +1183,14 @@ class ProductionPipeline:
                 "outputs": {k: str(v) for k, v in views.items()}
             }
-            # Stage 2: 3D generation with Sparc3D
             logger.info("=" * 50)
-            logger.info("Stage 2: Generating 3D model with Sparc3D")
             stage_start = time.time()
-            model_data = await self.sparc3d.generate_3d_from_views(views, name)
-            results["pipeline_stages"]["sparc3d"] = {
                 "success": model_data["success"],
                 "duration": time.time() - stage_start,
                 "outputs": model_data
@@ -1182,13 +1224,14 @@ class ProductionPipeline:
             results["final_outputs"] = {
                 "concept_sheet": str(views.get("concept_sheet", "")),
                 "mesh": str(model_data["mesh_path"]),
-                "texture": str(model_data["texture_path"]),
                 "rigged_model": str(rig_data["rigged_path"]),
                 "statistics": {
-                    "vertices": model_data["statistics"]["vertices"],
-                    "faces": model_data["statistics"]["faces"],
-                    "bones": rig_data["bone_count"],
-                    "texture_resolution": model_data["statistics"]["texture_size"]
                 }
             }
@@ -1224,7 +1267,7 @@ def create_production_config():
         "num_views": 6,
         "guidance_scale": 3.5,
         "inference_steps": 28,
-        "sparc3d_resolution": 512,
         "target_polycount": 30000,
         "texture_resolution": 2048,
         "output_dir": "./digipal_3d_output",

 """
 Production-Ready Open-Source Text-to-Rigged-3D Pipeline
+Uses HuggingFace Spaces API for Flux, Hunyuan3D implementation, and UniRig models
 Rick Rubin philosophy: Strip complexity, amplify creativity
 """
 class ProductionConfig:
     """Production configuration for open-source pipeline"""
+    # Text-to-image model options
+    text_to_image_model: str = "omnigen2"  # omnigen2 is primary, flux as fallback
+    omnigen2_repo: str = "shitao/OmniGen-v1"  # Updated to working repo
     flux_space: str = "black-forest-labs/FLUX.1-dev"  # Kept for fallback
+    # 3D Generation models
+    hunyuan3d_model: str = "tencent/Hunyuan3D-2.1"  # Updated to latest version
+    hunyuan3d_space: str = "tencent/Hunyuan3D-2.1"  # Official Gradio Space
     unirig_repo: str = "https://github.com/VAST-AI-Research/UniRig"
     unirig_hf_model: str = "VAST-AI/UniRig"
     inference_steps: int = 28
     # 3D settings
+    hunyuan3d_resolution: int = 1024
     target_polycount: int = 30000
     texture_resolution: int = 2048
     enable_cpu_offload: bool = True  # For VRAM optimization
 class OmniGen2MultiViewGenerator:
+    """Generate multi-view images using OmniGen"""
     def __init__(self, config: ProductionConfig):
         self.config = config
         self.model = None
         self.tokenizer = None
+        logger.info(f"Initializing OmniGen from: {config.omnigen2_repo}")
     def _load_model(self):
+        """Lazy load the OmniGen model to save memory"""
         if self.model is None:
             try:
+                # Import OmniGen specific modules
+                from diffusers import DiffusionPipeline
+                # Load OmniGen model using diffusers pipeline
+                self.model = DiffusionPipeline.from_pretrained(
                     self.config.omnigen2_repo,
                     trust_remote_code=True,
                     torch_dtype=torch.float16 if self.config.device == "cuda" else torch.float32,
                     device_map="auto" if self.config.enable_cpu_offload else None
                 )
                 if not self.config.enable_cpu_offload:
                     self.model = self.model.to(self.config.device)
+                # Enable memory efficient attention if available
+                if hasattr(self.model, 'enable_attention_slicing'):
+                    self.model.enable_attention_slicing()
+                if hasattr(self.model, 'enable_vae_slicing'):
+                    self.model.enable_vae_slicing()
+                logger.info("OmniGen model loaded successfully")
             except Exception as e:
+                logger.error(f"Failed to load OmniGen model: {str(e)}")
+                logger.info("Trying alternative loading method...")
+                try:
+                    # Fallback: try loading as a generic model
+                    self.model = AutoModel.from_pretrained(
+                        self.config.omnigen2_repo,
+                        trust_remote_code=True,
+                        torch_dtype=torch.float16 if self.config.device == "cuda" else torch.float32,
+                        device_map="auto" if self.config.enable_cpu_offload else None
+                    )
+                    if not self.config.enable_cpu_offload:
+                        self.model = self.model.to(self.config.device)
+                    logger.info("OmniGen model loaded via fallback method")
+                except Exception as fallback_e:
+                    logger.error(f"Fallback loading also failed: {fallback_e}")
+                    raise
     async def generate_creature_views(self, base_prompt: str,
                                     creature_name: str) -> Dict[str, Path]:
                     f"consistent lighting, monster character design"
                 )
+                # Generate image using OmniGen
                 with torch.no_grad():
+                    if hasattr(self.model, '__call__'):
+                        # Standard diffusers pipeline call
+                        result = self.model(
+                            prompt=full_prompt,
+                            width=self.config.image_resolution,
+                            height=self.config.image_resolution,
+                            guidance_scale=self.config.text_guidance_scale,
+                            num_inference_steps=self.config.inference_steps,
+                            generator=torch.Generator(device=self.config.device).manual_seed(42 + len(output_paths))
+                        )
+                        # Extract the image from the result
+                        if hasattr(result, 'images'):
+                            image = result.images[0]
+                        elif isinstance(result, list):
+                            image = result[0]
+                        else:
+                            image = result
+                    elif hasattr(self.model, 'generate'):
+                        # Alternative generation method for different model types
+                        result = self.model.generate(
+                            prompt=full_prompt,
+                            image_size=(self.config.image_resolution, self.config.image_resolution),
+                            guidance_scale=self.config.text_guidance_scale,
+                            num_inference_steps=self.config.inference_steps
+                        )
+                        if isinstance(result, torch.Tensor):
+                            # Convert tensor to PIL Image
+                            image_array = result.cpu().numpy().squeeze()
+                            if image_array.ndim == 3 and image_array.shape[0] == 3:
+                                image_array = np.transpose(image_array, (1, 2, 0))
+                            image_array = (image_array * 255).astype(np.uint8)
+                            image = Image.fromarray(image_array)
+                        else:
+                            image = result
+                    else:
+                        raise ValueError("Unknown model interface - cannot generate image")
                 # Save generated image
                 output_path = self.config.output_dir / f"{creature_name}_{view_name}_view.png"
                 output_path.parent.mkdir(parents=True, exist_ok=True)
+                # Ensure image is a PIL Image and save
+                if not isinstance(image, Image.Image):
+                    if isinstance(image, np.ndarray):
+                        image = Image.fromarray((image * 255).astype(np.uint8))
+                    else:
+                        logger.warning(f"Unexpected image type: {type(image)}")
+                        continue
                 image.save(output_path)
                 output_paths[view_name] = output_path
         return output_path
+class Hunyuan3DProcessor:
+    """Hunyuan3D-2.1 implementation using official Gradio Space API"""
     def __init__(self, config: ProductionConfig):
         self.config = config
+        self.client = None
+        logger.info(f"Initializing Hunyuan3D-2.1 from space: {config.hunyuan3d_space}")
+    def _initialize_client(self):
+        """Initialize Gradio client for Hunyuan3D Space"""
+        if self.client is None:
+            try:
+                from gradio_client import Client
+                # Connect to the official Hunyuan3D Space
+                self.client = Client(
+                    src=self.config.hunyuan3d_space,
+                    hf_token=self.config.hf_token
+                )
+                logger.info(f"Connected to Hunyuan3D-2.1 Space: {self.config.hunyuan3d_space}")
+            except Exception as e:
+                logger.error(f"Failed to connect to Hunyuan3D Space: {e}")
+                logger.info("Will try local fallback if available")
+                # Don't raise here, let the generation method handle fallback
+                self.client = None
     async def generate_3d_from_views(self, view_paths: Dict[str, Path],
                                    creature_name: str) -> Dict[str, Any]:
+        """Generate 3D model from image using Hunyuan3D-2.1 Space API"""
+        self._initialize_client()
+        logger.info(f"Generating 3D model from {len(view_paths)} views using Hunyuan3D-2.1")
         start_time = time.time()
         try:
+            # Use the front view as primary input for Hunyuan3D
+            primary_view = None
+            if "front" in view_paths:
+                primary_view = view_paths["front"]
+            elif view_paths:
+                primary_view = next(iter(view_paths.values()))
+            if not primary_view:
+                raise ValueError("No input images provided")
+            if not primary_view.exists():
+                raise ValueError(f"Input image not found: {primary_view}")
+            # Try using the official Hunyuan3D Space API
+            if self.client:
+                try:
+                    logger.info("Using official Hunyuan3D-2.1 Space API...")
+                    # Call the Hunyuan3D Space API
+                    # Based on the official interface, it typically takes an image input
+                    result = self.client.predict(
+                        image=str(primary_view),  # Input image path
+                        api_name="/generate_3d"  # API endpoint name (may vary)
+                    )
+                    # Handle the result - typically returns file paths or URLs
+                    if isinstance(result, (list, tuple)) and len(result) > 0:
+                        # Extract the 3D model file
+                        model_result = result[0] if isinstance(result[0], str) else result
+                        # Download or copy the result to our output directory
+                        mesh_path = self.config.output_dir / f"{creature_name}_hunyuan3d.glb"
+                        mesh_path.parent.mkdir(parents=True, exist_ok=True)
+                        if isinstance(model_result, str) and os.path.exists(model_result):
+                            # Copy from local path
+                            shutil.copy(model_result, mesh_path)
+                        elif isinstance(model_result, str) and model_result.startswith('http'):
+                            # Download from URL
+                            response = requests.get(model_result)
+                            with open(mesh_path, 'wb') as f:
+                                f.write(response.content)
+                        else:
+                            raise ValueError(f"Unexpected result format: {type(model_result)}")
+                        generation_time = time.time() - start_time
+                        # Get basic file statistics
+                        file_size = mesh_path.stat().st_size if mesh_path.exists() else 0
+                        return {
+                            "success": True,
+                            "mesh_path": mesh_path,
+                            "texture_path": mesh_path,  # Same file for GLB
+                            "statistics": {
+                                "file_size_mb": file_size / (1024 * 1024),
+                                "generation_time": generation_time,
+                                "model": "Hunyuan3D-2.1",
+                                "input_views": len(view_paths),
+                                "method": "official_space_api"
+                            }
+                        }
+                    else:
+                        raise ValueError("Invalid result from Hunyuan3D Space API")
+                except Exception as api_error:
+                    logger.error(f"Hunyuan3D Space API failed: {api_error}")
+                    logger.info("Falling back to alternative method...")
+                    # Fall through to local fallback
+            # Fallback: Use local processing or placeholder
+            logger.info("Using local fallback for 3D generation...")
+            return await self._local_3d_fallback(primary_view, creature_name, start_time)
         except Exception as e:
+            logger.error(f"Hunyuan3D generation failed: {e}")
             return {
                 "success": False,
                 "error": str(e)
             }
+    async def _local_3d_fallback(self, image_path: Path, creature_name: str,
+                               start_time: float) -> Dict[str, Any]:
+        """Fallback method for 3D generation when Space API is unavailable"""
+        logger.info("Generating placeholder 3D model...")
+        # Create a simple cube mesh as placeholder
+        import trimesh
+        # Generate a basic cube mesh
+        mesh = trimesh.creation.box(extents=[1.0, 1.0, 1.0])
+        # Apply basic coloring based on input image
+        try:
+            input_image = Image.open(image_path).convert("RGB")
+            # Get dominant color from image
+            avg_color = np.array(input_image).mean(axis=(0, 1)) / 255.0
+            # Apply color to mesh
+            if hasattr(mesh.visual, 'vertex_colors'):
+                mesh.visual.vertex_colors = np.tile(
+                    [*avg_color, 1.0], (len(mesh.vertices), 1)
+                ).astype(np.uint8) * 255
+        except Exception as color_error:
+            logger.warning(f"Failed to apply coloring: {color_error}")
+        # Save the mesh
+        mesh_path = self.config.output_dir / f"{creature_name}_fallback_3d.glb"
+        mesh_path.parent.mkdir(parents=True, exist_ok=True)
+        mesh.export(str(mesh_path))
+        generation_time = time.time() - start_time
         return {
+            "success": True,
+            "mesh_path": mesh_path,
+            "texture_path": mesh_path,
+            "statistics": {
+                "vertices": len(mesh.vertices),
+                "faces": len(mesh.faces),
+                "generation_time": generation_time,
+                "model": "fallback_cube",
+                "input_views": 1,
+                "method": "local_fallback"
             }
         }
 class UniRigProcessor:
+    """UniRig integration using HuggingFace models and inference API"""
     def __init__(self, config: ProductionConfig):
         self.config = config
         self.model_path = None
+        self.client = None
+        logger.info(f"Initializing UniRig from HuggingFace: {config.unirig_hf_model}")
+    def _setup_unirig(self):
+        """Setup UniRig using HuggingFace models and API"""
         try:
+            # Try to use HuggingFace Inference API first
+            from gradio_client import Client
+            # Check if there's a UniRig Space available
+            try:
+                # This would be the ideal approach if there's a UniRig Space
+                self.client = Client(
+                    src=self.config.unirig_hf_model,  # or a specific space
+                    hf_token=self.config.hf_token
+                )
+                logger.info("Connected to UniRig via HuggingFace Space/API")
+                return
+            except:
+                logger.info("No UniRig Space found, trying direct model download...")
+            # Fallback: Download models from HuggingFace
+            try:
+                self.model_path = snapshot_download(
+                    repo_id=self.config.unirig_hf_model,
+                    cache_dir=self.config.cache_dir,
+                    token=self.config.hf_token,
+                    allow_patterns=["*.py", "*.yaml", "*.json", "*.bin", "*.safetensors"]
+                )
+                logger.info(f"UniRig models downloaded to: {self.model_path}")
+            except Exception as download_error:
+                logger.warning(f"Could not download UniRig from HF: {download_error}")
+                logger.info("UniRig will use procedural fallback method")
+                self.model_path = None
+        except Exception as e:
+            logger.error(f"Failed to setup UniRig: {e}")
+            logger.info("UniRig will use procedural fallback method")
+            self.model_path = None
     async def auto_rig_creature(self, mesh_path: Path, creature_name: str,
                               creature_type: str = "biped") -> Dict[str, Any]:
+        """Apply automatic rigging using UniRig via HuggingFace"""
+        logger.info(f"Auto-rigging {creature_name} as {creature_type} using UniRig")
+        # Setup UniRig if not already done
+        if self.model_path is None and self.client is None:
+            self._setup_unirig()
         try:
+            # Try using HuggingFace Space/API first
+            if self.client:
+                return await self._rig_via_hf_api(mesh_path, creature_name, creature_type)
+            # Try using downloaded models
+            elif self.model_path:
+                return await self._rig_via_local_models(mesh_path, creature_name, creature_type)
+            # Fallback to procedural rigging
+            else:
+                logger.info("No UniRig models available, using procedural fallback")
+                return await self._procedural_rigging_fallback(mesh_path, creature_name, creature_type)
         except Exception as e:
             logger.error(f"UniRig failed: {e}")
             # Fallback to procedural rigging
             return await self._procedural_rigging_fallback(mesh_path, creature_name, creature_type)
+    async def _rig_via_hf_api(self, mesh_path: Path, creature_name: str,
+                            creature_type: str) -> Dict[str, Any]:
+        """Rig using HuggingFace Space API"""
+        logger.info("Using UniRig HuggingFace Space API...")
+        try:
+            # Call the UniRig Space API
+            result = self.client.predict(
+                mesh_file=str(mesh_path),
+                creature_type=creature_type,
+                api_name="/auto_rig"  # This would be the API endpoint
+            )
+            # Handle the result
+            if isinstance(result, (list, tuple)) and len(result) > 0:
+                rigged_file = result[0]
+                # Copy result to our output directory
+                output_dir = self.config.output_dir / "rigged"
+                output_dir.mkdir(exist_ok=True)
+                rigged_path = output_dir / f"{creature_name}_rigged.glb"
+                if isinstance(rigged_file, str) and os.path.exists(rigged_file):
+                    shutil.copy(rigged_file, rigged_path)
+                elif isinstance(rigged_file, str) and rigged_file.startswith('http'):
+                    # Download from URL
+                    response = requests.get(rigged_file)
+                    with open(rigged_path, 'wb') as f:
+                        f.write(response.content)
+                else:
+                    raise ValueError(f"Unexpected result format: {type(rigged_file)}")
+                return {
+                    "success": True,
+                    "rigged_path": rigged_path,
+                    "bone_count": "unknown",  # API doesn't provide this
+                    "method": "hf_space_api"
+                }
+            else:
+                raise ValueError("Invalid result from UniRig Space API")
+        except Exception as api_error:
+            logger.error(f"UniRig Space API failed: {api_error}")
+            raise
+    async def _rig_via_local_models(self, mesh_path: Path, creature_name: str,
+                                  creature_type: str) -> Dict[str, Any]:
+        """Rig using locally downloaded UniRig models"""
+        logger.info("Using local UniRig models...")
+        try:
+            # This would require implementing the UniRig inference pipeline
+            # For now, fall back to procedural method
+            logger.info("Local UniRig inference not yet implemented, using procedural fallback")
+            return await self._procedural_rigging_fallback(mesh_path, creature_name, creature_type)
+        except Exception as e:
+            logger.error(f"Local UniRig inference failed: {e}")
+            raise
     async def _run_unirig_pipeline(self, mesh_path: Path,
                                  skeleton_config: str,
                                  creature_name: str) -> Dict[str, Any]:
             "method": "procedural_fallback"
         }
+class SimplifiedHunyuan3D:
+    """Simplified 3D generation when Hunyuan3D is not available"""
     def __init__(self, config: ProductionConfig):
         self.config = config
 class ProductionPipeline:
     """
     Complete production-ready open-source pipeline
+    OmniGen2/Flux -> Hunyuan3D -> UniRig
     """
     def __init__(self, config: Optional[ProductionConfig] = None):
         else:
             self.image_generator = FluxMultiViewGenerator(self.config)
+        self.hunyuan3d = Hunyuan3DProcessor(self.config)
         self.unirig = UniRigProcessor(self.config)
         logger.info("Production pipeline ready!")
                 "outputs": {k: str(v) for k, v in views.items()}
             }
+            # Stage 2: 3D generation with Hunyuan3D
             logger.info("=" * 50)
+            logger.info("Stage 2: Generating 3D model with Hunyuan3D")
             stage_start = time.time()
+            model_data = await self.hunyuan3d.generate_3d_from_views(views, name)
+            results["pipeline_stages"]["hunyuan3d"] = {
                 "success": model_data["success"],
                 "duration": time.time() - stage_start,
                 "outputs": model_data
             results["final_outputs"] = {
                 "concept_sheet": str(views.get("concept_sheet", "")),
                 "mesh": str(model_data["mesh_path"]),
+                "texture": str(model_data.get("texture_path", model_data["mesh_path"])),
                 "rigged_model": str(rig_data["rigged_path"]),
                 "statistics": {
+                    "vertices": model_data.get("statistics", {}).get("vertices", "unknown"),
+                    "faces": model_data.get("statistics", {}).get("faces", "unknown"),
+                    "bones": rig_data.get("bone_count", "unknown"),
+                    "generation_time": model_data.get("statistics", {}).get("generation_time", 0),
+                    "model_method": model_data.get("statistics", {}).get("method", "unknown")
                 }
             }
         "num_views": 6,
         "guidance_scale": 3.5,
         "inference_steps": 28,
+        "hunyuan3d_resolution": 512,
         "target_polycount": 30000,
         "texture_resolution": 2048,
         "output_dir": "./digipal_3d_output",

src/ui/gradio_interface.py DELETED Viewed

@@ -1,1263 +0,0 @@
-import gradio as gr
-import asyncio
-import logging
-import json
-import time
-from typing import Dict, List, Optional, Any, Tuple
-from datetime import datetime, timedelta
-import numpy as np
-# Core imports
-from ..core.monster_engine import Monster, MonsterPersonalityType, EmotionalState
-from ..ai.qwen_processor import QwenProcessor, ModelConfig
-from ..ai.speech_engine import AdvancedSpeechEngine, SpeechConfig
-from ..utils.performance_tracker import PerformanceTracker
-from .state_manager import AdvancedStateManager
-from ..deployment.zero_gpu_optimizer import ZeroGPUOptimizer
-import spaces
-def create_interface():
-    """Create and return the Gradio interface"""
-    interface = ModernDigiPalInterface()
-    return interface.create_interface()
-class StreamingComponents:
-    """Helper class for streaming components"""
-    def __init__(self):
-        self.logger = logging.getLogger(__name__)
-class ModernDigiPalInterface:
-    def __init__(self):
-        self.logger = logging.getLogger(__name__)
-        # Initialize core systems
-        self.state_manager = AdvancedStateManager()
-        self.streaming = StreamingComponents()
-        self.gpu_optimizer = ZeroGPUOptimizer()
-        # AI Systems (will be initialized based on available resources)
-        self.qwen_processor = None
-        self.speech_engine = None
-        # Performance tracking
-        self.performance_metrics = {
-            "total_interactions": 0,
-            "average_response_time": 0.0,
-            "user_satisfaction": 0.0
-        }
-        # UI State
-        self.current_monster = None
-        self.ui_theme = "soft"
-    async def initialize(self):
-        """Initialize the interface with optimized configurations"""
-        try:
-            # Detect available resources
-            resources = await self.gpu_optimizer.detect_available_resources()
-            # Initialize AI processors based on resources
-            await self._initialize_ai_systems(resources)
-            # Initialize state management
-            await self.state_manager.initialize()
-            self.logger.info("DigiPal interface initialized successfully")
-        except Exception as e:
-            self.logger.error(f"Failed to initialize interface: {e}")
-            raise
-    async def _initialize_ai_systems(self, resources: Dict[str, Any]):
-        """Initialize AI systems based on available resources"""
-        # Initialize Qwen processor with fallback handling
-        try:
-            # Configure Qwen processor
-            if resources["gpu_memory_gb"] >= 8:
-                model_config = ModelConfig(
-                    model_name="Qwen/Qwen2.5-3B-Instruct",
-                    max_memory_gb=resources["gpu_memory_gb"],
-                    inference_speed="quality"
-                )
-            elif resources["gpu_memory_gb"] >= 4:
-                model_config = ModelConfig(
-                    model_name="Qwen/Qwen2.5-1.5B-Instruct",
-                    max_memory_gb=resources["gpu_memory_gb"],
-                    inference_speed="balanced"
-                )
-            else:
-                model_config = ModelConfig(
-                    model_name="Qwen/Qwen2.5-0.5B-Instruct",
-                    max_memory_gb=resources["gpu_memory_gb"],
-                    inference_speed="fast"
-                )
-            self.qwen_processor = QwenProcessor(model_config)
-            await self.qwen_processor.initialize()
-            self.logger.info("Qwen processor initialized successfully")
-        except Exception as e:
-            self.logger.error(f"Failed to initialize Qwen processor: {e}")
-            self.logger.info("Continuing with fallback responses")
-            self.qwen_processor = None
-        # Initialize speech engine with fallback handling
-        try:
-            speech_config = SpeechConfig()
-            if resources["gpu_memory_gb"] >= 6:
-                speech_config.model_size = "medium"
-                speech_config.device = "cuda"
-            elif resources["gpu_memory_gb"] >= 3:
-                speech_config.model_size = "small"
-                speech_config.device = "cuda"
-            else:
-                speech_config.model_size = "base"
-                speech_config.device = "cpu"
-            self.speech_engine = AdvancedSpeechEngine(speech_config)
-            await self.speech_engine.initialize()
-            self.logger.info("Speech engine initialized successfully")
-        except Exception as e:
-            self.logger.error(f"Failed to initialize speech engine: {e}")
-            self.logger.info("Continuing without speech capabilities")
-            self.speech_engine = None
-    def create_interface(self) -> gr.Blocks:
-        """Create the main Gradio interface"""
-        # Custom CSS for modern monster game UI
-        custom_css = """
-        /* Modern Dark Theme */
-        .gradio-container {
-            background: linear-gradient(135deg, #1a1a2e 0%, #16213e 50%, #0f3460 100%);
-            font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
-        }
-        /* Monster Display */
-        .monster-display {
-            background: linear-gradient(145deg, #2a2a4e, #1e1e3c);
-            border: 3px solid #4a9eff;
-            border-radius: 20px;
-            padding: 20px;
-            text-align: center;
-            box-shadow: 0 10px 30px rgba(74, 158, 255, 0.3);
-            backdrop-filter: blur(10px);
-            min-height: 400px;
-        }
-        /* Stat Bars */
-        .stat-bar {
-            background: #1e1e3c;
-            border-radius: 15px;
-            overflow: hidden;
-            margin: 8px 0;
-            height: 25px;
-            border: 2px solid #333;
-        }
-        .stat-fill {
-            height: 100%;
-            border-radius: 12px;
-            transition: width 0.8s ease-in-out;
-            background: linear-gradient(90deg, #ff6b6b, #4ecdc4, #45b7d1);
-        }
-        /* Care Action Buttons */
-        .care-button {
-            background: linear-gradient(145deg, #4a9eff, #357abd);
-            border: none;
-            color: white;
-            padding: 12px 24px;
-            border-radius: 12px;
-            font-weight: bold;
-            transition: all 0.3s ease;
-            box-shadow: 0 4px 15px rgba(74, 158, 255, 0.4);
-        }
-        .care-button:hover {
-            transform: translateY(-3px);
-            box-shadow: 0 8px 25px rgba(74, 158, 255, 0.6);
-            background: linear-gradient(145deg, #5aa7ff, #4a9eff);
-        }
-        /* Conversation Area */
-        .conversation-container {
-            background: rgba(30, 30, 60, 0.8);
-            border: 2px solid #4a9eff;
-            border-radius: 15px;
-            backdrop-filter: blur(10px);
-        }
-        /* Mini-game Container */
-        .mini-game-area {
-            background: linear-gradient(145deg, #2d1b69, #1a1a2e);
-            border: 2px solid #8b5cf6;
-            border-radius: 15px;
-            padding: 20px;
-            margin: 10px 0;
-        }
-        /* Status Indicators */
-        .status-indicator {
-            display: inline-block;
-            width: 12px;
-            height: 12px;
-            border-radius: 50%;
-            margin-right: 8px;
-        }
-        .status-healthy { background: #4ade80; }
-        .status-warning { background: #fbbf24; }
-        .status-critical { background: #ef4444; }
-        /* Responsive Design */
-        @media (max-width: 768px) {
-            .monster-display {
-                padding: 15px;
-                margin: 10px;
-            }
-            .care-button {
-                padding: 10px 20px;
-                margin: 5px;
-            }
-        }
-        """
-        with gr.Blocks(
-            css=custom_css,
-            title="DigiPal - Advanced Monster Companion",
-            theme=gr.themes.Soft()
-        ) as interface:
-            # Header
-            gr.HTML("""
-            <div style="text-align: center; padding: 20px;">
-                <h1 style="color: #4a9eff; font-size: 2.5em; margin: 0;">🐾 DigiPal</h1>
-                <p style="color: #8b5cf6; font-size: 1.2em;">Advanced AI Monster Companion</p>
-            </div>
-            """)
-            # State Management - Modern Gradio 5.34.2 patterns
-            with gr.Row():
-                # Session State for current monster
-                current_monster_state = gr.State(None)
-                # Conversation State
-                conversation_state = gr.State([])
-                # UI State
-                ui_state = gr.State({
-                    "last_action": None,
-                    "current_tab": "care",
-                    "mini_game_active": False
-                })
-            # Main Interface Layout
-            with gr.Row(equal_height=True):
-                # Left Column - Monster Display and Stats
-                with gr.Column(scale=3):
-                    # Monster Display Area
-                    monster_display = gr.HTML(
-                        value=self._get_default_monster_display(),
-                        elem_classes="monster-display"
-                    )
-                    # Monster Management Controls
-                    with gr.Row():
-                        create_monster_btn = gr.Button(
-                            "🥚 Create New Monster",
-                            variant="primary",
-                            elem_classes="care-button"
-                        )
-                        load_monster_btn = gr.Button(
-                            "📂 Load Monster",
-                            elem_classes="care-button"
-                        )
-                        save_progress_btn = gr.Button(
-                            "💾 Save Progress",
-                            elem_classes="care-button"
-                        )
-                    # New Monster Creation
-                    with gr.Group(visible=False) as monster_creation_group:
-                        monster_name_input = gr.Textbox(
-                            label="Monster Name",
-                            placeholder="Enter your monster's name...",
-                            max_lines=1
-                        )
-                        personality_type = gr.Dropdown(
-                            choices=[p.value for p in MonsterPersonalityType],
-                            label="Personality Type",
-                            value="playful"
-                        )
-                        confirm_creation_btn = gr.Button(
-                            "✨ Create Monster",
-                            variant="primary"
-                        )
-                # Middle Column - Care Actions and Training
-                with gr.Column(scale=2):
-                    with gr.Tabs() as care_tabs:
-                        # Care Tab
-                        with gr.TabItem("🍼 Care", id=0):
-                            # Feeding Section
-                            with gr.Group():
-                                gr.Markdown("### 🍽️ Feeding")
-                                food_type = gr.Dropdown(
-                                    choices=[
-                                        "meat", "fish", "fruit", "vegetables",
-                                        "medicine", "supplement", "treat"
-                                    ],
-                                    value="meat",
-                                    label="Food Type"
-                                )
-                                feed_btn = gr.Button(
-                                    "🍖 Feed Monster",
-                                    elem_classes="care-button"
-                                )
-                            # Training Section
-                            with gr.Group():
-                                gr.Markdown("### 💪 Training")
-                                training_type = gr.Dropdown(
-                                    choices=[
-                                        "strength", "endurance", "intelligence",
-                                        "dexterity", "spirit", "technique"
-                                    ],
-                                    value="strength",
-                                    label="Training Focus"
-                                )
-                                training_intensity = gr.Slider(
-                                    minimum=1,
-                                    maximum=5,
-                                    value=3,
-                                    step=1,
-                                    label="Training Intensity"
-                                )
-                                train_btn = gr.Button(
-                                    "🏋️ Start Training",
-                                    elem_classes="care-button"
-                                )
-                            # Care Actions
-                            with gr.Group():
-                                gr.Markdown("### 🧼 Care Actions")
-                                with gr.Row():
-                                    clean_btn = gr.Button("🚿 Clean", elem_classes="care-button")
-                                    play_btn = gr.Button("🎮 Play", elem_classes="care-button")
-                                    rest_btn = gr.Button("😴 Rest", elem_classes="care-button")
-                                    discipline_btn = gr.Button("📚 Discipline", elem_classes="care-button")
-                        # Evolution Tab
-                        with gr.TabItem("🦋 Evolution", id=1):
-                            evolution_status = gr.HTML(
-                                value="<p>No monster loaded</p>"
-                            )
-                            evolution_requirements = gr.JSON(
-                                label="Evolution Requirements",
-                                value={}
-                            )
-                            trigger_evolution_btn = gr.Button(
-                                "🌟 Trigger Evolution",
-                                variant="primary",
-                                interactive=False
-                            )
-                        # Breeding Tab
-                        with gr.TabItem("💕 Breeding", id=2):
-                            gr.Markdown("### Find a Breeding Partner")
-                            partner_search = gr.Dropdown(
-                                choices=[],
-                                label="Available Partners",
-                                interactive=False
-                            )
-                            breeding_compatibility = gr.HTML(
-                                value="<p>Select a partner to see compatibility</p>"
-                            )
-                            start_breeding_btn = gr.Button(
-                                "💖 Start Breeding",
-                                variant="primary",
-                                interactive=False
-                            )
-                # Right Column - Conversation and Mini-games
-                with gr.Column(scale=3):
-                    with gr.Tabs():
-                        # Conversation Tab
-                        with gr.TabItem("💬 Talk", id=0):
-                            # Conversation Display
-                            chatbot = gr.Chatbot(
-                                value=[],
-                                height=350,
-                                label="Conversation with your Monster",
-                                elem_classes="conversation-container",
-                                avatar_images=("👤", "🐾"),
-                                type="messages"
-                            )
-                            # Text Input
-                            with gr.Row():
-                                text_input = gr.Textbox(
-                                    label="Message",
-                                    placeholder="Talk to your monster...",
-                                    scale=4,
-                                    max_lines=3
-                                )
-                                send_btn = gr.Button("💬", scale=1)
-                            # Voice Input
-                            with gr.Group():
-                                gr.Markdown("### 🎤 Voice Chat")
-                                with gr.Row():
-                                    audio_input = gr.Audio(
-                                        sources=["microphone"],
-                                        type="numpy",
-                                        label="Voice Input",
-                                        streaming=False
-                                    )
-                                    voice_btn = gr.Button("🗣️ Send Voice")
-                                # Real-time audio streaming (Gradio 5.34.2 feature)
-                                with gr.Row():
-                                    start_stream_btn = gr.Button("🎙️ Start Live Chat")
-                                    stop_stream_btn = gr.Button("⏹️ Stop", interactive=False)
-                        # Mini-games Tab
-                        with gr.TabItem("🎯 Games", id=1):
-                            mini_game_display = gr.HTML(
-                                value=self._get_mini_game_display(),
-                                elem_classes="mini-game-area"
-                            )
-                            with gr.Row():
-                                reaction_game_btn = gr.Button("⚡ Reaction Training")
-                                memory_game_btn = gr.Button("🧠 Memory Challenge")
-                                rhythm_game_btn = gr.Button("🎵 Rhythm Game")
-                                puzzle_game_btn = gr.Button("🧩 Logic Puzzle")
-                            game_score_display = gr.JSON(
-                                label="Game Statistics",
-                                value={}
-                            )
-                        # Stats Tab
-                        with gr.TabItem("📊 Statistics", id=2):
-                            detailed_stats = gr.JSON(
-                                label="Detailed Monster Statistics",
-                                value={}
-                            )
-                            performance_charts = gr.Plot(
-                                label="Performance Over Time"
-                            )
-                            achievement_display = gr.HTML(
-                                value="<p>No achievements yet</p>"
-                            )
-            # Global Status Bar
-            with gr.Row():
-                status_display = gr.HTML(
-                    value="<p>Ready to start your monster care journey!</p>",
-                    elem_id="status-bar"
-                )
-                auto_save_indicator = gr.HTML(
-                    value="<span style='color: green;'>● Auto-save: ON</span>",
-                    elem_id="auto-save-status"
-                )
-            # Hidden components for data flow
-            action_result = gr.Textbox(visible=False)
-            background_timer = gr.Timer(value=30, active=True)  # 30-second updates
-            # Event Handlers with Modern Async Patterns
-            # Monster Creation Flow
-            create_monster_btn.click(
-                fn=lambda: gr.update(visible=True),
-                outputs=monster_creation_group
-            )
-            confirm_creation_btn.click(
-                fn=lambda name, personality: safe_create_monster(self, name, personality),
-                inputs=[monster_name_input, personality_type],
-                outputs=[current_monster_state, monster_display, monster_creation_group]
-            )
-            # Feeding handlers
-            feed_btn.click(
-                fn=lambda monster_state, food: asyncio.run(self.feed_monster(monster_state, food)),
-                inputs=[current_monster_state, food_type],
-                outputs=[current_monster_state, monster_display, action_result, chatbot]
-            )
-            # Training handlers
-            train_btn.click(
-                fn=lambda monster_state, training, intensity: asyncio.run(self.train_monster(monster_state, training, intensity)),
-                inputs=[current_monster_state, training_type, training_intensity],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            # Conversation handlers
-            send_btn.click(
-                fn=lambda monster_state, message, history: safe_handle_conversation(self, monster_state, message, history),
-                inputs=[current_monster_state, text_input, conversation_state],
-                outputs=[chatbot, text_input, conversation_state, current_monster_state]
-            )
-            text_input.submit(
-                fn=lambda monster_state, message, history: safe_handle_conversation(self, monster_state, message, history),
-                inputs=[current_monster_state, text_input, conversation_state],
-                outputs=[chatbot, text_input, conversation_state, current_monster_state]
-            )
-            # Voice handlers
-            voice_btn.click(
-                fn=lambda monster_state, audio, history: safe_handle_voice(self, monster_state, audio, history),
-                inputs=[current_monster_state, audio_input, conversation_state],
-                outputs=[chatbot, conversation_state, current_monster_state, action_result]
-            )
-            # Streaming handlers
-            start_stream_btn.click(
-                fn=self.start_voice_streaming,
-                outputs=[start_stream_btn, stop_stream_btn]
-            )
-            # Care Actions
-            feed_btn.click(
-                fn=self.feed_monster,
-                inputs=[current_monster_state, food_type],
-                outputs=[current_monster_state, monster_display, action_result, chatbot]
-            )
-            train_btn.click(
-                fn=self.train_monster,
-                inputs=[current_monster_state, training_type, training_intensity],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            # Conversation Handlers
-            send_btn.click(
-                fn=lambda monster_state, message, history: safe_handle_conversation(self, monster_state, message, history),
-                inputs=[current_monster_state, text_input, conversation_state],
-                outputs=[chatbot, text_input, conversation_state, current_monster_state]
-            )
-            text_input.submit(
-                fn=lambda monster_state, message, history: safe_handle_conversation(self, monster_state, message, history),
-                inputs=[current_monster_state, text_input, conversation_state],
-                outputs=[chatbot, text_input, conversation_state, current_monster_state]
-            )
-            voice_btn.click(
-                fn=lambda monster_state, audio, history: safe_handle_voice(self, monster_state, audio, history),
-                inputs=[current_monster_state, audio_input, conversation_state],
-                outputs=[chatbot, conversation_state, current_monster_state, action_result]
-            )
-            # Real-time streaming (Gradio 5.34.2)
-            start_stream_btn.click(
-                fn=self.start_voice_streaming,
-                outputs=[start_stream_btn, stop_stream_btn]
-            )
-            stop_stream_btn.click(
-                fn=self.stop_voice_streaming,
-                outputs=[start_stream_btn, stop_stream_btn]
-            )
-            # Background Updates
-            background_timer.tick(
-                fn=self.background_update,
-                inputs=[current_monster_state],
-                outputs=[current_monster_state, monster_display, auto_save_indicator]
-            )
-            # Care action handlers
-            def clean_action(monster_state):
-                return asyncio.run(self.perform_care_action(monster_state, "clean"))
-            def play_action(monster_state):
-                return asyncio.run(self.perform_care_action(monster_state, "play"))
-            def rest_action(monster_state):
-                return asyncio.run(self.perform_care_action(monster_state, "rest"))
-            def discipline_action(monster_state):
-                return asyncio.run(self.perform_care_action(monster_state, "discipline"))
-            clean_btn.click(
-                fn=clean_action,
-                inputs=[current_monster_state],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            play_btn.click(
-                fn=play_action,
-                inputs=[current_monster_state],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            rest_btn.click(
-                fn=rest_action,
-                inputs=[current_monster_state],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            discipline_btn.click(
-                fn=discipline_action,
-                inputs=[current_monster_state],
-                outputs=[current_monster_state, monster_display, action_result]
-            )
-            # Mini-game handlers
-            for btn, game in [(reaction_game_btn, "reaction"), (memory_game_btn, "memory"),
-                            (rhythm_game_btn, "rhythm"), (puzzle_game_btn, "puzzle")]:
-                btn.click(
-                    fn=lambda monster_state, game=game: self.start_mini_game(monster_state, game),
-                    inputs=[current_monster_state],
-                    outputs=[mini_game_display, game_score_display]
-                )
-        return interface
-    # Implementation methods continue...
-    async def create_new_monster(self, name: str, personality: str) -> Tuple:
-        """Create a new monster with specified parameters"""
-        try:
-            if not name.strip():
-                return None, self._get_default_monster_display(), gr.update(visible=True)
-            # Create monster with personality
-            monster = Monster(
-                name=name.strip(),
-                species="Botamon"  # Starting species
-            )
-            # Set personality
-            monster.personality.primary_type = MonsterPersonalityType(personality)
-            # Randomize personality traits based on type
-            trait_modifiers = {
-                "playful": {"extraversion": 0.8, "openness": 0.7, "agreeableness": 0.6},
-                "serious": {"conscientiousness": 0.8, "neuroticism": 0.3, "extraversion": 0.4},
-                "curious": {"openness": 0.9, "extraversion": 0.6, "conscientiousness": 0.5},
-                "gentle": {"agreeableness": 0.9, "neuroticism": 0.2, "extraversion": 0.5},
-                "energetic": {"extraversion": 0.9, "openness": 0.6, "neuroticism": 0.3},
-                "calm": {"neuroticism": 0.1, "conscientiousness": 0.7, "agreeableness": 0.7},
-                "mischievous": {"openness": 0.8, "extraversion": 0.7, "conscientiousness": 0.3},
-                "loyal": {"agreeableness": 0.8, "conscientiousness": 0.9, "neuroticism": 0.2}
-            }
-            modifiers = trait_modifiers.get(personality, {})
-            for trait, value in modifiers.items():
-                if hasattr(monster.personality, trait):
-                    setattr(monster.personality, trait, value)
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            # Generate display
-            display_html = self._generate_monster_display(monster)
-            self.current_monster = monster
-            return (
-                monster.dict(),
-                display_html,
-                gr.update(visible=False)
-            )
-        except Exception as e:
-            self.logger.error(f"Monster creation failed: {e}")
-            return None, self._get_error_display(str(e)), gr.update(visible=True)
-    def _get_default_monster_display(self) -> str:
-        """Get default monster display when no monster is loaded"""
-        return """
-        <div style="text-align: center; padding: 40px;">
-            <div style="font-size: 4em; margin-bottom: 20px;">🥚</div>
-            <h2 style="color: #4a9eff;">No Monster Loaded</h2>
-            <p style="color: #8b5cf6;">Create a new monster to begin your journey!</p>
-        </div>
-        """
-    def _generate_monster_display(self, monster: Monster) -> str:
-        """Generate HTML display for the monster"""
-        # Monster sprite based on species and stage
-        sprite_map = {
-            "Botamon": {"egg": "🥚", "baby": "🐣", "child": "🐾", "adult": "🐲"},
-            # Add more species...
-        }
-        sprite = sprite_map.get(monster.species, {}).get(monster.lifecycle.stage.value, "🐾")
-        # Emotional state emoji
-        emotion_emojis = {
-            "ecstatic": "🤩", "happy": "😊", "content": "😌", "neutral": "😐",
-            "melancholy": "😔", "sad": "😢", "angry": "😠", "sick": "🤒",
-            "excited": "😆", "tired": "😴"
-        }
-        emotion_emoji = emotion_emojis.get(monster.emotional_state.value, "😐")
-        # Calculate stat colors
-        def get_stat_color(value: int) -> str:
-            if value >= 80: return "#4ade80"  # Green
-            elif value >= 60: return "#fbbf24"  # Yellow
-            elif value >= 40: return "#fb923c"  # Orange
-            else: return "#ef4444"  # Red
-        # Age display
-        age_days = monster.lifecycle.age_minutes / 1440
-        age_display = f"{age_days:.1f} days"
-        return f"""
-        <div style="text-align: center; padding: 20px;">
-            <!-- Monster Sprite -->
-            <div style="font-size: 6em; margin: 20px 0;">{sprite}</div>
-            <!-- Monster Info -->
-            <h2 style="color: #4a9eff; margin: 10px 0;">{monster.name} {emotion_emoji}</h2>
-            <p style="color: #8b5cf6; margin: 5px 0;">
-                <strong>{monster.species}</strong> | {monster.lifecycle.stage.value.title()} | {age_display}
-            </p>
-            <!-- Mood and Activity -->
-            <p style="color: #a78bfa; margin: 10px 0;">
-                Feeling {monster.emotional_state.value} while {monster.current_activity}
-            </p>
-            <!-- Care Stats -->
-            <div style="margin: 20px 0;">
-                <h3 style="color: #4a9eff;">Care Status</h3>
-                <div style="text-align: left; max-width: 300px; margin: 0 auto;">
-                    <div style="margin: 8px 0;">
-                        <span style="color: white;">Health</span>
-                        <div class="stat-bar">
-                            <div class="stat-fill" style="width: {monster.stats.health}%; background: {get_stat_color(monster.stats.health)};"></div>
-                        </div>
-                        <span style="color: #888; font-size: 0.9em;">{monster.stats.health}/100</span>
-                    </div>
-                    <div style="margin: 8px 0;">
-                        <span style="color: white;">Happiness</span>
-                        <div class="stat-bar">
-                            <div class="stat-fill" style="width: {monster.stats.happiness}%; background: {get_stat_color(monster.stats.happiness)};"></div>
-                        </div>
-                        <span style="color: #888; font-size: 0.9em;">{monster.stats.happiness}/100</span>
-                    </div>
-                    <div style="margin: 8px 0;">
-                        <span style="color: white;">Hunger</span>
-                        <div class="stat-bar">
-                            <div class="stat-fill" style="width: {monster.stats.hunger}%; background: {get_stat_color(monster.stats.hunger)};"></div>
-                        </div>
-                        <span style="color: #888; font-size: 0.9em;">{monster.stats.hunger}/100</span>
-                    </div>
-                    <div style="margin: 8px 0;">
-                        <span style="color: white;">Energy</span>
-                        <div class="stat-bar">
-                            <div class="stat-fill" style="width: {monster.stats.energy}%; background: {get_stat_color(monster.stats.energy)};"></div>
-                        </div>
-                        <span style="color: #888; font-size: 0.9em;">{monster.stats.energy}/100</span>
-                    </div>
-                </div>
-            </div>
-            <!-- Battle Stats -->
-            <div style="margin: 20px 0;">
-                <h3 style="color: #8b5cf6;">Battle Power</h3>
-                <div style="display: grid; grid-template-columns: 1fr 1fr; gap: 10px; max-width: 300px; margin: 0 auto;">
-                    <div>Life: <strong style="color: #4ade80;">{monster.stats.life}</strong></div>
-                    <div>MP: <strong style="color: #60a5fa;">{monster.stats.mp}</strong></div>
-                    <div>Offense: <strong style="color: #f87171;">{monster.stats.offense}</strong></div>
-                    <div>Defense: <strong style="color: #34d399;">{monster.stats.defense}</strong></div>
-                    <div>Speed: <strong style="color: #fbbf24;">{monster.stats.speed}</strong></div>
-                    <div>Brains: <strong style="color: #a78bfa;">{monster.stats.brains}</strong></div>
-                </div>
-            </div>
-            <!-- Generation and Care Info -->
-            <div style="margin: 15px 0; font-size: 0.9em; color: #888;">
-                Generation {monster.lifecycle.generation} |
-                Care Mistakes: {monster.lifecycle.care_mistakes} |
-                Relationship: {monster.personality.relationship_level}/100
-            </div>
-        </div>
-        """
-    def _get_mini_game_display(self) -> str:
-        """Get mini-game display HTML"""
-        return """
-        <div style="text-align: center; padding: 20px;">
-            <h3 style="color: #8b5cf6;">Mini-Games Training Center</h3>
-            <p style="color: #a78bfa;">Select a mini-game to train your monster!</p>
-            <div style="margin-top: 20px;">
-                <p>⚡ Reaction: Improve Speed & Reflexes</p>
-                <p>🧠 Memory: Enhance Intelligence</p>
-                <p>🎵 Rhythm: Boost Spirit & Happiness</p>
-                <p>🧩 Logic: Develop Problem-Solving</p>
-            </div>
-        </div>
-        """
-    def _get_error_display(self, error: str) -> str:
-        """Get error display HTML"""
-        return f"""
-        <div style="text-align: center; padding: 40px;">
-            <div style="font-size: 3em; margin-bottom: 20px;">❌</div>
-            <h2 style="color: #ef4444;">Error Occurred</h2>
-            <p style="color: #f87171;">{error}</p>
-        </div>
-        """
-    async def feed_monster(self, monster_state: Dict, food_type: str) -> Tuple:
-        """Feed the monster"""
-        if not monster_state:
-            return monster_state, self._get_default_monster_display(), "No monster loaded!", []
-        try:
-            monster = Monster(**monster_state)
-            # Food effects
-            food_effects = {
-                "meat": {"hunger": 30, "happiness": 10},
-                "fish": {"hunger": 25, "happiness": 15, "health": 5},
-                "fruit": {"hunger": 20, "happiness": 20},
-                "vegetables": {"hunger": 25, "happiness": 5, "health": 10},
-                "medicine": {"health": 50, "happiness": -10},
-                "supplement": {"energy": 20, "happiness": 5},
-                "treat": {"happiness": 30, "hunger": 10}
-            }
-            effects = food_effects.get(food_type, food_effects["meat"])
-            # Apply effects
-            for stat, value in effects.items():
-                current = getattr(monster.stats, stat)
-                setattr(monster.stats, stat, max(0, min(100, current + value)))
-            # Update emotional state
-            monster.emotional_state = monster.calculate_emotional_state()
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            # Generate response
-            response = f"{monster.name} enjoyed the {food_type}! 😋"
-            # Create message format for chatbot
-            messages = [
-                {"role": "user", "content": f"Fed {food_type}"},
-                {"role": "assistant", "content": response}
-            ]
-            return (
-                monster.dict(),
-                self._generate_monster_display(monster),
-                response,
-                messages
-            )
-        except Exception as e:
-            self.logger.error(f"Feeding failed: {e}")
-            return monster_state, self._get_error_display(str(e)), str(e), []
-    async def train_monster(self, monster_state: Dict, training_type: str, intensity: int) -> Tuple:
-        """Train the monster"""
-        if not monster_state:
-            return monster_state, self._get_default_monster_display(), "No monster loaded!"
-        try:
-            monster = Monster(**monster_state)
-            # Check if monster can train
-            if monster.stats.energy < 20:
-                return monster_state, self._generate_monster_display(monster), f"{monster.name} is too tired to train! 😴"
-            # Training effects
-            training_effects = {
-                "strength": {"offense": 5 * intensity, "life": 20 * intensity},
-                "endurance": {"defense": 5 * intensity, "life": 30 * intensity},
-                "intelligence": {"brains": 8 * intensity, "mp": 10 * intensity},
-                "dexterity": {"speed": 6 * intensity},
-                "spirit": {"mp": 15 * intensity, "happiness": 5},
-                "technique": {"offense": 3 * intensity, "defense": 3 * intensity}
-            }
-            effects = training_effects.get(training_type, {})
-            # Apply stat increases
-            for stat, increase in effects.items():
-                if hasattr(monster.stats, stat):
-                    current = getattr(monster.stats, stat)
-                    setattr(monster.stats, stat, current + increase)
-            # Update training progress
-            if training_type in monster.stats.training_progress:
-                monster.stats.training_progress[training_type] += 10 * intensity
-            # Training costs
-            monster.stats.energy = max(0, monster.stats.energy - (15 * intensity))
-            monster.stats.hunger = max(0, monster.stats.hunger - (10 * intensity))
-            # Update emotional state
-            monster.emotional_state = monster.calculate_emotional_state()
-            monster.current_activity = "training"
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            response = f"{monster.name} completed {training_type} training! 💪"
-            return (
-                monster.dict(),
-                self._generate_monster_display(monster),
-                response
-            )
-        except Exception as e:
-            self.logger.error(f"Training failed: {e}")
-            return monster_state, self._get_error_display(str(e)), str(e)
-    async def handle_text_conversation(self, monster_state: Dict, message: str, conversation_history: List) -> Tuple:
-        """Handle text conversation with monster"""
-        if not monster_state or not message.strip():
-            return conversation_history, "", conversation_history, monster_state if monster_state else {}
-        try:
-            monster = Monster(**monster_state)
-            # Generate AI response with fallback
-            if self.qwen_processor:
-                response_data = await self.qwen_processor.generate_monster_response(
-                    monster.dict(),
-                    message,
-                    conversation_history
-                )
-                response = response_data["response"]
-            else:
-                # Fallback response when AI is not available
-                response = self._get_fallback_response(monster, message)
-                response_data = {
-                    "response": response,
-                    "emotional_impact": {"happiness": 0.1, "bonding": 0.02},
-                    "inference_time": 0.0
-                }
-            # Update conversation history
-            conversation_history.append([message, response])
-            # Convert to messages format for Gradio chatbot with type='messages'
-            messages_format = []
-            for msg in conversation_history:
-                if isinstance(msg, list) and len(msg) == 2:
-                    messages_format.append({"role": "user", "content": msg[0]})
-                    messages_format.append({"role": "assistant", "content": msg[1]})
-            # Update monster state based on interaction
-            monster.conversation.total_conversations += 1
-            monster.conversation.last_interaction = datetime.now()
-            monster.stats.happiness = min(100, monster.stats.happiness + 2)
-            monster.personality.relationship_level = min(100, monster.personality.relationship_level + 1)
-            # Apply emotional impact
-            emotional_impact = response_data.get("emotional_impact", {})
-            for emotion, value in emotional_impact.items():
-                if emotion == "happiness":
-                    monster.stats.happiness = max(0, min(100, monster.stats.happiness + int(value * 10)))
-                elif emotion == "bonding":
-                    monster.personality.relationship_level = min(100, monster.personality.relationship_level + int(value * 5))
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            return messages_format, "", conversation_history, monster.dict()
-        except Exception as e:
-            self.logger.error(f"Conversation failed: {e}")
-            return conversation_history, "", conversation_history, monster_state if monster_state else {}
-    async def handle_voice_input(self, monster_state: Dict, audio_data, conversation_history: List) -> Tuple:
-        """Handle voice input"""
-        if not monster_state or audio_data is None:
-            return conversation_history, conversation_history, monster_state, ""
-        try:
-            # Process speech with fallback
-            if self.speech_engine:
-                speech_result = await self.speech_engine.process_audio_stream(audio_data[1])
-                if not speech_result["success"]:
-                    return conversation_history, conversation_history, monster_state, "Speech processing failed"
-                transcribed_text = speech_result["transcription"]
-                if not transcribed_text.strip():
-                    return conversation_history, conversation_history, monster_state, "No speech detected"
-            else:
-                return conversation_history, conversation_history, monster_state, "Speech processing not available"
-            # Process as text conversation
-            new_history, _, updated_history, updated_monster = await self.handle_text_conversation(
-                monster_state, transcribed_text, conversation_history
-            )
-            return new_history, updated_history, updated_monster, f"Heard: \"{transcribed_text}\""
-        except Exception as e:
-            self.logger.error(f"Voice input failed: {e}")
-            return conversation_history, conversation_history, monster_state, str(e)
-    async def perform_care_action(self, monster_state: Dict, action: str) -> Tuple:
-        """Perform care action on monster"""
-        if not monster_state:
-            return monster_state, self._get_default_monster_display(), "No monster loaded!"
-        try:
-            monster = Monster(**monster_state)
-            care_effects = {
-                "clean": {"cleanliness": 50, "happiness": 10},
-                "play": {"happiness": 25, "energy": -15, "relationship": 5},
-                "rest": {"energy": 40, "happiness": 5},
-                "discipline": {"discipline": 20, "happiness": -10}
-            }
-            effects = care_effects.get(action, {})
-            # Apply effects
-            for stat, value in effects.items():
-                if stat == "relationship":
-                    monster.personality.relationship_level = min(100, monster.personality.relationship_level + value)
-                elif hasattr(monster.stats, stat):
-                    current = getattr(monster.stats, stat)
-                    setattr(monster.stats, stat, max(0, min(100, current + value)))
-            # Update activity
-            monster.current_activity = action
-            monster.emotional_state = monster.calculate_emotional_state()
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            response = f"{monster.name} is now {action}ing! ✨"
-            return (
-                monster.dict(),
-                self._generate_monster_display(monster),
-                response
-            )
-        except Exception as e:
-            self.logger.error(f"Care action failed: {e}")
-            return monster_state, self._get_error_display(str(e)), str(e)
-    async def background_update(self, monster_state: Dict) -> Tuple:
-        """Background update for time-based effects"""
-        if not monster_state:
-            return monster_state, self._get_default_monster_display(), gr.update()
-        try:
-            monster = Monster(**monster_state)
-            # Calculate time elapsed
-            time_elapsed = (datetime.now() - monster.last_update).total_seconds() / 60  # minutes
-            # Apply time effects
-            monster.apply_time_effects(time_elapsed)
-            # Save monster
-            await self.state_manager.save_monster(monster)
-            # Update save indicator
-            save_indicator = f"<span style='color: green;'>● Auto-saved at {datetime.now().strftime('%H:%M:%S')}</span>"
-            return (
-                monster.dict(),
-                self._generate_monster_display(monster),
-                save_indicator
-            )
-        except Exception as e:
-            self.logger.error(f"Background update failed: {e}")
-            return monster_state, self._get_error_display(str(e)), gr.update()
-    def start_mini_game(self, monster_state: Dict, game_type: str) -> Tuple:
-        """Start a mini-game"""
-        if not monster_state:
-            return self._get_mini_game_display(), {}
-        # Placeholder for mini-game implementation
-        game_display = f"""
-        <div style="text-align: center; padding: 20px;">
-            <h3 style="color: #8b5cf6;">{game_type.title()} Training</h3>
-            <p>Mini-game implementation coming soon!</p>
-        </div>
-        """
-        game_stats = {
-            "game_type": game_type,
-            "status": "not_implemented"
-        }
-        return game_display, game_stats
-    def start_voice_streaming(self) -> Tuple:
-        """Start voice streaming"""
-        return gr.update(interactive=False), gr.update(interactive=True)
-    def stop_voice_streaming(self) -> Tuple:
-        """Stop voice streaming"""
-        return gr.update(interactive=True), gr.update(interactive=False)
-    def _get_fallback_response(self, monster: 'Monster', message: str) -> str:
-        """Get fallback response when AI is not available"""
-        import random
-        # Simple rule-based responses
-        message_lower = message.lower()
-        greetings = ["hello", "hi", "hey", "good morning", "good afternoon", "good evening"]
-        questions = ["how", "what", "when", "where", "why", "who"]
-        positive_words = ["good", "great", "love", "happy", "fun", "wonderful", "amazing"]
-        negative_words = ["bad", "sad", "angry", "hate", "terrible", "awful"]
-        responses = []
-        # Greeting responses
-        if any(greeting in message_lower for greeting in greetings):
-            responses = [
-                f"Hello there! {monster.name} is happy to see you! 😊",
-                f"Hi! {monster.name} waves excitedly! 👋",
-                f"Hey! {monster.name} is ready to play! 🎮"
-            ]
-        # Question responses
-        elif any(q in message_lower for q in questions):
-            responses = [
-                f"*{monster.name} tilts head thoughtfully* 🤔",
-                f"That's a great question! {monster.name} is thinking... 💭",
-                f"*{monster.name} looks curious* Tell me more! 👀"
-            ]
-        # Positive responses
-        elif any(word in message_lower for word in positive_words):
-            responses = [
-                f"{monster.name} is so happy! 😄",
-                f"*{monster.name} does a little dance* 💃",
-                f"That makes {monster.name} feel great! ✨"
-            ]
-        # Negative responses
-        elif any(word in message_lower for word in negative_words):
-            responses = [
-                f"*{monster.name} looks concerned* 😟",
-                f"{monster.name} wants to help make things better! 💗",
-                f"*{monster.name} offers a gentle hug* 🤗"
-            ]
-        # Default responses
-        else:
-            responses = [
-                f"*{monster.name} makes a happy sound* 😊",
-                f"{monster.name} is listening! 👂",
-                f"*{monster.name} nods understandingly* 🙂",
-                f"Tell {monster.name} more! 💬"
-            ]
-        return random.choice(responses)
-    def launch(self, **kwargs):
-        """Launch the Gradio interface with optimized settings"""
-        loop = asyncio.new_event_loop()
-        asyncio.set_event_loop(loop)
-        # Initialize async components
-        loop.run_until_complete(self.initialize())
-        # Create interface
-        interface = self.create_interface()
-        # Launch with production settings
-        launch_config = {
-            "server_name": "0.0.0.0",
-            "server_port": 7860,
-            "share": False,
-            "debug": False,
-            "show_error": True,
-            "quiet": False,
-            "favicon_path": None,
-            "ssl_keyfile": None,
-            "ssl_certfile": None,
-            "ssl_keyfile_password": None,
-            "max_threads": 40,
-            **kwargs
-        }
-        self.logger.info("Launching DigiPal interface...")
-        return interface.launch(**launch_config)
-# ZeroGPU wrapper functions for CPU-intensive operations
-def safe_create_monster(interface, name: str, personality: str):
-    """Safe wrapper for monster creation"""
-    import asyncio
-    return asyncio.run(interface.create_new_monster(name, personality))
-def safe_handle_conversation(interface, monster_state: Dict, message: str, conversation_history: List):
-    """Safe wrapper for conversation handling"""
-    import asyncio
-    return asyncio.run(interface.handle_text_conversation(monster_state, message, conversation_history))
-def safe_handle_voice(interface, monster_state: Dict, audio_data, conversation_history: List):
-    """Safe wrapper for voice input handling"""
-    import asyncio
-    return asyncio.run(interface.handle_voice_input(monster_state, audio_data, conversation_history))
-# Apply GPU decorators only in Spaces environment for specific operations
-try:
-    import os
-    if os.getenv("SPACE_ID") is not None:
-        # Only decorate the most GPU-intensive operations
-        safe_handle_conversation = spaces.GPU(duration=120)(safe_handle_conversation)
-        safe_handle_voice = spaces.GPU(duration=120)(safe_handle_voice)
-except (ImportError, NotImplementedError, AttributeError) as e:
-    # GPU decorator not available or failed, continue without it
-    pass

src/ui/streamlit_interface.py ADDED Viewed

	@@ -0,0 +1,565 @@

+"""
+Streamlit Interface for DigiPal - Modern AI Monster Companion
+Replaces Gradio with Streamlit for better user experience
+"""
+import streamlit as st
+import asyncio
+import logging
+import json
+import time
+import requests
+import io
+from typing import Dict, List, Optional, Any, Tuple
+from datetime import datetime, timedelta
+import numpy as np
+from PIL import Image
+import threading
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+# API Configuration
+API_BASE_URL = "http://localhost:7861"  # FastAPI backend
+class StreamlitDigiPalInterface:
+    """Modern Streamlit interface for DigiPal"""
+    def __init__(self):
+        self.logger = logging.getLogger(__name__)
+        # Initialize session state
+        if 'current_monster' not in st.session_state:
+            st.session_state.current_monster = None
+        if 'monster_stats' not in st.session_state:
+            st.session_state.monster_stats = {}
+        if 'conversation_history' not in st.session_state:
+            st.session_state.conversation_history = []
+        if 'available_monsters' not in st.session_state:
+            st.session_state.available_monsters = []
+    def run(self):
+        """Main Streamlit application"""
+        # Page configuration
+        st.set_page_config(
+            page_title="DigiPal - AI Monster Companion",
+            page_icon="🐉",
+            layout="wide",
+            initial_sidebar_state="expanded"
+        )
+        # Custom CSS for cyberpunk theme
+        self._apply_custom_css()
+        # Header with cyberpunk styling
+        st.markdown('<h1 class="digipal-title">🐉 DigiPal</h1>', unsafe_allow_html=True)
+        st.markdown('<p style="text-align: center; font-family: Rajdhani, sans-serif; font-size: 1.2rem; color: #00ffff; margin-top: -1rem;">Advanced AI Monster Companion with 3D Generation</p>', unsafe_allow_html=True)
+        # Sidebar for monster management
+        self._render_sidebar()
+        # Main content area
+        if st.session_state.current_monster:
+            self._render_monster_interface()
+        else:
+            self._render_welcome_screen()
+    def _apply_custom_css(self):
+        """Apply custom CSS for cyberpunk theme"""
+        st.markdown("""
+        <style>
+        /* Import Google Fonts */
+        @import url('https://fonts.googleapis.com/css2?family=Orbitron:wght@400;700;900&family=Rajdhani:wght@300;400;500;600;700&display=swap');
+        /* Main background */
+        .main {
+            background: linear-gradient(135deg, #0a0a0a 0%, #1a0d2e 25%, #16213e 50%, #0f3460 75%, #0e4b99 100%);
+            color: #e0e0e0;
+            font-family: 'Rajdhani', sans-serif;
+        }
+        /* Headers */
+        h1, h2, h3 {
+            font-family: 'Orbitron', monospace;
+            color: #00ffff;
+            text-shadow: 0 0 20px #00ffff40;
+        }
+        /* Sidebar */
+        .css-1d391kg {
+            background: linear-gradient(180deg, #1a1a2e 0%, #16213e 100%);
+            border-right: 2px solid #00ffff40;
+        }
+        /* Buttons */
+        .stButton > button {
+            background: linear-gradient(45deg, #ff0080, #00ffff);
+            color: #0a0a0a;
+            border: none;
+            border-radius: 25px;
+            padding: 0.75rem 1.5rem;
+            font-weight: bold;
+            font-family: 'Orbitron', monospace;
+            font-size: 0.9rem;
+            box-shadow: 0 0 20px rgba(255, 0, 128, 0.3);
+            transition: all 0.3s ease;
+            text-transform: uppercase;
+            letter-spacing: 1px;
+        }
+        .stButton > button:hover {
+            transform: translateY(-3px) scale(1.05);
+            box-shadow: 0 0 30px rgba(0, 255, 255, 0.6);
+            background: linear-gradient(45deg, #00ffff, #ff0080);
+        }
+        /* Input fields */
+        .stTextInput > div > div > input, .stTextArea > div > div > textarea, .stSelectbox > div > div > select {
+            background: rgba(0, 0, 0, 0.8);
+            border: 2px solid #00ffff40;
+            border-radius: 10px;
+            color: #e0e0e0;
+            font-family: 'Rajdhani', sans-serif;
+        }
+        .stTextInput > div > div > input:focus, .stTextArea > div > div > textarea:focus {
+            border-color: #00ffff;
+            box-shadow: 0 0 15px #00ffff40;
+        }
+        /* Monster stats container */
+        .monster-stats {
+            background: linear-gradient(135deg, rgba(0, 255, 255, 0.1) 0%, rgba(255, 0, 128, 0.1) 100%);
+            border-radius: 15px;
+            padding: 1.5rem;
+            backdrop-filter: blur(10px);
+            border: 2px solid rgba(0, 255, 255, 0.3);
+            box-shadow: 0 0 30px rgba(0, 255, 255, 0.2);
+        }
+        /* Progress bars */
+        .stProgress > div > div > div {
+            background: linear-gradient(90deg, #ff0080 0%, #00ffff 100%);
+        }
+        /* Metrics */
+        [data-testid="metric-container"] {
+            background: rgba(0, 0, 0, 0.6);
+            border: 1px solid #00ffff40;
+            border-radius: 10px;
+            padding: 1rem;
+            box-shadow: 0 0 15px rgba(0, 255, 255, 0.1);
+        }
+        /* Chat messages */
+        .stChatMessage {
+            background: rgba(0, 0, 0, 0.7);
+            border-radius: 15px;
+            border-left: 4px solid #00ffff;
+            margin: 0.5rem 0;
+        }
+        /* Success/Error messages */
+        .stSuccess {
+            background: rgba(0, 255, 0, 0.1);
+            border: 1px solid #00ff00;
+            color: #00ff00;
+        }
+        .stError {
+            background: rgba(255, 0, 0, 0.1);
+            border: 1px solid #ff0000;
+            color: #ff6666;
+        }
+        /* Neon text */
+        .neon-text {
+            color: #00ffff;
+            text-shadow: 0 0 10px #00ffff, 0 0 20px #00ffff, 0 0 30px #00ffff;
+            font-family: 'Orbitron', monospace;
+            font-weight: 700;
+        }
+        /* DigiPal title effect */
+        .digipal-title {
+            background: linear-gradient(45deg, #ff0080, #00ffff, #ff0080);
+            background-size: 200% 200%;
+            -webkit-background-clip: text;
+            -webkit-text-fill-color: transparent;
+            animation: neon-glow 2s ease-in-out infinite alternate;
+            font-family: 'Orbitron', monospace;
+            font-weight: 900;
+            font-size: 3rem;
+            text-align: center;
+            margin: 1rem 0;
+        }
+        @keyframes neon-glow {
+            from { background-position: 0% 50%; }
+            to { background-position: 100% 50%; }
+        }
+        /* Holographic effect for containers */
+        .holo-container {
+            background: linear-gradient(135deg,
+                rgba(0, 255, 255, 0.1) 0%,
+                rgba(0, 255, 255, 0.05) 25%,
+                rgba(255, 0, 128, 0.05) 50%,
+                rgba(255, 0, 128, 0.1) 75%,
+                rgba(0, 255, 255, 0.1) 100%);
+            border: 2px solid;
+            border-image: linear-gradient(45deg, #00ffff, #ff0080, #00ffff) 1;
+            border-radius: 15px;
+            padding: 1.5rem;
+            backdrop-filter: blur(15px);
+            box-shadow:
+                0 0 20px rgba(0, 255, 255, 0.3),
+                inset 0 0 20px rgba(255, 0, 128, 0.1);
+        }
+        </style>
+        """, unsafe_allow_html=True)
+    def _render_sidebar(self):
+        """Render sidebar with monster management"""
+        with st.sidebar:
+            st.header("🎮 Monster Management")
+            # Load available monsters
+            if st.button("🔄 Refresh Monsters"):
+                self._load_available_monsters()
+            # Monster selection
+            if st.session_state.available_monsters:
+                selected_monster = st.selectbox(
+                    "Select Monster:",
+                    options=["None"] + [m["name"] for m in st.session_state.available_monsters],
+                    index=0
+                )
+                if selected_monster != "None":
+                    if st.button("🐾 Load Monster"):
+                        self._load_monster(selected_monster)
+            # Create new monster
+            st.subheader("🆕 Create New Monster")
+            with st.form("create_monster"):
+                new_name = st.text_input("Monster Name:")
+                personality = st.selectbox(
+                    "Personality:",
+                    ["FRIENDLY", "ENERGETIC", "CALM", "CURIOUS", "BRAVE"]
+                )
+                if st.form_submit_button("🥚 Create Monster"):
+                    self._create_monster(new_name, personality)
+            # Current monster info
+            if st.session_state.current_monster:
+                st.subheader(f"🐉 {st.session_state.current_monster['name']}")
+                st.write(f"**Stage:** {st.session_state.current_monster.get('stage', 'Unknown')}")
+                st.write(f"**Personality:** {st.session_state.current_monster.get('personality', 'Unknown')}")
+    def _render_welcome_screen(self):
+        """Render welcome screen when no monster is selected"""
+        col1, col2 = st.columns([2, 1])
+        with col1:
+            st.markdown("""
+            <div class="holo-container">
+            <h2 class="neon-text">Welcome to DigiPal! 🐉</h2>
+            <p style="font-size: 1.3rem; color: #e0e0e0; margin-bottom: 1.5rem;">
+            <strong>The most advanced AI monster companion experience</strong>
+            </p>
+            <h3 style="color: #ff0080;">🚀 Revolutionary Features:</h3>
+            <ul style="font-size: 1.1rem; line-height: 1.8;">
+            <li>🤖 <strong style="color: #00ffff;">Advanced AI Conversations</strong> with Qwen 2.5</li>
+            <li>🎤 <strong style="color: #00ffff;">Voice Interaction</strong> with Kyutai STT-2.6b</li>
+            <li>🎨 <strong style="color: #00ffff;">3D Model Generation</strong> with OmniGen2 → Hunyuan3D → UniRig</li>
+            <li>📊 <strong style="color: #00ffff;">Complex Care System</strong> inspired by Digimon World</li>
+            <li>🧬 <strong style="color: #00ffff;">Dynamic Evolution</strong> based on care quality</li>
+            <li>💬 <strong style="color: #00ffff;">Personality-driven Responses</strong></li>
+            </ul>
+            <h3 style="color: #ff0080; margin-top: 2rem;">⚡ Getting Started:</h3>
+            <ol style="font-size: 1.1rem; line-height: 1.8;">
+            <li><span style="color: #00ffff;">Create a new monster</span> in the sidebar</li>
+            <li><span style="color: #00ffff;">Talk to your monster</span> and watch it grow</li>
+            <li><span style="color: #00ffff;">Generate a unique 3D model</span></li>
+            <li><span style="color: #00ffff;">Care for your digital companion!</span></li>
+            </ol>
+            </div>
+            """, unsafe_allow_html=True)
+        with col2:
+            st.markdown("""
+            <div class="holo-container" style="text-align: center;">
+            <h3 class="neon-text">🔮 Your AI Companion Awaits</h3>
+            <div style="background: linear-gradient(45deg, #ff0080, #00ffff);
+                        border-radius: 20px;
+                        padding: 2rem;
+                        margin: 1rem 0;
+                        box-shadow: 0 0 30px rgba(0, 255, 255, 0.5);">
+                <p style="font-size: 4rem; margin: 0; animation: neon-glow 2s ease-in-out infinite alternate;">🐉</p>
+                <p style="font-size: 1.2rem; margin: 0.5rem 0; color: #0a0a0a; font-weight: bold;">DigiPal</p>
+            </div>
+            <p style="color: #e0e0e0; font-style: italic;">Ready to create your perfect digital companion?</p>
+            </div>
+            """, unsafe_allow_html=True)
+    def _render_monster_interface(self):
+        """Render main monster interaction interface"""
+        monster = st.session_state.current_monster
+        # Main layout
+        col1, col2 = st.columns([2, 1])
+        with col1:
+            # Conversation area
+            self._render_conversation_area()
+            # Action buttons
+            self._render_action_buttons()
+        with col2:
+            # Monster stats and 3D model
+            self._render_monster_stats()
+            self._render_3d_model_section()
+    def _render_conversation_area(self):
+        """Render conversation interface"""
+        st.subheader("💬 Talk to Your Monster")
+        # Chat history
+        chat_container = st.container()
+        with chat_container:
+            for message in st.session_state.conversation_history:
+                if message["role"] == "user":
+                    st.chat_message("user").write(message["content"])
+                else:
+                    st.chat_message("assistant").write(message["content"])
+        # Chat input
+        user_input = st.chat_input("Say something to your monster...")
+        if user_input:
+            self._send_message(user_input)
+    def _render_action_buttons(self):
+        """Render care action buttons"""
+        st.subheader("🎮 Care Actions")
+        col1, col2, col3 = st.columns(3)
+        with col1:
+            if st.button("🍖 Feed"):
+                self._perform_action("feed")
+            if st.button("🏃 Train"):
+                self._perform_action("train")
+        with col2:
+            if st.button("🎲 Play"):
+                self._perform_action("play")
+            if st.button("🧼 Clean"):
+                self._perform_action("clean")
+        with col3:
+            if st.button("💊 Heal"):
+                self._perform_action("heal")
+            if st.button("😴 Rest"):
+                self._perform_action("rest")
+    def _render_monster_stats(self):
+        """Render monster statistics"""
+        st.subheader("📊 Monster Stats")
+        if 'stats' in st.session_state.current_monster:
+            stats = st.session_state.current_monster['stats']
+            # Create visual stat bars
+            for stat_name, value in stats.items():
+                if isinstance(value, (int, float)):
+                    # Normalize to 0-100 for progress bar
+                    normalized_value = min(100, max(0, value))
+                    st.metric(
+                        label=stat_name.title(),
+                        value=f"{value:.1f}",
+                        delta=None
+                    )
+                    st.progress(normalized_value / 100)
+        else:
+            st.info("Load a monster to see stats")
+    def _render_3d_model_section(self):
+        """Render 3D model generation section"""
+        st.subheader("🎨 3D Model Generation")
+        # Model display area
+        if st.session_state.current_monster and st.session_state.current_monster.get('model_url'):
+            st.success("3D Model Ready!")
+            st.write(f"Model: {st.session_state.current_monster['model_url']}")
+        else:
+            st.info("No 3D model generated yet")
+        # Generation controls
+        with st.form("generate_3d"):
+            description = st.text_area(
+                "Custom Description (optional):",
+                placeholder="A cute dragon with blue scales and friendly eyes..."
+            )
+            if st.form_submit_button("🎨 Generate 3D Model"):
+                self._generate_3d_model(description)
+    def _load_available_monsters(self):
+        """Load list of available monsters from API"""
+        try:
+            response = requests.get(f"{API_BASE_URL}/api/monsters", timeout=5)
+            if response.status_code == 200:
+                data = response.json()
+                st.session_state.available_monsters = data.get("monsters", [])
+                st.success(f"Found {len(st.session_state.available_monsters)} monsters")
+            else:
+                st.error("Failed to load monsters")
+        except requests.exceptions.RequestException as e:
+            st.warning("🔧 Backend not connected")
+            st.info("💡 To enable full functionality, start backend: `python app.py`")
+            # Add some demo monsters for UI preview
+            st.session_state.available_monsters = [
+                {"name": "Demo Dragon", "id": "demo1", "stage": "Adult"},
+                {"name": "Cyber Wolf", "id": "demo2", "stage": "Champion"}
+            ]
+    def _load_monster(self, monster_name: str):
+        """Load a specific monster"""
+        try:
+            # Find monster by name
+            monster_data = None
+            for monster in st.session_state.available_monsters:
+                if monster["name"] == monster_name:
+                    monster_data = monster
+                    break
+            if monster_data:
+                # Load full monster data from API
+                response = requests.get(f"{API_BASE_URL}/api/monsters/{monster_data['id']}")
+                if response.status_code == 200:
+                    st.session_state.current_monster = response.json()
+                    st.session_state.conversation_history = st.session_state.current_monster.get('conversation_history', [])
+                    st.success(f"Loaded {monster_name}!")
+                    st.rerun()
+                else:
+                    st.error("Failed to load monster details")
+        except Exception as e:
+            st.error(f"Error loading monster: {str(e)}")
+    def _create_monster(self, name: str, personality: str):
+        """Create a new monster"""
+        if not name:
+            st.error("Please enter a monster name")
+            return
+        try:
+            response = requests.post(
+                f"{API_BASE_URL}/api/monsters",
+                json={"name": name, "personality": personality}
+            )
+            if response.status_code == 200:
+                monster_data = response.json()
+                st.session_state.current_monster = monster_data
+                st.session_state.conversation_history = []
+                st.success(f"Created {name}!")
+                st.rerun()
+            else:
+                st.error("Failed to create monster")
+        except Exception as e:
+            st.error(f"Error creating monster: {str(e)}")
+    def _send_message(self, message: str):
+        """Send message to monster"""
+        if not st.session_state.current_monster:
+            return
+        try:
+            # Add user message to history
+            st.session_state.conversation_history.append({
+                "role": "user",
+                "content": message,
+                "timestamp": datetime.now().isoformat()
+            })
+            # Send to API
+            response = requests.post(
+                f"{API_BASE_URL}/api/monsters/{st.session_state.current_monster['id']}/talk",
+                json={"message": message}
+            )
+            if response.status_code == 200:
+                data = response.json()
+                # Add AI response to history
+                st.session_state.conversation_history.append({
+                    "role": "assistant",
+                    "content": data["response"],
+                    "timestamp": datetime.now().isoformat()
+                })
+                # Update monster stats
+                st.session_state.current_monster['stats'] = data.get("stats", {})
+                st.rerun()
+            else:
+                st.error("Failed to send message")
+        except Exception as e:
+            st.error(f"Error sending message: {str(e)}")
+    def _perform_action(self, action: str):
+        """Perform care action on monster"""
+        if not st.session_state.current_monster:
+            return
+        try:
+            response = requests.post(
+                f"{API_BASE_URL}/api/monsters/{st.session_state.current_monster['id']}/action",
+                json={"action": action}
+            )
+            if response.status_code == 200:
+                data = response.json()
+                st.session_state.current_monster['stats'] = data.get("stats", {})
+                st.success(f"Performed {action}!")
+                st.rerun()
+            else:
+                st.error(f"Failed to perform {action}")
+        except Exception as e:
+            st.error(f"Error performing {action}: {str(e)}")
+    def _generate_3d_model(self, description: str = ""):
+        """Generate 3D model for monster"""
+        if not st.session_state.current_monster:
+            return
+        try:
+            with st.spinner("Generating 3D model... This may take a few minutes."):
+                response = requests.post(
+                    f"{API_BASE_URL}/api/monsters/{st.session_state.current_monster['id']}/generate-3d",
+                    json={"description": description}
+                )
+                if response.status_code == 200:
+                    data = response.json()
+                    if data["success"]:
+                        st.session_state.current_monster['model_url'] = data["model_url"]
+                        st.success("3D model generated successfully!")
+                        st.rerun()
+                    else:
+                        st.error("3D generation failed")
+                else:
+                    st.error("Failed to generate 3D model")
+        except Exception as e:
+            st.error(f"Error generating 3D model: {str(e)}")
+def main():
+    """Main entry point for Streamlit app"""
+    interface = StreamlitDigiPalInterface()
+    interface.run()
+if __name__ == "__main__":
+    main()

streamlit_app.py ADDED Viewed

	@@ -0,0 +1,55 @@

+#!/usr/bin/env python3
+"""
+DigiPal Streamlit App - HuggingFace Spaces Entry Point
+Unified Streamlit application that includes embedded FastAPI functionality
+"""
+import streamlit as st
+import asyncio
+import threading
+import time
+import logging
+import sys
+import os
+from pathlib import Path
+# Add src to path
+sys.path.insert(0, os.path.join(os.path.dirname(__file__), 'src'))
+# Configure logging
+logging.basicConfig(level=logging.INFO)
+logger = logging.getLogger(__name__)
+# Import our Streamlit interface
+from src.ui.streamlit_interface import StreamlitDigiPalInterface
+def start_background_services():
+    """Start background services needed for DigiPal"""
+    try:
+        # Create necessary directories
+        os.makedirs("data/saves", exist_ok=True)
+        os.makedirs("data/models", exist_ok=True)
+        os.makedirs("data/cache", exist_ok=True)
+        os.makedirs("logs", exist_ok=True)
+        # For Spaces deployment, we'll run a simplified version
+        # that doesn't require separate FastAPI server
+        logger.info("DigiPal background services initialized")
+    except Exception as e:
+        logger.error(f"Failed to initialize background services: {e}")
+def main():
+    """Main Streamlit application entry point"""
+    # Initialize background services
+    if 'services_initialized' not in st.session_state:
+        start_background_services()
+        st.session_state.services_initialized = True
+    # Create and run the interface
+    interface = StreamlitDigiPalInterface()
+    interface.run()
+if __name__ == "__main__":
+    main()

test_ui.py ADDED Viewed

	@@ -0,0 +1,50 @@

+#!/usr/bin/env python3
+"""
+Quick UI Test Script
+Run this to see the new Streamlit UI without the full backend
+"""
+import subprocess
+import sys
+import os
+import logging
+# Configure logging
+logging.basicConfig(level=logging.INFO, format='%(asctime)s - %(levelname)s - %(message)s')
+logger = logging.getLogger(__name__)
+def main():
+    """Test the Streamlit UI"""
+    logger.info("🎨 Testing DigiPal Streamlit UI")
+    logger.info("=" * 50)
+    logger.info("This will show you the new UI interface")
+    logger.info("Note: Backend features won't work without running the API")
+    logger.info("=" * 50)
+    # Create necessary directories
+    os.makedirs("data/saves", exist_ok=True)
+    os.makedirs("data/models", exist_ok=True)
+    os.makedirs("data/cache", exist_ok=True)
+    os.makedirs("logs", exist_ok=True)
+    try:
+        port = os.getenv("STREAMLIT_PORT", "8501")
+        logger.info(f"Starting Streamlit UI on port {port}...")
+        logger.info(f"Open your browser to: http://localhost:{port}")
+        subprocess.run([
+            sys.executable, "-m", "streamlit", "run",
+            "streamlit_app.py",
+            "--server.port", port,
+            "--server.address", "0.0.0.0",
+            "--server.headless", "false"
+        ], check=True)
+    except subprocess.CalledProcessError as e:
+        logger.error(f"Failed to start Streamlit: {e}")
+        logger.info("Make sure you have streamlit installed: pip install streamlit")
+    except KeyboardInterrupt:
+        logger.info("UI test stopped")
+if __name__ == "__main__":
+    main()