Spaces:

akhaliq
/

anycoder

Running

App Files Files Community

akhaliq HF Staff commited on about 18 hours ago

Commit

f777467

1 Parent(s): 9912408

add gemini models

Browse files

Files changed (2) hide show

README.md +5 -1
app.py +22 -0

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ AnyCoder is an AI-powered code generator that helps you create applications by d
 ## Features
-- **Multi-Model Support**: Choose from Moonshot Kimi-K2, DeepSeek V3, DeepSeek R1, ERNIE-4.5-VL, MiniMax M1, Qwen3-235B-A22B, Qwen3-30B-A3B-Instruct-2507, Qwen3-30B-A3B-Thinking-2507, SmolLM3-3B, and GLM-4.1V-9B-Thinking
 - **Flexible Input**: Describe your app in text, upload a UI design image (for multimodal models), provide a reference file (PDF, TXT, MD, CSV, DOCX, or image), or enter a website URL for redesign
 - **Web Search Integration**: Enable real-time web search (Tavily, with advanced search depth) to enhance code generation with up-to-date information and best practices
 - **Code Generation**: Generate code in HTML, Python, JS, and more. Special support for transformers.js apps (outputs index.html, index.js, style.css)
@@ -46,6 +46,7 @@ export HF_TOKEN="your_huggingface_token"
 export TAVILY_API_KEY="your_tavily_api_key"  # Optional, for web search feature
 export DASHSCOPE_API_KEY="your_dashscope_api_key"  # Required for Qwen3-30B models via DashScope
 export POE_API_KEY="your_poe_api_key"  # Required for GPT-5 and Grok-4 via Poe
 ```
 ## Usage
@@ -82,6 +83,8 @@ python app.py
 - GLM-4.1V-9B-Thinking (multimodal)
 - GPT-5 (via Poe)
 - Grok-4 (via Poe)
 ## Input Options
@@ -123,6 +126,7 @@ python app.py
 - `HF_TOKEN`: Your Hugging Face API token (required)
 - `TAVILY_API_KEY`: Your Tavily API key (optional, for web search)
 ## Project Structure

 ## Features
+- **Multi-Model Support**: Choose from Moonshot Kimi-K2, DeepSeek V3, DeepSeek R1, ERNIE-4.5-VL, MiniMax M1, Qwen3-235B-A22B, Qwen3-30B-A3B-Instruct-2507, Qwen3-30B-A3B-Thinking-2507, SmolLM3-3B, GLM-4.1V-9B-Thinking, Gemini 2.5 Flash and Gemini 2.5 Pro (OpenAI-compatible)
 - **Flexible Input**: Describe your app in text, upload a UI design image (for multimodal models), provide a reference file (PDF, TXT, MD, CSV, DOCX, or image), or enter a website URL for redesign
 - **Web Search Integration**: Enable real-time web search (Tavily, with advanced search depth) to enhance code generation with up-to-date information and best practices
 - **Code Generation**: Generate code in HTML, Python, JS, and more. Special support for transformers.js apps (outputs index.html, index.js, style.css)
 export TAVILY_API_KEY="your_tavily_api_key"  # Optional, for web search feature
 export DASHSCOPE_API_KEY="your_dashscope_api_key"  # Required for Qwen3-30B models via DashScope
 export POE_API_KEY="your_poe_api_key"  # Required for GPT-5 and Grok-4 via Poe
+export GEMINI_API_KEY="your_gemini_api_key"  # Required for Gemini models
 ```
 ## Usage
 - GLM-4.1V-9B-Thinking (multimodal)
 - GPT-5 (via Poe)
 - Grok-4 (via Poe)
+ - Gemini 2.5 Flash (OpenAI-compatible)
+ - Gemini 2.5 Pro (OpenAI-compatible)
 ## Input Options
 - `HF_TOKEN`: Your Hugging Face API token (required)
 - `TAVILY_API_KEY`: Your Tavily API key (optional, for web search)
+ - `GEMINI_API_KEY`: Your Google Gemini API key (required to use Gemini models)
 ## Project Structure

app.py CHANGED Viewed

@@ -512,6 +512,16 @@ AVAILABLE_MODELS = [
         "id": "codestral-2508",
         "description": "Mistral Codestral model - specialized for code generation and programming tasks"
     },
     {
         "name": "GPT-OSS-120B",
         "id": "openai/gpt-oss-120b",
@@ -653,6 +663,18 @@ def get_inference_client(model_id, provider="auto"):
     elif model_id == "codestral-2508":
         # Use Mistral client for Codestral model
         return Mistral(api_key=os.getenv("MISTRAL_API_KEY"))
     elif model_id == "openai/gpt-oss-120b":
         provider = "cerebras"
     elif model_id == "openai/gpt-oss-20b":

         "id": "codestral-2508",
         "description": "Mistral Codestral model - specialized for code generation and programming tasks"
     },
+    {
+        "name": "Gemini 2.5 Flash",
+        "id": "gemini-2.5-flash",
+        "description": "Google Gemini 2.5 Flash via OpenAI-compatible API"
+    },
+    {
+        "name": "Gemini 2.5 Pro",
+        "id": "gemini-2.5-pro",
+        "description": "Google Gemini 2.5 Pro via OpenAI-compatible API"
+    },
     {
         "name": "GPT-OSS-120B",
         "id": "openai/gpt-oss-120b",
     elif model_id == "codestral-2508":
         # Use Mistral client for Codestral model
         return Mistral(api_key=os.getenv("MISTRAL_API_KEY"))
+    elif model_id == "gemini-2.5-flash":
+        # Use Google Gemini (OpenAI-compatible) client
+        return OpenAI(
+            api_key=os.getenv("GEMINI_API_KEY"),
+            base_url="https://generativelanguage.googleapis.com/v1beta/openai/",
+        )
+    elif model_id == "gemini-2.5-pro":
+        # Use Google Gemini Pro (OpenAI-compatible) client
+        return OpenAI(
+            api_key=os.getenv("GEMINI_API_KEY"),
+            base_url="https://generativelanguage.googleapis.com/v1beta/openai/",
+        )
     elif model_id == "openai/gpt-oss-120b":
         provider = "cerebras"
     elif model_id == "openai/gpt-oss-20b":