Spaces:

Agents-MCP-Hackathon
/

KnowledgeBridge

Running

App Files Files Community

KnowledgeBridge / docs /archive /nebius-setup.md

fazeel007

initial commit

7c012de 18 days ago

preview code

raw

history blame contribute delete

3.94 kB

	# Nebius Configuration Guide

	## 🚀 Quick Setup

	Your app is now configured to use Nebius AI instead of OpenAI! Here's how to get it running:

	### 1. Set Environment Variables

	Create a `.env` file in the root directory:

	```bash
	# Required: Nebius AI API Key
	NEBIUS_API_KEY=your_nebius_api_key_here

	# Optional: Modal for advanced processing
	MODAL_TOKEN_ID=your_modal_token_id
	MODAL_TOKEN_SECRET=your_modal_token_secret

	# Optional: GitHub for repository search
	GITHUB_TOKEN=your_github_token_here
	```

	### 2. Supported Models

	The app now uses these Nebius models:

	#### 🧠 Default Model: DeepSeek-R1-0528
	- Model ID: `deepseek-ai/DeepSeek-R1-0528`
	- Use Cases: Chat completions, document analysis, query enhancement
	- Features: Advanced reasoning, code understanding, research synthesis

	#### 🌟 Alternative Models Available:
	- Qwen: `Qwen/Qwen3-235B-A22B`
	- Llama Nemotron: `nvidia/Llama-3_1-Nemotron-Ultra-253B-v1`

	### 3. Start the Application

	```bash
	npm run dev
	```

	The app will start at `http://localhost:5000`

	## 🔧 What Changed

	### ✅ Removed OpenAI Dependencies
	- ❌ `OPENAI_API_KEY` no longer required
	- ❌ `openai` package dependency removed from routes
	- ✅ All AI operations now use Nebius

	### ✅ Enhanced AI Capabilities
	- Document Analysis: Summary, classification, key points, quality scoring
	- Query Enhancement: Smart query improvement with intent analysis
	- Research Synthesis: Multi-document analysis and insights
	- Embeddings: Vector generation for semantic search
	- Chat Completions: Conversational AI for explanations

	### ✅ API Endpoints Updated
	- `/api/explain` - Uses DeepSeek for document explanations
	- `/api/ai-search` - Enhanced AI-powered search
	- `/api/analyze-document` - Document analysis with DeepSeek
	- `/api/enhance-query` - Query improvement
	- `/api/embeddings` - Vector embeddings via Nebius
	- `/api/health` - Health check includes Nebius status

	## 🎯 Features Now Available

	### AI-Enhanced Search Interface
	- Smart query enhancement with purple sparkles ✨
	- Real-time AI suggestions and improvements
	- Dual search modes: Traditional + AI-enhanced

	### Document Analysis Tools
	- Summary: Concise document summaries
	- Classification: Automatic document categorization
	- Key Points: Extract important findings
	- Quality Score: Assess document credibility

	### Advanced AI Features
	- Research Synthesis: Analyze multiple documents together
	- Embedding Generation: Create vector representations
	- Relevance Scoring: AI-powered result ranking

	## 🔍 Model Selection

	### When to Use Each Model:

	#### DeepSeek-R1-0528 (Default)
	- General-purpose tasks
	- Code analysis and programming
	- Research and academic content
	- Reasoning-heavy tasks

	#### Qwen3-235B-A22B
	- Multilingual content
	- Large-scale document processing
	- Complex reasoning tasks

	#### Llama-3_1-Nemotron-Ultra-253B-v1
	- Creative writing and content generation
	- Instruction following
	- Advanced conversational AI

	## 🚨 Troubleshooting

	### API Key Issues
	```bash
	# Check if your Nebius API key is working
	curl -H "Authorization: Bearer YOUR_API_KEY" \
	-H "Content-Type: application/json" \
	https://api.studio.nebius.ai/v1/models
	```

	### Health Check
	Visit `http://localhost:5000/api/health` to verify all services are working.

	### Common Issues
	1. Missing API Key: Ensure `NEBIUS_API_KEY` is set in your `.env` file
	2. Rate Limits: Nebius may have rate limits - check your usage
	3. Model Names: Use exact model IDs as shown above

	## 📊 Cost Benefits

	- Lower Costs: Nebius often provides better pricing than OpenAI
	- Better Performance: DeepSeek models are optimized for reasoning tasks
	- More Control: Choose specific models for different use cases

	Your KnowledgeBridge app is now powered by cutting-edge Nebius AI models! 🎉