FLUX1 TensorRT-RTX: DEV-Fp8 π¨ Building
Optimized TensorRT-RTX engines for FLUX1 on Fp8 architecture with DEV quantization.
π― This Repository
One variant, one download - only get exactly what you need!
- Model: FLUX1
 - Architecture: Fp8 (Compute Capability 8.0+)
 - Quantization: DEV
 - Memory: TBD
 - Speed: TBD for 1024x1024 generation
 
π Quick Start
Automatic (Recommended)
# ImageAI server downloads automatically
curl -X POST "http://localhost:8001/generate" \
  -H "Content-Type: application/json" \
  -d '{
    "prompt": "a beautiful landscape",
    "model": "flux1-tensorrt_rtx:dev",
    "width": 1024,
    "height": 1024
  }'
Manual Download
from huggingface_hub import snapshot_download
# Download this specific variant only
engines_path = snapshot_download(
    repo_id="imgailab/flux1-trtx-dev-fp8-blackwell"
)
# Engines are in: engines_path/engines/*.plan
Direct Integration
from imageai_server.tensorrt.nvidia_sdxl_pipeline import NVIDIASDXLPipeline
pipeline = NVIDIASDXLPipeline()
pipeline.load_engines(
    engine_dir=f"{engines_path}/engines",
    framework_model_dir=f"{engines_path}/framework",  
    onnx_dir=f"{engines_path}/onnx"
)
pipeline.activate_engines()
images, time_ms = pipeline.infer(
    prompt="a serene mountain landscape",
    height=1024,
    width=1024
)
π Performance
| Metric | Value | 
|---|---|
| Memory Usage | TBD | 
| Inference Speed | TBD | 
| Resolution | 1024x1024 (optimized) | 
| Batch Size | 1 (optimized) | 
| Precision | DEV | 
π§ Requirements
Hardware
- GPU: Fp8 architecture
- Ampere: RTX 3090, A100, etc.
 - Ada Lovelace: RTX 4090, etc.
 - Blackwell: H200, etc.
 
 - VRAM: TBD minimum
 - Compute Capability: 8.0+
 
Software
- TensorRT-RTX: 1.0.0.21+
 - CUDA: 12.0+
 - Python: 3.8+
 
π Repository Structure
flux1-trtx-dev-fp8-blackwell/
βββ engines/           # TensorRT engine files
β   βββ *.plan        # Optimized engines
βββ config.json       # Configuration metadata
βββ README.md         # This file
π Related Repositories
Other variants for FLUX1:
- Ampere BF16\n- Ada FP8\n- Ada BF16\n- Blackwell FP4\n- Blackwell FP8\n- Blackwell BF16\n
 
π License
Inherits license from base model: black-forest-labs/FLUX.1-dev
π Updates
- 2025-08-12: Initial release
 - Optimized for single-variant downloads
 
Part of the ImageAI TensorRT-RTX engine collection
- Downloads last month
 - 3
 
	Inference Providers
	NEW
	
	
	This model isn't deployed by any Inference Provider.
	π
			
		Ask for provider support
Model tree for imgailab/flux1-trtx-dev-fp8-blackwell
Base model
black-forest-labs/FLUX.1-dev