Emby
    DocsCompliancePricing

    Unlimited AI
    All the AI You Need

    Supercharge your engineering team with fast EU-hosted open-source AI, predictable daily capacity, and enterprise-grade provider routing.

    A drop-in replacement for OpenAI, Claude, and Cerebras-style coding speeds with no lock-in and no throttling surprises.

    emby_sk_1a2b3c4d5e6f7g8h9i0j
    Get StartedDocumentation

    We work with

    CursorVS CodeClaude CodeContinue.devOpenCodeClineRoo CodeGoose
    View installation guides
    curl https://dev.emby.ai/v1/chat/completions \
    -H "Authorization: Bearer YOUR_API_KEY" \
    -H "Content-Type: application/json" \
    -d '{
    "model": "emby/kimi-k2",
    "messages": [{"role": "user", "content": "Hello!"}]
    }'

    Available Models

    ModelInput/MOutput/MRegion
    $2.50$10.00NL
    $3.00$15.00US
    $0.80$0.80NL
    $0.45$0.45NL
    $0.60$0.60US
    $0.15$0.60NL
    View all models
    SDK Support

    Build with Your Favorite Language

    Production-ready SDKs for all major programming languages

    JS
    JavaScript
    PY
    Python
    TS
    TypeScript
    GO
    Go
    RB
    Ruby
    JV
    Java
    PHP
    PHP
    RS
    Rust
    Now in Beta

    Real-time cost & usage tracking

    Track revenue, costs & profits in realtime. Understand your business performance like never before.

    Activity

    azure/gpt-4o

    Azure OpenAI • 14:23:45

    €0.0045

    1.2K

    bedrock/claude-3.5-sonnet

    AWS Bedrock • 14:23:40

    €0.0089

    2.1K

    nebius/llama-3.3-70b

    Nebius AI • 14:23:37

    €0.0012

    892

    deepinfra/qwen-2.5-72b

    DeepInfra • 14:23:33

    €0.0023

    1.5K

    groq/llama-3.3-70b

    Groq • 14:23:23

    €0.0015

    1.1M

    Emby Pro Plans

    For developers who need predictable capacity, not throttled "unlimited" promises.

    Emby Pro

    5M tokens/day

    €60€100/ dev / mo

    Daily stable capacity for coding, agents, debugging, and Cursor sessions.

    Popular

    Emby Pro+

    10M tokens/day

    €100€167/ dev / mo

    For heavier dev cycles, large refactors, monorepos, and agentic workflows.

    Emby Ultra

    20M tokens/day

    €140€233/ dev / mo

    For automation, multi-agent flows, CI agents, and teams that ship fast.

    Looking to save even more?

    We deploy custom OSS models, private clusters, and dedicated EU inference.

    enterprise@emby.ai•WhatsApp (fastest reply)

    Overflow Protection

    Never break mid-generation. Choose how to handle capacity limits.

    Automatic Spillover (Optional)

    When you hit your daily limit, additional tokens are charged at provider cost + 5% routing fee. OSS models have no routing fee.

    Auto-Upgrade Option

    Teams may choose automatic plan upgrades if consistently exceeding limits. Ensures your builds, agents, or coding sessions never break mid-generation.

    Routing Fee Details: 5% fee applies only to non-OSS models (Azure, Bedrock, Vertex, Together, Groq). OSS models = no routing fee.

    Bring Your Own Keys (BYOK)

    Connect your own API keys or custom inference endpoints for complete control over your AI infrastructure.

    €50

    per org/month

    Supported Providers

    AzureBedrockVertexGroq

    Custom Endpoints

    Private API servers

    On-prem clusters

    Sovereign infrastructure

    Feature Comparison

    FeatureEmbyOpenAIClaude
    Developer subscription
    Default daily capacity5M / 10M / 20M
    Compliance 🇪🇺
    Zero data retention (default)
    OpenAI-Compatible
    Claude-Compatible
    Vertex / Bedrock / Azure routing
    OSS models included
    Multiple providers
    Team Billing
    IAM (model limiting per key)
    Custom providers
    Caching included
    Agent supportLimitedLimited
    BYOK per org
    TTFT2.21s3.89s4.53s
    WhatsApp (human) support

    Trusted by Engineering Teams

    See what developers are saying about Emby

    Emby has completely transformed our AI workflow. The EU hosting ensures compliance, and the predictable capacity means no more throttling during critical deployments.

    SC
    Sarah Chen
    CTO at DevTools Inc

    Switching to Emby was seamless. The OpenAI-compatible API meant zero code changes, but we gained better performance and EU data residency for half the cost.

    MR
    Marcus Rodriguez
    Lead Engineer at CloudScale

    The multi-provider routing is a game changer. We can use Azure, Bedrock, and OSS models through one API. Plus, their support team responds in minutes via WhatsApp.

    EvdB
    Emma van der Berg
    VP Engineering at Fintech Solutions

    ENTERPRISE-GRADE SECURITY

    Compliance and access control

    We have ISO27001 and NEN7510 certified infrastructure for our OSS models and we only work with EU compliant partners/options.

    🇪🇺

    GDPR Compliance

    Strict European data privacy protections with full data residency guarantees you can trust.

    ISO
    27001

    ISO 27001 Certified

    Global standards for information security management, fully enforced and regularly audited.

    NEN
    7510

    NEN 7510 Certified

    Healthcare-grade security standards ensuring the highest level of data protection and compliance.

    Language Support

    Get started in minutes

    All 3 Examples
    curl https://dev.emby.ai/v1/chat/completions \
    -H "Authorization: Bearer YOUR_API_KEY" \
    -H "Content-Type: application/json" \
    -d '{
    "model": "emby/kimi-k2",
    "messages": [
    {"role": "user", "content": "Hello!"}
    ]
    }'

    Drop-in replacement for your favorite coding languages

    Use Emby with any OpenAI-compatible SDK. Simply point your existing code to our endpoint no library changes needed.

    Framework Adapters

    Integrate in under a minute

    All 13 Adapters
    ">// app/api/chat/route.ts
    600">import { openai } 600">from ';
    600">import { streamText } 600">from ';
     
    600">export 600">async 600">function POST(req: Request) {
    600">const { messages } = 600">await req.json();
    600">const result = 600">await streamText({
    model: openai('),
    messages,
    });
    600">return result.toUIMessageStreamResponse();
    }

    Next.js Adapter

    AI Inference made simple with Next.js

    Secure & Simple Integration
    Integrated Developer Portal
    Granular & Reliable Webhook Handler
    Global Infrastructure Partner

    Automation Tools

    No-code workflows

    All 5 Platforms
    {
    "nodes": [{
    "parameters": {
    "method": "POST",
    "url": "https://dev.emby.ai/v1/chat/completions",
    "authentication": "headerAuth",
    "headerAuth": {
    "name": "Authorization",
    "value": "Bearer {{$credentials.embyApiKey}}"
    },
    "jsonParameters": true,
    "options": {},
    "bodyParametersJson": {
    "model": "emby/kimi-k2",
    "messages": [
    {"role": "user", "content": "{{$json.prompt}}"}
    ]
    }
    },
    "type": "n8n-nodes-base.httpRequest",
    "name": "Emby AI Request"
    }]
    }

    Connect with automation platforms

    Integrate Emby into your n8n, Make.com, or Zapier workflows. Build powerful AI automations without writing code.

    IDE Extensions

    Code with AI

    All 13 Integrations
    {
    "models": {
    "chat": "emby/kimi-k2",
    "autocomplete": "emby/qwen-2.5-coder"
    },
    "openaiCompatible": {
    "endpoint": "https://dev.emby.ai/v1",
    "apiKey": "YOUR_API_KEY"
    },
    "enableTabCompletion": true,
    "enableInlineChat": true
    }

    AI-powered coding assistants

    Configure Cursor, VS Code, Continue, and other popular IDEs to use Emby. Get intelligent code completion and AI chat in your editor.

    Terminal Tools

    AI in your CLI

    All 6 Integrations
    # ~/.warp/ai_config.yaml
    provider: openai
    api_key: YOUR_API_KEY
    base_url: https://dev.emby.ai/v1
    model: emby/kimi-k2
    temperature: 0.7
    max_tokens: 4096

    Terminal AI assistants

    Bring AI to your terminal with Warp, Aider, and other CLI tools. Get AI assistance directly in your command line workflow.

    Frequently Asked Questions

    Everything you need to know about Emby Dev

    What is Emby Dev?

    A predictable, EU-hosted AI platform for developers using IDE agents, coding workflows, or multi-provider routing.

    How does pricing work?

    Choose a daily token capacity (5M, 10M, 20M). OSS is included. Routed models carry a 5% fee, unless BYOK is enabled.

    Is my data secure and GDPR compliant?

    Yes. ISO 27001 + NEN 7510 infrastructure, zero retention, EU residency.

    Which AI coding tools work with Emby?

    Cursor, Continue, Claude Code, Cline, Roo Code, VS Code, and any OpenAI-compatible tool.

    What models are available?

    We host top open-source models including DeepSeek V3, Qwen 2.5 72B, Llama 3.3 70B, Mistral Large, and more. Through our routing feature, you can also access GPT-4, Claude 3.5 Sonnet, Gemini Pro, and other proprietary models. New frontier models are added within 24 hours of release.

    How do I get started?

    Generate a test key above to make 5 free requests immediately and explore our models.

    When you're ready to scale, create an account to get unlimited access with automatic usage tracking, team management, and billing. Setup takes < 2 minutes.

    Search Models

    ModelAvailabilityInput Price/MOutput Price/MLatency P95
    99.9%
    €2.50€10.00456ms
    99.8%
    €3.00€15.00378ms
    99.7%
    €0.80€0.80589ms
    99.5%
    €0.60€0.60512ms
    97.2%
    €0.15€0.60298ms

    Live Activity

    ModelProviderTokensResponseSpeedCostStatusTime
    Azure OpenAI1.2K2.34s45 t/s€0.0045completed14:23:45
    AWS Bedrock2.1K1.89s52 t/s€0.0089completed14:23:40
    Nebius AI8923.12s38 t/s€0.0012completed14:23:37
    DeepInfra1.5K2.67s41 t/s€0.0023completed14:23:33
    Groq1.1M1.45s120 t/s€0.0015completed14:23:23
    Together AI743K2.12s55 t/s€0.0008completed14:23:18
    EmbyJoin Emby today

    © Emby Software, Inc. 2025

    Features

    • Models
    • Pricing
    • Documentation
    • Compliance

    Company

    • About Emby
    • GitHub
    • Brand Assets
    • Terms of Service
    • Privacy Policy

    Support

    • Docs
    • Contact
    • Service Status