Fireworks AI - Fast Inference Platform
Learn how to use Fireworks AI’s models with Kodus
How Fireworks AI works
Fireworks AI is the fastest inference platform for generative AI, designed to build and run magical AI applications in seconds. The platform provides serverless access to popular open-source models like DeepSeek, Llama, Qwen, and Mistral with optimized speed, high throughput, and minimal latency. Built for developers who need reliable, blazing-fast AI infrastructure without GPU management complexity.
Recommended Models
We recommend good coding models with competitive pricing and high context windows.
For the most updated information, please visit Fireworks AI’s pricing page.
Model | Pricing (1M tokens) | Context Window |
---|---|---|
Llama 4 Maverick recommended | 0.88 | ~131k tokens |
Llama 4 Scout | 0.60 | ~131k tokens |
DeepSeek V3 | $0.90 | ~128k tokens |
Qwen3 235B | 0.88 | ~131k tokens |
Creating API Key
Go directly to Fireworks AI Console to create a new API Key.
Or, follow these steps:
- Visit app.fireworks.ai and create an account or sign in
- Once logged in, navigate to the API Keys page in your account settings
- Click “Create API Key” button
- Give your key a descriptive name (e.g., ‘Kodus’ or any name you prefer)
- Click “Create” to generate the key
- Copy the API key immediately and save it somewhere secure - you won’t be able to see it again
New accounts come with $1 in free credits to get started with your projects.
How to use
System Requirements
Internet access is only required if you plan to connect with cloud-based Git services like GitHub, GitLab, or Bitbucket. For self-hosted Git tools within your network, external internet access is optional.
Domain Name Setup (Optional)
If you're planning to integrate Kodus with cloud-based Git providers (GitHub, GitLab, or Bitbucket), you'll need public-facing URLs for both the Kodus Web App and its API. This allows your server to receive webhooks for proper Code Review functionality and ensures correct application behavior.
We recommend setting up two subdomains:
- One for the Web Application, e.g.,
kodus-web.yourdomain.com
. - One for the API, e.g.,
kodus-api.yourdomain.com
.
Both subdomains should have DNS A records pointing to your server's IP address. Later in this guide, we will configure a reverse proxy (Nginx) to route requests to these subdomains to the correct internal services. This setup is essential for full functionality, including webhooks and authentication.
Note: If you're only connecting to self-hosted Git tools on your network and do not require public access or webhooks, you might be able to use a simpler setup, but this guide focuses on public-facing deployments.
Get the Kodus Installer
Clone our installer repository:
Configure Environment Variables
First, copy the example environment file:
Generate secure keys for the required environment variables using:
You'll need to generate values for these security keys:
- WEB_NEXTAUTH_SECRET (use openssl rand -base64 32)
- WEB_JWT_SECRET_KEY (use openssl rand -base64 32)
- API_CRYPTO_KEY (use openssl rand -hex 32)
- API_JWT_SECRET (use openssl rand -base64 32)
- API_JWT_REFRESHSECRET (use openssl rand -base64 32)
- CODE_MANAGEMENT_SECRET (use openssl rand -hex 32)
- CODE_MANAGEMENT_WEBHOOK_TOKEN (use openssl rand -base64 32 | tr -d '=' | tr '/+' '_-')
Never commit your .env
file to version control. Keep your API keys and
database credentials secure.
Then update your .env
file with the following required variables:
Configure Fireworks AI in Environment File
Edit your .env
file and configure the core settings. For LLM Integration, use Fireworks AI in Fixed Mode:
Fixed Mode is ideal for Fireworks AI because it provides OpenAI-compatible APIs with blazing-fast inference speeds and access to cutting-edge open-source models with zero setup time.
Run the Installation Script
Looking for more control? Check out our docker-compose file for manual deployment options.
Set the proper permissions for the installation script:
Run the script:
What the Installer Does
Our installer automates several important steps:
- Verifies Docker installation
- Creates networks for Kodus services
- Clones repositories and configures environment files
- Runs docker-compose to start all services
- Executes database migrations
- Seeds initial data
🎉 Success! When complete, Kodus Orchestrator API and Web Application should be running on your machine.
You can verify your installation by visiting http://localhost:3000
- you should see the Kodus Web Application interface.
Code Review features will not work yet unless you complete the reverse proxy setup. Without this configuration, external Git providers cannot send webhooks to your instance.
Set Up Reverse Proxy (For Production)
For webhooks and external access, configure Nginx:
Verify Fireworks AI Integration
Além da verificação básica da instalação, confirme que o Fireworks AI está funcionando:
For detailed information about SSL setup, monitoring, and advanced configurations, see our complete deployment guide.