Production fine-tuning, no infra required
Train, evaluate, and serve custom checkpoints on the same infrastructure that runs the hosted catalog. Full fine-tune or LoRA — both deploy to the same OpenAI-compatible endpoint.
Upload your dataset
JSONL with prompt/completion pairs, or any HuggingFace dataset spec. Up to 10 GB on hobby, unlimited on Team and above.
Pick a base model & method
Full fine-tune (recommended for >70B) or LoRA / QLoRA (recommended for cost-sensitive iteration). Hyperparameter recipes provided.
We train & evaluate
Distributed training on H100 / H200 clusters. Auto-evaluation on your held-out set + standard benchmarks (MMLU, GSM8K, HumanEval).
Deploy in one click
Resulting checkpoint is automatically pushed into the FireAttention runtime as a hosted model. New endpoint live in minutes.
# Upload dataset luminet datasets push ./training.jsonl \ --name customer-support-v1 # Launch a LoRA fine-tune on Llama 4 70B luminet finetune create \ --base meta/llama-4-70b \ --dataset customer-support-v1 \ --method lora \ --epochs 3 \ --lr 1e-4 # Output: # ✓ Job started: ft-job_8a2c10b # ✓ ETA: 47 minutes (4× H100) # ✓ Live logs: https://www.lumnt.com/dashboard/jobs/ft-job_8a2c10b # ✓ Auto-deploy on success: yourorg/llama-4-70b-support-v1
Methods supported
Iterating on style, persona, format. Cheap, fast.
Domain shift, new languages, deep behavior change.
Adding new knowledge to a base model on raw corpora.
Aligning model outputs with human or AI preferences.
Free tier for evaluation
Every account gets $50 in fine-tuning creditsto experiment. No card required. Upgrade when you're ready to ship.