Skip to content

Tinker: a training API for researchers and developers

Tinker lets you focus on what matters in LLM post-training — your data and algorithms — while we handle the heavy lifting of distributed training.

You write a simple loop that runs on your CPU-only machine, including the data or environment and the loss function. We figure out how to make the training work on a bunch of GPUs, doing the exact computation you specified, efficiently. To change the model you're working with, you only need to change a single string in your code.

Tinker gives you full control over the training loop and all the algorithmic details. It's not a magic black box that makes fine-tuning "easy". It's a clean abstraction that shields you from the complexity of distributed training while preserving your control.

Here's how the division of responsibilities works in practice:

You focus on You write We handle
Datasets and RL environments
Your custom training data
Simple Python script
Runs on your CPU
Efficient distributed training of large models
Qwen3.5-397B, Kimi K2.6
Training logic
Your loss functions, training loop, and evals
API calls
forward_backward()
optim_step()
sample()
save_state()
Reliability
Hardware failures handled transparently

Features

What the Tinker service currently supports:

  • Tinker lets you fine-tune open-weight models ranging from 1B to 1T+ parameters, including both dense and mixture-of-experts architectures. See Models & Pricing for the full list.
  • Tinker supports vision-language models (VLMs) like Qwen3.6-35B-A3B for image understanding tasks. See the Rendering tutorial for details.
  • Tinker implements low-rank adaptation (LoRA) fine-tuning, not full fine-tuning. However, we believe that LoRA gives the same performance as full fine-tuning for many important use cases, especially in RL (see LoRA Without Regret).
  • You can download the weights of your trained model to use outside of Tinker, for example with your inference provider of choice.

A quick look at functionality

Tinker's main functionality is contained in a few key functions:

  • forward_backward: feed in your data and loss function, and we'll compute and accumulate the gradients for you.
  • optim_step: update your model using the accumulated gradients
  • sample: Generate outputs from your trained model
  • other functions for saving and loading weights and optimizer state