Skip to content

tinker_cookbook.supervised.Config

class tinker_cookbook.supervised.Config(**)

Configuration for supervised fine-tuning.

Fields:

  • log_path (str)
  • model_name (str)
  • load_checkpoint_path (str | None) – Default: None
  • renderer_name (str | None) – Default: None
  • dataset_builder (SupervisedDatasetBuilder)
  • learning_rate (float) – Default: 0.0001
  • lr_schedule (LRSchedule) – Default: 'linear'
  • num_epochs (int) – Default: 1
  • lora_rank (int) – Default: 32
  • base_url (str | None) – Default: None
  • evaluator_builders (list[EvaluatorBuilder]) – Checkpointing and evaluation (0 = disabled for *_every fields) Default: [].
  • infrequent_evaluator_builders (list[EvaluatorBuilder]) – Default: []
  • save_every (int) – Default: 20
  • eval_every (int) – Default: 10
  • infrequent_eval_every (int) – Default: 100
  • ttl_seconds (int | None) – 7 days Default: 604800.
  • rolling_save_every (int) – but skips the sampler-weight export, making it cheaper than periodic checkpoints. Default: 0.
  • rolling_ttl_seconds (int) – 2 hours Default: 7200.
  • adam_beta1 (float) – Default: 0.9
  • adam_beta2 (float) – Default: 0.95
  • adam_eps (float) – Default: 1e-08
  • wandb_project (str | None) – Default: None
  • wandb_name (str | None) – Default: None
  • enable_trace (bool) – Default: False
  • span_chart_every (int) – Default: 0
  • max_steps (int | None) – Maximum number of training steps. If None, train for num_epochs * n_batches. Default: None.