Skip to content

tinker_cookbook.rl.ActionExtra

class tinker_cookbook.rl.ActionExtra(TypedDict)

Extra metadata passed alongside an action to :meth:Env.step.

Fields:

  • stop_reason (tinker.StopReason) – Why sampling stopped — "stop" (hit a stop sequence) or "length" (hit max_tokens without a stop sequence).