tinker_cookbook.rl.ActionExtra
class tinker_cookbook.rl.ActionExtra(TypedDict)
Extra metadata passed alongside an action to :meth:Env.step.
Fields:
- stop_reason (tinker.StopReason) – Why sampling stopped —
"stop"(hit a stop sequence) or"length"(hit max_tokens without a stop sequence).