A BebelAgent owns an independent token transcript and decode cache while
sharing the loaded model weights. This mirrors upstream bebelm::agent::Agent.
Usage
bebel_agent(
model,
greedy = FALSE,
max_gen = NULL,
max_context = NULL,
max_think = NULL,
temperature = NULL,
top_k = NULL,
repeat_penalty = NULL
)