Generate a single ChatML assistant reply
Usage
bebel_chat(
model,
message,
greedy = FALSE,
on_event = bebel_console_event(),
check_interrupt = TRUE,
max_gen = NULL,
max_context = NULL,
max_think = NULL,
temperature = NULL,
top_k = NULL,
repeat_penalty = NULL
)Arguments
- model
A
BebelModelobject.- message
User message.
- greedy
Use deterministic greedy decoding.
- on_event
Event callback, named list of event-specific handlers, or
NULL. Event types arebebel_event_types(). Delta events containdelta,id, andindex; final events contain accumulatedcontentortext. Usebebel_console_event()for live console output.- check_interrupt
Check for Ctrl-C during prefill and before every decoded token.
- max_gen, max_context, max_think
Optional generation limits.
- temperature, top_k, repeat_penalty
Optional sampling settings.