Accurate token usage accounting #103
Labels
No labels
bug
commercial
documentation
duplicate
enhancement
feature
good first issue
help wanted
invalid
question
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
jasoncouture/llama-shears#103
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Today token counts come from rough estimation. Replace with actual usage as reported by the provider (per request: prompt tokens, completion tokens, total) and surface it everywhere the framework reasons about context size.
Why: compaction heuristics (#11, follow-up), budget enforcement (sibling issue, runaway prevention), UI dashboards, and cost reporting all depend on this. Estimates drift, especially across providers and tokenizers.
Scope:
Pairs with: smarter compaction (#11), compactor tool (sibling), token budget / runaway prevention (sibling).