* add check for timings field on final chunk to override usage data * refactor: extract llama.cpp timings into reusable private method Move timings extraction into #extractTimings so it can be shared by both streaming (handleStream) and non-streaming (getChatCompletion) code paths. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * lint and cleanup --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com> |
||
|---|---|---|
| .. | ||
| __tests__ | ||
| endpoints | ||
| jobs | ||
| middleware | ||
| models | ||
| prisma | ||
| storage | ||
| swagger | ||
| utils | ||
| .env.example | ||
| .flowconfig | ||
| .gitignore | ||
| .nvmrc | ||
| index.js | ||
| jsconfig.json | ||
| nodemon.json | ||
| package.json | ||
| yarn.lock | ||