merlyn/server/utils/AiProviders
jonathanortega2023 7a0c149d2e
fix: Use eval_duration for output TPS calculations in Ollama LLM provider (#4568)
* fix: Use eval_duration for output TPS calculations and add as a metric field

* refactor usage of eval_duration from ollama metrics

* move eval_duration to usage

* overwrite duration in ollama provider wip measureAsyncFunction optional param

* allow for overloaded duration in measureAsyncFunction

* simplify flow for duration tracking

---------

Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-11-20 13:02:47 -08:00
..
anthropic Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
apipie Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
azureOpenAi patch Azure image reading regressions 2025-05-12 11:10:35 -07:00
bedrock fix to support: feat2864 - using local credentials file with Amazon Bedrock (#3986) 2025-07-02 09:15:23 -07:00
cohere Reimplement Cohere models for basic chat (#4489) 2025-10-03 18:28:20 -07:00
cometapi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
deepseek Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
dellProAiStudio Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
fireworksAi Sync models from remote for FireworksAI (#4475) 2025-10-02 12:34:05 -07:00
foundry Microsoft Foundry Local LLM provider & agent provider (#4435) 2025-10-01 20:04:13 -07:00
gemini Add stream options to Gemini LLM for usage tracking (#4466) 2025-10-01 14:00:26 -07:00
genericOpenAi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
groq Automatic Context window detection (#3817) 2025-05-14 11:03:19 -07:00
huggingface LLM performance metric tracking (#2825) 2024-12-16 14:31:17 -08:00
koboldCPP Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
liteLLM Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
lmStudio Model context limit auto-detection for LM Studio and Ollama LLM Providers (#4468) 2025-10-02 11:54:19 -07:00
localAi Support historical message image inputs/attachments for n+1 queries (#2919) 2025-01-16 13:49:06 -08:00
mistral Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
modelMap Reimplement Cohere models for basic chat (#4489) 2025-10-03 18:28:20 -07:00
moonshotAi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
novita Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
nvidiaNim Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
ollama fix: Use eval_duration for output TPS calculations in Ollama LLM provider (#4568) 2025-11-20 13:02:47 -08:00
openAi Patch OpenAI metrics (#4458) 2025-09-30 15:19:34 -07:00
openRouter Implement full chat and @agent chat user indentificiation for OpenRouter (#4668) 2025-11-20 12:38:43 -08:00
perplexity handling of citations in openRouter provider #3581 (#3620) 2025-04-15 10:57:09 -07:00
ppio Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
textGenWebUI Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
togetherAi Dynamic fetching of TogetherAI models (#3017) 2025-01-24 11:06:59 -08:00
xai Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00