merlyn/server/utils/AiProviders
Sean Hatfield c913a2d68c
Prompt caching for Anthropic LLM and Agent providers (#4488)
* prompt caching for anthropic llm and agent providers

* add UI for control of ENV
simplify implementation

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-11-20 17:17:03 -08:00
..
anthropic Prompt caching for Anthropic LLM and Agent providers (#4488) 2025-11-20 17:17:03 -08:00
apipie Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
azureOpenAi patch Azure image reading regressions 2025-05-12 11:10:35 -07:00
bedrock feature: Support for AWS Bedrock API Keys (#4651) 2025-11-20 15:38:45 -08:00
cohere Reimplement Cohere models for basic chat (#4489) 2025-10-03 18:28:20 -07:00
cometapi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
deepseek Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
dellProAiStudio Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
fireworksAi Sync models from remote for FireworksAI (#4475) 2025-10-02 12:34:05 -07:00
foundry Microsoft Foundry Local LLM provider & agent provider (#4435) 2025-10-01 20:04:13 -07:00
gemini Add stream options to Gemini LLM for usage tracking (#4466) 2025-10-01 14:00:26 -07:00
genericOpenAi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
groq Automatic Context window detection (#3817) 2025-05-14 11:03:19 -07:00
huggingface LLM performance metric tracking (#2825) 2024-12-16 14:31:17 -08:00
koboldCPP Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
liteLLM Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
lmStudio Simplify cache condition for LMStudio and Ollama to prevent race condition (#4669) 2025-11-20 16:32:02 -08:00
localAi Support historical message image inputs/attachments for n+1 queries (#2919) 2025-01-16 13:49:06 -08:00
mistral Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
modelMap Z.ai LLM & agent provider (#4573) 2025-11-20 15:57:03 -08:00
moonshotAi Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
novita Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
nvidiaNim Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
ollama Simplify cache condition for LMStudio and Ollama to prevent race condition (#4669) 2025-11-20 16:32:02 -08:00
openAi Patch OpenAI metrics (#4458) 2025-09-30 15:19:34 -07:00
openRouter Implement full chat and @agent chat user indentificiation for OpenRouter (#4668) 2025-11-20 12:38:43 -08:00
perplexity handling of citations in openRouter provider #3581 (#3620) 2025-04-15 10:57:09 -07:00
ppio Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
textGenWebUI Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
togetherAi Dynamic fetching of TogetherAI models (#3017) 2025-01-24 11:06:59 -08:00
xai Refactor Class Name Logging (#4426) 2025-09-25 15:34:19 -10:00
zai Z.ai LLM & agent provider (#4573) 2025-11-20 15:57:03 -08:00