Timothy Carambat
66b4bf2679
Add support for Anthropics /model endpoint (finally) ( #3376 )
...
* Add support for Anthropics /model endpoint (finally)
* dev
2025-02-28 13:29:43 -08:00
cnJasonZ
2aeb4c2961
Add new model provider PPIO ( #3211 )
...
* feat: add new model provider PPIO
* fix: fix ppio model fetching
* fix: code lint
* reorder LLM
update interface for streaming and chats to use valid keys
linting
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-02-27 10:53:00 -08:00
Skanda Kaashyap
d1354caccb
[FEAT] Add claude-3-7 ( #3337 )
...
* add claude 3-7 sonnet
* made all the changes everywhere
* add 3-7-sonnet-latest model
* lint
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-02-25 12:52:17 -08:00
timothycarambat
12b43256a0
lint
2025-02-18 20:49:40 -08:00
Sushanth Srivatsa
3fd0fe8fc5
2749 ollama client auth token ( #3005 )
...
* ollama auth token provision
* auth token provision
* ollama auth provision
* ollama auth token
* ollama auth provision
* token input field css fix
* Fix provider handler not using key
sensible fallback to not break existing installs
re-order of input fields
null-check for API key and header optional insert on request
linting
* apply header and auth to agent invocations
* upgrading to ollama 5.10 for passing headers to constructor
* rename Auth systemSetting key to be more descriptive
linting and copy
* remove untracked files + update gitignore
* remove debug
* patch lockfile
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-02-18 16:00:17 -08:00
Timothy Carambat
cc3d619061
Add handling to reasoning models for Generic OpenAI connector ( #3183 )
...
* Add handling to resoning models for Generic OpenAI connector
resolves #3177
* linting
2025-02-12 10:28:44 -08:00
Timothy Carambat
1bfd461719
Patch PPLX streaming for timeouts ( #3130 )
...
Add in-text citations as well for PPLX token streaming
handle timeouts for stream/buffer hanging
2025-02-05 16:35:22 -08:00
Timothy Carambat
e76baacec4
Add reasoning flag for azure models with "default" fallback ( #3128 )
2025-02-05 15:22:45 -08:00
timothycarambat
a5ee6121e2
Add patch for o# models on Azure
...
connect #3023
Note: depends on user naming the deployment correctly.
2025-02-05 11:34:03 -08:00
timothycarambat
a4d5b2588f
o3 model patch
2025-02-03 14:19:21 -08:00
Timothy Carambat
d1ca16f7f8
Add tokenizer improvments via Singleton class and estimation ( #3072 )
...
* Add tokenizer improvments via Singleton class
linting
* dev build
* Estimation fallback when string exceeds a fixed byte size
* Add notice to tiktoken on backend
2025-01-30 17:55:03 -08:00
Timothy Carambat
e1af72daa7
parse streams from reasoning_content from DeepSeek API ( #3068 )
2025-01-30 14:55:20 -08:00
Timothy Carambat
8eac83bb82
Enable include_reasoning in request ( #3067 )
2025-01-30 14:32:38 -08:00
timothycarambat
6192080635
Update NVIDIA branding
2025-01-28 10:27:31 -08:00
Sean Hatfield
75790e7e90
Remove native LLM option ( #3024 )
...
* remove native llm
* remove node-llama-cpp from dockerfile
* remove unneeded items from dockerfile
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-27 13:42:52 -08:00
Jason
c757c3fb5f
feat: update novita AI logo and default model ( #3037 )
2025-01-27 08:41:12 -08:00
Timothy Carambat
2ca22abc9c
Add Version to AzureOpenAI ( #3023 )
2025-01-24 13:41:37 -08:00
Sean Hatfield
48dcb22b25
Dynamic fetching of TogetherAI models ( #3017 )
...
* implement dynamic fetching of togetherai models
* implement caching for togetherai models
* update gitignore for togetherai model caching
* Remove models.json from git tracking
* Remove .cached_at from git tracking
* lint
* revert unneeded change
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-24 11:06:59 -08:00
timothycarambat
273d116586
linting
2025-01-23 16:43:18 -08:00
Sean Hatfield
57f4f46a39
Bump perplexity models ( #3014 )
...
* bump perplexity models
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-23 16:35:38 -08:00
Timothy Carambat
c4f75feb08
Support historical message image inputs/attachments for n+1 queries ( #2919 )
...
* Support historical message image inputs/attachments for n+1 queries
* patch gemini
* OpenRouter vision support cleanup
* xai vision history support
* Mistral logging
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-01-16 13:49:06 -08:00
Timothy Carambat
21af81085a
Add caching to Gemini /models ( #2969 )
...
rename file typo
2025-01-13 13:12:03 -08:00
timothycarambat
4b2bb529c9
enable leftover mlock setting
2024-12-28 17:48:24 -08:00
Timothy Carambat
a51de73aaa
update ollama performance mode ( #2874 )
2024-12-18 11:21:35 -08:00
Timothy Carambat
b082c8e441
Add support for gemini authenticated models endpoint ( #2868 )
...
* Add support for gemini authenticated models endpoint
add customModels entry
add un-authed fallback to default listing
separate models by expiermental status
resolves #2866
* add back improved logic for apiVersion decision making
2024-12-17 15:20:26 -08:00
Timothy Carambat
dd7c4675d3
LLM performance metric tracking ( #2825 )
...
* WIP performance metric tracking
* fix: patch UI trying to .toFixed() null metric
Anthropic tracking migraiton
cleanup logs
* Apipie implmentation, not tested
* Cleanup Anthropic notes, Add support for AzureOpenAI tracking
* bedrock token metric tracking
* Cohere support
* feat: improve default stream handler to track for provider who are actually OpenAI compliant in usage reporting
add deepseek support
* feat: Add FireworksAI tracking reporting
fix: improve handler when usage:null is reported (why?)
* Add token reporting for GenericOpenAI
* token reporting for koboldcpp + lmstudio
* lint
* support Groq token tracking
* HF token tracking
* token tracking for togetherai
* LiteLLM token tracking
* linting + Mitral token tracking support
* XAI token metric reporting
* native provider runner
* LocalAI token tracking
* Novita token tracking
* OpenRouter token tracking
* Apipie stream metrics
* textwebgenui token tracking
* perplexity token reporting
* ollama token reporting
* lint
* put back comment
* Rip out LC ollama wrapper and use official library
* patch images with new ollama lib
* improve ollama offline message
* fix image handling in ollama llm provider
* lint
* NVIDIA NIM token tracking
* update openai compatbility responses
* UI/UX show/hide metrics on click for user preference
* update bedrock client
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2024-12-16 14:31:17 -08:00
wolfganghuse
d145602d5a
Add attachments to GenericOpenAI prompt ( #2831 )
...
* added attachments to genericopenai prompt
* add devnote
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-16 12:03:51 -08:00
Sean Hatfield
f651ca8628
APIPie LLM provider improvements ( #2695 )
...
* fix apipie streaming/sort by chat models
* lint
* linting
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-13 14:18:02 -08:00
timothycarambat
f8e91e1ffa
patch gemini-2.0-key
2024-12-11 16:52:31 -08:00
timothycarambat
69b672b625
add gemini 1206 and gemini-2.0-flash exp models
...
connect #2788
2024-12-11 09:04:29 -08:00
Timothy Carambat
a69997a715
update chat model filters for openai ( #2803 )
2024-12-11 08:55:10 -08:00
timothycarambat
4b09a06590
persist token window for NIM and not only on model change
2024-12-05 11:57:07 -08:00
Timothy Carambat
b2dd35fe15
Add Support for NVIDIA NIM ( #2766 )
...
* Add Support for NVIDIA NIM
* update README
* linting
2024-12-05 10:38:23 -08:00
timothycarambat
62be0cd0c5
add gemini-exp-1121 to expiermental set
2024-11-22 09:36:44 -08:00
timothycarambat
246152c024
Add gemini-exp-1121
...
resolves #2657
2024-11-21 11:02:43 -08:00
Sean Hatfield
55fc9cd6b1
TogetherAI Llama 3.2 vision models support ( #2666 )
...
* togetherai llama 3.2 vision models support
* remove console log
* fix listing to reflect what is on the chart
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-11-21 10:42:42 -08:00
Timothy Carambat
26e2d8cc3b
Add more expiermental models from Gemini ( #2663 )
2024-11-20 09:52:33 -08:00
timothycarambat
af16332c41
remove dupe key in ModelMap
2024-11-19 20:20:28 -08:00
Sean Hatfield
e29f054706
Bump TogetherAI models ( #2645 )
...
* bump together ai models
* Run post-bump command
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-11-18 13:08:26 -08:00
Sean Hatfield
27b07d46b3
Patch bad models endpoint path in LM Studio embedding engine ( #2628 )
...
* patch bad models endpoint path in lm studio embedding engine
* convert to OpenAI wrapper compatibility
* add URL force parser/validation for LMStudio connections
* remove comment
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-11-13 12:34:42 -08:00
timothycarambat
5aa79128f7
bump Anthropic models
2024-11-06 08:14:08 -08:00
Timothy Carambat
80565d79e0
2488 novita ai llm integration ( #2582 )
...
* feat: add new model provider: Novita AI
* feat: finished novita AI
* fix: code lint
* remove unneeded logging
* add back log for novita stream not self closing
* Clarify ENV vars for LLM/embedder seperation for future
Patch ENV check for workspace/agent provider
---------
Co-authored-by: Jason <ggbbddjm@gmail.com>
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2024-11-04 11:34:29 -08:00
Timothy Carambat
dd2756b570
add sessionToken validation connection auth for AWSbedrock ( #2554 )
2024-10-29 16:34:52 -07:00
Timothy Carambat
5bc96bca88
Add Grok/XAI support for LLM & agents ( #2517 )
...
* Add Grok/XAI support for LLM & agents
* forgot files
2024-10-21 16:32:49 -07:00
Timothy Carambat
446164d7b9
Add Groq vision preview support ( #2511 )
...
Adds support for only the llama3.2 vision models on groq. This comes with many conditionals and nuances to handle as Groqs vision implemention is quite bad right now
2024-10-21 12:37:39 -07:00
Timothy Carambat
7342839e77
Passthrough agentModel for LMStudio ( #2499 )
2024-10-18 11:44:48 -07:00
Timothy Carambat
93d7ce6d34
Handle Bedrock models that cannot use system prompts ( #2489 )
2024-10-16 12:31:04 -07:00
Sean Hatfield
fa528e0cf3
OpenAI o1 model support ( #2427 )
...
* support openai o1 models
* Prevent O1 use for agents
getter for isO1Model;
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-10-15 19:42:13 -07:00
Sean Hatfield
6674e5aab8
Support free-form input for workspace model for providers with no /models endpoint ( #2397 )
...
* support generic openai workspace model
* Update UI for free form input for some providers
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2024-10-15 15:24:44 -07:00
Timothy Carambat
bce7988683
Integrate Apipie support directly ( #2470 )
...
resolves #2464
resolves #989
Note: Streaming not supported
2024-10-15 12:36:06 -07:00