Commit Graph

589 Commits

Author SHA1 Message Date
timothycarambat
c2c4f63643 bump cdn 2025-02-05 10:30:43 -08:00
timothycarambat
a4d5b2588f o3 model patch 2025-02-03 14:19:21 -08:00
Timothy Carambat
d1ca16f7f8
Add tokenizer improvments via Singleton class and estimation (#3072)
* Add tokenizer improvments via Singleton class
linting

* dev build

* Estimation fallback when string exceeds a fixed byte size

* Add notice to tiktoken on backend
2025-01-30 17:55:03 -08:00
Timothy Carambat
e1af72daa7
parse streams from reasoning_content from DeepSeek API (#3068) 2025-01-30 14:55:20 -08:00
Timothy Carambat
8eac83bb82
Enable include_reasoning in request (#3067) 2025-01-30 14:32:38 -08:00
timothycarambat
6192080635 Update NVIDIA branding 2025-01-28 10:27:31 -08:00
Timothy Carambat
6aa1854155
Add ability to disable default agent skills (#3041)
* Add ability to disable default agent skills

* debug build
2025-01-27 16:52:43 -08:00
Sean Hatfield
75790e7e90
Remove native LLM option (#3024)
* remove native llm

* remove node-llama-cpp from dockerfile

* remove unneeded items from dockerfile

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-27 13:42:52 -08:00
Jason
c757c3fb5f
feat: update novita AI logo and default model (#3037) 2025-01-27 08:41:12 -08:00
Timothy Carambat
2ca22abc9c
Add Version to AzureOpenAI (#3023) 2025-01-24 13:41:37 -08:00
Sean Hatfield
48dcb22b25
Dynamic fetching of TogetherAI models (#3017)
* implement dynamic fetching of togetherai models

* implement caching for togetherai models

* update gitignore for togetherai model caching

* Remove models.json from git tracking

* Remove .cached_at from git tracking

* lint

* revert unneeded change

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-24 11:06:59 -08:00
timothycarambat
273d116586 linting 2025-01-23 16:43:18 -08:00
Sean Hatfield
57f4f46a39
Bump perplexity models (#3014)
* bump perplexity models

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-23 16:35:38 -08:00
Timothy Carambat
d35b37b6ec
wip agent ui animation (#2999)
* wip agent ui animation

* WIP agent ui revision

* linting

* simplify css

* memoize agent responses

---------

Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-01-22 11:10:02 -08:00
Sean Hatfield
f8c72786df
Fix similarity score bug in lance/chroma dbs (#2986)
* fix similarity score bug in lance/chroma dbs

* batch lower bound case

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-01-17 18:27:54 -08:00
Timothy Carambat
c4f75feb08
Support historical message image inputs/attachments for n+1 queries (#2919)
* Support historical message image inputs/attachments for n+1 queries

* patch gemini

* OpenRouter vision support cleanup

* xai vision history support

* Mistral logging

---------

Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-01-16 13:49:06 -08:00
timothycarambat
a2264f46ff add index to response object for marshalling
resolves #2956
2025-01-14 14:43:00 -08:00
timothycarambat
4d5d8d4dec patch gemini embedding key preference
resolves #2972
2025-01-14 08:20:57 -08:00
Timothy Carambat
21af81085a
Add caching to Gemini /models (#2969)
rename file typo
2025-01-13 13:12:03 -08:00
ReindeerFlotilla
be886f7d61
feat: add support for voyage-3-large and voyage-code-3 embedding models (#2955)
* feat: add support for voyage-3-large and voyage-code-3 embedding models

- Add voyage-3-large and voyage-code-3 to VoyageAiOptions dropdown
- Update getMaxEmbeddingLength to support 32k context for new models
- Update .env.example with new model options

* unset env example

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-01-08 17:21:30 -08:00
timothycarambat
6134c15ca4 Add fallback URL for reranking model 2025-01-07 15:09:54 -08:00
timothycarambat
e192364d8d Migrate CDN download URL from S3 bucket 2025-01-07 12:09:14 -08:00
Sean Hatfield
c6547ec04d
Update example ENV files (#2931)
update example env files
2025-01-02 20:09:42 -08:00
Timothy Carambat
ad01df8790
Reranker option for RAG (#2929)
* Reranker WIP

* add cacheing and singleton loading

* Add field to workspaces for vectorSearchMode
Add UI for lancedb to change mode
update all search endpoints to pass in reranker prop if provider can use it

* update hint text

* When reranking, swap score to rerank score

* update optchain
2025-01-02 14:27:52 -08:00
Timothy Carambat
bb5c3b7e0d
make similarityResponse object arguments and not positional (#2930)
* make `similarityResponse` object arguments and not positional

* reuse client for qdrant
2025-01-02 12:03:26 -08:00
timothycarambat
f417ffd5e0 resolves #2926
floor ms value
2025-01-01 15:54:44 -08:00
timothycarambat
d52249aff8 support attachments via thread API chat/stream-chat endpoints 2024-12-31 09:52:21 -08:00
Timothy Carambat
ef71b95cf9
Return timestamps in OpenAI compat endpoints in proper format (#2921) 2024-12-31 09:35:52 -08:00
Chaiwat Saithongcum
fa3079bbbf
Add support for Google Generative AI (Gemini) embedder (#2895)
* Add support for Google Generative AI (Gemini) embedder

* Add missing example in docker
Fix UI key elements in options
Add Gemini to data handling section
Patch issues with chunk handling during embedding

* remove dupe in env

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-31 09:29:38 -08:00
Sean Hatfield
8d302c3f67
Patch custom models endpoint (#2903)
* prevent non admin users from changing llm settings via custom-models endpoint

* permission middleware to JSDOC

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-30 14:58:26 -08:00
Sean Hatfield
696af19c45
Patch unauthorized access to other user's pfps (#2904)
* patch unauthorized viewing of other user's pfps

* inline return responses

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2024-12-30 14:36:51 -08:00
Sean Hatfield
0b7bf68f2c
Normalize paths on files uploaded to prevent arbitrary file writes (#2905)
* normalize paths on files uploaded to prevent arbitrary file writes

* force normalize path in string parse

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-30 14:29:10 -08:00
Sean Hatfield
99b6dedc8b
Fix typo in frontend for SearXNG agent setup (#2917)
fix typo in frontend for SearXNG agent setup
2024-12-30 14:11:20 -08:00
timothycarambat
4b2bb529c9 enable leftover mlock setting 2024-12-28 17:48:24 -08:00
Timothy Carambat
a51de73aaa
update ollama performance mode (#2874) 2024-12-18 11:21:35 -08:00
wolfganghuse
af703427c7
fix wrong metadata assignment in MilvusProvider (#2870)
fixed wrong metadata assignment
2024-12-18 10:33:18 -08:00
Timothy Carambat
d54b5dfc62
Remove fine-tuning flow (#2872)
remove fine-tuning flow
2024-12-18 10:24:02 -08:00
Timothy Carambat
b082c8e441
Add support for gemini authenticated models endpoint (#2868)
* Add support for gemini authenticated models endpoint
add customModels entry
add un-authed fallback to default listing
separate models by expiermental status
resolves #2866

* add back improved logic for apiVersion decision making
2024-12-17 15:20:26 -08:00
Sean Hatfield
f8885a43e4
Append writable fields to dev API new workspace endpoint (#2843)
* add writible fields to dev api new workspace endpoint

* lint

* implement validations for workspace model

* update swagger comments

* simplify validations for workspace on frontend and API

* cleanup validations

---------

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2024-12-16 15:12:20 -08:00
Timothy Carambat
dd7c4675d3
LLM performance metric tracking (#2825)
* WIP performance metric tracking

* fix: patch UI trying to .toFixed() null metric
Anthropic tracking migraiton
cleanup logs

* Apipie implmentation, not tested

* Cleanup Anthropic notes, Add support for AzureOpenAI tracking

* bedrock token metric tracking

* Cohere support

* feat: improve default stream handler to track for provider who are actually OpenAI compliant in usage reporting
add deepseek support

* feat: Add FireworksAI tracking reporting
fix: improve handler when usage:null is reported (why?)

* Add token reporting for GenericOpenAI

* token reporting for koboldcpp + lmstudio

* lint

* support Groq token tracking

* HF token tracking

* token tracking for togetherai

* LiteLLM token tracking

* linting + Mitral token tracking support

* XAI token metric reporting

* native provider runner

* LocalAI token tracking

* Novita token tracking

* OpenRouter token tracking

* Apipie stream metrics

* textwebgenui token tracking

* perplexity token reporting

* ollama token reporting

* lint

* put back comment

* Rip out LC ollama wrapper and use official library

* patch images with new ollama lib

* improve ollama offline message

* fix image handling in ollama llm provider

* lint

* NVIDIA NIM token tracking

* update openai compatbility responses

* UI/UX show/hide metrics on click for user preference

* update bedrock client

---------

Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2024-12-16 14:31:17 -08:00
Sean Hatfield
ae510619f0
Purge cached docs and remove docs from all workspaces on vectorDB/embedder changes (#2819)
* wip remove all docs clear vector db on embedder/vector db change

* purge all cached docs and remove docs from workspaces on vectordb/embedder change

* lint

* remove unneeded console log

* remove reset vector stores endpoint and move to server side updateENV with postUpdate check

* reset embed module

* remove unused import

* simplify deletion process
rescoped document deletion to be more general for speed, everything needs to be reset anyway
fixed issue where unembedded docs not in any workspaces, but cached, were not removed

* add back missing readme file
update warning text modals

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-16 12:16:20 -08:00
wolfganghuse
d145602d5a
Add attachments to GenericOpenAI prompt (#2831)
* added attachments to genericopenai prompt

* add devnote

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-16 12:03:51 -08:00
Timothy Carambat
ff024286d0
Add new workspace user management endpoint (#2842)
deprecate old endpoint which users workspace ID vs slug
resolves #2838
2024-12-16 11:50:34 -08:00
Sean Hatfield
79cdcd4e64
Add thread name to workspaces get endpoint (#2822)
add thread name to workspaces get endpoint

Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2024-12-13 17:53:54 -08:00
Sean Hatfield
f651ca8628
APIPie LLM provider improvements (#2695)
* fix apipie streaming/sort by chat models

* lint

* linting

---------

Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-13 14:18:02 -08:00
Timothy Carambat
a46eb2418a
add Gitlab to watchable documents (#2817) 2024-12-12 10:44:11 -08:00
Timothy Carambat
538078747d
Add vector search API endpoint (#2815)
* Add vector search API endpoint

* Add missing import

* Modify the data that is returned

* Change similarityThreshold to scoreThreshold

As this is what is actually returned by the search

* Removing logging (oops!)

* chore: regen swagger docs for new endpoint
fix: update function to sanity check values to prevent crashes during search

---------

Co-authored-by: Scott Bowler <scott@dcsdigital.co.uk>
2024-12-12 10:12:32 -08:00
timothycarambat
f8e91e1ffa patch gemini-2.0-key 2024-12-11 16:52:31 -08:00
timothycarambat
69b672b625 add gemini 1206 and gemini-2.0-flash exp models
connect #2788
2024-12-11 09:04:29 -08:00
Timothy Carambat
a69997a715
update chat model filters for openai (#2803) 2024-12-11 08:55:10 -08:00