Adam Setch
d63438fa61
chore: rename Github to GitHub ( #3199 )
...
* chore: rename Github to GitHub
Signed-off-by: Adam Setch <adam.setch@outlook.com>
* chore: rename Github to GitHub
Signed-off-by: Adam Setch <adam.setch@outlook.com>
* Undo some code changes for references
---------
Signed-off-by: Adam Setch <adam.setch@outlook.com>
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-02-13 10:45:43 -08:00
Sean Hatfield
e5f3fb0892
Agent flow builder ( #3077 )
...
* wip agent builder
* refactor structure for agent builder
* improve ui for add block menu and sidebar
* lint
* node ui improvement
* handle deleting variable in all nodes
* add headers and body to apiCall node
* lint
* Agent flow builder backend (#3078 )
* wip agent builder backend
* save/load agent tasks
* lint
* refactor agent task to use uuids instead of names
* placeholder for run task
* update frontend sidebar + seperate backend to agent-tasks utils
* lint
* add deleting of agent tasks
* create AgentTasks class + wip load agent tasks into aibitat
* lint
* inject + call agent tasks
* wip call agent tasks
* add llm instruction + fix api calling blocks
* add ui + backend for editing/toggling agent tasks
* lint
* add back middlewares
* disable run task + add navigate to home on logo click
* implement normalizePath to prevent path traversal
* wip make api calling more consistent
* lint
* rename all references from task to flow
* patch load flow bug when on editing page
* remove unneeded files/comments
* lint
* fix delete endpoint + rename load flows
* add move block to ui + fix api-call backend + add telemetry
* lint
* add web scraping block
* only allow admin for agent builder
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
* Move AgentFlowManager flows to static
simplify UI states
Handle LLM prompt flow when provided non-string
* delete/edit menu for agent flow panel + update flow icon
* lint
* fix open builder button hidden bug
* add tooltips to move up/down block buttons
* add tooltip to delete block
* truncate block description to fit on blocklist component
* light mode agent builder sidebar
* light mode api call block
* fix light mode styles for agent builder blocks
* agent flow fetch in UI
* sync delete flow
* agent flow ui/ux improvements
* remove unused AgentSidebar component
* comment out /run
* UI changes and updates for flow builder
* format flow panel info
* update link handling
* ui tweaks to header menu
* remove unused import
* update doc links
update block icons
* bump readme
* Patch code block header oddity
resolves #3117
* bump dev image
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-02-12 16:50:43 -08:00
Timothy Carambat
cc3d619061
Add handling to reasoning models for Generic OpenAI connector ( #3183 )
...
* Add handling to resoning models for Generic OpenAI connector
resolves #3177
* linting
2025-02-12 10:28:44 -08:00
Timothy Carambat
d90571fa33
Add improved logging and introspection to web-browsing ( #3140 )
2025-02-06 09:23:05 -08:00
Timothy Carambat
1bfd461719
Patch PPLX streaming for timeouts ( #3130 )
...
Add in-text citations as well for PPLX token streaming
handle timeouts for stream/buffer hanging
2025-02-05 16:35:22 -08:00
Timothy Carambat
df8d34d31e
Enable num_ctx to match defined chunk length in ollama embedder ( #3129 )
...
* Enable `num_ctx` to match defined chunk length in ollama embedder
* remove console
2025-02-05 15:46:39 -08:00
Timothy Carambat
e76baacec4
Add reasoning flag for azure models with "default" fallback ( #3128 )
2025-02-05 15:22:45 -08:00
timothycarambat
a5ee6121e2
Add patch for o# models on Azure
...
connect #3023
Note: depends on user naming the deployment correctly.
2025-02-05 11:34:03 -08:00
timothycarambat
c2c4f63643
bump cdn
2025-02-05 10:30:43 -08:00
timothycarambat
a4d5b2588f
o3 model patch
2025-02-03 14:19:21 -08:00
Timothy Carambat
d1ca16f7f8
Add tokenizer improvments via Singleton class and estimation ( #3072 )
...
* Add tokenizer improvments via Singleton class
linting
* dev build
* Estimation fallback when string exceeds a fixed byte size
* Add notice to tiktoken on backend
2025-01-30 17:55:03 -08:00
Timothy Carambat
e1af72daa7
parse streams from reasoning_content from DeepSeek API ( #3068 )
2025-01-30 14:55:20 -08:00
Timothy Carambat
8eac83bb82
Enable include_reasoning in request ( #3067 )
2025-01-30 14:32:38 -08:00
timothycarambat
6192080635
Update NVIDIA branding
2025-01-28 10:27:31 -08:00
Timothy Carambat
6aa1854155
Add ability to disable default agent skills ( #3041 )
...
* Add ability to disable default agent skills
* debug build
2025-01-27 16:52:43 -08:00
Sean Hatfield
75790e7e90
Remove native LLM option ( #3024 )
...
* remove native llm
* remove node-llama-cpp from dockerfile
* remove unneeded items from dockerfile
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-27 13:42:52 -08:00
Jason
c757c3fb5f
feat: update novita AI logo and default model ( #3037 )
2025-01-27 08:41:12 -08:00
Timothy Carambat
2ca22abc9c
Add Version to AzureOpenAI ( #3023 )
2025-01-24 13:41:37 -08:00
Sean Hatfield
48dcb22b25
Dynamic fetching of TogetherAI models ( #3017 )
...
* implement dynamic fetching of togetherai models
* implement caching for togetherai models
* update gitignore for togetherai model caching
* Remove models.json from git tracking
* Remove .cached_at from git tracking
* lint
* revert unneeded change
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-24 11:06:59 -08:00
timothycarambat
273d116586
linting
2025-01-23 16:43:18 -08:00
Sean Hatfield
57f4f46a39
Bump perplexity models ( #3014 )
...
* bump perplexity models
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-01-23 16:35:38 -08:00
Timothy Carambat
d35b37b6ec
wip agent ui animation ( #2999 )
...
* wip agent ui animation
* WIP agent ui revision
* linting
* simplify css
* memoize agent responses
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-01-22 11:10:02 -08:00
Sean Hatfield
f8c72786df
Fix similarity score bug in lance/chroma dbs ( #2986 )
...
* fix similarity score bug in lance/chroma dbs
* batch lower bound case
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-01-17 18:27:54 -08:00
Timothy Carambat
c4f75feb08
Support historical message image inputs/attachments for n+1 queries ( #2919 )
...
* Support historical message image inputs/attachments for n+1 queries
* patch gemini
* OpenRouter vision support cleanup
* xai vision history support
* Mistral logging
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-01-16 13:49:06 -08:00
timothycarambat
a2264f46ff
add index to response object for marshalling
...
resolves #2956
2025-01-14 14:43:00 -08:00
timothycarambat
4d5d8d4dec
patch gemini embedding key preference
...
resolves #2972
2025-01-14 08:20:57 -08:00
Timothy Carambat
21af81085a
Add caching to Gemini /models ( #2969 )
...
rename file typo
2025-01-13 13:12:03 -08:00
ReindeerFlotilla
be886f7d61
feat: add support for voyage-3-large and voyage-code-3 embedding models ( #2955 )
...
* feat: add support for voyage-3-large and voyage-code-3 embedding models
- Add voyage-3-large and voyage-code-3 to VoyageAiOptions dropdown
- Update getMaxEmbeddingLength to support 32k context for new models
- Update .env.example with new model options
* unset env example
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2025-01-08 17:21:30 -08:00
timothycarambat
6134c15ca4
Add fallback URL for reranking model
2025-01-07 15:09:54 -08:00
timothycarambat
e192364d8d
Migrate CDN download URL from S3 bucket
2025-01-07 12:09:14 -08:00
Timothy Carambat
ad01df8790
Reranker option for RAG ( #2929 )
...
* Reranker WIP
* add cacheing and singleton loading
* Add field to workspaces for vectorSearchMode
Add UI for lancedb to change mode
update all search endpoints to pass in reranker prop if provider can use it
* update hint text
* When reranking, swap score to rerank score
* update optchain
2025-01-02 14:27:52 -08:00
Timothy Carambat
bb5c3b7e0d
make similarityResponse object arguments and not positional ( #2930 )
...
* make `similarityResponse` object arguments and not positional
* reuse client for qdrant
2025-01-02 12:03:26 -08:00
timothycarambat
f417ffd5e0
resolves #2926
...
floor ms value
2025-01-01 15:54:44 -08:00
Timothy Carambat
ef71b95cf9
Return timestamps in OpenAI compat endpoints in proper format ( #2921 )
2024-12-31 09:35:52 -08:00
Chaiwat Saithongcum
fa3079bbbf
Add support for Google Generative AI (Gemini) embedder ( #2895 )
...
* Add support for Google Generative AI (Gemini) embedder
* Add missing example in docker
Fix UI key elements in options
Add Gemini to data handling section
Patch issues with chunk handling during embedding
* remove dupe in env
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-31 09:29:38 -08:00
Sean Hatfield
8d302c3f67
Patch custom models endpoint ( #2903 )
...
* prevent non admin users from changing llm settings via custom-models endpoint
* permission middleware to JSDOC
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-30 14:58:26 -08:00
Sean Hatfield
0b7bf68f2c
Normalize paths on files uploaded to prevent arbitrary file writes ( #2905 )
...
* normalize paths on files uploaded to prevent arbitrary file writes
* force normalize path in string parse
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-30 14:29:10 -08:00
timothycarambat
4b2bb529c9
enable leftover mlock setting
2024-12-28 17:48:24 -08:00
Timothy Carambat
a51de73aaa
update ollama performance mode ( #2874 )
2024-12-18 11:21:35 -08:00
wolfganghuse
af703427c7
fix wrong metadata assignment in MilvusProvider ( #2870 )
...
fixed wrong metadata assignment
2024-12-18 10:33:18 -08:00
Timothy Carambat
b082c8e441
Add support for gemini authenticated models endpoint ( #2868 )
...
* Add support for gemini authenticated models endpoint
add customModels entry
add un-authed fallback to default listing
separate models by expiermental status
resolves #2866
* add back improved logic for apiVersion decision making
2024-12-17 15:20:26 -08:00
Timothy Carambat
dd7c4675d3
LLM performance metric tracking ( #2825 )
...
* WIP performance metric tracking
* fix: patch UI trying to .toFixed() null metric
Anthropic tracking migraiton
cleanup logs
* Apipie implmentation, not tested
* Cleanup Anthropic notes, Add support for AzureOpenAI tracking
* bedrock token metric tracking
* Cohere support
* feat: improve default stream handler to track for provider who are actually OpenAI compliant in usage reporting
add deepseek support
* feat: Add FireworksAI tracking reporting
fix: improve handler when usage:null is reported (why?)
* Add token reporting for GenericOpenAI
* token reporting for koboldcpp + lmstudio
* lint
* support Groq token tracking
* HF token tracking
* token tracking for togetherai
* LiteLLM token tracking
* linting + Mitral token tracking support
* XAI token metric reporting
* native provider runner
* LocalAI token tracking
* Novita token tracking
* OpenRouter token tracking
* Apipie stream metrics
* textwebgenui token tracking
* perplexity token reporting
* ollama token reporting
* lint
* put back comment
* Rip out LC ollama wrapper and use official library
* patch images with new ollama lib
* improve ollama offline message
* fix image handling in ollama llm provider
* lint
* NVIDIA NIM token tracking
* update openai compatbility responses
* UI/UX show/hide metrics on click for user preference
* update bedrock client
---------
Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2024-12-16 14:31:17 -08:00
Sean Hatfield
ae510619f0
Purge cached docs and remove docs from all workspaces on vectorDB/embedder changes ( #2819 )
...
* wip remove all docs clear vector db on embedder/vector db change
* purge all cached docs and remove docs from workspaces on vectordb/embedder change
* lint
* remove unneeded console log
* remove reset vector stores endpoint and move to server side updateENV with postUpdate check
* reset embed module
* remove unused import
* simplify deletion process
rescoped document deletion to be more general for speed, everything needs to be reset anyway
fixed issue where unembedded docs not in any workspaces, but cached, were not removed
* add back missing readme file
update warning text modals
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-16 12:16:20 -08:00
wolfganghuse
d145602d5a
Add attachments to GenericOpenAI prompt ( #2831 )
...
* added attachments to genericopenai prompt
* add devnote
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-16 12:03:51 -08:00
Sean Hatfield
f651ca8628
APIPie LLM provider improvements ( #2695 )
...
* fix apipie streaming/sort by chat models
* lint
* linting
---------
Co-authored-by: timothycarambat <rambat1010@gmail.com>
2024-12-13 14:18:02 -08:00
timothycarambat
f8e91e1ffa
patch gemini-2.0-key
2024-12-11 16:52:31 -08:00
timothycarambat
69b672b625
add gemini 1206 and gemini-2.0-flash exp models
...
connect #2788
2024-12-11 09:04:29 -08:00
Timothy Carambat
a69997a715
update chat model filters for openai ( #2803 )
2024-12-11 08:55:10 -08:00
timothycarambat
4b09a06590
persist token window for NIM and not only on model change
2024-12-05 11:57:07 -08:00
Timothy Carambat
b2dd35fe15
Add Support for NVIDIA NIM ( #2766 )
...
* Add Support for NVIDIA NIM
* update README
* linting
2024-12-05 10:38:23 -08:00