michael/merlyn - merlyn - Honeywest

michael/merlyn

Author	SHA1	Message	Date
Timothy Carambat	018e0cffbd	Lazy load Lancedb (#4764 )	2025-12-11 09:50:52 -08:00
方程	90e474abcb	Support Gitee AI(LLM Provider) (#3361 ) * Support Gitee AI(LLM Provider) * refactor(server): 重构 GiteeAI 模型窗口限制功能,暂时将窗口限制硬编码,计划使用外部 API 数据和缓存 * updates for Gitee AI * use legacy lookup since gitee does not enable getting token context windows * add more missing records * reorder imports --------- Co-authored-by: 方程 <fangcheng@oschina.cn> Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 14:19:32 -08:00
Colin Perry	157e3e4b38	Feat/add openrouter embedding models (#4682 ) * implemented openrouter embedding model support * ran yarn lint * data handling entry --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 11:16:16 -08:00
Sean Hatfield	49c29fb968	Z.ai LLM & agent provider (#4573 ) * wip zai llm provider * cleanup + add zai agent provider * lint * change how caching works for failed models --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 15:57:03 -08:00
Timothy Carambat	cf76bad452	Implement full chat and `@agent` chat `user` indentificiation for OpenRouter (#4668 ) Implmenet chat and agentic chat user-id for OpenRouter resolves #4553 closes #4482	2025-11-20 12:38:43 -08:00
Timothy Carambat	22c619586b	Failover invalid vector db identifier to lanceDB (#4661 ) resolves #4640 closes #4626	2025-11-19 13:36:19 -08:00
Sean Hatfield	599a3fd8b8	Microsoft Foundry Local LLM provider & agent provider (#4435 ) * add microsoft foundry local llm and agent providers * minor change to fix early stop token + overloading of context window always use user defined window _unless_ it is larger than the models real contenxt window cache the context windows when we can from the API (0.7.)+ Unload model forcefully on model change to prevent resource hogging add back token preference since some models have very large windows and can crash a machine normalize cases --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-01 20:04:13 -07:00
TensorNull	5922349bb7	feat: Implement CometAPI integration for chat completions and model m… (#4379 ) * feat: Implement CometAPI integration for chat completions and model management - Added CometApiLLM class for handling chat completions using CometAPI. - Implemented model synchronization and caching mechanisms. - Introduced streaming support for chat responses with timeout handling. - Created CometApiProvider class for agent interactions with CometAPI. - Enhanced error handling and logging throughout the integration. - Established a structure for managing function calls and completions. * linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-09-16 14:38:49 -07:00
Sean Hatfield	c6e1b9c3e2	Chroma Cloud vector db provider (#4273 ) * add chroma cloud as new vector db provider * update docker example env * extend chroma class to chroma cloud * update readme --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-08-12 16:21:14 -07:00
Timothy Carambat	0fb33736da	Workspace Chat with documents overhaul (#4261 ) * Create parse endpoint in collector (#4212) * create parse endpoint in collector * revert cleanup temp util call * lint * remove unused cleanupTempDocuments function * revert slug change minor change for destinations --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> * Add parsed files table and parse server endpoints (#4222) * add workspace_parsed_files table + parse endpoints/models * remove dev api parse endpoint * remove unneeded imports * iterate over all files + remove unneeded update function + update telemetry debounce * Upload UI/UX context window check + frontend alert (#4230) * prompt user to embed if exceeds prompt window + handle embed + handle cancel * add tokenCountEstimate to workspace_parsed_files + optimizations * use util for path locations + use safeJsonParse * add modal for user decision on overflow of context window * lint * dynamic fetching of provider/model combo + inject parsed documents * remove unneeded comments * popup ui for attaching/removing files + warning to embed + wip fetching states on update * remove prop drilling, fetch files/limits directly in attach files popup * rework ux of FE + BE optimizations * fix ux of FE + BE optimizations * Implement bidirectional sync for parsed file states linting small changes and comments * move parse support to another endpoint file simplify calls and loading of records * button borders * enable default users to upload parsed files but NOT embed * delete cascade on user/workspace/thread deletion to remove parsedFileRecord * enable bgworker with "always" jobs and optional document sync jobs orphan document job: Will find any broken reference files to prevent overpollution of the storage folder. This will run 10s after boot and every 12hr after * change run timeout for orphan job to 1m to allow settling before spawning a worker * linting and cleanup pr --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com> * dev build * fix tooltip hiding during embedding overflow files * prevent crash log from ERRNO on parse files * unused import * update docs link * Migrate parsed-files to GET endpoint patch logic for grabbing models names from utils better handling for undetermined context windows (null instead of Pos_INIFI) UI placeholder for null context windows * patch URL --------- Co-authored-by: Sean Hatfield <seanhatfield5@gmail.com>	2025-08-11 09:26:19 -07:00
Sean Hatfield	6d6bd14622	Moonshot AI LLM & agent provider (#4178 ) * add moonshot ai LLM & agent provider * fix moonshot agent calling * handle attachments/fix moonshot llm provider * update docs/example env * add moonshot to onboarding privacy * add moonshot to onboarding llm preference * update privacy for moonshot ai * update logo higher res * remove caching and use modelmap	2025-07-22 09:56:51 -07:00
Timothy Carambat	c0d66e6c19	Enable UI/UX for model swapping in chat window (#3969 ) * Enable UI/UX for model swapping in chat window * forgot component * patch useGetProviders hook to set loading on change of provider * dev build * normalize translations * patch how model default is provided --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2025-06-09 09:59:17 -07:00
Timothy Carambat	378ceaecec	Support Dell Pro AI Studio provider (#3829 )	2025-05-14 15:10:48 -07:00
Timothy Carambat	e1b7f5820c	PGvector vector database support (#3788 ) * PGVector support for vector db storage * forgot files * comments * dev build * Add ENV connection and table schema validations for vector table add .reset call to drop embedding table when changing the AnythingLLM embedder update instrutions Add preCheck error reporting in UpdateENV add timeout to pg connection * update setup * update README * update doc	2025-05-09 12:27:11 -07:00
cnJasonZ	2aeb4c2961	Add new model provider PPIO (#3211 ) * feat: add new model provider PPIO * fix: fix ppio model fetching * fix: code lint * reorder LLM update interface for streaming and chats to use valid keys linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-02-27 10:53:00 -08:00
Sean Hatfield	75790e7e90	Remove native LLM option (#3024 ) * remove native llm * remove node-llama-cpp from dockerfile * remove unneeded items from dockerfile --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-01-27 13:42:52 -08:00
Timothy Carambat	ad01df8790	Reranker option for RAG (#2929 ) * Reranker WIP * add cacheing and singleton loading * Add field to workspaces for vectorSearchMode Add UI for lancedb to change mode update all search endpoints to pass in reranker prop if provider can use it * update hint text * When reranking, swap score to rerank score * update optchain	2025-01-02 14:27:52 -08:00
Chaiwat Saithongcum	fa3079bbbf	Add support for Google Generative AI (Gemini) embedder (#2895 ) * Add support for Google Generative AI (Gemini) embedder * Add missing example in docker Fix UI key elements in options Add Gemini to data handling section Patch issues with chunk handling during embedding * remove dupe in env --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-12-31 09:29:38 -08:00
Timothy Carambat	dd7c4675d3	LLM performance metric tracking (#2825 ) * WIP performance metric tracking * fix: patch UI trying to .toFixed() null metric Anthropic tracking migraiton cleanup logs * Apipie implmentation, not tested * Cleanup Anthropic notes, Add support for AzureOpenAI tracking * bedrock token metric tracking * Cohere support * feat: improve default stream handler to track for provider who are actually OpenAI compliant in usage reporting add deepseek support * feat: Add FireworksAI tracking reporting fix: improve handler when usage:null is reported (why?) * Add token reporting for GenericOpenAI * token reporting for koboldcpp + lmstudio * lint * support Groq token tracking * HF token tracking * token tracking for togetherai * LiteLLM token tracking * linting + Mitral token tracking support * XAI token metric reporting * native provider runner * LocalAI token tracking * Novita token tracking * OpenRouter token tracking * Apipie stream metrics * textwebgenui token tracking * perplexity token reporting * ollama token reporting * lint * put back comment * Rip out LC ollama wrapper and use official library * patch images with new ollama lib * improve ollama offline message * fix image handling in ollama llm provider * lint * NVIDIA NIM token tracking * update openai compatbility responses * UI/UX show/hide metrics on click for user preference * update bedrock client --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-12-16 14:31:17 -08:00
Sean Hatfield	ae510619f0	Purge cached docs and remove docs from all workspaces on vectorDB/embedder changes (#2819 ) * wip remove all docs clear vector db on embedder/vector db change * purge all cached docs and remove docs from workspaces on vectordb/embedder change * lint * remove unneeded console log * remove reset vector stores endpoint and move to server side updateENV with postUpdate check * reset embed module * remove unused import * simplify deletion process rescoped document deletion to be more general for speed, everything needs to be reset anyway fixed issue where unembedded docs not in any workspaces, but cached, were not removed * add back missing readme file update warning text modals --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-12-16 12:16:20 -08:00
Timothy Carambat	b2dd35fe15	Add Support for NVIDIA NIM (#2766 ) * Add Support for NVIDIA NIM * update README * linting	2024-12-05 10:38:23 -08:00
Sean Hatfield	9f38b9337b	Mistral embedding engine support (#2667 ) * add mistral embedding engine support * remove console log + fix data handling onboarding * update data handling description --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2024-11-21 11:05:55 -08:00
Timothy Carambat	80565d79e0	2488 novita ai llm integration (#2582 ) * feat: add new model provider: Novita AI * feat: finished novita AI * fix: code lint * remove unneeded logging * add back log for novita stream not self closing * Clarify ENV vars for LLM/embedder seperation for future Patch ENV check for workspace/agent provider --------- Co-authored-by: Jason <ggbbddjm@gmail.com> Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-11-04 11:34:29 -08:00
Timothy Carambat	5bc96bca88	Add Grok/XAI support for LLM & agents (#2517 ) * Add Grok/XAI support for LLM & agents * forgot files	2024-10-21 16:32:49 -07:00
Timothy Carambat	bce7988683	Integrate Apipie support directly (#2470 ) resolves #2464 resolves #989 Note: Streaming not supported	2024-10-15 12:36:06 -07:00
Sean Hatfield	7390bae6f6	Support DeepSeek (#2377 ) * add deepseek support * lint * update deepseek context length * add deepseek to onboarding --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2024-09-26 12:55:12 -07:00
Timothy Carambat	a30fa9b2ed	1943 add fireworksai support (#2300 ) * Issue #1943: Add support for LLM provider - Fireworks AI * Update UI selection boxes Update base AI keys for future embedder support if needed Add agent capabilites for FireworksAI * class only return --------- Co-authored-by: Aaron Van Doren <vandoren96+1@gmail.com>	2024-09-16 12:10:44 -07:00
Timothy Carambat	99f2c25b1c	Agent Context window + context window refactor. (#2126 ) * Enable agent context windows to be accurate per provider:model * Refactor model mapping to external file Add token count to document length instead of char-count refernce promptWindowLimit from AIProvider in central location * remove unused imports	2024-08-15 12:13:28 -07:00
Timothy Carambat	38fc181238	Add multimodality support (#2001 ) * Add multimodality support * Add Bedrock, KoboldCpp,LocalAI,and TextWebGenUI multi-modal * temp dev build * patch bad import * noscrolls for windows dnd * noscrolls for windows dnd * update README * update README * add multimodal check	2024-07-31 10:47:49 -07:00
Timothy Carambat	9366e69d88	Add AWS bedrock support for LLM + agents (#1935 ) add AWS bedrock support for LLM + agents	2024-07-23 16:35:37 -07:00
Timothy Carambat	0b845fbb1c	Deprecate `.isSafe` moderation (#1790 ) Add type defs to helpers	2024-06-28 15:32:30 -07:00
Sean Hatfield	e72fa8b370	[FEAT] Generic OpenAI embedding provider (#1664 ) * implement generic openai embedding provider * linting * comment & description update for generic openai embedding provider * fix privacy for generic --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-06-21 16:27:02 -07:00
Sean Hatfield	d29292ebd2	[FEAT] Add LiteLLM embedding provider support (#1579 ) * add liteLLM embedding provider support * update tooltip id --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-06-06 12:43:34 -07:00
Sean Hatfield	5bf4b4db58	[FEAT] Add support for Voyage AI embedder (#1401 ) * add support for voyageai embedder * remove unneeded import * linting * Add ENV examples Update how chunks are processed for Voyage use correct langchain import Add data handling --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2024-05-19 13:20:23 -05:00
Timothy Carambat	cae6cee1b5	Do not go through LLM to embed when embedding documents (#1428 )	2024-05-16 17:51:04 -07:00
Sean Hatfield	826ef00da3	[FEAT] LiteLLM provider support (#1424 ) * litellm LLM provider support * fix lint error * change import orders fix issue with model retrieval --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2024-05-16 13:56:28 -07:00
timothycarambat	a87978d1d9	Make LanceDB the vector database default provider in backend to prevent issues where somehow this key is not set by the user resulting in a Pinecone error even though they never said they wanted Pinecone to be their vector db	2024-05-13 12:22:53 -07:00
Sean Hatfield	977a07db86	[FEAT] Text Generation Web UI LLM provider support (#1279 ) * add text gen web ui LLM provider support * update README * README typo * update TextWebUI display name patch workspace<>model support for provider --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-05-08 11:56:30 -07:00
Sean Hatfield	fc77b46800	[FEAT] KoboldCPP LLM Support (#1268 ) * koboldcpp LLM support * update .env.examples for koboldcpp support * update LLM preference order update koboldcpp comments --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-05-02 12:12:44 -07:00
Sean Hatfield	3caebc47b4	[FEAT] Cohere LLM and embedder support (#1233 ) * getChatCompletion working WIP streaming * WIP * working streaming WIP abort stream * implement cohere embedder support * remove inputType option from cohere embedder * fix cohere LLM from not aborting stream when canceled by user * Patch Cohere implemention * add cohere to onboarding --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-05-02 10:35:50 -07:00
Timothy Carambat	df17fbda36	Add generic OpenAI endpoint support (#1178 ) * Add generic OpenAI endpoint support * allow any input for model in case provider does not support models endpoint	2024-04-23 13:06:07 -07:00
Timothy Carambat	c65f890afc	Add LMStudio embedding endpoint support (#1141 ) * Add LMStudio embedding endpoint support * update alive path check for HEAD remove commented JSX * update comment	2024-04-19 15:36:07 -07:00
Timothy Carambat	6f52a2b729	Embedder download - fallback URL (#1056 ) * Embedder download - fallback URL * improve logging for native embedder	2024-04-06 11:49:15 -07:00
Timothy Carambat	94b58249a3	Enable per-workspace provider/model combination (#1042 ) * Enable per-workspace provider/model combination * cleanup * remove resetWorkspaceChatModels and wipeWorkspaceModelPreference to prevent workspace from resetting model * add space --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-04-05 10:58:36 -07:00
Sean Hatfield	0634013788	[FEAT] Groq LLM support (#865 ) * Groq LLM support complete * update useGetProvidersModels for groq models * Add definiations update comments and error log reports add example envs --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-03-06 14:48:38 -08:00
Timothy Carambat	b64cb199f9	788 ollama embedder (#814 ) * Add Ollama embedder model support calls * update docs	2024-02-26 16:12:20 -08:00
Sean Hatfield	633f425206	[FEAT] OpenRouter integration (#784 ) * WIP openrouter integration * add OpenRouter options to onboarding flow and data handling * add todo to fix headers for rankings * OpenRouter LLM support complete * Fix hanging response stream with OpenRouter update tagline update comment * update timeout comment * wait for first chunk to start timer * sort OpenRouter models by organization * uppercase first letter of organization * sort grouped models by org --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-23 17:18:58 -08:00
Sean Hatfield	80ced5eba4	[FEAT] PerplexityAI Support (#778 ) * add LLM support for perplexity * update README & example env * fix ENV keys in example env files * slight changes for QA of perplexity support * Update Perplexity AI name --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-22 12:48:57 -08:00
Timothy Carambat	2bc11d3f1a	Implement support for HuggingFace Inference Endpoints (#680 )	2024-02-06 09:17:51 -08:00
Hakeem Abbas	5614e2ed30	feature: Integrate Astra as vectorDBProvider (#648 ) * feature: Integrate Astra as vectorDBProvider feature: Integrate Astra as vectorDBProvider * Update .env.example * Add env.example to docker example file Update spellcheck fo Astra Update Astra key for vector selection Update order of AstraDB options Resize Astra logo image to 330x330 Update methods of Astra to take in latest vectorDB params like TopN and more Update Astra interface to support default methods and avoid crash errors from 404 collections Update Astra interface to comply to max chunk insertion limitations Update Astra interface to dynamically set dimensionality from chunk 0 size on creation * reset workspaces --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-26 13:07:53 -08:00

1 2