merlyn

Author	SHA1	Message	Date
Marcello Fitton	c4f19cec0e	Refactor `LLMPerformanceMonitor.measureStream()` to Use Options Object Pattern (#4786 ) * Refactor LLMPerformanceMonitor to use options object for measureStream parameters * Refactor invocations of `measureStream` to use options arguments * Change invocation of `measureStream` in anthropic provider to use options argument --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-12-16 13:10:09 -08:00
Timothy Carambat	664f466e3f	4601 log model on response (#4781 ) * add model tag to chatCompletion * add modelTag `model` to async streaming keeps default arguments for prompt token calculation where applied via explict arg * fix HF default arg * render all performance metrics as available for backward compatibility add `timestamp` to both sync/async chat methods * extract metrics string to function	2025-12-14 14:46:55 -08:00
Colin Perry	a8bdc00aba	Feat/cohere agent implementation (#4703 ) * implement cohere agent support * run yarn lint * moderize Cohere add supported langchain method redo streaming since it was not working looping of agent calls was not functioning * change default model to real model tag add case statement for model tag * remove debug * update default * only whitelist known labels --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-12-12 16:25:58 -08:00
Timothy Carambat	018e0cffbd	Lazy load Lancedb (#4764 )	2025-12-11 09:50:52 -08:00
Timothy Carambat	b96988aae9	Dell model capability patch (#4719 )	2025-12-05 13:31:15 -08:00
Timothy Carambat	92752a2a15	Improve MCP functionality (#4709 ) * Improve MCP functionality * apply ANSI strip path * attempt ARM build * improve dockerfile IO build time and migrate to ARM build * fix comment * add ability to disable MCP cooldown feature * update devbuild name * move chromium arm build patch to CDN	2025-12-05 11:01:20 -08:00
方程	90e474abcb	Support Gitee AI(LLM Provider) (#3361 ) * Support Gitee AI(LLM Provider) * refactor(server): 重构 GiteeAI 模型窗口限制功能,暂时将窗口限制硬编码,计划使用外部 API 数据和缓存 * updates for Gitee AI * use legacy lookup since gitee does not enable getting token context windows * add more missing records * reorder imports --------- Co-authored-by: 方程 <fangcheng@oschina.cn> Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 14:19:32 -08:00
Sean Hatfield	66e44f65b4	Add batch embedding support for Ollama embedding provider (#4553 ) * add batch embedding support for ollama embedding provider * lint * simplify ollama embedder input	2025-11-25 13:03:47 -08:00
Colin Perry	157e3e4b38	Feat/add openrouter embedding models (#4682 ) * implemented openrouter embedding model support * ran yarn lint * data handling entry --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-25 11:16:16 -08:00
Marcello Fitton	5716ac5ed5	Custom Default System Prompt (#4487 ) * Add Default System Prompt Management - Introduced a new route for fetching and updating the default system prompt in the backend. - Added a new Admin page for managing the default system prompt, including a form for editing and saving changes. - Updated the SettingsSidebar to include a link to the new Default System Prompt page. - Implemented fetching of available system prompt variables for use in the prompt editor. - Enhanced the ChatSettings and ChatPromptSettings components to support the new default system prompt functionality. This commit lays the groundwork for improved management of system prompts across workspaces. * Remove validation for system prompt in ChatSettings component * Add comment for system prompt in workspaces model * linting, simplify logic for default assumption * dev build --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-24 13:24:10 -08:00
Sean Hatfield	c913a2d68c	Prompt caching for Anthropic LLM and Agent providers (#4488 ) * prompt caching for anthropic llm and agent providers * add UI for control of ENV simplify implementation --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 17:17:03 -08:00
Sean Hatfield	49c29fb968	Z.ai LLM & agent provider (#4573 ) * wip zai llm provider * cleanup + add zai agent provider * lint * change how caching works for failed models --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 15:57:03 -08:00
Chetan Sarva	c169193fc4	feature: Support for AWS Bedrock API Keys (#4651 ) * feat: add AWS Bedrock API Key option to settings panel * feat: Bedrock API key auth method * fix: hide IAM note when using bedrock api key * move to camcelCase identifier for bedrock api key use linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-11-20 15:38:45 -08:00
Alex Barron	2eb5384e27	Add SerpApi web search (#4623 ) Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 15:12:15 -08:00
jonathanortega2023	7a0c149d2e	fix: Use eval_duration for output TPS calculations in Ollama LLM provider (#4568 ) * fix: Use eval_duration for output TPS calculations and add as a metric field * refactor usage of eval_duration from ollama metrics * move eval_duration to usage * overwrite duration in ollama provider wip measureAsyncFunction optional param * allow for overloaded duration in measureAsyncFunction * simplify flow for duration tracking --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-11-20 13:02:47 -08:00
Timothy Carambat	cf76bad452	Implement full chat and `@agent` chat `user` indentificiation for OpenRouter (#4668 ) Implmenet chat and agentic chat user-id for OpenRouter resolves #4553 closes #4482	2025-11-20 12:38:43 -08:00
Timothy Carambat	22c619586b	Failover invalid vector db identifier to lanceDB (#4661 ) resolves #4640 closes #4626	2025-11-19 13:36:19 -08:00
Timothy Carambat	c2e7ccc00f	Reimplement Cohere models for basic chat (#4489 ) * Reimplement Cohere models - Redo LLM implementation to grab models from endpoint and pre-filter - Migrate embedding models to also grab from remote - Add records for easy context window lookup' * fix comment	2025-10-03 18:28:20 -07:00
Timothy Carambat	8cdadd8cb3	Sync models from remote for FireworksAI (#4475 ) resolves #4474	2025-10-02 12:34:05 -07:00
Sean Hatfield	0b18ac6577	Model context limit auto-detection for LM Studio and Ollama LLM Providers (#4468 ) * auto model context limit detection for ollama llm provider * auto model context limit detection for lmstudio llm provider * Patch Ollama to function and sync context windows like Foundry * normalize how model context windows are cached from endpoint service todo: move this into global utility class with MODEL_MAP eager load models on boot to pre-cache them add performance model improvements into ollama agent as well as apply n_ctx * remove debug log --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-02 11:54:19 -07:00
Sean Hatfield	599a3fd8b8	Microsoft Foundry Local LLM provider & agent provider (#4435 ) * add microsoft foundry local llm and agent providers * minor change to fix early stop token + overloading of context window always use user defined window _unless_ it is larger than the models real contenxt window cache the context windows when we can from the API (0.7.)+ Unload model forcefully on model change to prevent resource hogging add back token preference since some models have very large windows and can crash a machine normalize cases --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-10-01 20:04:13 -07:00
Timothy Carambat	c8f13d5f27	Enable custom HTTP response timeout for ollama (#4448 )	2025-09-29 12:32:55 -07:00
Timothy Carambat	95557ee16f	Allow user to specify args for chromium process so they dont need SYS_ADMIN on container. (#4397 ) * allow user to specify args for chromium process so they dont need SYS_ADMIN perms * use arg flag content * update console outputs	2025-09-17 16:31:08 -07:00
Timothy Carambat	cd063af4d1	Add support for `SIMPLE_SSO_NO_LOGIN_REDIRECT` config setting (#4394 ) * Add support for `SIMPLE_SSO_NO_LOGIN_REDIRECT` config setting * reset changes on simple page * redirect at root * remove paths change	2025-09-17 12:51:00 -07:00
TensorNull	5922349bb7	feat: Implement CometAPI integration for chat completions and model m… (#4379 ) * feat: Implement CometAPI integration for chat completions and model management - Added CometApiLLM class for handling chat completions using CometAPI. - Implemented model synchronization and caching mechanisms. - Introduced streaming support for chat responses with timeout handling. - Created CometApiProvider class for agent interactions with CometAPI. - Enhanced error handling and logging throughout the integration. - Established a structure for managing function calls and completions. * linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-09-16 14:38:49 -07:00
Sean Hatfield	e31465a639	Export image support for JSON and JSONL (#4359 ) * export image support for json and jsonl * add tests and cleanup functionality * add test for convertTo prepare function * comment --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-09-03 10:30:57 -07:00
Sean Hatfield	c6e1b9c3e2	Chroma Cloud vector db provider (#4273 ) * add chroma cloud as new vector db provider * update docker example env * extend chroma class to chroma cloud * update readme --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-08-12 16:21:14 -07:00
Neha Prasad	a230a44f5c	feat: Add Exa as a Search Provider (#4258 ) * Added exa-search case to the search provider switch in web-browsing.js * Added ExaSearchOptions component for API key input * update * Patch missing image crashing UI Fix issue where ENV key did not exist or was saved on click Update copy for provider Add Docs for ENV keys for manual placements update systemssettings for returning key saved to UI --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-08-11 09:46:50 -07:00
Timothy Carambat	0fb33736da	Workspace Chat with documents overhaul (#4261 ) * Create parse endpoint in collector (#4212) * create parse endpoint in collector * revert cleanup temp util call * lint * remove unused cleanupTempDocuments function * revert slug change minor change for destinations --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> * Add parsed files table and parse server endpoints (#4222) * add workspace_parsed_files table + parse endpoints/models * remove dev api parse endpoint * remove unneeded imports * iterate over all files + remove unneeded update function + update telemetry debounce * Upload UI/UX context window check + frontend alert (#4230) * prompt user to embed if exceeds prompt window + handle embed + handle cancel * add tokenCountEstimate to workspace_parsed_files + optimizations * use util for path locations + use safeJsonParse * add modal for user decision on overflow of context window * lint * dynamic fetching of provider/model combo + inject parsed documents * remove unneeded comments * popup ui for attaching/removing files + warning to embed + wip fetching states on update * remove prop drilling, fetch files/limits directly in attach files popup * rework ux of FE + BE optimizations * fix ux of FE + BE optimizations * Implement bidirectional sync for parsed file states linting small changes and comments * move parse support to another endpoint file simplify calls and loading of records * button borders * enable default users to upload parsed files but NOT embed * delete cascade on user/workspace/thread deletion to remove parsedFileRecord * enable bgworker with "always" jobs and optional document sync jobs orphan document job: Will find any broken reference files to prevent overpollution of the storage folder. This will run 10s after boot and every 12hr after * change run timeout for orphan job to 1m to allow settling before spawning a worker * linting and cleanup pr --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com> * dev build * fix tooltip hiding during embedding overflow files * prevent crash log from ERRNO on parse files * unused import * update docs link * Migrate parsed-files to GET endpoint patch logic for grabbing models names from utils better handling for undetermined context windows (null instead of Pos_INIFI) UI placeholder for null context windows * patch URL --------- Co-authored-by: Sean Hatfield <seanhatfield5@gmail.com>	2025-08-11 09:26:19 -07:00
Timothy Carambat	a9d9f9cdfd	Add custom JWT TTL (#4234 ) resolves #4224	2025-08-01 10:39:38 -07:00
Timothy Carambat	2c19dd09ed	Native Embedder model selection (incl: Multilingual support) (#3835 ) * WIP on embedder selection TODO: apply splitting and query prefixes (if applicable) * wip on upsert * Support base model support nomic-text-embed-v1 support multilingual-e5-small Add prefixing for both embedding and query for RAG tasks Add chunking prefix to all vector dbs to apply prefix when possible Show dropdown and auto-pull on new selection * norm translations * move supported models to constants handle null seelction or invalid selection on dropdown update comments * dev * patch text splitter maximums for now * normalize translations * add tests for splitter functionality * normalize --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2025-07-22 10:07:20 -07:00
Sean Hatfield	6d6bd14622	Moonshot AI LLM & agent provider (#4178 ) * add moonshot ai LLM & agent provider * fix moonshot agent calling * handle attachments/fix moonshot llm provider * update docs/example env * add moonshot to onboarding privacy * add moonshot to onboarding llm preference * update privacy for moonshot ai * update logo higher res * remove caching and use modelmap	2025-07-22 09:56:51 -07:00
Timothy Carambat	043cb1d085	Add `model` field parameter for generic OpenAI TTS (#4171 ) resolves #4165	2025-07-17 10:28:51 -07:00
Timothy Carambat	ea956f4d77	Add ability to search workspace and threads (#4120 ) * Add ability to search workspace and threads * fix height * styling * update placeholder * Translations PR (#4122)	2025-07-10 16:42:10 -07:00
Sean Hatfield	5d60047dc7	Handle BigInt in message response (#4110 ) * wip handle bigints in message response * extend bigint protoype to handle bigint stringification + add test * unset unrelated file * update tests, simplify implementation; --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-07-10 12:33:34 -07:00
timothycarambat	db2466e179	persistence of the GENERIC_OPENAI_STREAMING_DISABLED setting	2025-07-08 14:29:47 -07:00
Timothy Carambat	2055c8accd	Flag to disable login UI and endpoints for credentialed auth (#3984 ) * Flag to disable login UI and endpoints for credentialed auth * dev build * fix translation key	2025-06-11 12:46:40 -07:00
Timothy Carambat	c0d66e6c19	Enable UI/UX for model swapping in chat window (#3969 ) * Enable UI/UX for model swapping in chat window * forgot component * patch useGetProviders hook to set loading on change of provider * dev build * normalize translations * patch how model default is provided --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2025-06-09 09:59:17 -07:00
Timothy Carambat	378ceaecec	Support Dell Pro AI Studio provider (#3829 )	2025-05-14 15:10:48 -07:00
Timothy Carambat	e1b7f5820c	PGvector vector database support (#3788 ) * PGVector support for vector db storage * forgot files * comments * dev build * Add ENV connection and table schema validations for vector table add .reset call to drop embedding table when changing the AnythingLLM embedder update instrutions Add preCheck error reporting in UpdateENV add timeout to pg connection * update setup * update README * update doc	2025-05-09 12:27:11 -07:00
Danny Steenman	5500fa2bc5	feat: support for iam roles for bedrock client (#2632 ) * feat: implement iam role auth for bedrock * fix: make client refreshes properly when switching between iam_user and iam_role * checkout agent flow * fix aiprovider for bedrock in agent use --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-05-06 13:48:15 -07:00
Tristan Stahnke	b64a77f29f	Refactor AWS Bedrock Provider for Multi-modal Support & Correct Token Limits (#3714 ) * Fixed two primary issues discovered while using AWS Bedrock with Anthropic Claude Sonnet models: - Context Window defaults to 8192 maximum, which isn't correct - Multimodal stopped working when removing langchain, which was transparently handling image_url to a format sonnet expects. * Ran `yarn lint` * Updated .env.example to have aws bedrock examples too * Refactor for readability move utils for AWS specific functionality to subfile add token output max to ENV so setting persits --------- Co-authored-by: Tristan Stahnke <tristan.stahnke+gpsec@guidepointsecurity.com> Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-05-06 12:55:24 -07:00
timothycarambat	ae092b4d4a	openai filter other niche models	2025-05-05 12:07:48 -07:00
Sean Hatfield	78e6117dd8	Filter image models from OpenAI (#3768 ) filter image models from openai llm provider	2025-05-05 12:01:04 -07:00
Sean Hatfield	8912d0f0fc	Add option to control KoboldCPP max response tokens (#3746 ) add option to control koboldcpp max response tokens	2025-05-02 14:12:06 -07:00
timothycarambat	62627fca7f	add preUpdate checks on updateENV	2025-04-29 14:44:23 -07:00
timothycarambat	15acb19a68	Telemetry offset disabled count	2025-04-29 14:10:59 -07:00
Timothy Carambat	1601eb986c	Enable bypass of ip limitations via ENV in collector processing (#3652 ) * Enable bypass of ip limitations via ENV in collector startup resolves #3625 connect #3626 * dev build * bump dockerx build action * enable runtime setting config of collector requests * comments and linting for option passing * unset * unset * update docs link * linting and docs	2025-04-21 11:10:41 -07:00
timothycarambat	f176a294f5	remove hard-coded token limit check resolves #3674	2025-04-18 12:05:45 -07:00
timothycarambat	0b1a0e0a1f	Merge branch 'master' of github.com:Mintplex-Labs/anything-llm	2025-03-04 09:08:21 -08:00

1 2 3 4 5

228 Commits