michael/merlyn - merlyn - Honeywest

michael/merlyn

Author	SHA1	Message	Date
Timothy Carambat	b3944eb50e	Revert "Add automatic chat mode with native tool calling support (#5140 )" - Need to support documents in agents - Need to support images in agent mode This reverts commit `4c69960dca`.	2026-03-04 15:29:41 -08:00
Timothy Carambat	4c69960dca	Add automatic chat mode with native tool calling support (#5140 ) Introduces a new automatic chat mode (now the default) that automatically invokes tools when the provider supports native tool calling. Conditionally shows/hides the @agent command based on whether native tooling is available. - Add supportsNativeToolCalling() to AI providers (OpenAI, Anthropic, Azure always support; others opt-in via ENV) - Update all locale translations with new mode descriptions - Enhance translator to preserve Trans component tags - Remove deprecated ability tags UI	2026-03-04 14:34:30 -08:00
Timothy Carambat	cf76bad452	Implement full chat and `@agent` chat `user` indentificiation for OpenRouter (#4668 ) Implmenet chat and agentic chat user-id for OpenRouter resolves #4553 closes #4482	2025-11-20 12:38:43 -08:00
Timothy Carambat	0fb33736da	Workspace Chat with documents overhaul (#4261 ) * Create parse endpoint in collector (#4212) * create parse endpoint in collector * revert cleanup temp util call * lint * remove unused cleanupTempDocuments function * revert slug change minor change for destinations --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> * Add parsed files table and parse server endpoints (#4222) * add workspace_parsed_files table + parse endpoints/models * remove dev api parse endpoint * remove unneeded imports * iterate over all files + remove unneeded update function + update telemetry debounce * Upload UI/UX context window check + frontend alert (#4230) * prompt user to embed if exceeds prompt window + handle embed + handle cancel * add tokenCountEstimate to workspace_parsed_files + optimizations * use util for path locations + use safeJsonParse * add modal for user decision on overflow of context window * lint * dynamic fetching of provider/model combo + inject parsed documents * remove unneeded comments * popup ui for attaching/removing files + warning to embed + wip fetching states on update * remove prop drilling, fetch files/limits directly in attach files popup * rework ux of FE + BE optimizations * fix ux of FE + BE optimizations * Implement bidirectional sync for parsed file states linting small changes and comments * move parse support to another endpoint file simplify calls and loading of records * button borders * enable default users to upload parsed files but NOT embed * delete cascade on user/workspace/thread deletion to remove parsedFileRecord * enable bgworker with "always" jobs and optional document sync jobs orphan document job: Will find any broken reference files to prevent overpollution of the storage folder. This will run 10s after boot and every 12hr after * change run timeout for orphan job to 1m to allow settling before spawning a worker * linting and cleanup pr --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com> * dev build * fix tooltip hiding during embedding overflow files * prevent crash log from ERRNO on parse files * unused import * update docs link * Migrate parsed-files to GET endpoint patch logic for grabbing models names from utils better handling for undetermined context windows (null instead of Pos_INIFI) UI placeholder for null context windows * patch URL --------- Co-authored-by: Sean Hatfield <seanhatfield5@gmail.com>	2025-08-11 09:26:19 -07:00
Sean Hatfield	f3ea21bcd1	Prompt variables (#3359 ) * wip prompt variables * refactor backend + add popup suggestions menu to frontend * use processString to replace all variables in system prompts * update translations * fix translations * wip highlight variables * revert accidental name change * rename everything, remove translations * Update prompt var UI and backend logic * Update form handler linting * linting * normalize all translation files for prompt variables * prompt vars dev image --------- Co-authored-by: Timothy Carambat <rambat1010@gmail.com>	2025-03-25 12:44:19 -07:00
Sean Hatfield	5785a705cf	Enable use of @agent in slash commands (#3508 ) * allow @agent in slash commands * make prompt input focused on slash command click * lint --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2025-03-24 11:35:48 -07:00
Timothy Carambat	ad01df8790	Reranker option for RAG (#2929 ) * Reranker WIP * add cacheing and singleton loading * Add field to workspaces for vectorSearchMode Add UI for lancedb to change mode update all search endpoints to pass in reranker prop if provider can use it * update hint text * When reranking, swap score to rerank score * update optchain	2025-01-02 14:27:52 -08:00
Timothy Carambat	dd7c4675d3	LLM performance metric tracking (#2825 ) * WIP performance metric tracking * fix: patch UI trying to .toFixed() null metric Anthropic tracking migraiton cleanup logs * Apipie implmentation, not tested * Cleanup Anthropic notes, Add support for AzureOpenAI tracking * bedrock token metric tracking * Cohere support * feat: improve default stream handler to track for provider who are actually OpenAI compliant in usage reporting add deepseek support * feat: Add FireworksAI tracking reporting fix: improve handler when usage:null is reported (why?) * Add token reporting for GenericOpenAI * token reporting for koboldcpp + lmstudio * lint * support Groq token tracking * HF token tracking * token tracking for togetherai * LiteLLM token tracking * linting + Mitral token tracking support * XAI token metric reporting * native provider runner * LocalAI token tracking * Novita token tracking * OpenRouter token tracking * Apipie stream metrics * textwebgenui token tracking * perplexity token reporting * ollama token reporting * lint * put back comment * Rip out LC ollama wrapper and use official library * patch images with new ollama lib * improve ollama offline message * fix image handling in ollama llm provider * lint * NVIDIA NIM token tracking * update openai compatbility responses * UI/UX show/hide metrics on click for user preference * update bedrock client --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-12-16 14:31:17 -08:00
Timothy Carambat	38fc181238	Add multimodality support (#2001 ) * Add multimodality support * Add Bedrock, KoboldCpp,LocalAI,and TextWebGenUI multi-modal * temp dev build * patch bad import * noscrolls for windows dnd * noscrolls for windows dnd * update README * update README * add multimodal check	2024-07-31 10:47:49 -07:00
Timothy Carambat	0b845fbb1c	Deprecate `.isSafe` moderation (#1790 ) Add type defs to helpers	2024-06-28 15:32:30 -07:00
Sean Hatfield	c2523a9593	[FEAT] Persist query mode refusal responses as chat history (#1727 ) * log query refusals to workspace chats but hide in ui * linting --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-06-20 15:44:19 -07:00
Timothy Carambat	13fb63930b	Improve RAG responses via source backfilling (#1477 ) * Improve RAG responses via source backfilling * Hide irrelevant citations from UI	2024-05-23 09:56:57 -07:00
Sean Hatfield	d36c3ff8b2	[FEAT] Slash templates (#1314 ) * WIP slash presets * WIP slash command customization CRUD + validations complete * backend slash command support * fix permission setting on new slash commands rework form submit and pattern on frontend * Add field updates for hooks, required=true to field add user<>command constraint to keep them unique enforce uniquness via teritary uid field on table for multi and non-multi user * reset migration --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-05-10 12:35:33 -07:00
Sean Hatfield	d02013fd71	[FIX] Document pinning does not count in query mode (#1250 ) * if document is pinned, do not give queryRefusalResponse message * forgot embed.js patch --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-05-02 10:27:09 -07:00
Timothy Carambat	894f727903	Remove restrictions on pinned documents to use more context (#1248 ) * Remove restrictions on pinned documents to use more contet * update comment	2024-05-01 13:32:52 -07:00
Timothy Carambat	42e1d8e8ce	Customize refusal response for `query` mode (#1243 ) * Customize refusal response for `query` mode * remove border for desktop	2024-04-30 16:14:30 -07:00
Timothy Carambat	9655880cf0	Update all vector dbs to filter duplicate source documents that may be pinned (#1122 ) * Update all vector dbs to filter duplicate parents * cleanup	2024-04-17 18:04:39 -07:00
Timothy Carambat	f9ac27e9a4	Handle Anthropic streamable errors (#1113 )	2024-04-16 16:25:32 -07:00
Timothy Carambat	a5bb77f97a	Agent support for `@agent` default agent inside workspace chat (#1093 ) V1 of agent support via built-in `@agent` that can be invoked alongside normal workspace RAG chat.	2024-04-16 10:50:10 -07:00
Timothy Carambat	94b58249a3	Enable per-workspace provider/model combination (#1042 ) * Enable per-workspace provider/model combination * cleanup * remove resetWorkspaceChatModels and wipeWorkspaceModelPreference to prevent workspace from resetting model * add space --------- Co-authored-by: shatfield4 <seanhatfield5@gmail.com>	2024-04-05 10:58:36 -07:00
Timothy Carambat	791c0ee9dc	Enable ability to do full-text query on documents (#758 ) * Enable ability to do full-text query on documents Show alert modal on first pin for client Add ability to use pins in stream/chat/embed * typo and copy update * simplify spread of context and sources	2024-02-21 13:15:45 -08:00
Timothy Carambat	c59ab9da0a	Refactor LLM chat backend (#717 ) * refactor stream/chat/embed-stram to be a single execution logic path so that it is easier to maintain and build upon * no thread in sync chat since only api uses it adjust import locations	2024-02-14 12:32:07 -08:00
Sean Hatfield	f4b09a8c79	[FEAT] RLHF on response messages (#708 ) * WIP RLHF works on historical messages * refactor Actions component * completed RLHF up and down votes for chats * add defaults for HistoricalMessage params * refactor RLHF implmenation remove forwardRef on history items to prevent rerenders * remove dup id * Add rating to CSV output --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-13 11:33:05 -08:00
Timothy Carambat	406732830f	Implement workspace threading that is backwards compatible (#699 ) * Implement workspace thread that is compatible with legacy versions * last touches * comment on chat qty enforcement	2024-02-08 18:37:22 -08:00
Timothy Carambat	aca5940650	Refactor handleStream to LLM Classes (#685 )	2024-02-07 08:15:14 -08:00
Timothy Carambat	2bc11d3f1a	Implement support for HuggingFace Inference Endpoints (#680 )	2024-02-06 09:17:51 -08:00
Sean Hatfield	1846a99b93	[FEAT] Embedded AnythingLLM (#656 ) * WIP embedded app * WIP got response from backend in embedded app * WIP streaming prints to embedded app * implemented streaming and tailwind min for styling into embedded app * WIP embedded app history functional * load params from script tag into embedded app * rough in modularization of embed chat cleanup dev process for easier dev support move all chat to components todo: build process todo: backend support * remove eslint config * Implement models and cleanup embed chat endpoints Improve build process for embed prod minification and bundle size awareness WIP * forgot files * rename to embed folder * introduce chat modal styles * add middleware validations on embed chat * auto open param and default greeting * reset chat history * Admin embed config page * Admin Embed Chats mgmt page * update embed * nonpriv * more style support reopen if chat was last opened * update comments * remove unused imports * allow change of workspace for embedconfig * update failure to lookup message * update reset script * update instructions * Add more styling options Add sponsor text at bottom Support dynamic container height Loading animations * publish new embed script * Add back syntax highlighting and keep bundle small via dynamic script build * add hint * update readme * update copy model for snippet with link to styles --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-02-05 14:21:34 -08:00
Timothy Carambat	8377600211	Patch Azure text completion persistence (#647 )	2024-01-24 13:08:22 -08:00
Sean Hatfield	56fa17caf2	create configurable topN per workspace (#616 ) * create configurable topN per workspace * Update TopN UI text Fix fallbacks for all providers Add SQLite CHECK to TOPN value * merge with master Update zilliz provider for variable TopN --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-18 12:34:20 -08:00
Sean Hatfield	c2c8fe9756	add support for mistral api (#610 ) * add support for mistral api * update docs to show support for Mistral * add default temp to all providers, suggest different results per provider --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 14:42:05 -08:00
Sean Hatfield	90df37582b	Per workspace model selection (#582 ) * WIP model selection per workspace (migrations and openai saves properly * revert OpenAiOption * add support for models per workspace for anthropic, localAi, ollama, openAi, and togetherAi * remove unneeded comments * update logic for when LLMProvider is reset, reset Ai provider files with master * remove frontend/api reset of workspace chat and move logic to updateENV add postUpdate callbacks to envs * set preferred model for chat on class instantiation * remove extra param * linting * remove unused var * refactor chat model selection on workspace * linting * add fallback for base path to localai models --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-17 12:59:25 -08:00
Timothy Carambat	f5bb064dee	Implement streaming for workspace chats via API (#604 )	2024-01-16 10:37:46 -08:00
Timothy Carambat	bd158ce7b1	[Feat] Query mode to return no-result when no context found (#601 ) * Query mode to return no-result when no context found * update default error for sync chat * remove unnecessary type conversion	2024-01-16 09:32:51 -08:00
timothycarambat	dfd03e332c	patch stream response	2024-01-10 15:32:07 -08:00
Sean Hatfield	1d39b8a2ce	add Together AI LLM support (#560 ) * add Together AI LLM support * update readme to support together ai * Patch togetherAI implementation * add model sorting/option labels by organization for model selection * linting + add data handling for TogetherAI * change truthy statement patch validLLMSelection method --------- Co-authored-by: timothycarambat <rambat1010@gmail.com>	2024-01-10 12:35:30 -08:00
Timothy Carambat	e9f7b9b79e	Handle undefined stream chunk for native LLM (#534 )	2024-01-04 18:05:06 -08:00
Timothy Carambat	75dd86967c	Implement AzureOpenAI model chat streaming (#518 ) resolves #492	2024-01-03 16:25:39 -08:00
Timothy Carambat	2a1202de54	Patch Ollama Streaming chunk issues (#500 ) Replace stream/sync chats with Langchain interface for now connect #499 ref: https://github.com/Mintplex-Labs/anything-llm/issues/495#issuecomment-1871476091	2023-12-28 13:59:47 -08:00
Timothy Carambat	e0a0a8976d	Add Ollama as LLM provider option (#494 ) * Add support for Ollama as LLM provider resolves #493	2023-12-27 17:21:47 -08:00
Timothy Carambat	24227e48a7	Add LLM support for Google Gemini-Pro (#492 ) resolves #489	2023-12-27 17:08:03 -08:00
Timothy Carambat	37cdb845a4	patch: implement @lunamidori hotfix for LocalAI streaming chunk overflows (#433 ) * patch: implement @lunamidori hotfix for LocalAI streaming chunk overflows resolves #416 * change log to error log * log trace * lint	2023-12-12 16:20:06 -08:00
Timothy Carambat	655ebd9479	[Feature] AnythingLLM use locally hosted Llama.cpp and GGUF files for inferencing (#413 ) * Implement use of native embedder (all-Mini-L6-v2) stop showing prisma queries during dev * Add native embedder as an available embedder selection * wrap model loader in try/catch * print progress on download * add built-in LLM support (expiermental) * Update to progress output for embedder * move embedder selection options to component * saftey checks for modelfile * update ref * Hide selection when on hosted subdomain * update documentation hide localLlama when on hosted * saftey checks for storage of models * update dockerfile to pre-build Llama.cpp bindings * update lockfile * add langchain doc comment * remove extraneous --no-metal option * Show data handling for private LLM * persist model in memory for N+1 chats * update import update dev comment on token model size * update primary README * chore: more readme updates and remove screenshots - too much to maintain, just use the app! * remove screeshot link	2023-12-07 14:48:27 -08:00
Timothy Carambat	4bb99ab4bf	Support LocalAi as LLM provider by @tlandenberger (#373 ) * feature: add LocalAI as llm provider * update Onboarding/mgmt settings Grab models from models endpoint for localai merge with master * update streaming for complete chunk streaming update localAI LLM to be able to stream * force schema on URL --------- Co-authored-by: timothycarambat <rambat1010@gmail.com> Co-authored-by: tlandenberger <tobiaslandenberger@gmail.com>	2023-11-14 12:31:44 -08:00
Timothy Carambat	c22c50cca8	Enable chat streaming for LLMs (#354 ) * [Draft] Enable chat streaming for LLMs * stream only, move sendChat to deprecated * Update TODO deprecation comments update console output color for streaming disabled	2023-11-13 15:07:30 -08:00