merlyn/collector
Timothy Carambat b6d3a411b1
Add querySelectorAll capability to web-scraping block (#3186)
* Add `querySelectorAll` capability to web-scraping block

* patches and fallbacks

* fix styles of text in web scraping block

---------

Co-authored-by: shatfield4 <seanhatfield5@gmail.com>
2025-02-13 16:11:15 -08:00
..
extensions chore: rename Github to GitHub (#3199) 2025-02-13 10:45:43 -08:00
hotdir Document Processor v2 (#442) 2023-12-14 15:14:56 -08:00
middleware [BETA] Live document sync (#1719) 2024-06-21 13:38:50 -07:00
processLink Add querySelectorAll capability to web-scraping block (#3186) 2025-02-13 16:11:15 -08:00
processRawText Add tokenizer improvments via Singleton class and estimation (#3072) 2025-01-30 17:55:03 -08:00
processSingleFile Add tokenizer improvments via Singleton class and estimation (#3072) 2025-01-30 17:55:03 -08:00
storage feat: Embed on-instance Whisper model for audio/mp4 transcribing (#449) 2023-12-15 11:20:13 -08:00
utils chore: rename Github to GitHub (#3199) 2025-02-13 10:45:43 -08:00
.env.example devcontainer v1 (#297) 2024-01-08 15:31:06 -08:00
.gitignore Document Processor v2 (#442) 2023-12-14 15:14:56 -08:00
.nvmrc Document Processor v2 (#442) 2023-12-14 15:14:56 -08:00
index.js Add querySelectorAll capability to web-scraping block (#3186) 2025-02-13 16:11:15 -08:00
nodemon.json Document Processor v2 (#442) 2023-12-14 15:14:56 -08:00
package.json Audio file validations (#2902) 2024-12-30 14:48:28 -08:00
yarn.lock Audio file validations (#2902) 2024-12-30 14:48:28 -08:00