Marcello Fitton
8f33203ade
chore: add ESLint to /collector ( #5128 )
...
* add eslint config to /collector
* prettier formatting
* fix unused
* fix undefined
* disable lines
* lockfile
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2026-03-05 16:25:23 -08:00
AbelDuan
df166eb64e
feat: Add multilingual support for ocr module ( #3325 )
...
* Add multilingual support for ocr mudule
* Add OCR langauge as server var that is passed into Collector
Support all valid tesseract language codes
Filter and parse only valid codes with fallbacks'
* persist TARGET_OCR_LANG
* update docker example env
---------
Co-authored-by: Timothy Carambat <rambat1010@gmail.com>
2025-02-27 12:31:17 -08:00
Timothy Carambat
4545ce24cd
Drop Node canvas for manual sharp conversion ( #3221 )
...
* Drop Node `canvas` for manual `sharp` conversion
* bump dev
2025-02-14 17:38:13 -08:00
Timothy Carambat
89bba68219
Add OCR of image support ( #3219 )
...
* OCR PDFs as fallback in spawn thread
* wip
* build our own worker fanout and wrapper
* norm pkgs
* Add image OCR support
2025-02-14 12:07:33 -08:00
Timothy Carambat
2a9066e83a
OCR PDFs as fallback during upload ( #3204 )
...
* OCR PDFs as fallback in spawn thread
* wip
* build our own worker fanout and wrapper
* norm pkgs
* bump dev
2025-02-14 11:57:31 -08:00