- Added new OCR model
48px
- Added 4x-UltraSharp upscaler
- Countless bug fixes and refactor
- Add CRAFT detector, enable by
--detector craft
- Added New inpainting model LaMa MPE by dmMaze and set as default
Project version is now at beta-0.3
- Added English text renderer by dmMaze
- Added new CTC based OCR engine, significant speed improvement
- The new OCR model now support Korean
- Use new font rendering method by pokedexter
- Added manual translation UI by rspreet92
- Added text detection model by dmMaze
- New MST based text region merge algorithm, huge text region merge improvement
- Add baidu translator in demo mode
- Add google translator in demo mode
- Various bugfixes
- Web demo adds translator, detection resolution and target language option
- Slight text color extraction improvement
Major upgrades for all components, now we are on beta!
Note in this version all English texts are detected as capital letters,
You need Python >= 3.8 for cached_property
to work
- Detection model upgrade
- OCR model upgrade, better at text color extraction
- Inpainting model upgrade
- Major text rendering improvement, faster rendering and higher quality text with shadow
- Slight mask generation improvement
- Various bugfixes
- Default detection resolution has been dialed back to 1536 from 2048
- Fix erroneous image rendering when inpainting is not used
- Support manual translation
- Support detection and rendering of angled texts
- Text mask completion is now based on CRF, mask quality is drastically improved
- Improve text rendering
- New text region based text direction detection method
- Support running demo as web service
- Text detection model is now based on DBNet with ResNet34 backbone
- OCR model is now trained with more English sentences
- Inpaint model is now based on AOT which requires far less memory
- Default inpainting resolution is now increased to 2048, thanks to the new inpainting model
- Support merging hyphenated English words
- Add youdao translate and set as default translator
- Text detection model is now based on DBNet with ResNet101 backbone
- OCR model is now deeper
- Default detection resolution has been increased to 2048 from 1536
Note this version is slightly better at handling English texts, other than that it is worse in every other ways
- Added inpainting model
- First version launched