feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc)#2488
feat: OCR as enrichment for pictures in simple pipeline (docx, pptx, html, etc)#2488dolfim-ibm wants to merge 1 commit intomainfrom
Conversation
Signed-off-by: Michele Dolfi <[email protected]>
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
|
✅ DCO Check Passed Thanks @dolfim-ibm, all your commits are properly signed off. 🎉 |
Codecov Report❌ Patch coverage is
📢 Thoughts on this report? Let us know! |
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
1 similar comment
Merge ProtectionsYour pull request matches the following merge protections and will not be merged until they are valid. 🟢 Enforce conventional commitWonderful, this rule succeeded.Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/
|
|
Not needed now, and new approaches will do it differently. |
This PR allows to run the OCR step also in the pictures found in the documents converted with the
SimplePipeline, e.g. docx, pptx, html, etc.Unfinished work TODO
Checklist: