Document cleaning with LLM #13
Labels
data cleaning
Related to the data cleaning module
enhancement
New feature or request
help wanted
Extra attention is needed
Developing a component which uses an LLM to clean up the uploaded document, thereby replacing/improving the traditional document pre-processing component.
Acceptance criteria:
UAT
As a developer, I can successfully upload a variety of document formats (PDF, DOCX, TXT) and see the extracted text displayed without errors
The text was updated successfully, but these errors were encountered: