Replies: 1 comment
-
Hello @mophilly. DocumentLoader works separately from the Splitter, i made sure that was done in each one. And works great with docling, the one i would not advice is MarkitDown, because it doesnt allow splitting page out of the box, Docling yes. To understand a documentLoader, just read this: always uses one function, load, that contains an array of pages, each page with content and image (if is vision). If you want to use splitter, just take a look at: PS: sorry for the delay, im finishing another article, this one is a big boy! |
Beta Was this translation helpful? Give feedback.
-
The docling project seems like a great fit for many IDP cases. I have just arrived at the need to split large files for submission to an LLM. It appears that splitting is a fundamental element in the decling examples.
Is splitting, as expressed in ExtractThinker, a complement to docling or would using docling replace it?
Beta Was this translation helpful? Give feedback.
All reactions