chore(deps): update dependency unstructured to >=0.15.14 #7
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This PR contains the following updates:
>=0.15.13
->>=0.15.14
Release Notes
Unstructured-IO/unstructured (unstructured)
v0.15.14
Compare Source
Enhancements
Features
.filename
,.filetype
and.languages
. This will be installed in a closely following PR to replace the four currently being used for this purpose.Fixes
partition_via_api
. Make a minor syntax change to ensure forward compatibility with the upcoming 0.26.0 Python SDK.date_from_file_object
parameter. As part of simplifying partitioning parameter set, removedate_from_file_object
parameter. A file object does not have a last-modified date attribute so can never give a useful value. When a file-object is used as the document source (such as in Unstructured API) the last-modified date must come from themetadata_last_modified
argument.KeyError
when mapping parent ids to hash ids. Occasionally the input elements intoassign_and_map_hash_ids
can contain duplicated element instances, which lead to error when mapping parent id.@apply_metadata()
decorator and only decorate the principal partitioner (CSV and DOCX in this case); remove decoration from delegating partitioners.@apply_metadata()
decorator and only decorate the principal partitioner; remove decoration from delegating partitioners.@apply_metadata()
decorator and only decorate the principal partitioner (HTML in this case); remove decoration from delegating partitioners.min_partition
andmax_partition
parameters were an initial rough implementation of chunking but now interfere with chunking and are unused. Remove those parameters frompartition_text()
andpartition_email()
.@apply_metadata()
decorator operating on partitioners they delegate to (TXT, HTML, and all others for attachments) and remove direct decoration from EML and MSG.Configuration
📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).
🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.
♻ Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.
🔕 Ignore: Close this PR and you won't be reminded about this update again.
This PR was generated by Mend Renovate. View the repository job log.