Releases: Unstructured-IO/unstructured-api
Releases · Unstructured-IO/unstructured-api
0.0.52
- Bump unstructured to 0.10.21
- Fix an unhandled error when a non pdf file is sent with content-type pdf
- Fix an unhandled error when a non docx file is sent with content-type docx
- Fix an unhandled error when a non-Unstructured json schema is sent
0.0.51
- Bump unstructured to 0.10.19
0.0.50
- Bump unstructured to 0.10.18
0.0.49
- Remove spurious whitespace in
app-start.sh
. This fixes deployments in some envs such as Google Cloud Run.
0.0.48
- Adds
languages
kwarg ocr_languages
will eventually be deprecated and replaced by lanugages
to specify what languages to use for OCR
- Adds a startup log and other minor cleanups
0.0.47
- Adds
chunking_strategy
kwarg and associated params These params allow users to "chunk" elements into larger or smaller CompositeElement
s
- Remove
parent_id
from the element metadata. New metadata fields are causing errors with existing installs. We'll readd this once a fix is widely available.
- Fix some pdfs incorrectly returning a file is encrypted error. The
pypdf.is_encrypted
check caused us to return this error even if the file is readable.
0.0.46
- Bump unstructured to 0.10.16
0.0.45
- Drop
detection_class_prob
from the element metadata. This broke backwards compatibility when library users called partition_via_api
.
- Bump unstructured to 0.10.15
0.0.44
- Bump unstructured to 0.10.14
- Improve parallel mode retry handling
- Improve logging during error handling. We don't need to log stack traces for expected errors.
0.0.43
- Bump unstructured to 0.10.13
- Bump unstructured-inference to 0.5.25
- Remove dependency on unstructured-api-tools
- Add a top level error handler for more consistent response bodies
- Tesseract minor version bump to 5.3.2