Releases: aws-samples/amazon-textract-textractor
Releases · aws-samples/amazon-textract-textractor
Version 1.4.0
What's Changed
Version 1.4.0 introduces text linearization, powered by the new Amazon Textract Layout feature. Layout prediction enables the conversion of documents to text while preserving their reading order. See our layout prediction notebook for an example.
Full Changelog: v1.3.7...v1.4.0
Version 1.3.7
What's Changed
- Fix STRUCTURED and SEMI_STRUCTURED table types not being properly parsed
Full Changelog: v1.3.6...v1.3.7
Version 1.3.6
What's Changed
Full Changelog: v1.3.5...v1.3.6
Version 1.3.5
Version 1.3.4
What's Changed
- Pass
region_name
as argument to Textract client when provided by @alanmohan in #229 - Fix for PDF-filename suffix is hardcoded as .pdf by @Paike in #225
- Fix ExpenseField value.bbox is None causing Error by @syanng in #241
Full Changelog: v.1.3.3...v1.3.4
Version 1.3.3
What's Changed
Full Changelog: v1.3.2...v.1.3.3
Version 1.3.2
What's Changed
- Fix assertion error on empty line item rows by @vinyasmusic in #224
- Add type check before S3 prefix check by @Belval in #227
Full Changelog: v1.3.1...v1.3.2
Version 1.3.1
Version 1.3.0
What's Changed
- Significant improvement to response validation time by @krzim in #203
- Drop support for Python 3.7 as the End-of-Life is in June 2023 and
amazon-textract-response-parser
has already dropped support
Full Changelog: v1.2.0...v1.3.0
Version 1.2.0
What's Changed
- Add support for titles, footers and cell types by @Belval in #197
- Set save_image=False in CLI by @Belval in #189
- Remove line implying that async APIs take List[PIL.Image] as file_source by @Belval in #190
- chore: add tests for Signature by @schadem in #159
- feat: adding pretty-print for signatures by @schadem in #194