Skip to content

Releases: aws-samples/amazon-textract-textractor

Version 1.4.0

19 Oct 19:24

Choose a tag to compare

What's Changed

Version 1.4.0 introduces text linearization, powered by the new Amazon Textract Layout feature. Layout prediction enables the conversion of documents to text while preserving their reading order. See our layout prediction notebook for an example.

Full Changelog: v1.3.7...v1.4.0

Version 1.3.7

04 Oct 16:29
6e7125c

Choose a tag to compare

What's Changed

  • Fix STRUCTURED and SEMI_STRUCTURED table types not being properly parsed

Full Changelog: v1.3.6...v1.3.7

Version 1.3.6

29 Sep 00:37
7e398cc

Choose a tag to compare

What's Changed

  • Fix Levenshtein distance not returning the most similar key by @Belval in #245

Full Changelog: v1.3.5...v1.3.6

Version 1.3.5

15 Sep 13:51
17c7646

Choose a tag to compare

What's Changed

  • Add support for Python 3.11

Full Changelog: v1.3.4...v1.3.5

Version 1.3.4

08 Sep 14:21

Choose a tag to compare

What's Changed

  • Pass region_name as argument to Textract client when provided by @alanmohan in #229
  • Fix for PDF-filename suffix is hardcoded as .pdf by @Paike in #225
  • Fix ExpenseField value.bbox is None causing Error by @syanng in #241

Full Changelog: v.1.3.3...v1.3.4

Version 1.3.3

05 Sep 14:32

Choose a tag to compare

What's Changed

  • Fix bbox denormalization breaking visualizations by @Belval in #242

Full Changelog: v1.3.2...v.1.3.3

Version 1.3.2

05 Jul 19:38
309449b

Choose a tag to compare

What's Changed

  • Fix assertion error on empty line item rows by @vinyasmusic in #224
  • Add type check before S3 prefix check by @Belval in #227

Full Changelog: v1.3.1...v1.3.2

Version 1.3.1

24 May 03:33
2435fdc

Choose a tag to compare

What's Changed

  • fix: ordering of paginated files on S3 OutputConfig by @schadem in #208
  • Fix empty bounding box in QUERY_RESULT block raising exception by @Belval in #211
  • Fix missing bounding box for empty values by @Belval in #212
  • Add Python 3.10 by @Belval in #213

Full Changelog: v1.3.0...v1.3.1

Version 1.3.0

18 Apr 22:35

Choose a tag to compare

What's Changed

  • Significant improvement to response validation time by @krzim in #203
  • Drop support for Python 3.7 as the End-of-Life is in June 2023 and amazon-textract-response-parser has already dropped support

Full Changelog: v1.2.0...v1.3.0

Version 1.2.0

12 Apr 15:36

Choose a tag to compare

What's Changed

  • Add support for titles, footers and cell types by @Belval in #197
  • Set save_image=False in CLI by @Belval in #189
  • Remove line implying that async APIs take List[PIL.Image] as file_source by @Belval in #190
  • chore: add tests for Signature by @schadem in #159
  • feat: adding pretty-print for signatures by @schadem in #194