Skip to content

Releases: delta-io/delta-rs

python-v1.0.1

30 May 19:09
Compare
Choose a tag to compare

Bug Fixes

Other Changes

New Contributors

Full Changelog: python-v1.0.0...python-v1.0.1

python-v1.0.0: Zero to One

28 May 19:31
Compare
Choose a tag to compare

It only took us 5 years, but we made it! You can find the upgrade guide here.

Performance improvements

New features

  • feat: remove optimize operations when building without Apache Datafusion by @rtyler in #3290
  • feat(api): add rustls and native-tls features by @zeevm in #3335
  • feat!: update storage configuration system by @roeap in #3383
  • feat: derive macro for config implementations by @roeap in #3389
  • feat: upgrade to DataFusion 47.0.0 by @alamb in #3378
  • feat: introduce VacuumMode::Full for cleaning up orphaned files by @rtyler in #3368
  • feat: during LakeFS file operations, skip merge when 0 changes by @smeyerre in #3346
  • feat: added a check for gc code to run by @JustinRush80 in #3419
  • feat: spawn io with spawn service by @ion-elgreco in #3426
  • feat: optimize datafusion predicate pushdown and partition pruning by @rtyler in #3436
  • feat: expose kernel Engine on LogStore by @roeap in #3446
  • refactor: remove pyarrow dependency by @ion-elgreco in #3459
  • feat: write checkpoints with kernel by @roeap in #3466
  • feat: add a table description and name to the Delta Table from Python by @fvaleye in #3464
  • refactor!: remove and deprecate some python methods by @roeap in #3488

Bug Fixes

  • fix: use field physical name when resolving partition columns by @zeevm in #3349
  • fix(pandas): retain pyarrow decimal datatype in to_pandas() by adding types_mapper to prevent precision loss by @Abhishek1005 in #3296
  • fix: prevent panics when peek_next_commit() encounters invalid data by @rtyler in #3308
  • fix: serialize empty deletionVector in add actions as absent by @rtyler in #3309
  • fix: stats column binary_column has unsupported type binary by @omkar-foss in #3146
  • fix: check for all known valid delta files in is_deltatable by @umartin in #3318
  • fix: block_in_place to allow nested tasks by @ion-elgreco in #3324
  • fix: parse snapshot by @ion-elgreco in #3355
  • fix: added restored metadata as action to the next committed version by @Nordalf in #3303
  • fix: parse unconventional logs by @roeap in #3373
  • fix: clippy warnings by @alamb in #3390
  • fix: the default target size should be 100MB by @HiromuHota in #3404
  • fix: if field contains space in constraint expression, checks will fail by @Nordalf in #3374
  • fix: build Unity Catalog crate without DataFusion by @linhr in #3420
  • fix: drop column update by @ion-elgreco in #3416
  • fix: ignore temp log entries by @corwinjoy in #3423
  • fix: use more accurate log path parsing by @roeap in #3461
  • fix: correct spelling errors found by CI spell checker by @fvaleye in #3465
  • fix: schema conversion, add conversion test cases by @ion-elgreco in #3468
  • fix: set casting safe param to False by @ion-elgreco in #3481
  • fix: ensure projecting only columns that exist in new files afte sche… by @alexwilcoxson-rel in #3487

Other Changes

  • refactor: drop pyarrow support, restructure python modules by @ion-elgreco in #3285
  • chore: bump python version for release by @rtyler in #3291
  • chore: use flags for apple arm64 by @ion-elgreco in #3213
  • chore: upgrade the kernel version and bump our majorish versions too by @rtyler in #3289
  • chore: upgrade to DataFusion 46.0.0 by @alamb in #3261
  • refactor: add 'cloud' feature to 'core' to enable 'cloud' on 'object_store' only when needed by @zeevm in #3332
  • docs: update dataFusion integration example by @riziles in #3343
  • refactor(python): improve typing, linting by @ion-elgreco in #3344
  • chore: remove pyarrow upper by @ion-elgreco in #3325
  • chore: improve io error msg by @ion-elgreco in #3328
  • docs: update merge-tables.md with "Optimizing Merge Performance" section by @ldacey in #3351
  • docs: add example how to authenticate using Azure CLI for Azure ADSL integration by @DanielBertocci in #3357
  • chore: remove cdf feature by @ion-elgreco in #3365
  • fix: correct Python docs for incremental compaction on OPTIMIZE by @roykim98 in #3301
  • chore: fix some minor build warnings by @rtyler in #3366
  • refactor: move transaction module to kernel by @roeap in #3380
  • chore: clippy by @roeap in #3379
  • chore: move proofs into dedicated folder by @roeap in #3381
  • refactor!: move storage module into logstore by @roeap in #3382
  • chore: put a couple symbols behind the right feature gate by @rtyler in #3393
  • chore: update delta_kernel to 0.10.0 by @zachschuermann in #3403
  • refactor: make "cloud" feature in object_store optional by @zeevm in #3398
  • chore: bump versions of rust crates for another release party by @rtyler in #3406
  • chore: commit the contents of the 0.26.0 release by @rtyler in #3408
  • chore: reduce scope of feature flags and compilation requirements for subcrates by @rtyler in #3409
  • chore(deps): update sqlparser requirement from 0.53.0 to 0.56.0 by @dependabot in #3413
  • chore(deps): update foyer requirement from 0.16.1 to 0.17.0 by @dependabot in #3412
  • chore: bringing dat integration testing in ahead of kernel replay by @rtyler in #3411
  • chore: missed a version bump for core by @rtyler in #3415
  • chore: include license file in deltalake-derive crate by @ankane in #3417
  • chore(deps): bump foyer to v0.17.2 to prevent from wrong result by @MrCroxx in #3428
  • chore: bump crate versions which are due for release by @rtyler in #3430
  • chore: rely on the testing during coverage generation to speed up tests by @rtyler in #3431
  • chore: make codecov more vigorously enforced to help ensure quality by @rtyler in #3434
  • chore: prepare py-1.0 release by @ion-elgreco in #3435
  • chore: experiment with using sccache in GitHub Actions by @rtyler in #3437
  • chore: remove unused code and deps by @roeap in #3441
  • chore: minor table module refactors by @rtyler in #3442
  • docs: add 1.0.0 migration guide by @ion-elgreco in #3443
  • refactor: more specific factory parameter names by @roeap in #3445
  • refactor: use LogStore in Snapshot / LogSegment APIs by @roeap in #3452
  • test: avoid circular dependency with core/test crates by @roeap in #3450
  • chore: ensuring default builds work without datafusion by @rtyler in #3453
  • ci: add spellchecker to pr tests by @roeap in #3457
  • chore: mark more tests which require datafusion by @rtyler in #3458
  • refactor: use full paths in log processing by @roeap in #3456
  • chore: set correct markers by @ion-elgreco in https://github.com/delta-io/...
Read more

python-v0.25.5

04 Apr 07:19
3e13d43
Compare
Choose a tag to compare

What's Changed

  • chore: relax the pyarrow constraints now that 19.0.1 is out by @rtyler in #3361

Full Changelog: python-v0.25.4...python-v0.25.5

python-v0.25.4

02 Mar 16:46
Compare
Choose a tag to compare

What's Changed

  • chore: more manual inline format args by @nyurik in #3251
  • fix: chrono 0.4.40 causes disambiguation syntax when building arrow deps by @Nordalf in #3272
  • fix(rust): on write, have a schema evolution maintain metadata table id by @liamphmurphy in #3275
  • chore: remove unused dev dependency on home by @alamb in #3277
  • refactor: replaced asterisk with constraint name in get_constraints by @Nordalf in #3270
  • refactor: simplify expressions by @ion-elgreco in #3281
  • fix: compare before after schema before creating action by @ion-elgreco in #3282
  • fix: timestamp truncation in stats parsed by @ion-elgreco in #3288

Full Changelog: python-v0.25.2...python-v0.25.4

python-v0.25.2

25 Feb 20:09
Compare
Choose a tag to compare

New features

Bug Fixes

Other Changes

  • chore: inline format args to improve readability by @nyurik in #3154
  • chore: update readme for Generated columns by @ion-elgreco in #3247
  • docs: update CONTRIBUTING.md and Makefile to account for switch to uv by @adamreeve in #3257
  • chore: fix some compiler warnings by @alamb in #3262
  • chore: fix clippy warnings on main by @alamb in #3264
  • chore: use builder API to create FileScanConfig by @alamb in #3266
  • chore: update tests to use Column::new and other expr_fn functions by @alamb in #3265

New Contributors

Full Changelog: python-v0.25.1...python-v0.25.2

python-v0.25.0: Improved memory writer, schema evolution `merge` and UC support

20 Feb 17:24
Compare
Choose a tag to compare

Highlights

  • Streamed execution in write and merge (lower memory pressure)
  • Overall improved memory allocation
  • Schema evolution in MERGE
  • Unity catalog support through uc:// uri-prefix

Performance improvements

New features

  • feat: add schema evolution to merge statement by @JustinRush80 in #3136
  • feat(python): add capability to read unity catalog (uc://) uris by @omkar-foss in #3113
  • feat: support reading tables via Unity Catalog provided credentials by @hntd187 in #3078
  • feat: added workspace oauth source for UC by @hntd187 in #3152
  • feat: update metadata field builder by @Nordalf in #3149
  • feat: set column metadata from python by @umartin in #3189
  • feat(disable conflict checker): start counting concurrent retries at 0 and disable conflict checker with maxRetries=0 by @daphnenhuch-at in #3204
  • feat: make Add:get_stats public by @jkylling in #3216
  • feat: configurable column encoding for parquet checkpoint files by @dmunch in #3214
  • feat: return metrics about the commit by @HawaiianSpork in #3202
  • feat: cdf tableprovider with predicate pushdown support by @ion-elgreco in #3220

Bug Fixes

Other Changes

New Contributors

Full Changelog: python-v0.24.0...python-v0.25.0

python-v0.24.0: LakeFS integration + GeneratedColumns

15 Jan 07:26
Compare
Choose a tag to compare

New features

Bug Fixes

Other Changes

New Contributors

Full Changelog: python-v0.23.2...python-v0.24.0

python-v0.23.2

07 Jan 15:14
Compare
Choose a tag to compare

What's Changed

  • fix: introduce a reproduction case for List casting with polars by @rtyler in #3093
  • feat(python): add except_cols argument to when_matched_update_all and when_not_matched_insert_all for excluding specific columns by @mrjsj in #3098
  • feat(python): add sql() and show() convenience method to QueryBuilder by @DevChrisCross in #3100
  • docs: rust usage documentation by @Abdullahsab3 in #3089
  • feat: allow multiple Python threads to work with a single DeltaTable instance by @rtyler in #3101

New Contributors

Full Changelog: python-v0.23.1...python-v0.23.2

python-v0.23.1

02 Jan 14:47
Compare
Choose a tag to compare

What's Changed

Full Changelog: python-v0.23.0...python-v0.23.1

python-v0.23.0: conditional put for S3 by default

02 Jan 09:02
Compare
Choose a tag to compare

New features

Bug Fixes

  • fix: add explicit type casts while merge by @t1g0rz in #3050
  • fix: add decimal expr parsing by @ion-elgreco in #3090
  • fix: raise during checkpoint creating with without_files by @ion-elgreco in #3059
  • fix: use of deprecated DeltaTable::peek_next_commit by @jkylling in #3080
  • fix: remove println from regular_rename func by @tom1919 in #3071
  • fix: use projected_table_schema for projection in DeltaSchemaAdapter by @jkylling in #3068
  • fix: use correct aws_endpoint key by @ion-elgreco in #3072
  • fix: add test for Parquet column pruning in scans by @jkylling in #3079
  • fix: allow more than 15 concurrent transactions to have been committed by @rtyler in #3067

Other Changes

New Contributors

Full Changelog: python-v0.22.3...python-v0.23.0