Skip to content

feat: distributed index building for ivf index#218

Open
goodqiang wants to merge 3 commits intolance-format:mainfrom
goodqiang:main
Open

feat: distributed index building for ivf index#218
goodqiang wants to merge 3 commits intolance-format:mainfrom
goodqiang:main

Conversation

@goodqiang
Copy link

feat: distributed index building for ivf index support #176
BREAKING CHANGE: AddIndexExec adds "alter table create index index_name using ivf_flat/ivf_pq/ivf_sq"

@github-actions github-actions bot added enhancement New feature or request breaking-change Features that introduces a breaking API change labels Feb 9, 2026
Dataset.open()
.allocator(LanceRuntime.allocator())
.namespace(readOptions.getNamespace)
.readOptions(readOptions.toReadOptions)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this should not be removed

import java.util.*;
import java.util.stream.Collectors;

public class BaseVectorIndexTest extends BaseAddIndexTest {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this test is not really inheriting anything from BaseAddIndexTest?

pom.xml Outdated
<properties>
<lance-spark.version>0.2.0</lance-spark.version>
<lance.version>2.0.0</lance.version>
<lance.version>3.0.0-beta.2</lance.version>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

do we depend on any feature not in 2.0?

Copy link
Contributor

@jackye1995 jackye1995 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks for the work! Please also add related documentations, and make sure CI is passing 🙏

Copy link
Collaborator

@hamersaw hamersaw left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we add a small integration test in docker/tests/test_lance_spark.py?

@hamersaw
Copy link
Collaborator

Looks like there are a few integration test failures, can you fix those? The logs are extremely verbose making it difficult to discover the root cause, I'm wondering if we can reduce the log-level so they're more easily digested?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

breaking-change Features that introduces a breaking API change enhancement New feature or request

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants