Skip to content

Conversation

Subash-Mohan
Copy link
Contributor

@Subash-Mohan Subash-Mohan commented Aug 24, 2025

Description

Draft PR to the indexing abstraction with initial worker setup.

How Has This Been Tested?

[Describe the tests you ran to verify your changes]

Backporting (check the box to trigger backport action)

Note: You have to check that the action passes, otherwise resolve the conflicts manually and tag the patches.

  • This PR should be backported (make sure to check that the backport attempt succeeds)
  • [Optional] Override Linear Check

Summary by cubic

Introduces an indexing adapter abstraction to separate orchestration from side effects and wires it into the pipeline. Adds a dedicated Celery worker for user file processing with configurable concurrency.

  • Refactors

    • Indexing pipeline now uses an IndexingBatchAdapter (prepare, lock_context, build_metadata_aware_chunks, post_index).
    • Implemented DocumentIndexingBatchAdapter to handle access, document sets, chunk counts, token counts, plaintext storage, boost updates, and CC pair marking.
    • Updated index_doc_batch, index_doc_batch_with_handler, and run_indexing_pipeline to accept an adapter; removed inlined DB/metadata logic.
    • Docprocessing task constructs and passes the adapter.
    • Added BuildMetadataAwareChunksResult and IndexingBatchAdapter protocol.
  • New Features

    • New Celery app for user file processing with thread pool and health checks.
    • Configurable concurrency via CELERY_WORKER_USER_FILE_PROCESSING_CONCURRENCY.
    • New Postgres app name: celery_worker_user_file_processing.

Copy link

vercel bot commented Aug 24, 2025

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Preview Comments Updated (UTC)
internal-search Error Error Sep 22, 2025 2:06pm

@Subash-Mohan Subash-Mohan marked this pull request as ready for review September 8, 2025 13:03
@Subash-Mohan Subash-Mohan requested a review from a team as a code owner September 8, 2025 13:03
@Subash-Mohan Subash-Mohan changed the title initial indexing abstraction setup feat(projects): add project creation and management Sep 8, 2025
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

40 issues found across 155 files

Note: This PR contains a large number of files. cubic only reviews up to 150 files per PR, so some files may not have been reviewed.

React with 👍 or 👎 to teach cubic. You can also tag @cubic-dev-ai to give feedback, ask questions, or re-run the review.

for user_file_id in user_file_ids:
self.app.send_task(
OnyxCeleryTask.PROCESS_SINGLE_USER_FILE,
kwargs={"user_file_id": user_file_id, "tenant_id": tenant_id},
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Passing uuid.UUID in Celery kwargs will fail with default JSON serializer; convert to str before enqueueing.

Prompt for AI agents
Address the following comment on backend/onyx/background/celery/tasks/user_file_processing/tasks.py at line 80:

<comment>Passing uuid.UUID in Celery kwargs will fail with default JSON serializer; convert to str before enqueueing.</comment>

<file context>
@@ -0,0 +1,248 @@
+            for user_file_id in user_file_ids:
+                self.app.send_task(
+                    OnyxCeleryTask.PROCESS_SINGLE_USER_FILE,
+                    kwargs={&quot;user_file_id&quot;: user_file_id, &quot;tenant_id&quot;: tenant_id},
+                    queue=OnyxCeleryQueues.USER_FILE_PROCESSING,
+                    priority=OnyxCeleryPriority.HIGH,
</file context>
Suggested change
kwargs={"user_file_id": user_file_id, "tenant_id": tenant_id},
kwargs={"user_file_id": str(user_file_id), "tenant_id": tenant_id},

"id": str(self.file_id),
"type": self.file_type,
"name": self.filename,
"user_file_id": str(self.file_id),
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Assigning user_file_id to str(self.file_id) is incorrect; it should be the UserFile UUID or not set

Prompt for AI agents
Address the following comment on backend/onyx/file_store/models.py at line 61:

<comment>Assigning user_file_id to str(self.file_id) is incorrect; it should be the UserFile UUID or not set</comment>

<file context>
@@ -56,4 +58,5 @@ def to_file_descriptor(self) -&gt; FileDescriptor:
             &quot;id&quot;: str(self.file_id),
             &quot;type&quot;: self.file_type,
             &quot;name&quot;: self.filename,
+            &quot;user_file_id&quot;: str(self.file_id),
         }
</file context>

connector_id=cc_pair.connector_id,
credential_id=cc_pair.credential_id,
),
request_id=None,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

run_indexing_pipeline now requires adapter; adding request_id without supplying adapter will raise a TypeError at runtime. Construct and pass a DocumentIndexingBatchAdapter here.

Prompt for AI agents
Address the following comment on backend/onyx/server/onyx_api/ingestion.py at line 124:

<comment>run_indexing_pipeline now requires adapter; adding request_id without supplying adapter will raise a TypeError at runtime. Construct and pass a DocumentIndexingBatchAdapter here.</comment>

<file context>
@@ -121,10 +121,7 @@ def upsert_ingestion_doc(
-            connector_id=cc_pair.connector_id,
-            credential_id=cc_pair.credential_id,
-        ),
+        request_id=None,
     )
 
</file context>

try:

# Use our consolidated function that handles indexing properly
categorized_files_result = upload_files_to_user_files_with_indexing(
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing ownership validation: project_id is accepted and used to associate files without verifying the project belongs to the authenticated user, enabling cross-project linkage.

Prompt for AI agents
Address the following comment on backend/onyx/server/features/projects/api.py at line 70:

<comment>Missing ownership validation: project_id is accepted and used to associate files without verifying the project belongs to the authenticated user, enabling cross-project linkage.</comment>

<file context>
@@ -0,0 +1,347 @@
+    try:
+
+        # Use our consolidated function that handles indexing properly
+        categorized_files_result = upload_files_to_user_files_with_indexing(
+            files=files, project_id=project_id, user=user, db_session=db_session
+        )
</file context>

)
project_id = chat_session_creation_request.project_id
if project_id:
if not check_project_ownership(project_id, user.id, db_session):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Accessing user.id without a None check can raise AttributeError for anonymous access; handle unauthenticated users before calling check_project_ownership.

Prompt for AI agents
Address the following comment on backend/onyx/server/query_and_chat/chat_backend.py at line 280:

<comment>Accessing user.id without a None check can raise AttributeError for anonymous access; handle unauthenticated users before calling check_project_ownership.</comment>

<file context>
@@ -282,6 +272,15 @@ def create_new_chat_session(
+    )
+    project_id = chat_session_creation_request.project_id
+    if project_id:
+        if not check_project_ownership(project_id, user.id, db_session):
+            raise HTTPException(
+                status_code=403, detail=&quot;User does not have access to project&quot;
</file context>

// Show temporary uploading files immediately
const tempFiles: ProjectFile[] = Array.from(files).map(
(file) => ({
id: file.name,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using file.name as id for temporary items risks duplicate React keys when multiple uploads share the same filename; generate a unique temp id.

Prompt for AI agents
Address the following comment on web/src/app/chat/components/projects/ProjectContextPanel.tsx at line 175:

<comment>Using file.name as id for temporary items risks duplicate React keys when multiple uploads share the same filename; generate a unique temp id.</comment>

<file context>
@@ -0,0 +1,301 @@
+              // Show temporary uploading files immediately
+              const tempFiles: ProjectFile[] = Array.from(files).map(
+                (file) =&gt; ({
+                  id: file.name,
+                  file_id: file.name,
+                  name: file.name,
</file context>

results.non_accepted.append(upload.filename or "")
else:
results.acceptable.append(upload)
results.acceptable_file_to_token_count[upload.filename or ""] = (
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using an empty string as the dict key risks collisions for multiple unnamed uploads; use a unique placeholder for missing filenames.

Prompt for AI agents
Address the following comment on backend/onyx/server/features/projects/projects_file_utils.py at line 135:

<comment>Using an empty string as the dict key risks collisions for multiple unnamed uploads; use a unique placeholder for missing filenames.</comment>

<file context>
@@ -0,0 +1,178 @@
+                    results.non_accepted.append(upload.filename or &quot;&quot;)
+                else:
+                    results.acceptable.append(upload)
+                    results.acceptable_file_to_token_count[upload.filename or &quot;&quot;] = (
+                        token_count
+                    )
</file context>

if (!currentProjectId) {
throw new Error("No project selected");
}
console.log("upserting instructions", instructions);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Avoid logging potentially sensitive user content (instructions) to the console; remove this debug log.

Prompt for AI agents
Address the following comment on web/src/app/chat/projects/ProjectsContext.tsx at line 112:

<comment>Avoid logging potentially sensitive user content (instructions) to the console; remove this debug log.</comment>

<file context>
@@ -0,0 +1,419 @@
+      if (!currentProjectId) {
+        throw new Error(&quot;No project selected&quot;);
+      }
+      console.log(&quot;upserting instructions&quot;, instructions);
+      await svcUpsertProjectInstructions(currentProjectId, instructions);
+      await refreshCurrentProjectDetails();
</file context>

db_session.query(UserFile)
.filter(UserFile.user_id == user.id)
.order_by(UserFile.last_accessed_at.desc())
.all()
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unbounded query for 'recent' files can return excessive rows; add a LIMIT to avoid large memory/latency impact.

Prompt for AI agents
Address the following comment on backend/onyx/server/manage/users.py at line 906:

<comment>Unbounded query for &#39;recent&#39; files can return excessive rows; add a LIMIT to avoid large memory/latency impact.</comment>

<file context>
@@ -889,3 +891,19 @@ def update_assistant_preferences_for_user_api(
+        db_session.query(UserFile)
+        .filter(UserFile.user_id == user.id)
+        .order_by(UserFile.last_accessed_at.desc())
+        .all()
+    )
+
</file context>
Suggested change
.all()
.limit(50).all()

retry_delay = 0.5


def _acquire_user_file_locks(db_session: Session, user_file_ids: list[int]) -> bool:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Incorrect type annotation for user_file_ids; IDs are UUID/str in this codebase, not int. Update the hint to reflect the actual type.

Prompt for AI agents
Address the following comment on backend/onyx/indexing/adapters/user_file_indexing_adapter.py at line 33:

<comment>Incorrect type annotation for user_file_ids; IDs are UUID/str in this codebase, not int. Update the hint to reflect the actual type.</comment>

<file context>
@@ -0,0 +1,210 @@
+retry_delay = 0.5
+
+
+def _acquire_user_file_locks(db_session: Session, user_file_ids: list[int]) -&gt; bool:
+    &quot;&quot;&quot;Acquire locks for the specified user files.&quot;&quot;&quot;
+    stmt = (
</file context>

Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

40 issues found across 155 files

Note: This PR contains a large number of files. cubic only reviews up to 150 files per PR, so some files may not have been reviewed.

React with 👍 or 👎 to teach cubic. You can also tag @cubic-dev-ai to give feedback, ask questions, or re-run the review.

for user_file_id in user_file_ids:
self.app.send_task(
OnyxCeleryTask.PROCESS_SINGLE_USER_FILE,
kwargs={"user_file_id": user_file_id, "tenant_id": tenant_id},
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Passing uuid.UUID in Celery kwargs will fail with default JSON serializer; convert to str before enqueueing.

Prompt for AI agents
Address the following comment on backend/onyx/background/celery/tasks/user_file_processing/tasks.py at line 80:

<comment>Passing uuid.UUID in Celery kwargs will fail with default JSON serializer; convert to str before enqueueing.</comment>

<file context>
@@ -0,0 +1,248 @@
+            for user_file_id in user_file_ids:
+                self.app.send_task(
+                    OnyxCeleryTask.PROCESS_SINGLE_USER_FILE,
+                    kwargs={&quot;user_file_id&quot;: user_file_id, &quot;tenant_id&quot;: tenant_id},
+                    queue=OnyxCeleryQueues.USER_FILE_PROCESSING,
+                    priority=OnyxCeleryPriority.HIGH,
</file context>
Suggested change
kwargs={"user_file_id": user_file_id, "tenant_id": tenant_id},
kwargs={"user_file_id": str(user_file_id), "tenant_id": tenant_id},
Fix with Cubic

"id": str(self.file_id),
"type": self.file_type,
"name": self.filename,
"user_file_id": str(self.file_id),
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Assigning user_file_id to str(self.file_id) is incorrect; it should be the UserFile UUID or not set

Prompt for AI agents
Address the following comment on backend/onyx/file_store/models.py at line 61:

<comment>Assigning user_file_id to str(self.file_id) is incorrect; it should be the UserFile UUID or not set</comment>

<file context>
@@ -56,4 +58,5 @@ def to_file_descriptor(self) -&gt; FileDescriptor:
             &quot;id&quot;: str(self.file_id),
             &quot;type&quot;: self.file_type,
             &quot;name&quot;: self.filename,
+            &quot;user_file_id&quot;: str(self.file_id),
         }
</file context>
Fix with Cubic

connector_id=cc_pair.connector_id,
credential_id=cc_pair.credential_id,
),
request_id=None,
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

run_indexing_pipeline now requires adapter; adding request_id without supplying adapter will raise a TypeError at runtime. Construct and pass a DocumentIndexingBatchAdapter here.

Prompt for AI agents
Address the following comment on backend/onyx/server/onyx_api/ingestion.py at line 124:

<comment>run_indexing_pipeline now requires adapter; adding request_id without supplying adapter will raise a TypeError at runtime. Construct and pass a DocumentIndexingBatchAdapter here.</comment>

<file context>
@@ -121,10 +121,7 @@ def upsert_ingestion_doc(
-            connector_id=cc_pair.connector_id,
-            credential_id=cc_pair.credential_id,
-        ),
+        request_id=None,
     )
 
</file context>

✅ Addressed in 714a152

try:

# Use our consolidated function that handles indexing properly
categorized_files_result = upload_files_to_user_files_with_indexing(
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Missing ownership validation: project_id is accepted and used to associate files without verifying the project belongs to the authenticated user, enabling cross-project linkage.

Prompt for AI agents
Address the following comment on backend/onyx/server/features/projects/api.py at line 70:

<comment>Missing ownership validation: project_id is accepted and used to associate files without verifying the project belongs to the authenticated user, enabling cross-project linkage.</comment>

<file context>
@@ -0,0 +1,347 @@
+    try:
+
+        # Use our consolidated function that handles indexing properly
+        categorized_files_result = upload_files_to_user_files_with_indexing(
+            files=files, project_id=project_id, user=user, db_session=db_session
+        )
</file context>

✅ Addressed in c78d522

)
project_id = chat_session_creation_request.project_id
if project_id:
if not check_project_ownership(project_id, user.id, db_session):
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Accessing user.id without a None check can raise AttributeError for anonymous access; handle unauthenticated users before calling check_project_ownership.

Prompt for AI agents
Address the following comment on backend/onyx/server/query_and_chat/chat_backend.py at line 280:

<comment>Accessing user.id without a None check can raise AttributeError for anonymous access; handle unauthenticated users before calling check_project_ownership.</comment>

<file context>
@@ -282,6 +272,15 @@ def create_new_chat_session(
+    )
+    project_id = chat_session_creation_request.project_id
+    if project_id:
+        if not check_project_ownership(project_id, user.id, db_session):
+            raise HTTPException(
+                status_code=403, detail=&quot;User does not have access to project&quot;
</file context>

// Show temporary uploading files immediately
const tempFiles: ProjectFile[] = Array.from(files).map(
(file) => ({
id: file.name,
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using file.name as id for temporary items risks duplicate React keys when multiple uploads share the same filename; generate a unique temp id.

Prompt for AI agents
Address the following comment on web/src/app/chat/components/projects/ProjectContextPanel.tsx at line 175:

<comment>Using file.name as id for temporary items risks duplicate React keys when multiple uploads share the same filename; generate a unique temp id.</comment>

<file context>
@@ -0,0 +1,301 @@
+              // Show temporary uploading files immediately
+              const tempFiles: ProjectFile[] = Array.from(files).map(
+                (file) =&gt; ({
+                  id: file.name,
+                  file_id: file.name,
+                  name: file.name,
</file context>
Fix with Cubic

results.non_accepted.append(upload.filename or "")
else:
results.acceptable.append(upload)
results.acceptable_file_to_token_count[upload.filename or ""] = (
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using an empty string as the dict key risks collisions for multiple unnamed uploads; use a unique placeholder for missing filenames.

Prompt for AI agents
Address the following comment on backend/onyx/server/features/projects/projects_file_utils.py at line 135:

<comment>Using an empty string as the dict key risks collisions for multiple unnamed uploads; use a unique placeholder for missing filenames.</comment>

<file context>
@@ -0,0 +1,178 @@
+                    results.non_accepted.append(upload.filename or &quot;&quot;)
+                else:
+                    results.acceptable.append(upload)
+                    results.acceptable_file_to_token_count[upload.filename or &quot;&quot;] = (
+                        token_count
+                    )
</file context>
Fix with Cubic

if (!currentProjectId) {
throw new Error("No project selected");
}
console.log("upserting instructions", instructions);
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Avoid logging potentially sensitive user content (instructions) to the console; remove this debug log.

Prompt for AI agents
Address the following comment on web/src/app/chat/projects/ProjectsContext.tsx at line 112:

<comment>Avoid logging potentially sensitive user content (instructions) to the console; remove this debug log.</comment>

<file context>
@@ -0,0 +1,419 @@
+      if (!currentProjectId) {
+        throw new Error(&quot;No project selected&quot;);
+      }
+      console.log(&quot;upserting instructions&quot;, instructions);
+      await svcUpsertProjectInstructions(currentProjectId, instructions);
+      await refreshCurrentProjectDetails();
</file context>

db_session.query(UserFile)
.filter(UserFile.user_id == user.id)
.order_by(UserFile.last_accessed_at.desc())
.all()
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unbounded query for 'recent' files can return excessive rows; add a LIMIT to avoid large memory/latency impact.

Prompt for AI agents
Address the following comment on backend/onyx/server/manage/users.py at line 906:

<comment>Unbounded query for &#39;recent&#39; files can return excessive rows; add a LIMIT to avoid large memory/latency impact.</comment>

<file context>
@@ -889,3 +891,19 @@ def update_assistant_preferences_for_user_api(
+        db_session.query(UserFile)
+        .filter(UserFile.user_id == user.id)
+        .order_by(UserFile.last_accessed_at.desc())
+        .all()
+    )
+
</file context>
Suggested change
.all()
.limit(50).all()
Fix with Cubic

retry_delay = 0.5


def _acquire_user_file_locks(db_session: Session, user_file_ids: list[int]) -> bool:
Copy link
Contributor

@cubic-dev-ai cubic-dev-ai bot Sep 8, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Incorrect type annotation for user_file_ids; IDs are UUID/str in this codebase, not int. Update the hint to reflect the actual type.

Prompt for AI agents
Address the following comment on backend/onyx/indexing/adapters/user_file_indexing_adapter.py at line 33:

<comment>Incorrect type annotation for user_file_ids; IDs are UUID/str in this codebase, not int. Update the hint to reflect the actual type.</comment>

<file context>
@@ -0,0 +1,210 @@
+retry_delay = 0.5
+
+
+def _acquire_user_file_locks(db_session: Session, user_file_ids: list[int]) -&gt; bool:
+    &quot;&quot;&quot;Acquire locks for the specified user files.&quot;&quot;&quot;
+    stmt = (
</file context>

✅ Addressed in af2fc3d

Copy link

blacksmith-sh bot commented Sep 11, 2025

50 Jobs Failed:

Run Integration Tests v2 / integration-tests (connector_job_tests/google, connector-google) failed on "Pull Docker images"
[...]
47964e6bbe9b: Retrying in 2 seconds
47964e6bbe9b: Retrying in 1 second
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (connector_job_tests/jira, connector-jira) failed on "Pull Docker images"
[...]
3e2a38691e82: Retrying in 3 seconds
3e2a38691e82: Retrying in 2 seconds
3e2a38691e82: Retrying in 1 second
unexpected EOF
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (connector_job_tests/sharepoint, connector-sharepoint) failed on "Pull Docker images"
[...]
3e2a38691e82: Retrying in 2 seconds
3e2a38691e82: Retrying in 1 second
unexpected EOF
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (connector_job_tests/slack, connector-slack) failed on "Pull Docker images"
[...]
unexpected EOF
09770038a452: Pull complete
b5b97af0704b: Pull complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/anonymous_user, tests-anonymous_user) failed on "Pull Docker images"
[...]
3e2a38691e82: Retrying in 1 second
f109a73ed262: Download complete
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/api_key, tests-api_key) failed on "Pull Docker images"
[...]
3e2a38691e82: Retrying in 2 seconds
3e2a38691e82: Retrying in 1 second
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/auth, tests-auth) failed on "Pull Docker images"
[...]
2b1f5329bca5: Verifying Checksum
2b1f5329bca5: Download complete
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/chat, tests-chat) failed on "Pull Docker images"
[...]
47964e6bbe9b: Retrying in 3 seconds
47964e6bbe9b: Retrying in 2 seconds
47964e6bbe9b: Retrying in 1 second
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/chat_retention, tests-chat_retention) failed on "Pull Docker images"
[...]
unexpected EOF
09770038a452: Pull complete
b5b97af0704b: Pull complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/connector, tests-connector) failed on "Pull Docker images"
[...]
3e2a38691e82: Retrying in 2 seconds
3e2a38691e82: Retrying in 1 second
unexpected EOF
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/dev_apis, tests-dev_apis) failed on "Pull Docker images"
[...]
14bd20d89e4f: Retrying in 1 second
3e2a38691e82: Retrying in 1 second
unexpected EOF
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/document_set, tests-document_set) failed on "Pull Docker images"
[...]
47964e6bbe9b: Retrying in 2 seconds
47964e6bbe9b: Retrying in 1 second
unexpected EOF
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/image_indexing, tests-image_indexing) failed on "Pull Docker images"
[...]
2b1f5329bca5: Download complete
38903a8783c3: Download complete
unexpected EOF
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/index_attempt, tests-index_attempt) failed on "Pull Docker images"
[...]
47964e6bbe9b: Retrying in 2 seconds
13f1b08d45fb: Retrying in 1 second
47964e6bbe9b: Retrying in 1 second
unexpected EOF
f82acb7dbe45: Download complete
07d8bad72860: Verifying Checksum
07d8bad72860: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.
Run Integration Tests v2 / integration-tests (tests/indexing, tests-indexing) failed on "Pull Docker images"
[...]
47964e6bbe9b: Retrying in 3 seconds
47964e6bbe9b: Retrying in 2 seconds
47964e6bbe9b: Retrying in 1 second
unexpected EOF
07d8bad72860: Download complete
f82acb7dbe45: Verifying Checksum
f82acb7dbe45: Download complete
07d8bad72860: Pull complete
f82acb7dbe45: Pull complete
272fc460a5c8: Pull complete
db83d36ad2b5: Pull complete
febd2a84bb58: Pull complete
ab2b03209e6c: Pull complete
9f33a63fc3d0: Pull complete
Digest: sha256:b71218312c693bc06f119339164bdd0b7e7810c7cba78e180662141f88beadbd
Status: Downloaded newer image for experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
experimental-registry.blacksmith.sh:5000/integration-test-onyx-model-server:test-17947134878
All Docker images pulled successfully
Error response from daemon: No such image: experimental-registry.blacksmith.sh:5000/integration-test-onyx-backend:test-17947134878
Error: Process completed with exit code 1.

25 more jobs failed (See summary below for more details)

10 jobs failed running on non-Blacksmith runners.


Summary: 2 successful workflows, 8 failed workflows

Last updated: 2025-09-23 13:49:12 UTC

… related tables and foreign keys for project integration.
…ch deletions, and ensure foreign key integrity during project integration updates
…IDs to UUID scheme, ensuring document_id_migrated defaults to TRUE, and normalizing S3 object keys for plaintext records. This improves data integrity and prepares for future migrations.
…es, ensuring consistent handling of legacy data and enhancing error reporting during batch deletions. This update streamlines the migration process and reinforces data integrity
…ocument IDs, ensuring better data integrity. Adjust fetchChatData to correctly reference the available tools array, improving chat functionality.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant