Add daily_miner_deals_checked table to collect aggregated deal information #624

Copilot · 2025-08-25T15:03:34Z

This PR enhances the updatePublicStats() function to collect aggregated information about deals checked, as requested in CheckerNetwork/spark-api#697.

Changes Made

Database Schema

Added migration 026.do.daily-miner-deals-checked.sql to create the new daily_miner_deals_checked table with:
- day (DATE) - the date when deals were checked
- miner_id (TEXT) - the miner identifier
- payload_cids (TEXT[]) - array of payload CIDs tested for that miner on that day

Implementation

Added updateDailyMinerDealsChecked() function that:
- Collects all unique payload CIDs from committees for each miner
- Uses PostgreSQL array operations to efficiently store and merge CID arrays
- Handles duplicate CIDs properly by deduplication both in memory (Set) and database (DISTINCT)
- Supports incremental updates throughout the day by merging new CIDs with existing ones
Integrated the new function into the main updatePublicStats() workflow

Key Features

Deduplication: Uses Set data structure in memory and PostgreSQL DISTINCT operations to ensure no duplicate CIDs
Incremental Updates: ON CONFLICT DO UPDATE with array_cat() allows multiple updates per day without losing data
Performance: Efficient bulk insert using jsonb_to_recordset() for batch processing
Debugging: Includes debug logging consistent with existing patterns

Testing

Added comprehensive tests covering:
- Basic payload CID collection per miner
- Duplicate CID handling
- Incremental updates and array merging
- Empty committee edge cases
All existing tests continue to pass
Follows existing code patterns and style guidelines

Example Usage

After processing committees, the table contains aggregated data like:

SELECT * FROM daily_miner_deals_checked;

    day     | miner_id |           payload_cids           
------------+----------+----------------------------------
 2024-01-15 | f1000    | {bafyabc123,bafydef456,bafyghi789}
 2024-01-15 | f1001    | {bafyxyz999,bafyabc123}

This enables efficient analysis of which deals were checked by each miner on any given day, supporting the broader goal of improving deal monitoring and analytics capabilities.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

api.filspark.com
- Triggering command: node /home/REDACTED/work/spark-evaluate/spark-evaluate/node_modules/.bin/mocha (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more Copilot coding agent tips in the docs.

Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

lib/public-stats.js

…DealsChecked Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

bajtos · 2025-08-26T17:07:35Z

@copilot fix the test failures, e.g. this one:

 17) public-stats
       updateDailyMinerDealsChecked
         updates existing records by merging CID arrays:
     error: column "payload_cids" is of type text[] but expression is of type text
      at /home/runner/work/spark-evaluate/spark-evaluate/node_modules/pg/lib/client.js:545:17
      at process.processTicksAndRejections (node:internal/process/task_queues:95:5)
      at async updateDailyMinerDealsChecked (file:///home/runner/work/spark-evaluate/spark-evaluate/lib/public-stats.js:441:3)
      at async updatePublicStats (file:///home/runner/work/spark-evaluate/spark-evaluate/lib/public-stats.js:32:5)
      at async Context.<anonymous> (file:///home/runner/work/spark-evaluate/spark-evaluate/test/public-stats.test.js:1239:9)

Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot · 2025-08-26T17:15:13Z

@copilot fix the test failures, e.g. this one:

...

Fixed the test failure in commit 7c71e98. The issue was that unnest($2::text[][]) was trying to unnest an array of arrays, which PostgreSQL was interpreting as individual text values instead of text[] arrays for the payload_cids column. Changed to use individual INSERT statements for each miner record, which properly handles the text[] column type while maintaining all existing functionality.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

api.filspark.com
- Triggering command: node /home/REDACTED/work/spark-evaluate/spark-evaluate/node_modules/.bin/mocha (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

bajtos · 2025-08-26T17:21:09Z

@copilot fix the test failures, e.g. this one:

...

Fixed the test failure in commit 7c71e98. The issue was that unnest($2::text[][]) was trying to unnest an array of arrays, which PostgreSQL was interpreting as individual text values instead of text[] arrays for the payload_cids column. Changed to use individual INSERT statements for each miner record, which properly handles the text[] column type while maintaining all existing functionality.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

That won't work, there will be too many INSERT calls and the operation will take long to complete.

Please find a way how to keep using unnest and a single query.

…oach Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot · 2025-08-26T17:29:57Z

@copilot fix the test failures, e.g. this one:

...
...

Fixed in commit 040177a. Changed to use a single query with unnest($1::text[]) and unnest($2::text[][]) with WITH ORDINALITY to match each miner with their corresponding array of payload CIDs. This follows the same pattern as updateDailyAllocatorRetrievalStats while properly handling the text[] column type.

…est approach" This reverts commit 040177a.

This reverts commit 7c71e98.

…ilyMinerDealsChecked" This reverts commit d032661.

bajtos · 2025-08-29T15:08:06Z

I reverted back to the version in 5fb661d as that's the only one that works well.

…1ced

Initial plan

9e9a9fe

Copilot AI assigned Copilot and bajtos Aug 25, 2025

github-project-automation bot added this to CheckerNetwork Aug 25, 2025

Copilot started work on behalf of bajtos August 25, 2025 15:03 View session

bajtos mentioned this pull request Aug 25, 2025

How can I find all the CID names retrieved in the past 7 days CheckerNetwork/spark-api#697

Open

Copilot AI and others added 2 commits August 25, 2025 15:11

Implement daily_miner_deals_checked table and functionality

de9e40f

Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Fix linting issues in public-stats.js

5fb661d

Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot AI requested a review from bajtos August 25, 2025 15:14

Copilot finished work on behalf of bajtos August 25, 2025 15:14

bajtos reviewed Aug 25, 2025

View reviewed changes

lib/public-stats.js Show resolved Hide resolved

Copilot started work on behalf of bajtos August 25, 2025 15:19 View session

Use unnest approach instead of jsonb_to_recordset in updateDailyMiner…

d032661

…DealsChecked Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot finished work on behalf of bajtos August 25, 2025 15:26

Copilot started work on behalf of bajtos August 26, 2025 17:07 View session

Fix type error in updateDailyMinerDealsChecked function

7c71e98

Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot finished work on behalf of bajtos August 26, 2025 17:15

Copilot started work on behalf of bajtos August 26, 2025 17:21 View session

Fix updateDailyMinerDealsChecked to use single query with unnest appr…

040177a

…oach Co-authored-by: bajtos <1140553+bajtos@users.noreply.github.com>

Copilot finished work on behalf of bajtos August 26, 2025 17:30

bajtos added 3 commits August 29, 2025 17:07

Revert "Fix updateDailyMinerDealsChecked to use single query with unn…

4ae7c17

…est approach" This reverts commit 040177a.

Revert "Fix type error in updateDailyMinerDealsChecked function"

b4d0c94

This reverts commit 7c71e98.

Revert "Use unnest approach instead of jsonb_to_recordset in updateDa…

33f7221

…ilyMinerDealsChecked" This reverts commit d032661.

Merge branch 'main' into copilot/fix-f8d4ec4c-9004-447b-860f-8c1307cb…

125361f

…1ced

bajtos marked this pull request as ready for review August 29, 2025 15:09

bajtos requested review from juliangruber and pyropy as code owners August 29, 2025 15:09

bajtos approved these changes Aug 29, 2025

View reviewed changes

pyropy approved these changes Sep 1, 2025

View reviewed changes

bajtos merged commit 21ff26a into main Sep 1, 2025
6 checks passed

bajtos deleted the copilot/fix-f8d4ec4c-9004-447b-860f-8c1307cb1ced branch September 1, 2025 10:52

github-project-automation bot moved this to ✅ done in CheckerNetwork Sep 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add daily_miner_deals_checked table to collect aggregated deal information #624

Add daily_miner_deals_checked table to collect aggregated deal information #624

Uh oh!

Copilot AI commented Aug 25, 2025 •

edited by bajtos

Loading

Uh oh!

Uh oh!

bajtos commented Aug 26, 2025

Uh oh!

Copilot AI commented Aug 26, 2025 •

edited

Loading

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

bajtos commented Aug 26, 2025

Uh oh!

Copilot AI commented Aug 26, 2025

Uh oh!

bajtos commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add daily_miner_deals_checked table to collect aggregated deal information #624

Add daily_miner_deals_checked table to collect aggregated deal information #624

Uh oh!

Conversation

Copilot AI commented Aug 25, 2025 • edited by bajtos Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

Database Schema

Implementation

Key Features

Testing

Example Usage

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Uh oh!

bajtos commented Aug 26, 2025

Uh oh!

Copilot AI commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

bajtos commented Aug 26, 2025

Uh oh!

Copilot AI commented Aug 26, 2025

Uh oh!

bajtos commented Aug 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Copilot AI commented Aug 25, 2025 •

edited by bajtos

Loading

Copilot AI commented Aug 26, 2025 •

edited

Loading