feat:Implement Supabase storage for conversation summaries #105

DhruvK278 · 2025-07-13T13:29:17Z

Replaces the placeholder database storage in store_summary_to_database with actual Supabase upsert logic. Adds environment variable loading, Supabase client initialization, and improved logging for summary storage operations.

Closes #78

📝 Description

This pull request implements persistent storage of conversation summaries by replacing the placeholder store_summary_to_database function with working Supabase logic. It ensures the DevRel agent can remember previous user interactions by upserting conversation state data into the conversation_context table on Supabase.

This enables long-term memory across sessions, improves personalization, and allows memory to persist even after restarts. The code uses environment variables to securely initialize the Supabase client and includes logging for both success and failure cases.

🔧 Changes Made

Replaced placeholder implementation of store_summary_to_database with functional Supabase upsert.
Upserts into conversation_context table using memory_thread_id to ensure idempotency.
Refactored summary and session data serialization for Supabase compatibility.
Added logger messages for better debugging of Supabase insert/update status.
Verified Supabase environment variables are loaded via .env.

📷 Screenshots or Visual Changes (if applicable)

✅ Checklist

I have read the contributing guidelines.
I have added tests that prove my fix is effective or that my feature works.
I have added necessary documentation (if applicable).
Any dependent changes have been merged and published in downstream modules.

Summary by CodeRabbit

New Features
- Conversation summaries are now securely stored in Supabase, enhancing data persistence for user sessions.
Chores
- Improved logging for successful and failed database operations.

coderabbitai · 2025-07-13T13:29:23Z

"""

Walkthrough

The code implements Supabase integration for storing conversation summaries. It loads Supabase credentials from environment variables, initializes a client, and fully implements the store_summary_to_database async function to upsert conversation context into the Supabase "conversation_context" table. Logging is added for success and error tracking.

Changes

Files/Paths	Change Summary
backend/app/agents/devrel/nodes/summarization.py	Added Supabase integration: loads credentials, initializes client, implements `store_summary_to_database` to upsert conversation summaries, adds logging, and defines a helper for datetime conversion.

Sequence Diagram(s)

sequenceDiagram
    participant AgentState
    participant SummarizationNode
    participant Supabase

    AgentState->>SummarizationNode: Trigger store_summary_to_database(state)
    SummarizationNode->>Supabase: Upsert conversation summary (user/session info, summary, topics, etc.)
    Supabase-->>SummarizationNode: Response (success or error)
    SummarizationNode->>SummarizationNode: Log result

Assessment against linked issues

Objective	Addressed	Explanation
Implement persistent state management and conversation continuity (thread-based memory, summarization, DB storage) [#78]	✅
Implement automatic summarization workflow and vector DB storage [#78]	✅
Enhance AgentState schema, state reducers, and background cleanup [#78]	❓	Changes only implement database storage; other objectives (schema/reducers/cleanup) not shown.

Assessment against linked issues: Out-of-scope changes

Code Change	Explanation
None found

Possibly related issues

FEATURE REQUEST: Persistent Memory Storage for checkpoints #78: Implements the previously placeholder store_summary_to_database function to persist conversation summaries in Supabase, directly addressing persistent memory storage.

Poem

In the warren where memories hop,
Now summaries safely in Supabase drop.
With threads remembered and context in tow,
Our agent’s wisdom will steadily grow!
🐇💾✨

(A rabbit’s ode to memory that won’t stop!)
"""

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 007f905 and 172401d.

📒 Files selected for processing (1)

backend/app/agents/devrel/nodes/summarization.py (2 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

backend/app/agents/devrel/nodes/summarization.py

✨ Finishing Touches

📝 Generate Docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 4

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 6c7dc98 and 6583aa8.

📒 Files selected for processing (1)

backend/app/agents/devrel/nodes/summarization.py (2 hunks)

🧰 Additional context used

🧠 Learnings (2)

📓 Common learnings

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#75
File: backend/app/agents/devrel/agent.py:34-35
Timestamp: 2025-06-13T21:56:19.183Z
Learning: In the Devr.AI backend, the DevRelAgent follows a singleton pattern where only one instance exists for the entire application lifetime, using InMemorySaver with thread-based conversation management to persist user conversations across sessions.

backend/app/agents/devrel/nodes/summarization.py (2)

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/devrel/nodes/handle_technical_support_node.py:6-17
Timestamp: 2025-06-08T13:15:40.536Z
Learning: The handle_technical_support_node function in backend/app/agents/devrel/nodes/handle_technical_support_node.py is intentionally minimal and will be extended after database configuration is completed.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#75
File: backend/app/agents/devrel/agent.py:34-35
Timestamp: 2025-06-13T21:56:19.183Z
Learning: In the Devr.AI backend, the DevRelAgent follows a singleton pattern where only one instance exists for the entire application lifetime, using InMemorySaver with thread-based conversation management to persist user conversations across sessions.

🧬 Code Graph Analysis (1)

backend/app/agents/devrel/nodes/summarization.py (1)

backend/app/agents/state.py (1)

AgentState (18-73)

🔇 Additional comments (3)

backend/app/agents/devrel/nodes/summarization.py (3)

17-18: LGTM! Clean utility function.

The helper function handles the datetime to ISO format conversion safely with null checking.

145-175: Data Type Compatibility for key_topics
Supabase-py automatically serializes Python list objects into JSON arrays when inserting into json/jsonb columns. No manual json.dumps is required for the key_topics field.

164-166: Ignore null‐check for memory_thread_id – it’s always populated
The memory_thread_id used in the upsert is driven by the orchestrator, which falls back to user_id when not explicitly provided, and the database schema enforces it as NOT NULL UNIQUE. In practice:
In agent_coordinator.py,
memory_thread_id = message_data.get("memory_thread_id") or message_data.get("user_id", "")
guarantees a non-empty string.
The LangGraph workflow maps that value into state.thread_id before the summarization node runs.

The conversation_context schema declares memory_thread_id TEXT NOT NULL UNIQUE.
Since memory_thread_id can never be None at this point, the upsert conflict strategy on "memory_thread_id" is safe and idempotent.

Likely an incorrect or invalid review comment.

backend/app/agents/devrel/nodes/summarization.py

smokeyScraper · 2025-07-14T00:16:30Z

I'm facing this error, could you please check once @DhruvK278

attachment

could you please let me know how you were using this? (some dtype mismatch)

PS: really sorry, I mistakenly clicked on "close with comment". Rest, the implementation seems very clean and well aligned as per your attached references.

DhruvK278 · 2025-07-14T06:22:00Z

Hi, @smokeyScraper , this error is likely because of invalid uuid, uuid should look like this 006ccfe9-7662-4427-bfa3-2266009f6fe9
if it is a random generated string/int it will throw an error. Please recheck it and then I will look further into it.

smokeyScraper · 2025-07-15T21:31:36Z

hey @DhruvK278, I see, tested a bit and realized the discord ID 493468178475646976 is not a UUID format xD, which is being stored in the logic.
This is moreover a DB issue on the part of referential constraints; they aren't very well aligned and need a few changes.

smokeyScraper · 2025-07-15T22:05:35Z

I'll try to resolve this by tomorrow. We probably have to change the IDs that are being stored and used in the DB. Also, your PR is upserts these based on memory_thread_id, making it to be 1:N (user:conversation context), but we're preferring a 1:1 relation, and that is to be based on the Discord ID as it's having referential constraints as of now with the users table, right?

Instead, we could use the user_id in conversation_context to directly refer users(id), this way we can simply get the Discord ID from the users table (a bit of a tradeoff on extra call/storage for a field). Could you please look at this once? I, too, will try aligning this throughout and will probably fix it by EOD. But yeah, I'm up for any suggestions.

DhruvK278 · 2025-07-16T08:58:05Z

I see, as of now the user_id uuid is not UNIQUE in db. For this to work we will need to update the DB as you mentioned. After you update it I will use the change the upsert logic to user_id from memory_thread_id to make it 1:1 relation. I think we can use memory_thread_id as a metadata for analytics in future.

smokeyScraper · 2025-07-16T09:23:53Z

yes, you've got it.

rest, for the use case of memory_thread_id, I'm not very sure on this as we'll be already storing all interactions involving any user (all) in interactions table to be further used as analytics after applying simple SQL queries.

smokeyScraper · 2025-07-17T09:39:20Z

hey @DhruvK278, I have fixed the schema. Now we are first looking for the ID of an user in the users table via a lookup using the platform-specific ID for that user, which will be discord ID for now.
could you please make changes with your PR? I'll share with you the working script.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

backend/app/agents/devrel/nodes/summarization.py (2)
2-2: Remove unused import.

The os module is imported but not used in the code.
-import os
8-10: Consider using centralized Supabase client initialization.

The code directly calls get_supabase_client() at module level. While this works, it creates a dependency on the Supabase client being available at import time.

Consider moving the client initialization inside the function that uses it, or add proper error handling if the client initialization fails:
-supabase = get_supabase_client()
+
+def _get_supabase_client():
+    """Get Supabase client with error handling"""
+    try:
+        return get_supabase_client()
+    except Exception as e:
+        logger.error(f"Failed to initialize Supabase client: {e}")
+        raise
Then use supabase = _get_supabase_client() inside the function.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 19ab71c and bd378ca.

📒 Files selected for processing (1)

backend/app/agents/devrel/nodes/summarization.py (2 hunks)

🧰 Additional context used

🧠 Learnings (1)

backend/app/agents/devrel/nodes/summarization.py (5)

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/devrel/nodes/handle_technical_support_node.py:6-17
Timestamp: 2025-06-08T13:15:40.536Z
Learning: The handle_technical_support_node function in backend/app/agents/devrel/nodes/handle_technical_support_node.py is intentionally minimal and will be extended after database configuration is completed.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#87
File: backend/app/api/v1/health.py:1-12
Timestamp: 2025-06-28T23:14:21.089Z
Learning: In the Devr.AI project, the application is designed to run from the backend directory, making import paths like `from main import DevRAIApplication` correct for the current setup. Import path adjustments for deployment will be configured later during the deployment process.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/devrel/nodes/handle_web_search_node.py:31-42
Timestamp: 2025-06-08T13:31:11.572Z
Learning: In backend/app/agents/devrel/tools/search_tool.py, the TavilySearchTool.search() method has partial error handling for missing API key, AttributeError, ConnectionError, and TimeoutError, but lacks a comprehensive Exception catch-all block, so calling functions may still need additional error handling for other potential exceptions.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/shared/classification_router.py:0-0
Timestamp: 2025-06-08T13:08:48.469Z
Learning: The user plans to migrate the JSON parsing in backend/app/agents/shared/classification_router.py from manual JSON extraction to using Pydantic parser for better validation and type safety.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#85
File: backend/app/services/auth/management.py:83-83
Timestamp: 2025-06-28T14:44:34.399Z
Learning: In the backend/app/services/auth/management.py file, the team prefers to use datetime.now() (local timezone/IST) during development and testing for easier debugging, with plans to change to UTC timezone-aware datetime (datetime.now(timezone.utc)) during deployment.

🧬 Code Graph Analysis (1)

backend/app/agents/devrel/nodes/summarization.py (2)

backend/app/agents/state.py (1)

AgentState (18-73)

backend/app/database/supabase/client.py (1)

get_supabase_client (9-13)

🪛 Ruff (0.12.2)

backend/app/agents/devrel/nodes/summarization.py

2-2: os imported but unused

Remove unused import: os

(F401)

🔇 Additional comments (7)

backend/app/agents/devrel/nodes/summarization.py (7)

12-13: Helper function implementation looks good.

The isoformat_or_none helper function correctly handles optional datetime objects and provides consistent datetime formatting.

143-148: Validation logic is appropriate.

The validation checks ensure required fields are present before proceeding with database operations.

149-161: User lookup implementation aligns with PR discussion.

The code correctly implements the user lookup approach discussed in the PR comments, where it queries the users table using the platform-specific ID to get the internal UUID. This addresses the database schema alignment issue mentioned in the PR discussion.

The implementation properly:

Constructs platform-specific column name (discord_id, slack_id, etc.)

Queries the users table to get the internal UUID

Validates that the user exists before proceeding

Logs the found UUID for debugging

162-171: Record structure follows new schema approach.

The record structure correctly uses the user_id field with the UUID from the users table, implementing the 1:1 relationship approach discussed in the PR comments. The removal of memory_thread_id from the record aligns with the updated database schema.

172-174: Upsert operation uses correct conflict resolution.

The upsert operation correctly uses on_conflict="user_id" which implements the 1:1 relationship between users and conversation contexts as discussed in the PR comments.

175-180: Response validation is improved.

The response validation checks both the existence of response.data and that it contains at least one record, which is more robust than the previous implementation.

181-182: Error handling is comprehensive.

The exception handling properly catches and logs any unexpected errors during the storage operation.

DhruvK278 · 2025-07-17T10:00:17Z

@smokeyScraper Done!!

smokeyScraper · 2025-07-17T10:06:30Z

To maintain a clean commit history, could you please undo a few of your recent commits, make the necessary changes, and then commit with a force push. This keeps the commit history clean. I'm preferring "'n' commits for edits" + "1 commit for coderabbit fix" (Easiest is to use editor's (VS Code/Cursor) GitHub section for this)

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

backend/app/agents/devrel/nodes/summarization.py (1)
2-2: Remove unused import.

The os import is not used in the file since Supabase client initialization was moved to a separate module.
-import os

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bd378ca and 007f905.

📒 Files selected for processing (1)

backend/app/agents/devrel/nodes/summarization.py (2 hunks)

🧰 Additional context used

🧠 Learnings (2)

📓 Common learnings

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#85
File: tests/test_supabase.py:1-3
Timestamp: 2025-06-28T14:45:55.244Z
Learning: In the Devr.AI project, smokeyScraper prefers to defer comprehensive test refactoring to separate PRs/efforts when doing major backend restructuring, rather than expanding the scope of the current refactoring PR to include test updates.

backend/app/agents/devrel/nodes/summarization.py (5)

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/devrel/nodes/handle_technical_support_node.py:6-17
Timestamp: 2025-06-08T13:15:40.536Z
Learning: The handle_technical_support_node function in backend/app/agents/devrel/nodes/handle_technical_support_node.py is intentionally minimal and will be extended after database configuration is completed.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#87
File: backend/app/api/v1/health.py:1-12
Timestamp: 2025-06-28T23:14:21.089Z
Learning: In the Devr.AI project, the application is designed to run from the backend directory, making import paths like `from main import DevRAIApplication` correct for the current setup. Import path adjustments for deployment will be configured later during the deployment process.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/devrel/nodes/handle_web_search_node.py:31-42
Timestamp: 2025-06-08T13:31:11.572Z
Learning: In backend/app/agents/devrel/tools/search_tool.py, the TavilySearchTool.search() method has partial error handling for missing API key, AttributeError, ConnectionError, and TimeoutError, but lacks a comprehensive Exception catch-all block, so calling functions may still need additional error handling for other potential exceptions.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#72
File: backend/app/agents/shared/classification_router.py:0-0
Timestamp: 2025-06-08T13:08:48.469Z
Learning: The user plans to migrate the JSON parsing in backend/app/agents/shared/classification_router.py from manual JSON extraction to using Pydantic parser for better validation and type safety.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#85
File: backend/app/services/auth/management.py:83-83
Timestamp: 2025-06-28T14:44:34.399Z
Learning: In the backend/app/services/auth/management.py file, the team prefers to use datetime.now() (local timezone/IST) during development and testing for easier debugging, with plans to change to UTC timezone-aware datetime (datetime.now(timezone.utc)) during deployment.

🧬 Code Graph Analysis (1)

backend/app/agents/devrel/nodes/summarization.py (2)

backend/app/agents/state.py (1)

AgentState (18-73)

backend/app/database/supabase/client.py (1)

get_supabase_client (9-13)

🪛 Ruff (0.12.2)

backend/app/agents/devrel/nodes/summarization.py

2-2: os imported but unused

Remove unused import: os

(F401)

🔇 Additional comments (2)

backend/app/agents/devrel/nodes/summarization.py (2)

8-10: LGTM: Clean separation of concerns with centralized Supabase client.

The import and initialization of the Supabase client through the centralized get_supabase_client() function is a good architectural decision that promotes reusability and consistent configuration management.

12-13: LGTM: Useful helper function for datetime serialization.

The isoformat_or_none helper function properly handles optional datetime objects, providing consistent serialization for database storage while gracefully handling None values.

coderabbitai · 2025-07-17T10:24:35Z

backend/app/agents/devrel/nodes/summarization.py

 async def store_summary_to_database(state: AgentState) -> None:
-    # TODO: Implement database storage to user_interactions
-    """Store the summary in PostgreSQL database"""
-    logger.info(f"[PLACEHOLDER] Storing summary for session {state.session_id}")
-    pass
+    """Store the summary in Supabase database"""
+    logger.info(f"Storing summary for session {state.session_id} into conversation_context")
+
+    try:
+        # Validation
+        if not state.user_id or not state.platform:
+            logger.error(f"Missing required fields: user_id={state.user_id}, platform={state.platform}")
+            return
+
+        platform_id = state.user_id
+        platform_column = f"{state.platform}_id"
+
+        # UUID fetch
+        user_response = await supabase.table("users").select("id").eq(platform_column, platform_id).limit(1).execute()
+
+        if not user_response.data:
+            logger.error(f"User with {platform_column} '{platform_id}' not found in users table.")
+            return
+
+        user_uuid = user_response.data[0]['id']
+        logger.info(f"Found user UUID: {user_uuid} for {platform_column}: {platform_id}")
+
+        # Records
+        record = {
+            "user_id": user_uuid,
+            "conversation_summary": state.conversation_summary,
+            "key_topics": state.key_topics,
+            "total_interactions": state.interaction_count,
+            "session_start_time": isoformat_or_none(state.session_start_time),
+            "session_end_time": isoformat_or_none(datetime.now()),
+        }
+
+        # Upsert 
+        response = await supabase.table("conversation_context").upsert(record, on_conflict="user_id").execute()
+
+        # Checks
+        if response.data and len(response.data) > 0:
+            logger.info(f"✅ Summary stored successfully for session {state.session_id}")
+        else:
+            logger.error(f"❌ Supabase upsert failed for session {state.session_id}: {response}")
+
+    except Exception as e:
+        logger.error(f"Unexpected error while storing summary: {str(e)}")


💡 Verification agent

🧩 Analysis chain

Verify database schema alignment and consider additional error handling.

The implementation correctly follows the updated approach discussed in PR comments - looking up user UUID from the users table using platform-specific ID and upserting by user_id. The validation, logging, and error handling look solid.

However, there are a few areas to verify:

Database schema alignment: Ensure the conversation_context table has the expected structure with user_id as a unique constraint for the upsert operation.

Platform ID validation: The code assumes state.user_id contains the platform-specific ID (e.g., Discord ID), but there's no validation that this ID format is appropriate for the platform.

Run the following script to verify the database schema and usage patterns:

🏁 Script executed:

#!/bin/bash # Description: Verify database schema alignment and potential usage patterns # Expected: Confirm conversation_context table structure and user_id uniqueness # Check if there are any references to the old memory_thread_id approach echo "=== Checking for old memory_thread_id usage ===" rg -A 3 -B 3 "memory_thread_id" --type py # Check for other usages of store_summary_to_database to understand call patterns echo "=== Checking store_summary_to_database usage patterns ===" rg -A 5 -B 5 "store_summary_to_database" --type py # Check for platform ID validation patterns in the codebase echo "=== Checking platform ID validation patterns ===" rg -A 3 -B 3 "discord_id|slack_id|github_id" --type py

Length of output: 32486

I’ve reviewed the Supabase models and found that the ConversationContext model still defines a required memory_thread_id and platform field, but our upsert payload only includes user_id, conversation_summary, key_topics, total_interactions, session_start_time, and session_end_time. To align the code with the schema, we should:

Include platform and memory_thread_id in the upsert record (to satisfy non-null constraints).

Confirm that user_id has a unique constraint on the conversation_context table for on_conflict upserts.

Add a simple format check for state.user_id (e.g., ensure Discord IDs are numeric, GitHub IDs are numeric, Slack IDs match ^U[A-Z0-9]+$, etc.) before querying.

File to update:
• backend/app/agents/devrel/nodes/summarization.py (in store_summary_to_database)

Suggested diff:

--- a/backend/app/agents/devrel/nodes/summarization.py +++ b/backend/app/agents/devrel/nodes/summarization.py @@ async def store_summary_to_database(state: AgentState) -> None: - record = { + record = { "user_id": user_uuid, + "platform": state.platform, + "memory_thread_id": state.session_id, "conversation_summary": state.conversation_summary, "key_topics": state.key_topics, "total_interactions": state.interaction_count, "session_start_time": isoformat_or_none(state.session_start_time), "session_end_time": isoformat_or_none(datetime.now()), }

Also add just-before-upsert:

# Quick platform ID format validation if state.platform == "discord" and not state.user_id.isdigit(): logger.error(f"Invalid Discord ID format: {state.user_id}") return # Add similar checks for slack_id and github_id as needed...

With these adjustments, the upsert will satisfy the table schema and guard against malformed platform IDs.

🤖 Prompt for AI Agents

In backend/app/agents/devrel/nodes/summarization.py from lines 139 to 182, the upsert record is missing the required non-null fields platform and memory_thread_id according to the ConversationContext model. Add these two fields to the record dictionary before the upsert call. Also, before querying the users table, add validation checks on state.user_id format based on state.platform (e.g., check if Discord and GitHub IDs are numeric, Slack IDs match the regex ^U[A-Z0-9]+$) and log an error and return early if the format is invalid. Finally, verify that user_id has a unique constraint in the conversation_context table to safely use it in the on_conflict parameter of the upsert.

Replaces the placeholder in store_summary_to_database with logic to upsert conversation summaries into the Supabase 'conversation_context' table. Fetches user UUID, validates required fields, and ensures one summary per user using upsert. Adds error handling and logging for database operations.

smokeyScraper · 2025-07-17T10:52:26Z

@coderabbitai review please

coderabbitai · 2025-07-17T10:54:27Z

Tip

For best results, initiate chat on the files or code changes.

🧩 Analysis chain

I'll review the current state of the implementation after the database schema fixes and changes you and DhruvK278 have made.

@smokeyScraper The implementation looks much better now! The database schema issues have been properly addressed. Here's what I can see:

✅ Issues Resolved:

UUID Format Issue: Fixed by looking up the proper UUID from the users table instead of storing Discord IDs directly
1:1 Relationship: Now uses user_id for upserts instead of memory_thread_id, establishing the desired 1:1 relationship
Platform-Specific Lookup: Properly constructs the platform ID column (discord_id) for user lookup

✅ Implementation Quality:

Clean user lookup logic with proper error handling
Appropriate logging for both success and failure cases
Proper datetime conversion helper function
Good separation of concerns

Minor Suggestions:

Consider adding a comment explaining the platform ID column construction logic
The error logging could include the platform and user_id for better debugging

The core database integration issues you and @DhruvK278 discussed have been well addressed. The implementation should now work correctly with the updated schema and avoid the UUID format errors you encountered earlier.

🏁 Script executed:

#!/bin/bash
# Read the current implementation to see the changes made
cat backend/app/agents/devrel/nodes/summarization.py

Length of output: 6931

@smokeyScraper The implementation looks excellent now! All the database schema issues have been properly resolved. Here's my review:

✅ Issues Successfully Resolved:

UUID Format Issue: ✅ Fixed by looking up the proper UUID from the users table instead of storing Discord IDs directly
1:1 Relationship: ✅ Now uses user_id for upserts with on_conflict="user_id", establishing the desired 1:1 relationship
Platform-Specific Lookup: ✅ Properly constructs the platform ID column (discord_id) for user lookup
Database Schema Alignment: ✅ Correctly references users(id) via foreign key relationship

✅ Implementation Quality:

Clean Architecture: User lookup logic is well-separated and handles errors gracefully
Proper Error Handling: Validates required fields and handles missing users appropriately
Good Logging: Comprehensive logging for debugging and monitoring
Datetime Handling: Clean helper function for ISO format conversion
Conflict Resolution: Uses on_conflict="user_id" for proper upsert behavior

Code Quality Highlights:

Clear variable naming (platform_id, platform_column, user_uuid)
Proper async/await usage
Comprehensive error logging with context
Validates data before database operations

The implementation now correctly handles the Discord ID → UUID lookup flow and maintains the 1:1 user-to-conversation relationship as intended. The database integration should work smoothly with your updated schema.

Great work on resolving the schema alignment issues! 🎉

🧠 Learnings used

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#85
File: tests/test_supabase.py:1-3
Timestamp: 2025-06-28T14:45:55.244Z
Learning: In the Devr.AI project, smokeyScraper prefers to defer comprehensive test refactoring to separate PRs/efforts when doing major backend restructuring, rather than expanding the scope of the current refactoring PR to include test updates.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#87
File: tests/test_supabase.py:1-3
Timestamp: 2025-06-28T23:15:13.374Z
Learning: In the Devr.AI project, smokeyScraper prefers to defer test updates and fixes (like missing imports after module reorganization) to separate PRs rather than expanding the scope of module update/chore PRs to include comprehensive test refactoring.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#90
File: backend/app/agents/devrel/nodes/react_supervisor.py:97-101
Timestamp: 2025-07-05T04:33:39.840Z
Learning: In the Devr.AI project, smokeyScraper prefers to defer code deduplication refactoring (like extracting duplicate functions to shared utilities) until there are more common functionalities present among tools/workflow. With only two files using the same function, they consider it not a problem currently and prefer to "align later in a more better way" once more patterns emerge.

Learnt from: smokeyScraper
PR: AOSSIE-Org/Devr.AI#85
File: backend/app/services/auth/management.py:32-33
Timestamp: 2025-06-28T14:44:36.819Z
Learning: In the Devr.AI project, smokeyScraper prefers using machine timezone (IST) for datetime operations during development and testing for easier debugging, with plans to switch to UTC for deployment later.

smokeyScraper · 2025-07-17T09:01:49Z

backend/app/agents/devrel/nodes/summarization.py

+
+supabase_url = os.getenv("SUPABASE_URL")
+supabase_key = os.getenv("SUPABASE_KEY")
+supabase: Client = create_client(supabase_url, supabase_key)


This leads to sync calls btw, refer backend/app/database/supabase/client.py and use it directly.

smokeyScraper · 2025-07-17T10:57:03Z

merged!
cheers! @DhruvK278 !

coderabbitai bot reviewed Jul 13, 2025

View reviewed changes

DhruvK278 changed the title ~~Implement Supabase storage for conversation summaries~~ FEAT:Implement Supabase storage for conversation summaries Jul 13, 2025

DhruvK278 changed the title ~~FEAT:Implement Supabase storage for conversation summaries~~ feat:Implement Supabase storage for conversation summaries Jul 13, 2025

smokeyScraper closed this Jul 14, 2025

smokeyScraper reopened this Jul 14, 2025

coderabbitai bot reviewed Jul 17, 2025

View reviewed changes

DhruvK278 force-pushed the enhancement/persistentDB-#78 branch from bd378ca to 007f905 Compare July 17, 2025 10:22

coderabbitai bot reviewed Jul 17, 2025

View reviewed changes

DhruvK278 force-pushed the enhancement/persistentDB-#78 branch from 007f905 to 172401d Compare July 17, 2025 10:47

smokeyScraper approved these changes Jul 17, 2025

View reviewed changes

smokeyScraper merged commit 1356976 into AOSSIE-Org:main Jul 17, 2025
1 check passed

DhruvK278 deleted the enhancement/persistentDB-#78 branch July 23, 2025 20:27

DhruvK278 restored the enhancement/persistentDB-#78 branch July 23, 2025 20:27

coderabbitai bot mentioned this pull request Oct 18, 2025

feat: database metrics setup with enhancing memory logic for downstream metrics #144

Merged

feat:Implement Supabase storage for conversation summaries #105

feat:Implement Supabase storage for conversation summaries #105

Uh oh!

Conversation

DhruvK278 commented Jul 13, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

📝 Description

🔧 Changes Made

📷 Screenshots or Visual Changes (if applicable)

✅ Checklist

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jul 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Assessment against linked issues

Assessment against linked issues: Out-of-scope changes

Possibly related issues

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

smokeyScraper commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DhruvK278 commented Jul 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smokeyScraper commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smokeyScraper commented Jul 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DhruvK278 commented Jul 16, 2025

Uh oh!

smokeyScraper commented Jul 16, 2025

Uh oh!

smokeyScraper commented Jul 17, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

DhruvK278 commented Jul 17, 2025

Uh oh!

smokeyScraper commented Jul 17, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

smokeyScraper commented Jul 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Jul 17, 2025

Uh oh!

smokeyScraper Jul 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

smokeyScraper commented Jul 17, 2025

Uh oh!

Reviewers

Assignees

Labels

DhruvK278 commented Jul 13, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jul 13, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

smokeyScraper commented Jul 14, 2025 •

edited

Loading

DhruvK278 commented Jul 14, 2025 •

edited

Loading

smokeyScraper commented Jul 15, 2025 •

edited

Loading

smokeyScraper commented Jul 15, 2025 •

edited

Loading

smokeyScraper commented Jul 17, 2025 •

edited

Loading