Fix: SQL export reports success on failure #63#64
Merged
Conversation
The --sql-export command was displaying a misleading success message even when data conversion errors occurred. This was because the `format_sql_value` function would log a warning and return NULL on conversion failure, but it did not propagate the error. This commit makes the data conversion stricter by raising an `SQLExportError` when a conversion for a NOT NULL column fails. The `run_sql_export` function now handles this exception, counts the errors, and returns `False` if any errors occurred. It also deletes the incomplete SQL file to avoid leaving invalid artifacts. This ensures that the SQL export process provides accurate feedback and only reports success when the export is actually successful.
4 tasks
Tests now expect SQLExportError when None is provided for NOT NULL columns. Updated assertions to match new SQL statement formatting with quoted column names. This improves test accuracy and enforces stricter validation in SQL export logic.
monozoide
added a commit
that referenced
this pull request
Oct 17, 2025
* workflows/update-gitignore-and-create-mls-ci #34 (#38) * workflows/update-gitignore #34 Updated .gitignore to allow .log files in the repository for test data. * workflow/add-gh-actions-workflow #34 Introduces a CI workflow that runs linting and tests on code changes to the main and dev branch, while skipping these steps for documentation-only changes. This setup uses flake8 for linting and pytest for testing, and optimizes CI runs by detecting code vs. docs changes. * docs: Add sample email report and log files to dataset #31 (#39) Added sample_email_report_output.txt, sample_mail.log, and sample_sasl.log to docs/dataset for documentation and testing purposes. These files provide example outputs and logs for MailLogSentinel. * chore: add standardized PR templates for all contribution types (#35) (#42) * Add PR templates Introduces standardized pull request templates for bugfixes, code changes, documentation, CI/CD, and features in the .github/PR_TEMPLATES directory. These templates help ensure consistent and thorough PR descriptions, validation steps, and project checklists across different types of contributions. * Delete PULL_REQUEST_TEMPLATE.md #35 Deleted the .github/PULL_REQUEST_TEMPLATE.md file. This change removes the default template for new pull requests. * Revamp README with clearer setup and feature guide #32 (#43) * Revamp README with clearer setup and feature guide #32 The README has been rewritten for clarity and conciseness, featuring a new quick start guide, clearer prerequisites, simplified command references, and improved documentation links. The overview, installation, and usage instructions are now more accessible, and advanced features are summarized with direct links to the Wiki. The new format is more user-friendly for first-time users and contributors. * Update README links and formatting #32 Corrected documentation and sample output links, updated the contributing guide URL, and improved formatting for the closing quote in the README. * Fix relative link to sample email report in README #32 Updated the link to the daily email report example to use the correct relative path, ensuring the documentation points to the right file location. * ci: fix path filter for docs-only changes #44 (#45) Enhanced the GitHub Actions workflow to better distinguish between code and documentation changes using separate path filters for pull requests and pushes. Updated the lint job to use Python 3.11 and ruff instead of flake8, and improved dependency installation for both lint and test jobs. The workflow now supports a fast path for documentation-only changes, skipping unnecessary jobs. * Revise and expand contributing guidelines #33 (#46) * Revise and expand contributing guidelines #33 Updated CONTRIBUTING.md with clearer, more structured quick-start instructions and recommendations. Added a new CONTRIBUTING_DETAILED.md file providing comprehensive workflow, commit signing, quality standards, and contribution requirements to help contributors follow best practices. * Fix relative links in contributing docs #33 Updated relative paths in CONTRIBUTING.md and CONTRIBUTING_DETAILED.md to ensure links to detailed guidelines and discussions work correctly. Closes #33 * Revise and condense maillogsentinel man page #47 (#49) The man page for maillogsentinel was rewritten for clarity, brevity, and improved structure. Redundant and verbose sections were condensed, option descriptions were clarified, and auxiliary tool documentation was streamlined. The new version emphasizes practical usage, configuration, diagnostics, and security best practices, while removing excessive detail and outdated formatting. * Add manpages for ipinfo and log_anonymizer #48 (#50) Introduces manual pages for the ipinfo and log_anonymizer command-line tools, providing usage instructions, options, examples, and related information for system administrators. * Add comprehensive FAQ documentation (#51) Introduces a detailed FAQ (docs/wiki/FAQ.md) covering installation, configuration, usage, maintenance, integrations, troubleshooting, data analysis, security, and development for MailLogSentinel. This resource aims to assist users and contributors with common questions and operational guidance. * Update documentation links and add manual pages #52 (#53) Adjusted wiki links to use correct relative paths, added FAQ link, and included references to manual pages for maillogsentinel, ipinfo, and log_anonymizer in the README. * Create readable markdown versions of manpages #54 (#55) Introduces manual pages in Markdown format for the ipinfo, log_anonymizer, and maillogsentinel utilities. These documents provide usage instructions, options, configuration details, examples, and security considerations for each tool as part of the MailLogSentinel project. * Add Debian install guide for MailLogSentinel #21 (#56) Introduces a comprehensive installation and configuration guide for MailLogSentinel on Debian 12/13. The guide covers prerequisites, system preparation, installation steps, configuration, verification, service and timer setup, advanced options, troubleshooting, security, and additional resources. * Fix linting errors in CI workflow #58 (#59) * Fix linting errors in CI workflow #58 This commit fixes a number of linting errors that were causing the CI workflow to fail. The errors were primarily related to unused imports, f-strings without placeholders, and unused variables. * Refactor and clean up test code #58 Removed unused imports and variables in test_maillogsentinel_setup.py and test_sql_exporter.py. Updated test_run_sql_export_basic_flow to use context manager for patching datetime and simplified the mocking of the logger. These changes improve test clarity and maintainability. * Remove duplicate unittest.mock import #58 Consolidated the import of patch and MagicMock from unittest.mock to avoid redundancy in the test file. * Remove unused MagicMock import #58 Cleaned up the import statements by removing MagicMock, which was not used in the test file. * Fix Python 3.13 compatibility with pathlib #61 (#62) Refactored the SQL import/export functionality to use `importlib.resources.as_file` instead of the deprecated `pathlib.Path` context manager. This resolves a crash on Python 3.13, where `pathlib.Path` objects no longer support the context manager protocol. close #61 * Fix: SQL export reports success on failure #63 (#64) * Fix: SQL export reports success on failure #63 The --sql-export command was displaying a misleading success message even when data conversion errors occurred. This was because the `format_sql_value` function would log a warning and return NULL on conversion failure, but it did not propagate the error. This commit makes the data conversion stricter by raising an `SQLExportError` when a conversion for a NOT NULL column fails. The `run_sql_export` function now handles this exception, counts the errors, and returns `False` if any errors occurred. It also deletes the incomplete SQL file to avoid leaving invalid artifacts. This ensures that the SQL export process provides accurate feedback and only reports success when the export is actually successful. * Update SQL export tests #63 Tests now expect SQLExportError when None is provided for NOT NULL columns. Updated assertions to match new SQL statement formatting with quoted column names. This improves test accuracy and enforces stricter validation in SQL export logic. Closes #63 * Improve NULL handling for NOT NULL integer columns Fix #65 (#66) Refines the logic in format_sql_value to treat columns as nullable only if 'NOT NULL' is absent from the SQL type definition. Adds stricter validation to prevent empty strings from being converted to integers for NOT NULL columns, and introduces a new test case to verify data conversion failure for NOT NULL integer fields.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
🐞 Bug Summary
The
--sql-exportcommand incorrectly reports success even when data conversion errors occur, resulting in an invalid or empty SQL file being created. When values like 'N/A' cannot be converted to integers for NOT NULL columns, the current code logs a warning and proceeds silently, giving users a false sense that their export succeeded.Severity: high
🔁 Reproduction
--sql-exporton a dataset containing null-like values ('N/A', 'null', etc.) in NOT NULL integer columnsActual result:
The command returns a success message and zero exit code, but creates an invalid SQL file with missing or improperly converted data. Conversion errors are silently suppressed.
Expected result:
The command should detect conversion errors for NOT NULL columns, fail immediately, provide appropriate error feedback, and clean up any incomplete SQL files before returning a failure status.
🌍 Runtime Context
🕵️ Analysis & Root Cause
The root cause stems from overly permissive error handling in the data conversion pipeline:
format_sql_valuefunction silently returns NULL when conversion fails, even for NOT NULL columns, without raising any error signalgenerate_insert_statementfunction does not validate conversion success or propagate errorsrun_sql_exportfunction has no error aggregation mechanism, so it cannot detect when conversion failures have occurredThis silent error suppression allows the process to report success while producing unusable data.
format_sql_valuecombined with lack of error tracking inrun_sql_export✅ Applied Fix
The fix implements strict error handling and validation throughout the conversion pipeline:
Changes to
format_sql_value:SQLExportError(new custom exception) when a null-like value ('N/A', 'null', etc.) is provided for a NOT NULL columnSQLExportErrorwhen type conversion (e.g., integer conversion) fails for a NOT NULL columnChanges to
generate_insert_statement:format_sql_valuecalls to catchSQLExportErrorChanges to
run_sql_export:conversion_errorscounter initialized to 0SQLExportErrorconversion_errors > 0: logs a final error message, deletes the incomplete SQL file, and returnsFalseThis ensures strict validation of data conversions while providing comprehensive error reporting and cleanup.
sql_exporter.py🧪 Tests
Manual validation procedure:
--sql-exporton this dataset--sql-exporton the valid dataset🔗 Links
Closes #63
✅ Project checklist
lintpasses locally and in CI