Skip to content

Conversation

raunakab
Copy link
Contributor

@raunakab raunakab commented Jul 8, 2025

Description

This PR enables support for parsing Confluence Macros and removing the unnecessary styling.

Addresses: https://linear.app/danswer/issue/DAN-2164/add-support-for-confluence-karma-parsing.

How Has This Been Tested?

Manually tested.

@raunakab raunakab requested a review from a team as a code owner July 8, 2025 19:11
Copy link

vercel bot commented Jul 8, 2025

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
internal-search ✅ Ready (Inspect) Visit Preview 💬 Add feedback Jul 8, 2025 7:43pm

Copy link
Contributor

@greptile-apps greptile-apps bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

PR Summary

Added functionality to clean up Confluence macro stylings in backend/onyx/connectors/confluence/onyx_confluence.py, improving document parsing cleanliness.

  • Added new _remove_macro_stylings function to strip unnecessary style parameters from Confluence HTML content
  • Removed legacy commented-out code (_wrap_methods and _ensure_token_valid) for better code cleanliness
  • Specifically targets ac:parameter elements with ac:name="page" attributes for removal

1 file reviewed, 1 comment
Edit PR Review Bot Settings | Greptile

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Copy link
Contributor

@Orbital-Web Orbital-Web left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@raunakab raunakab added this pull request to the merge queue Jul 11, 2025
Merged via the queue into main with commit 3f31340 Jul 11, 2025
14 of 15 checks passed
@raunakab raunakab deleted the feat/confluence-karma branch July 11, 2025 02:56
AnkitTukatek pushed a commit to TukaTek/onyx that referenced this pull request Sep 23, 2025
* Remove macro stylings from HTML tree

* Add params

* Handle multiple cases of `ac:structured-macro` being found.

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>

---------

Co-authored-by: greptile-apps[bot] <165735046+greptile-apps[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants