Unable to read table which contains sub/mini table inside parent table #4632
Replies: 4 comments 16 replies
-
Stacked tables are not supported, and will not be in the foreseeable future. ['Protocol Title', 'Abc', None]
['Brief Title', 'Def', None]
['Study Intervention', 'ghi', None]
['Background and\nRationale', 'jkl', None]
['Objectives and\nEndpoints', 'Objectives', 'Endpoints']
['Primary', 'mno', 'pqr']
['Secondary', '• stu\n• vwx', '• yz\n• 123']
['Overall Design', '456\n789: 101112\n131415\n161718:192021', None] Or this one using pandas:
|
Beta Was this translation helpful? Give feedback.
-
![]() @JorjMcKie I do have one more table as shown in above image however that table is not getting identified by find_tables(). can you please help me with this code and explain why find_tables() function is not able to read this table. in attached pdf there are few more tables which are not getting identified by find_tables() format. help me understand as my pdf contains many customized table formats. |
Beta Was this translation helpful? Give feedback.
-
I used below code to identify whether the page contains only text or table or image or any of them or all of them. however, even my page contains complete image it shows table as present. please look at the below code and let me know if i did anything wrong. def analyze_page_content(pdf_path):
df = analyze_page_content(pdf_path) |
Beta Was this translation helpful? Give feedback.
-
@JorjMcKie i am searching to connect you in linkedin, can u please share the linkedin id if its ok with you |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
test.pdf
Hello guys,
I am trying to read different formats of tables from pdf using Pymupdf ==1.26.3 version however as in the attached pdf if i try to read it is not reading as expected. I have other table formats as well which are not properly read by find_tables and then converting to data frame.
@JorjMcKie Please help me on how to read the pdf
Thank you in advance
Beta Was this translation helpful? Give feedback.
All reactions