paddleocr-vl 开启use_chart_recognition图表检测,如何保留chart截图? #16781
              
                Unanswered
              
          
                  
                    
                      zhongguogu
                    
                  
                
                  asked this question in
                Q&A
              
            Replies: 2 comments
-
| save_to_img() | 
Beta Was this translation helpful? Give feedback.
                  
                    0 replies
                  
                
            -
| 您好,一般图表解析都是图像,解析table二选一,paddleocr-vl 并没有相关参数,需要修改源码。可以pip show paddlex, 查看paddlex源码的本地路径,然后将这行代码https://github.com/PaddlePaddle/PaddleX/blob/b2ebed2ae7cb53904c5c4763d5fce3b89a55e4e7/paddlex/inference/pipelines/paddleocr_vl/pipeline.py#L215 改成 image_labels = (
            IMAGE_LABELS + ["chart"]
        ) | 
Beta Was this translation helpful? Give feedback.
                  
                    0 replies
                  
                
            
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
pipeline = PaddleOCRVL(
vl_rec_backend="vllm-server",
vl_rec_server_url="http://127.0.0.1:8118/v1",
use_chart_recognition=True
)
pipeline.predict( pdf_path)
解析pdf文件,如果未开启use_chart_recognition,则images/目录下会保存chart图表,没有普通表格截图。
如果开启了图表识别后,文件中的图表被解析成table格式,但是imgaes/目录下不会保存chart图表截图,这个是否有参数配置,保留chart截图
Beta Was this translation helpful? Give feedback.
All reactions