Replies: 1 comment 1 reply
-
| 
         I think it is platform-dependent, the current x86_64 code in particular performing worse than, say, arm64. But if I knew why exactly and how to fix it, we wouldn't be having this discussion  | 
  
Beta Was this translation helpful? Give feedback.
                  
                    1 reply
                  
                
            
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @martin-frbg
I noticed that the gemv function in OpenBLAS performs equally or even better in a single thread compared to multiple threads. Are there specific factors like memory access patterns, workload distribution, or threading overhead affecting this behavior?
Beta Was this translation helpful? Give feedback.
All reactions