New feature: OnnxPredict algorithm #1488

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

xaviliz wants to merge 61 commits into MTG:master from xaviliz:feat/onnx-predict

Contributor

xaviliz commented Sep 1, 2025 •

edited

Loading

New feature: OnnxPredict algorithm

Feature

It makes additional changes in Essentia library to build ONNX Runtime Inferencing library from the source and to implement a new algorithm OnnxPredict for running ONNX models (.onnx) with multiple IO.

Implementation

Provide a new building script for ONNX Runtime Inferencing library.
Modify Essentia scripts to link with the onnxruntime dynamic library.
Implement new algorithm OnnxPredict to run ONNX models in Essentia.
Implement unittests in test_onnxpredict.py

Prerequisites

python >= 3.10
cmake >= 3.28

Testing

Builds successfully with ONNX Runtime v1.22.1 in MacOS
- ARM64
- x86_64
Builds successfully with ONNX Runtime v1.22.1 in Linux
Multiple input inferencing
Multiple output inferencing
No runtime errors or compatibility issues

How to Test

Tested in onnxruntime-v1.22.1:

MacOS with an ARM64 machine with python 3.13.4 and cmake 4.0.2
Linux docker with python 3.10.18 and cmake 4.1.0

How to build ONNX Runtime

After installing Essentia dependencies in a virtual environment, install cmake

python3 -m pip install cmake
which cmake

Then we can run the building script:

cd packaging/debian_3rdparty
bash build_onnx.sh

How to build OnnxPredict

In MacOS:

source .env/bin/activate
python3 waf configure --fft=KISS --include-algos=OnnxPredict,Windowing,Spectrum,MelBands,UnaryOperator,TriangularBands,FFT,Magnitude,NoiseAdder,RealAccumulator,FileOutputProxy,FrameCutter --static-dependencies --pkg-config-path=/packaging/debian_3rdparty/lib/pkgconfig --with-onnx --lightweight= --with-python --pythondir=.env/lib/python3.13/site-packages
python3 waf -v && python3 waf install

In Linux:

python3 waf configure --fft=KISS --include-algos=OnnxPredict,Windowing,Spectrum,MelBands,UnaryOperator,TriangularBands,FFT,Magnitude,NoiseAdder,RealAccumulator,FileOutputProxy,FrameCutter --static-dependencies --with-onnx --lightweight= --with-python --pkg-config-path /usr/share/pkgconfig --std=c++14
python3 waf -v && python3 waf install

How to unittest

# prepare essentia audio repo
git clone https://github.com/MTG/essentia-audio.git test/essentia-audio
rm -rf test/audio && mv test/essentia-audio test/audio

# download effnet.onnx model for testing
curl https://essentia.upf.edu/models/feature-extractors/discogs-effnet/discogs-effnet-bsdynamic-1.onnx --output test/models/discogs-effnet-bsdynamic-1.onnx
python3 test/src/unittests/all_tests.py onnxpredict

xaviliz added 30 commits

June 16, 2025 10:19


          Add initial script for building onnx-runtime.

98994c6


          Modify wscript for onnx flag.

4e92dc0


          Define algos for onnx-runtime and create --with-onnx flag.

d673012


          Support --with-onnx

4246f5b


          Rename library to be detected by pkg-config

9720e28


          Add initial onnxpredict.cpp files.

d60385e


          Provide a constructor and solve some errors.

3bb07a9


          Fix status errors.

90da5d4


          Update ONNX Runtime versioning

7d357be


          Support for MacOS building at arm64

56da948


          Support for multi IO models

d9c8a49


          Except OnnxPredict algorithm to use pool in cpp

d136388


          First onnx runtime unittest with effnet model

f46d481


          Merge branch 'master' into feat/onnx-predict

4798b8b


          Add support for building onnxruntime in Linux.

bce71f9


          Save onnxruntime files after building

28da0a9


          Copy all dynamic library files

0da9800


          Fix stem in audio path

88787a2


          Clean and update the parameter declaration

1ee0fe6


          Small clean

83ed01d


          Fix issue in the EssentiaExceptions for name parsing.

5a300fc


          Add TestIONameParser()

4f7955c


          Adapt testEmptyModelName() unittest

29d0e2b


          Add testInvalidParams() as unitest

2655a03


          Update effnetdiscogs-bsdynamic-1.onnx location

2c42e91


          Throw EssentiaException for empty model names.

c766952


          Updated test/models submodule

35b0228


          Updated test/audio submodule

4302cb9


          Polish ORT building command for MacOS

9a09d95


          Initial support for multi input models

b552601

xaviliz added 9 commits

September 16, 2025 13:10


          Small clean

64a86cb


          Assert testInference() with input and output shapes

e206845


          Add testComputeWithoutConfiguration() unittest

d246e28


          Add testIgnoreInvalidReconfiguration() as unittest

86c64eb


          Add testInvalidSqueezeConfiguration() as unittest

459652e


          Add testConfigure() as unittest

30abf8a


          Add testConfigure() as unittest

6fa1780


          Fix issue in testIgnoreInvalidReconfiguration() for Linux

46727b7


          Clean and small change

palonso reviewed

View reviewed changes

Contributor

palonso left a comment

Great work @xaviliz !!
I left some comments. Some are questions about things I didn't understand

packaging/debian_3rdparty/build_onnx.sh Outdated

    
              OS=$(uname -s)

              CONFIG=Release

              if [ "$OS" = "Darwin" ]; then

Contributor

palonso Sep 22, 2025

@xaviliz, since we are inside debian_3rdparty, should we remove or move somewhere else the MacOS support?

Contributor Author

xaviliz Sep 23, 2025

Yes, that's true. I kept it for testing purpouses. Let me clean it a bit.

Contributor Author

xaviliz Sep 23, 2025

It has been tested on Linux.

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
              const char* OnnxPredict::name = "OnnxPredict";

              const char* OnnxPredict::category = "Machine Learning";

              const char* OnnxPredict::description = DOC("This algorithm runs a Onnx graph and stores the desired output tensors in a pool.\n"

Contributor

palonso Sep 22, 2025

an ONNX graph?

Contributor Author

xaviliz Sep 23, 2025

it should be an ONNX model, there is no access to graphs in onnxruntime. It is fixed now.

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                // Do not do anything if we did not get a non-empty model name.

                if (_graphFilename.empty()) return;

                cout << "after return" << endl;

Contributor

palonso Sep 22, 2025

Clean debug output

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  _env = Ort::Env(ORT_LOGGING_LEVEL_WARNING, "multi_io_inference"); // {"default", "test", "multi_io_inference"}

                  // Set graph optimization level - check https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html

                  _sessionOptions.SetGraphOptimizationLevel(GraphOptimizationLevel::ORT_ENABLE_EXTENDED);

Contributor

palonso Sep 22, 2025

Since there are different optimization options, I'm wondering if there is a chance that extended optimization doesn't work or affects model performance in some cases. I think this should be turned into a parameter that defaults to extended.

https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html#graph-optimization-levels

Contributor Author

xaviliz Sep 23, 2025

That's a good point, I am not sure how optimizations could affect the performance. Adding new parameter sounds good to me. So, do you propose to add boolean parameter for each optimization? or just an string to use one of them?

src/algorithms/machinelearning/onnxpredict.cpp Outdated

    
                  // Set graph optimization level - check https://onnxruntime.ai/docs/performance/model-optimizations/graph-optimizations.html

                  _sessionOptions.SetGraphOptimizationLevel(GraphOptimizationLevel::ORT_ENABLE_EXTENDED);

                  // To enable model serialization after graph optimization set this

                  _sessionOptions.SetOptimizedModelFilePath("optimized_file_path");

Contributor

palonso Sep 22, 2025

I think this is mainly intended for debugging purposes. Can we skip saving the optimized graph for efficiency?

https://onnxruntime.ai/docs/api/c/struct_ort_api.html#ad238e424200c0f1682947a1f342c39ca

Contributor Author

xaviliz Sep 23, 2025

yes, we don't need to store the optimized graph in a model.

src/algorithms/machinelearning/onnxpredict.cpp

    
                return out;

              }

              void OnnxPredict::reset() {

Contributor

palonso Sep 22, 2025

Shouldn't we reset the session and env too?

Contributor Author

xaviliz Sep 26, 2025

That's a good point. I couldn't find a reset method for session and env in the CPP_API like in Tensorflow. But let me try it using std::unique_ptr maybe that could work. however, I am doubting if we should do that after compute(), because if we reset the session at the end of configure(), session.Run() will fail.

src/algorithms/machinelearning/onnxpredict.cpp

    
                const Pool& poolIn = _poolIn.get();

                Pool& poolOut = _poolOut.get();

                std::vector<std::vector<float>> input_datas;  // <-- keeps inputs alive

Contributor

palonso Sep 22, 2025

input_datas -> input_data?
I think data is already plural

src/algorithms/machinelearning/onnxpredict.cpp

    
                  // Step 2: Convert data to float32

                  input_datas.emplace_back(inputData.size());

                  for (size_t j = 0; j < inputData.size(); ++j) {

                      input_datas.back()[j] = static_cast<float>(inputData.data()[j]);

Contributor

palonso Sep 22, 2025

Instead of forcing casting data to float, shouldn't we keep it in Real format (that is actually float32 by default) and make sure that the model runs in whatever type Real points to?

src/algorithms/machinelearning/onnxpredict.cpp

    
                  }

                  // Step 3: Create ONNX Runtime tensor

                  _memoryInfo = Ort::MemoryInfo::CreateCpu(OrtArenaAllocator, OrtMemTypeDefault);

Contributor

palonso Sep 22, 2025

Would it be possible to run the models on GPU if available?

src/python/essentia/standard.py

    
              def _create_essentia_class(name, moduleName = __name__):

                  essentia.log.debug(essentia.EPython, 'Creating essentia.standard class: %s' % name)

                  # print(f"name: {name}")

Contributor

palonso Sep 22, 2025

remove debug print

xaviliz added 20 commits

September 23, 2025 11:31


          Remove MacOS support

509b991


          Clean debug output

00c09ee


          Skip saving the optimized graph

6dc8f55


          Set to 0 the intraop number of threads

881fe48


          Improve setTensorInfos() for readablity

cac1cb4


          Raise an exception if no input is provided

6b971f6


          Small clean

1f8f0fb


          Fix testInference() unitest after declaring inputs as mandatory param…

679b2fb

…eter


          Small change

31b7964


          Excepts when no outputs are defined

e2c6c2a


          Clean unused checkers for inputs and outputs

f86fd52


          Replace std::cout by E_INFO()

59ae04e


          Add onnxruntime library in the Debian building pipeline.

108ee4c


          Configure GPU providers dynamically

442bae5


          Handle parallel computational providers with macros to avoid errors w…

580a62c

…hen cuda, metal or open_ml are not compiled in onnxruntime library


          Support for resetting Ort::Session and Ort::SessionOption

382dd17


          Check input and output nodes before any computating

8e137b3


          Reset Ort::Session and Ort::SessionOptions for each configure()

37e6912


          Create Ort::Env once per application and reuse it for all sessions

1cdcb2b


          Add deviceId parameter to specify gpu id for inferencing when CUDA, M…

74a585b

…ETAL or OPEN_ML providers are found

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet