You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[Nebius] Add Docker support for Nebius cloud (#5334)
* Add Docker support for Nebius cloud
This commit introduces Docker compatibility for Nebius, enabling users to specify Docker images and runtime options. It includes updates to templates, resource management, and Docker daemon initialization to ensure smooth GPU integration and proper runtime setup. Relevant tests have been adjusted to reflect the new functionality.
* Enable Nebius support for H100 GPUs and CUDA >= 12.0
Updated templates and tests to add Nebius compatibility for H100 GPUs and CUDA versions 12.0 and above. Adjusted smoke test parameters and logic to incorporate these changes while maintaining backward compatibility.
* fix typo
* fix typo
* Update formatting and clarify Docker installation logic
Adjusted spacing in templates for consistency. Added inline documentation to explain the Docker installation process for specific OS images, optimizing deployment speed by selectively installing Docker only when required.
* Fix GPU runtime option and refine Nebius template logic
Corrected the comment in `nebius.py` regarding GPU runtime options and adjusted YAML logic to handle docker installation conditionally. Improved formatting and removed redundant lines to enhance clarity and maintainability.
* Apply suggestions from code review
Co-authored-by: Tian Xia <cblmemo@gmail.com>
---------
Co-authored-by: Seung Jin <seungjin219@gmail.com>
Co-authored-by: Tian Xia <cblmemo@gmail.com>
0 commit comments