feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

syangx39 · 2025-06-24T19:45:26Z

Summary

This PR introduces GkeCodeExecutor, a new code executor that provides a secure and scalable method for running LLM-generated code by leveraging GKE Sandbox. It serves as a robust alternative to local or standard containerized executors by leveraging the GKE Sandbox environment, which uses gVisor for workload isolation.

For each code execution request, it dynamically creates an ephemeral Kubernetes Job with a hardened Pod configuration, offering significant security benefits and ensuring that each code execution runs in a clean, isolated environment.

Key Features of GkeCodeExecutor

Dynamic Job Creation: Uses the Kubernetes batch/v1 API to create a new Job for each code snippet.
Secure Code Mounting: Injects code into the Pod via a temporary ConfigMap, which is mounted to a read-only file.
gVisor Sandboxing: Enforces execution within a gvisor runtime for kernel-level isolation.
Hardened Security Context: Pods run as non-root with all Linux capabilities dropped and a read-only root filesystem.
Resource Management: Applies configurable CPU and memory limits to prevent abuse.
Automatic Cleanup: Uses the ttl_seconds_after_finished feature on Jobs for robust, automatic garbage collection of completed Pods and Jobs.
Node Scheduling: The executor uses Kubernetes tolerations in its Pod specification. This allows the k8s scheduler to place the execution Pod onto a pre-configured gVisor-enabled node.
Module Integration: The GkeCodeExecutor is registered in the code_executors/__init__.py, making it available for use by agents. The ImportError handling is configured to check for the required kubernetes SDK.

Execution Flow:

Agent invokes GkeCodeExecutor with the LLM-generated code.
The GkeCodeExecutor will execute_code – creates a temporary ConfigMap, and then create a k8s Job to run it.
This Job runs a standard python:3.11-slim container. The image is pulled once to the node and cached. The Job will mount the ConfigMap as /app/code.py
The GkeCodeExecutor will monitor the Job to completion, fetch stdout/stderr logs from the container, return CodeExecutionResult to the LlmAgent, and ensure all temp resources are deleted.
The calling agent formats the result and provides a final response to the user. If the result contains error, it will retry up to error_retry_attempts times.

syangx39 added 4 commits June 24, 2025 17:25

[06/24] Add gke_code_executor.py

7ddd66d

[06/24] Add gke_code_executor.py

e8635b9

[06/24] Add gke_code_executor.py

3dd917b

[06/24] Add gke_code_executor.py

bdbda05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

Uh oh!

syangx39 commented Jun 24, 2025

Uh oh!

Uh oh!

feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

Are you sure you want to change the base?

feat(code_executors): Add GkeCodeExecutor for sandboxed code execution on GKE #1629

Uh oh!

Conversation

syangx39 commented Jun 24, 2025

Summary

Key Features of GkeCodeExecutor

Execution Flow:

Uh oh!

Uh oh!