1212- 🎮 ** GPU Temperature Support** : Optional GPU temperature monitoring with NVIDIA Container Toolkit
1313- ⚙️ ** Environment Variable Control** : Complete configuration through ` .env ` files
1414- 🐳 ** Docker Deployment** : One-click deployment with no manual environment setup
15- - 🔥 ** Decision Temperature Algorithm** : ` max(disk_temp, gpu_temp - 20°C ) ` ensures optimal cooling
15+ - 🔥 ** Decision Temperature Algorithm** : ` max(disk_temp, gpu_temp - GPU_TEMP_OFFSET ) ` ensures optimal cooling
1616- 📊 ** Configurable Thresholds** : Customize temperature and fan speed settings
1717- 🔄 ** Auto/Manual Modes** : Flexible operation modes for different use cases
1818
@@ -105,22 +105,24 @@ CHECK_INTERVAL=60 # Check interval in seconds (auto mode only)
105105When GPU temperature monitoring is enabled, the system uses this algorithm to calculate the decision temperature:
106106
107107``` text
108- Decision Temperature = max(Disk Temperature, GPU Temperature - 20°C )
108+ Decision Temperature = max(Disk Temperature, GPU Temperature - GPU_TEMP_OFFSET )
109109```
110110
111111This algorithm ensures:
112112
113113- ** Proactive GPU cooling** : High GPU temperatures trigger increased fan speeds
114- - ** Temperature offset compensation** : Accounts for thermal differences between GPU and system
114+ - ** Configurable offset compensation** : Adjustable offset ( ` GPU_TEMP_OFFSET ` ) accounts for thermal differences between GPU and system
115115- ** Disk temperature baseline** : Disk temperature always serves as the minimum baseline
116116
117117### Algorithm Examples
118118
119- | Disk Temp | GPU Temp | GPU-20 | Decision Temp | Reasoning |
119+ With default ` GPU_TEMP_OFFSET=15°C ` :
120+
121+ | Disk Temp | GPU Temp | GPU-15 | Decision Temp | Reasoning |
120122| -----------| ----------| --------| ---------------| -----------|
121- | 65°C | 70°C | 50 °C | ** 65°C** | Disk temperature is higher |
122- | 65°C | 90 °C | 70°C | ** 70°C** | GPU-20 is higher, use adjusted GPU temp |
123- | 75°C | 80°C | 60 °C | ** 75°C** | Disk temperature remains baseline |
123+ | 65°C | 70°C | 55 °C | ** 65°C** | Disk temperature is higher |
124+ | 65°C | 85 °C | 70°C | ** 70°C** | GPU-15 is higher, use adjusted GPU temp |
125+ | 75°C | 80°C | 65 °C | ** 75°C** | Disk temperature remains baseline |
124126
125127## 🔧 Troubleshooting
126128
@@ -133,7 +135,7 @@ This algorithm ensures:
133135nvidia-smi
134136
135137# Verify Docker GPU support
136- docker run --rm --gpus all nvidia/cuda:11.0-base nvidia-smi
138+ docker run --rm --gpus all nvidia/cuda:12.9.0-runtime-ubuntu24.04 nvidia-smi
137139```
138140
139141#### Cannot Connect to ESXi Host
@@ -184,6 +186,7 @@ docker exec idrac-fan-control tail -f /var/log/fan-control/fan_control.log
184186| ` OPERATION_MODE ` | auto | Operation mode (auto/manual) |
185187| ` CHECK_INTERVAL ` | 60 | Check interval in seconds |
186188| ` WITH_GPU_TEMP ` | false | Enable GPU temperature monitoring |
189+ | ` GPU_TEMP_OFFSET ` | 15 | GPU temperature offset for decision algorithm (°C) |
187190
188191## 🐳 Docker Compose Examples
189192
0 commit comments