The complete solution guide for hardware and driver issues
#Overview
GPU and VRAM errors are the #2 most common problem in ComfyUI and Stable Diffusion (after installation issues). This guide solves every major GPU-related error reported on Reddit.
Hardware Partner
Running these workflows? ComputeAtlas.ai helps you find the right GPU
Optimization is only half the battle. Get precise VRAM benchmarks and hardware recommendations tailored for ComfyUI.
Check GPU Prices →#🔴 Error: "CUDA not available" or "Device: cpu"
Symptoms
- →Console shows
Device: cpuinstead ofDevice: cuda - →Generation is extremely slow
- →Error message: "CUDA not available"
Root Cause
PyTorch installed without CUDA support, or incorrect CUDA version
Solution
Step 1: Check Your GPU
Windows:
Expected output:
Note your CUDA Version (e.g., 12.1)
Step 2: Reinstall PyTorch with CUDA
Uninstall current PyTorch:
Install correct CUDA version:
For CUDA 12.1:
For CUDA 11.8:
Step 3: Verify
Restart ComfyUI. Console should now show:
#🔴 Error: "CUDA out of memory" (OOM)
Symptoms
- →Error:
RuntimeError: CUDA out of memory - →Generation crashes mid-process
- →Can't complete workflows that previously worked
Understanding VRAM
| GPU | VRAM | What You Can Run |
|---|---|---|
| GTX 1660 Ti | 6GB | SD 1.5 only, 512×512 max |
| RTX 3060 | 12GB | SD 1.5 + SDXL, 768×768 |
| RTX 3080 | 10GB | SDXL, 1024×1024 |
| RTX 4090 | 24GB | Everything, high-res, video |
Solution 1: Reduce Image Resolution
Current workflow using 1024×1024?
Try:
- →768×768 (SDXL)
- →512×512 (SD 1.5)
How to change:
Find the "Empty Latent Image" node → Change width and height
Solution 2: Enable Low VRAM Mode
For 4-8GB GPUs:
Run ComfyUI with:
For very low VRAM (< 4GB):
Portable version:
Edit run_nvidia_gpu.bat, add --lowvram:
Solution 3: Use Tiled VAE
Install ComfyUI-TiledKSampler custom node:
- →Go to
ComfyUI/custom_nodes - →Clone:
comfyui-workflow.json
- →Restart ComfyUI
- →Replace VAE Decode node with Tiled VAE Decode
Result: Reduces VRAM usage by ~40-60%
Solution 4: Reduce Batch Size
Find any node with batch_size → Set to 1
Example:
Solution 5: Use Lower Precision Models
SD 1.5 models:
- →Use
fp16versions instead offp32 - →Half the VRAM usage
Where to find: Model filename contains:
- →
fp16← Use this - →
fp32← Avoid on low VRAM
#🔴 Mac GPU Not Being Used (M1/M2/M3)
Symptoms
- →Generation extremely slow on Mac
- →Console shows
Device: cpu - →Activity Monitor shows low GPU usage
Solution
Force MPS (Metal Performance Shaders):
For maximum performance:
Verify: Console should show:
#🔴 Error: "RuntimeError: No CUDA GPUs are available"
Cause
- →GPU driver not installed
- →Driver outdated
- →GPU not detected by system
Solution
Step 1: Update GPU Drivers
NVIDIA:
- →Go to NVIDIA Driver Downloads
- →Select your GPU model
- →Download and install latest Game Ready or Studio driver
AMD (Linux):
Step 2: Verify Driver
Windows/Linux:
Should show your GPU and driver version
Step 3: Restart
Fully restart your computer after driver installation
#🔴 Error: "Torch not compiled with CUDA enabled"
Cause
PyTorch CPU-only version installed
Solution
Complete reinstall:
Verify CUDA is available:
Expected output: True
#🔴 Black Images / Empty Output
Cause
Often related to VAE or precision issues on certain GPUs
Solution 1: Change VAE
- →Download
vae-ft-mse-840000-ema-pruned.safetensors - →Place in
ComfyUI/models/vae/ - →In workflow, add "VAE Loader" node
- →Connect to "VAE Decode"
Solution 2: Force FP32 Precision
For Mac:
#🔴 Very Slow Generation (Multiple Minutes Per Image)
Diagnostic Checklist
- →
Is GPU being used?
- →Check console: Should say
Device: cudaorDevice: mps - →If says
Device: cpu→ See "CUDA not available" above
- →Check console: Should say
- →
Is resolution too high for your GPU?
- →See VRAM table above
- →Reduce resolution
- →
Are you using too many steps?
- →20-30 steps is usually enough
- →Reduce from 50+ → 25
- →
Is xformers enabled?
comfyui-workflow.json - →
Too many upscale passes?
- →Remove or reduce upscaling nodes
#🔴 Error: "CUDA error: device-side assert triggered"
Cause
Usually model/LoRA incompatibility or corrupted model file
Solution
- →Remove all LoRAs from workflow
- →Try different checkpoint
- →Re-download suspected corrupt models
- →Check model compatibility:
- →SD 1.5 LoRA → SD 1.5 checkpoint only
- →SDXL LoRA → SDXL checkpoint only
#Performance Optimization Quick Reference
For 4-6GB VRAM (GTX 1660, RTX 3050)
- →Use SD 1.5 only
- →Max 512×512 resolution
- →Install Tiled VAE
- →Batch size: 1
For 8-12GB VRAM (RTX 3060, RTX 4060)
- →SDXL supported at 768×768
- →SD 1.5 at 768×768
- →Batch size: 1-2
For 16-24GB VRAM (RTX 4080, RTX 4090)
- →SDXL at 1024×1024+
- →High-resolution workflows
- →Batch size: 2-4
- →Video generation supported
#🆘 Emergency Reset
If nothing works:
- →
Completely uninstall PyTorch:
comfyui-workflow.json - →
Update GPU drivers (see above)
- →
Restart computer
- →
Fresh PyTorch install:
comfyui-workflow.json - →
Verify:
comfyui-workflow.json
#🎯 Related Guides
- →Guide 7: Performance Optimization
- →Guide 8: Fix Missing Dependencies
- →Guide 9: Troubleshooting Assistant
Still having GPU issues? Check the Troubleshooting Assistant for interactive diagnosis.
Hardware Partner
Running these workflows? ComputeAtlas.ai helps you find the right GPU
Optimization is only half the battle. Get precise VRAM benchmarks and hardware recommendations tailored for ComfyUI.
Check GPU Prices →