Skip to content

Kaggle Dual T4 Overview

llcuda v2.2.0 is optimized for Kaggle's dual Tesla T4 GPU environment.

Hardware Specs

  • 2× NVIDIA Tesla T4
  • 30GB total VRAM (15GB each)
  • SM 7.5 (Turing architecture)
  • FlashAttention support

What You Can Run

Model Size Quantization Strategy
1-13B Q4_K_M Single T4
32-34B Q4_K_M Dual T4 tensor-split
70B IQ3_XS Dual T4 tensor-split

See: Multi-GPU Inference