AMD RX 7900 XTX vs NVIDIA RTX 4080 SUPER

How these GPUs compare for running local LLMs — VRAM, bandwidth, price, and per-model fit across popular open-weights models.

54 models compared24 GB vs 16 GB VRAM960 GB/s vs 736 GB/s$879 vs $1,295
GPU ARuns more models

AMD RX 7900 XTX

VRAM
24 GB
Bandwidth
960 GB/s
Street price
$879
Vendor
amd
GPU B

NVIDIA RTX 4080 SUPER

VRAM
16 GB
Bandwidth
736 GB/s
Street price
$1,295
Vendor
nvidia

The short answer

AMD RX 7900 XTX can run 10 models that NVIDIA RTX 4080 SUPER can't fit in VRAM — mostly the larger models. For the 23 models both can handle, speeds are similar. If you want headroom for bigger models, AMD RX 7900 XTX is the clear choice.

10 models A runs, B can't
7 models A is 20%+ faster
11 models B is 20%+ faster
5 equal (both run, <20% diff)
21 too large for either

Model-by-model fit

Click any row for the full breakdown. Tie% shown in the Winner column when both GPUs run the model within 20% of each other.

ModelAMD RX 7900 XTXNVIDIA RTX 4080 SUPERWinner
c4ai-command-r-v01 35B
35B · command
40 tok/s · Q4_K_MToo largeA (only)
Command-R+ 104B
104B · command
Too largeToo large
DeepSeek R1 Distill Llama 8B
8B · deepseek
49 tok/s · FP1674 tok/s · Q8_0B (faster)
DeepSeek R1 Distill Qwen 14B
14.8B · deepseek
53 tok/s · Q8_050 tok/s · Q6_K≈tie A +6%
DeepSeek R1 Distill Llama 70B
70.6B · deepseek
Too largeToo large
DeepSeek R1 671B
671B · deepseek
Too largeToo large
DeepSeek-V3 685B
685B · deepseek
Too largeToo large
DeepSeek-V3.2 685.4B
685.4B · deepseek
Too largeToo large
gemma-2-9b
9.2B · gemma
43 tok/s · FP1665 tok/s · Q8_0B (faster)
gemma-2-27b
27.2B · gemma
36 tok/s · Q6_KToo largeA (only)
Llama 3.1 8B Compact
8B · llama
44 tok/s · FP1668 tok/s · Q8_0B (faster)
CodeLlama 34B
34B · llama
42 tok/s · Q4_K_MToo largeA (only)
CodeLlama 34B
34B · llama
42 tok/s · Q4_K_MToo largeA (only)
Llama 3.3 70B
70.6B · llama
Too largeToo large
Llama 3.1 70B
70.6B · llama
Too largeToo large
Llama 4 Scout 17B
109B · llama
Too largeToo large
Llama-4-Maverick-17B-128E
400B · llama
Too largeToo large
Llama 3.1 405B
405B · llama
Too largeToo large
Mistral 7B v0.1
7.25B · mistral
54 tok/s · FP1642 tok/s · FP16A (faster)
Codestral 22B
22.2B · mistral
43 tok/s · Q6_K47 tok/s · Q4_K_M≈tie B +9%
Mixtral 8x7B Instruct v0.1
47B · mixtral
Too largeToo large
Mistral Large 2 123B
123B · mistral
Too largeToo large
Phi-4-mini 3.8B
3.8B · phi
101 tok/s · FP1677 tok/s · FP16A (faster)
Phi-4 14B
14B · phi
51 tok/s · Q8_048 tok/s · Q6_K≈tie A +6%
Qwen 2.5 1.5B
1.5B · qwen
234 tok/s · FP16179 tok/s · FP16A (faster)
Qwen 2.5 3B
3.1B · qwen
122 tok/s · FP1694 tok/s · FP16A (faster)
Qwen3.5-4B
4.7B · qwen
83 tok/s · FP1663 tok/s · FP16A (faster)
Qwen 2.5 7B
7.6B · qwen
52 tok/s · FP1677 tok/s · Q8_0B (faster)
Qwen 2.5 7B
7.6B · qwen
52 tok/s · FP1677 tok/s · Q8_0B (faster)
Qwen 3 8B
8B · qwen
44 tok/s · FP1668 tok/s · Q8_0B (faster)
Qwen3.5-9B
9.7B · qwen
41 tok/s · FP1661 tok/s · Q8_0B (faster)
Qwen 3 32B
32B · qwen
39 tok/s · Q4_K_MToo largeA (only)
Qwen3.5-35B-A3B
36B · qwen
39 tok/s · Q4_K_MToo largeA (only)
Qwen 2.5 72B
72.7B · qwen
Too largeToo large
Qwen 2.5 72B
72.7B · qwen
Too largeToo large
Llama 3.2 1B
1.24B · llama
286 tok/s · FP16219 tok/s · FP16A (faster)
Llama 4 Scout 17B
109B · llama
Too largeToo large
DeepSeek R1 671B
671B · deepseek
Too largeToo large
Gemma 3 27B
27B · gemma
38 tok/s · Q5_K_MToo largeA (only)
Qwen 3 8B
8B · qwen
44 tok/s · FP1668 tok/s · Q8_0B (faster)
Qwen 3 32B
32B · qwen
39 tok/s · Q4_K_MToo largeA (only)
Llama 3.1 8B Compact
8B · llama
44 tok/s · FP1668 tok/s · Q8_0B (faster)
Mixtral 8x7B Instruct v0.1
47B · mixtral
Too largeToo large
Mistral Small 3.2 24B
24B · mistral
43 tok/s · Q6_K47 tok/s · Q4_K_M≈tie B +9%
Command A 111B
111B · command
Too largeToo large
DeepSeek R1 0528
685B · deepseek
Too largeToo large
DeepSeek-V3-0324
684.5B · deepseek
Too largeToo large
DeepSeek-R1-0528-Qwen3-8B
8.2B · qwen
51 tok/s · FP1678 tok/s · Q8_0B (faster)
Qwen3-235B-A22B-Instruct-2507
235B · qwen
Too largeToo large
Qwen3-30B-A3B-Instruct-2507
30B · qwen
50 tok/s · Q4_K_MToo largeA (only)
Qwen3-4B-Instruct-2507
4B · qwen
104 tok/s · FP1680 tok/s · FP16A (faster)
gemma-4-E4B-it
8B · gemma
52 tok/s · FP1680 tok/s · Q8_0B (faster)
gemma-4-26B-A4B-it
26.5B · gemma
39 tok/s · Q6_K43 tok/s · Q4_K_M≈tie B +10%
gemma-4-31B-it
32.7B · gemma
45 tok/s · Q4_K_MToo largeA (only)

Want a different pairing? Browse all comparisons →

Stay ahead of local AI