Last week, AMD asserted that its Radeon RX 7900 XTX could outperform Nvidia’s GeForce RTX 4090 in a DeepSeek benchmark. However, the test did not include Nvidia’s latest Blackwell-based GeForce RTX 5090, instead using the older RTX 4080 Super. In response, Nvidia has released its own benchmarks, which, as expected, highlight its products in a much more favorable manner.
Proper Labelling Matters
In contrast to AMD, Nvidia accurately labeled its Y-axis (tokens/second). It conducted tests using the Llama-bench platform with int4 quantization. In the initial test featuring 7 billion parameters, the Radeon RX 7900 XTX reached just over 100 tokens per second. The RTX 4090 outperformed it by 46%, achieving around 150 tokens per second, while the RTX 5090 surpassed it by an impressive 103%, hitting approximately 200 tokens per second.
Consistent Results Across Models
The results remain largely consistent with a model of 8 billion tokens, and when testing with a 32 billion token model, the RTX 5090’s advantage increases to 124%, generating about 50 tokens per second. It’s important to note that these benchmarks come directly from the companies and should be viewed with a degree of skepticism. Additionally, both companies seem to have designed their testing methods to favor their own results. Nonetheless, it isn’t shocking to see that the RTX 5090 outpaces the two-year-old RX 7900 XTX, particularly in a competitive environment where Nvidia has a stronghold.
Source:
Link
