• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA RTX 4070 Ti SUPER with 26 Gbps G6X Mod Beats RTX 4080

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,587 (7.66/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
The graphics card modders from Brazil, TecLab and Paulo Gomes are back with yet another audacious, competitive VGA mod. The duo has already built a reputation for video memory replacement mods that have significantly altered the performance profile of graphics cards, and their latest feat sees an NVIDIA GeForce RTX 4070 Ti SUPER graphics card get a bit of a memory upgrade. Out of the box, the RTX 4070 Ti SUPER comes with 16 GB of 21 Gbps GDDR6X memory across the 256-bit wide memory interface of the "AD103" GPU. The two tech-tubers each performed memory chip replacement mods, which in combination with GPU overclocking, resulted in memory speeds ranging between 24 Gbps and 26 Gbps. The big story here is that with faster memory, the RTX 4070 Ti SUPER beats the RTX 4080, despite 13% fewer shaders and other key components.

A stock GeForce RTX 4080 graphics card scores 8525 points in the Unigine Superposition 8K benchmark, and a stock-speed Manli RTX 4070 Ti SUPER does 7212 points, on account of fewer shaders (66 SM vs. 76 SM) and slower memory (21 Gbps vs. 22.4 Gbps). With a 24 Gbps memory speed mod, and GPU overclocking, Paulo Gomes achieved 8870 points in the test for the RTX 4070 Ti SUPER, beating the RTX 4080, and overcoming the shader deficit. Meanwhile, over at TecLab, their Galax-branded RTX 4070 Ti SUPER yields 7028 points at stock speeds; and the team pulled off an epic 26 Gbps memory speed (chip replacement + overclocking), which when combined with some GPU overclocking, yielded a staggering 9133 points, which would inch close to what an RTX 4080 SUPER could produce with its 23 Gbps memory and maxed out "AD103" ASIC with all 80 SM on deck.



View at TechPowerUp Main Site | Source
 
Joined
Oct 31, 2022
Messages
148 (0.25/day)
Error in Title.
The in Article it is said that the 4070Ti Super beats the 4080, but not the 4080 Super.

I hope Nvidia will use the faster 32gbps GDDR7 memory for the 5090. :D
 
Joined
Dec 12, 2016
Messages
1,346 (0.49/day)
Am I correct in saying that Nvidia GPUs are bandwidth starved but not AMD GPUs due to infinity cache?
 
Joined
Jan 8, 2017
Messages
9,119 (3.37/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
Am I correct in saying that Nvidia GPUs are bandwidth starved but not AMD GPUs due to infinity cache?
Every GPU benefits from more memory bandwidth.
 
Joined
Jan 18, 2020
Messages
701 (0.44/day)
8k benchmark ?

Now there's a realistic use case if I've ever seen one...
 
Joined
Jan 7, 2022
Messages
102 (0.12/day)
Processor Intel i5 9400f
Motherboard MSI Z390 Gaming Plus
Cooling SilentiumPC Fera 3
Memory 2x 8GB Corsair Vengeance LPX 3200 16-18-18-36
Video Card(s) MSI GTX 1660 Super Ventus XS OC
Storage 500 GB Kingston A2000; 1 TB Kingston A2000; 1 TB HGST Travelstar
Display(s) AOC 24G2U
Case SilentiumPC Signum SG1
Audio Device(s) Creative Pebble Plus; Logitech G533
Power Supply beQuiet Systempower 9 500W
Mouse Logitech G403 Hero
Keyboard HyperX Alloy Origins (Red)
Software Windows 10 Pro
One thing to note: looking at hwBot, an overclocked 4080 can get over 10000 points, which means most of the preformance increase came from the core oc and not from the memory replacement.
 
Joined
Aug 4, 2020
Messages
1,583 (1.13/day)
Location
::1
i mean you can overclock the memory asw
just ymmv since siliconlottery™ will dictate how much you can oc the memory, if you replace those chips w/ better bins ofc you can also oc them more
 
Joined
Dec 12, 2016
Messages
1,346 (0.49/day)
Every GPU benefits from more memory bandwidth.
That's not how I understand things unless you mean compared within a certain architecture. For instance, the Radeon VII had a bandwidth of 1024 GBps with 3870 cores. The Radeon 7800xt has a bandwidth of 624 GBps with 3870 cores. The 7800xt is much much faster than the Radeon VII. Giving the Radeon VII even more bandwidth probably would not have affected performance.
 
Joined
Nov 11, 2016
Messages
3,142 (1.14/day)
System Name The de-ploughminator Mk-II
Processor i7 13700KF
Motherboard MSI Z790 Carbon
Cooling ID-Cooling SE-226-XT + Phanteks T30
Memory 2x16GB G.Skill DDR5 7200Cas34
Video Card(s) Asus RTX4090 TUF
Storage Kingston KC3000 2TB NVME
Display(s) LG OLED CX48"
Case Corsair 5000D Air
Audio Device(s) KEF LSX II LT speakers + KEF KC62 Subwoofer
Power Supply Corsair HX850
Mouse Razor Viper Ultimate
Keyboard Corsair K75
Software win11
would be interesting to see 4090 fitted with 26gbps GDDR6X
 
Joined
Jan 8, 2017
Messages
9,119 (3.37/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
That's not how I understand things unless you mean compared within a certain architecture.
No, I mean literally all GPU architectures, memory bandwidth is always a constraint on performance, some architectures are more efficient at utilizing available memory bandwidth than others.

Giving the Radeon VII even more bandwidth probably would not have affected performance.
It probably would have, think how much more faster caches are than VRAM, that means there is always more performance to be had if you improve memory bandwidth. You get diminishing returns but the % improvement would be non zero.
 
Joined
Sep 17, 2014
Messages
21,235 (5.99/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define R5
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse XTRFY M42
Keyboard Lenovo Thinkpad Trackpoint II
Software W10 x64
One thing to note: looking at hwBot, an overclocked 4080 can get over 10000 points, which means most of the preformance increase came from the core oc and not from the memory replacement.
Yeah memory OC is always shaky. At some point you do get the frequency but the error count kills performance regardless.

No, I mean literally all GPU architectures, memory bandwidth is always a constraint on performance, some architectures are more efficient at utilizing available memory bandwidth than others.
That statement feels way to general to be true. It depends entirely on what you're feeding it.
 
Joined
Dec 16, 2021
Messages
148 (0.16/day)
Location
Denmark
Processor AMD Ryzen 7 3800X
Motherboard ASUS Prime X470-Pro
Cooling bequiet! Dark Rock Slim
Memory 64 GB ECC DDR4 2666 MHz (Samsung M391A2K43BB1-CTD)
Video Card(s) eVGA GTX 1080 SC Gaming, 8 GB
Storage 1 TB Samsung 970 EVO Plus, 1 TB Samsung 850 EVO, 4 TB Lexar NM790, 12 TB WD HDDs
Display(s) Acer Predator XB271HU
Case Corsair Obsidian 550D
Audio Device(s) Creative X-Fi Fatal1ty
Power Supply Seasonic X-Series 560W
Mouse Logitech G502
Keyboard Glorious GMMK
Considering both graphics cards use the same processor, when one gets faster memory and is overclocked (yet uses stock cooling) I would expect it to be faster. Like someone wrote on videocardz.com, where I first read about this, if anything it proves nVidia deliberately held performance back.
 

Solaris17

Super Dainty Moderator
Staff member
Joined
Aug 16, 2005
Messages
26,069 (3.80/day)
Location
Alabama
System Name Rocinante
Processor I9 14900KS
Motherboard MSI MPG Z790I Edge WiFi Gaming
Cooling be quiet! Pure Loop 240mm
Memory 64GB Gskill Trident Z5 DDR5 6000 @6400
Video Card(s) MSI SUPRIM Liquid X 4090
Storage 1x 500GB 980 Pro | 1x 1TB 980 Pro | 1x 8TB Corsair MP400
Display(s) Odyssey OLED G9 (G95SC)
Case LANCOOL 205M MESH Snow
Audio Device(s) Moondrop S8's on schitt Modi+ & Valhalla 2
Power Supply ASUS ROG Loki SFX-L 1000W
Mouse Lamzu Atlantis mini (White)
Keyboard Monsgeek M3 Lavender, Akko Crystal Blues
VR HMD Quest 3
Software openSUSE Tumbleweed
Benchmark Scores I dont have time for that.
That statement feels way to general to be true. It depends entirely on what you're feeding it.
To a point but I agree with him. Is it going to matter to legos adventures? Probably not. But if you are benchmarking or playing anything demanding it really does. Especially if you start using lots of vram.
 
Joined
Feb 3, 2017
Messages
3,544 (1.32/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) EVGA Geforce RTX 3080 XC3
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
VR HMD HTC Vive
The big story here is that with faster memory, the RTX 4070 Ti SUPER beats the RTX 4080, despite 13% fewer shaders and other key components.
Is it? The 3GHz on the screenshot should be ~10% faster clock than a 4080 which would make up a very large part of the shader deficit. This at the same time with the main headline of memory bandwidth increase of 15-25% and I would be rather disappointed if it did not beat a stock 4080 :)

That's not how I understand things unless you mean compared within a certain architecture. For instance, the Radeon VII had a bandwidth of 1024 GBps with 3870 cores. The Radeon 7800xt has a bandwidth of 624 GBps with 3870 cores. The 7800xt is much much faster than the Radeon VII. Giving the Radeon VII even more bandwidth probably would not have affected performance.
That is not a fair comparison and will not always play out like this in real world. 7800XT has the relatively huge 64MB of cache that heavily augments the lacking memory bandwidth. If cards run out of cache - which they tend to do at one point - the difference in memory bandwidth comes back to play its role.

This was the latest big change in the paradigm - AMD started with RDNA2 and Nvidia followed suit with Ada. Crappy for consumer but from technical point of view a nice huge efficiency boost.
 
Joined
Dec 26, 2006
Messages
3,581 (0.56/day)
Location
Northern Ontario Canada
Processor Ryzen 5700x
Motherboard Gigabyte X570S Aero G R1.1 BiosF5g
Cooling Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s) AMD RX 6800 - Asus Tuf
Storage Kingston KC3000 1TB & 2TB & 4TB Corsair LPX
Display(s) LG 27UL550-W (27" 4k)
Case Be Quiet Pure Base 600 (no window)
Audio Device(s) Realtek ALC1220-VB
Power Supply SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse Mionix Naos Pro
Keyboard Corsair Strafe with browns
Software W10 22H2 Pro x64
Wish they would have posted the cost of the ram (labour excluded)

x$ for x%perf increase
 
Joined
Jun 11, 2019
Messages
497 (0.27/day)
Location
Moscow, Russia
Processor Intel 12600K
Motherboard Gigabyte Z690 Gaming X
Cooling CPU: Noctua NH-D15S; Case: 2xNoctua NF-A14, 1xNF-S12A.
Memory Ballistix Sport LT DDR4 @3600CL16 2*16GB
Video Card(s) Palit RTX 4080
Storage Samsung 970 Pro 512GB + Crucial MX500 500gb + WD Red 6TB
Display(s) Dell S2721qs
Case Phanteks P300A Mesh
Audio Device(s) Behringer UMC204HD
Power Supply Fractal Design Ion+ 560W
Mouse Glorious Model D-
Card beats the next tier with good OC, this has never happened in the history! /s
 
Joined
Sep 17, 2014
Messages
21,235 (5.99/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define R5
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse XTRFY M42
Keyboard Lenovo Thinkpad Trackpoint II
Software W10 x64
To a point but I agree with him. Is it going to matter to legos adventures? Probably not. But if you are benchmarking or playing anything demanding it really does. Especially if you start using lots of vram.
Benchmarking and playing something demanding are two different worlds. A lot of gpu benches are not very memory focused. They also work with a pretty limited set of assets. Heaven, Valley... Superposition. Non DX12 3DMark...
 
Joined
Aug 2, 2012
Messages
1,805 (0.42/day)
Location
Netherlands
System Name TheDeeGee's PC
Processor Intel Core i7-11700
Motherboard ASRock Z590 Steel Legend
Cooling Noctua NH-D15
Memory Crucial Ballistix 3200/C16 32GB
Video Card(s) Nvidia RTX 4070 Ti 12GB
Storage Crucial P5 Plus 2TB / Crucial P3 Plus 2TB / Crucial P3 Plus 4TB
Display(s) EIZO CX240
Case Lian-Li O11 Dynamic Evo XL
Audio Device(s) Creative Sound Blaster ZxR / AKG K601 Headphones
Power Supply Seasonic PRIME Fanless TX-700
Mouse Logitech G500s
Keyboard Keychron Q6
Software Windows 10 Pro 64-Bit
Benchmark Scores None, as long as my games runs smooth.
Considering both graphics cards use the same processor, when one gets faster memory and is overclocked (yet uses stock cooling) I would expect it to be faster. Like someone wrote on videocardz.com, where I first read about this, if anything it proves nVidia deliberately held performance back.
Or they didn't fancy adding another $300 on top of cards due to memory costs.
 

Solaris17

Super Dainty Moderator
Staff member
Joined
Aug 16, 2005
Messages
26,069 (3.80/day)
Location
Alabama
System Name Rocinante
Processor I9 14900KS
Motherboard MSI MPG Z790I Edge WiFi Gaming
Cooling be quiet! Pure Loop 240mm
Memory 64GB Gskill Trident Z5 DDR5 6000 @6400
Video Card(s) MSI SUPRIM Liquid X 4090
Storage 1x 500GB 980 Pro | 1x 1TB 980 Pro | 1x 8TB Corsair MP400
Display(s) Odyssey OLED G9 (G95SC)
Case LANCOOL 205M MESH Snow
Audio Device(s) Moondrop S8's on schitt Modi+ & Valhalla 2
Power Supply ASUS ROG Loki SFX-L 1000W
Mouse Lamzu Atlantis mini (White)
Keyboard Monsgeek M3 Lavender, Akko Crystal Blues
VR HMD Quest 3
Software openSUSE Tumbleweed
Benchmark Scores I dont have time for that.
Benchmarking and playing something demanding are two different worlds. A lot of gpu benches are not very memory focused. They also work with a pretty limited set of assets. Heaven, Valley... Superposition. Non DX12 3DMark...
Benchmarks also like games aren’t created equal but I mean they clearly scored higher so I will respectfully disagree I guess
 
Joined
Jan 8, 2017
Messages
9,119 (3.37/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
That statement feels way to general to be true
It has to be true. Why do you think GPUs have caches ? It's because the memory is always a bottleneck, if it wasn't there would be no need for caches.
 

Keullo-e

S.T.A.R.S.
Joined
Dec 16, 2012
Messages
11,316 (2.70/day)
Location
Finland
System Name 4K-gaming
Processor AMD Ryzen 7 5800X
Motherboard Gigabyte B550M Aorus Elite
Cooling Custom loop, Corsair ML/LL fans
Memory 48GB Kingston Fury DDR4-3200
Video Card(s) Asus GeForce RTX 3080 TUF 10GB @ +150/700
Storage ~4TB SSD + 6TB HDD
Display(s) Acer XV273K 4K120 + Lenovo L32p-30 4K60
Case Corsair 4000D Airflow White
Audio Device(s) Asus TUF H3 Wireless
Power Supply EVGA Supernova G2 750W
Mouse Logitech MX518 + Asus TUF P1 mousepad
Keyboard Roccat Vulcan 121 AIMO
VR HMD Oculus Rift CV1
Software Windows 11 Pro
Benchmark Scores It runs Crysis remastered at 4K
Good ol' HW-modding is always interesting. Would be interesting to see could a 1080 Ti/2080 Ti be boosted to a 12GB/384-bit card by adding the missing memory chip etc.
 
Joined
Feb 3, 2012
Messages
112 (0.02/day)
Location
Medina, Ohio
System Name Daily driver
Processor i9 13900k
Motherboard Z690 Aorus Master
Cooling Custom loop
Memory 2x16 GB GSkill DDR5 @ 6000
Video Card(s) RTX4090 FE
Storage 2x 2TB 990 Pro SSD 1x 2TB 970 evo SSD, 1x 4TB HDD
Display(s) LG 32" 2560x1440
Case Fractal Design Meshify 2 XL
Audio Device(s) onboard
Power Supply beQuiet Dark Power 12 1000W
Mouse Razer Death adder
Keyboard Razer blackwidow v3
VR HMD n/a
Software Windows 11 pro
Benchmark Scores Heaven 4.0 @ 2560x1440 270.5 FPS
Fun fact - RTX4080 was the first time in nvidia history that a next gen product had a memory bandwidth DECREASE from its predecessor. (except the GTX780 > GTX980)
I guess they claim that the extra cache makes up for it, similar to how they claimed the memory compression stuff made up for it with maxwell.
 
Joined
Feb 3, 2017
Messages
3,544 (1.32/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) EVGA Geforce RTX 3080 XC3
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
VR HMD HTC Vive
Fun fact - RTX4080 was the first time in nvidia history that a next gen product had a memory bandwidth DECREASE from its predecessor. (except the GTX780 > GTX980)
I guess they claim that the extra cache makes up for it, similar to how they claimed the memory compression stuff made up for it with maxwell.
Another fun fact - AMD did exactly the same thing a generation or two earlier with 6000 series.
 
Joined
Sep 17, 2014
Messages
21,235 (5.99/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define R5
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse XTRFY M42
Keyboard Lenovo Thinkpad Trackpoint II
Software W10 x64
Fun fact - RTX4080 was the first time in nvidia history that a next gen product had a memory bandwidth DECREASE from its predecessor. (except the GTX780 > GTX980)
I guess they claim that the extra cache makes up for it, similar to how they claimed the memory compression stuff made up for it with maxwell.
Which it did, and which it does, to a pretty reasonable extent.

I do agree Ada's bandwidth leaves a lot to be desired, but the cards seem balanced, if you don't look at the 12GB siblings apart from the x70.
 
Joined
Mar 28, 2024
Messages
115 (1.72/day)
Processor AMD 7800X3D
Motherboard MSI B650 Tomahawk
Cooling Noctua NHU12S
Memory 2x16 GB GSKILL 6000MHZ CL28
Video Card(s) Powercolor 7900 GRE
Storage 1TB Samsung 980 PRO
Display(s) LG 32GP750 31.5" 2K QHD (2560 x 1440) 165Hz Gaming Monitor
Case Coolermater HAF 650
Audio Device(s) BeyerDynamic Amiron Home
Power Supply Seasonic 850W Gold
a memory swapped heavily overclocked 4070ti super barely beat a 4080. In other news water is still wet.
 
Top