NVIDIA GeForce RTX 4060 Mobile

NVIDIA GeForce RTX 4060 Mobile

NVIDIA GeForce RTX 4060 Mobile: Power and Efficiency in a Mobile Format

April 2025

Modern mobile graphics cards have long ceased to be a compromise between performance and portability. The NVIDIA GeForce RTX 4060 Mobile is a striking example of how engineers have managed to package cutting-edge technology into a compact form factor. In this article, we will explore what makes this model noteworthy, how it performs in gaming and professional tasks, and who should pay attention to it.


1. Architecture and Key Features: Ada Lovelace and AI Acceleration

The RTX 4060 Mobile graphics card is built on the Ada Lovelace architecture, which debuted in 2022. The chips are manufactured using 4nm TSMC technology, providing high transistor density and energy efficiency. Key features include:

- DLSS 3.5 — AI upscaling technology that boosts FPS even at 4K by "reconstructing" frames with neural networks.

- Third-generation ray tracing — improved RT cores that are 50% more efficient than those in the RTX 30 series.

- Support for DirectX 12 Ultimate and Vulkan RT — full compatibility with modern APIs.

- NVIDIA Reflex — reduced latency in esports titles.

Despite its mobile format, the RTX 4060 Mobile retains all the "features" of desktop counterparts, including hardware support for AV1 video encoding.


2. Memory: GDDR6 and Bandwidth

The card is equipped with 8 GB of GDDR6 memory with a 128-bit bus. The bandwidth reaches 256 GB/s thanks to a memory speed of 16 Gbps. This is sufficient for comfortable gaming at resolutions up to 1440p, but limitations may arise at 4K due to VRAM capacity. For instance, in games like Starfield or Cyberpunk 2077: Phantom Liberty, at ultra settings, the memory usage can reach 90-95%, occasionally leading to micro-stutters.

For most tasks, 8 GB is adequate, but creative professionals working with heavy 3D scenes should consider models with 12 GB.


3. Gaming Performance: FPS, Resolutions, and RTX

The RTX 4060 Mobile is positioned as a solution for 1080p and 1440p. Here are examples of average performance in current games of 2025 (Ultra settings, without DLSS):

- Cyberpunk 2077: 58 FPS (1080p), 42 FPS (1440p), 28 FPS (4K).

- Call of Duty: Black Ops 6: 92 FPS (1080p), 68 FPS (1440p).

- Horizon Forbidden West PC Edition: 75 FPS (1080p), 54 FPS (1440p).

With DLSS 3.5 in "Quality" mode, the performance boost ranges from 30-50%. For example, in Cyberpunk 2077 with ray tracing and DLSS, the frame rate rises to 65 FPS at 1440p. For 4K gaming, the card is only suitable with DLSS or FSR 3.0 (AMD technology compatibility exists, but with lower efficiency).


4. Professional Tasks: CUDA, Rendering, and Editing

Thanks to 3072 CUDA cores and support for NVIDIA Studio Drivers, the RTX 4060 Mobile excels not only in gaming:

- Video Editing: In Adobe Premiere Pro, 4K video rendering in H.265 is accelerated by 40% compared to the RTX 3060 Mobile.

- 3D Modeling: In Blender, rendering a scene in Cycles completes in 12 minutes (versus 18 minutes for the previous generation).

- Machine Learning: Support for CUDA and TensorFlow libraries makes the card suitable for training small neural networks.

However, for complex simulations (e.g., in Ansys), it’s better to opt for models with larger memory capacity.


5. Power Consumption and Thermal Output: TDP and Cooling

The TDP of the RTX 4060 Mobile is 85-100 W, depending on the operating mode (Dynamic Boost 2.0 temporarily increases power to 115 W). This requires an efficient cooling system:

- Laptops with 2-3 heat pipes and anti-vibration fan mounts are recommended (e.g., ASUS ROG Zephyrus or Lenovo Legion Pro).

- Under prolonged loads, the core temperature stays around 75-82°C — a safe but noisy range.

To reduce heat, you can use FPS limiting through the NVIDIA Control Panel or third-party utilities.


6. Comparison with Competitors: AMD and Intel

The main competitors in 2025 are:

- AMD Radeon RX 7600M XT (8 GB GDDR6): $100 cheaper, but lags by 15-20% in ray tracing. FSR 3.0 does not match DLSS 3.5 in image quality.

- Intel Arc A770M (12 GB GDDR6): Performs better in 4K, but drivers are still less stable, especially in older games.

The RTX 4060 Mobile wins due to its ecosystem: DLSS, Reflex, and Broadcast utilities for streamers.


7. Practical Tips: Selection and Configuration

- Laptop Power Supply: Minimum 180 W for models with Intel Core i7 or Ryzen 7 processors.

- Compatibility: The card operates on PCIe 4.0 x8 — check if your laptop supports this version.

- Drivers: Regularly update through GeForce Experience. For professional tasks, switch to Studio Drivers.

Avoid thin ultrabooks with passive cooling — they throttle quickly under load.


8. Pros and Cons

Pros:

- High FPS in 1080p/1440p with DLSS.

- Support for all current AI technologies.

- Energy efficiency for a mobile GPU.

Cons:

- 8 GB of VRAM limits 4K gaming.

- Laptop prices start from $1200 (the card itself is not available in OEM supply).


9. Final Conclusion: Who is the RTX 4060 Mobile Suitable For?

This graphics card is an ideal choice for:

- Gamers who want to play AAA titles on high settings without being tethered to a wall socket.

- Content creators who need accelerated rendering and work with AI tools.

- Students and professionals looking for a balance between performance and portability.

If your budget is limited and 4K is not a priority, the RTX 4060 Mobile will be a reliable companion for the next 3-4 years. However, for future projects focused on RTX and 8K, consider models with 12-16 GB of memory.

Basic

Label Name
NVIDIA
Platform
Mobile
Launch Date
January 2023
Model Name
GeForce RTX 4060 Mobile
Generation
GeForce 40 Mobile
Base Clock
1545MHz
Boost Clock
1890MHz
Bus Interface
PCIe 4.0 x16
Transistors
Unknown
RT Cores
24
Tensor Cores
?
Tensor Cores are specialized processing units designed specifically for deep learning, providing higher training and inference performance compared to FP32 training. They enable rapid computations in areas such as computer vision, natural language processing, speech recognition, text-to-speech conversion, and personalized recommendations. The two most notable applications of Tensor Cores are DLSS (Deep Learning Super Sampling) and AI Denoiser for noise reduction.
96
TMUs
?
Texture Mapping Units (TMUs) serve as components of the GPU, which are capable of rotating, scaling, and distorting binary images, and then placing them as textures onto any plane of a given 3D model. This process is called texture mapping.
96
Foundry
TSMC
Process Size
5 nm
Architecture
Ada Lovelace

Memory Specifications

Memory Size
8GB
Memory Type
GDDR6
Memory Bus
?
The memory bus width refers to the number of bits of data that the video memory can transfer within a single clock cycle. The larger the bus width, the greater the amount of data that can be transmitted instantaneously, making it one of the crucial parameters of video memory. The memory bandwidth is calculated as: Memory Bandwidth = Memory Frequency x Memory Bus Width / 8. Therefore, when the memory frequencies are similar, the memory bus width will determine the size of the memory bandwidth.
128bit
Memory Clock
2000MHz
Bandwidth
?
Memory bandwidth refers to the data transfer rate between the graphics chip and the video memory. It is measured in bytes per second, and the formula to calculate it is: memory bandwidth = working frequency × memory bus width / 8 bits.
256.0 GB/s

Theoretical Performance

Pixel Rate
?
Pixel fill rate refers to the number of pixels a graphics processing unit (GPU) can render per second, measured in MPixels/s (million pixels per second) or GPixels/s (billion pixels per second). It is the most commonly used metric to evaluate the pixel processing performance of a graphics card.
90.72 GPixel/s
Texture Rate
?
Texture fill rate refers to the number of texture map elements (texels) that a GPU can map to pixels in a single second.
181.4 GTexel/s
FP16 (half)
?
An important metric for measuring GPU performance is floating-point computing capability. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable. Single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks, while double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy.
11.61 TFLOPS
FP64 (double)
?
An important metric for measuring GPU performance is floating-point computing capability. Double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy, while single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable.
181.4 GFLOPS
FP32 (float)
?
An important metric for measuring GPU performance is floating-point computing capability. Single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks, while double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable.
11.842 TFLOPS

Miscellaneous

SM Count
?
Multiple Streaming Processors (SPs), along with other resources, form a Streaming Multiprocessor (SM), which is also referred to as a GPU's major core. These additional resources include components such as warp schedulers, registers, and shared memory. The SM can be considered the heart of the GPU, similar to a CPU core, with registers and shared memory being scarce resources within the SM.
24
Shading Units
?
The most fundamental processing unit is the Streaming Processor (SP), where specific instructions and tasks are executed. GPUs perform parallel computing, which means multiple SPs work simultaneously to process tasks.
3072
L1 Cache
128 KB (per SM)
L2 Cache
32MB
TDP
115W
Vulkan Version
?
Vulkan is a cross-platform graphics and compute API by Khronos Group, offering high performance and low CPU overhead. It lets developers control the GPU directly, reduces rendering overhead, and supports multi-threading and multi-core processors.
1.3
OpenCL Version
3.0
OpenGL
4.6
DirectX
12 Ultimate (12_2)
CUDA
8.9
Power Connectors
None
Shader Model
6.7
ROPs
?
The Raster Operations Pipeline (ROPs) is primarily responsible for handling lighting and reflection calculations in games, as well as managing effects like anti-aliasing (AA), high resolution, smoke, and fire. The more demanding the anti-aliasing and lighting effects in a game, the higher the performance requirements for the ROPs; otherwise, it may result in a sharp drop in frame rate.
48

Benchmarks

Shadow of the Tomb Raider 2160p
Score
41 fps
Shadow of the Tomb Raider 1440p
Score
94 fps
Shadow of the Tomb Raider 1080p
Score
151 fps
Cyberpunk 2077 1440p
Score
27 fps
GTA 5 2160p
Score
76 fps
GTA 5 1440p
Score
76 fps
GTA 5 1080p
Score
167 fps
FP32 (float)
Score
11.842 TFLOPS
3DMark Time Spy
Score
10189
Blender
Score
3569
OctaneBench
Score
350

Compared to Other GPU

Shadow of the Tomb Raider 2160p / fps
94 +129.3%
54 +31.7%
18 -56.1%
Shadow of the Tomb Raider 1440p / fps
292 +210.6%
128 +36.2%
67 -28.7%
49 -47.9%
Shadow of the Tomb Raider 1080p / fps
310 +105.3%
101 -33.1%
72 -52.3%
Cyberpunk 2077 1440p / fps
79 +192.6%
35 +29.6%
GTA 5 2160p / fps
173 +127.6%
98 +28.9%
59 -22.4%
31 -59.2%
GTA 5 1440p / fps
153 +101.3%
103 +35.5%
82 +7.9%
29 -61.8%
GTA 5 1080p / fps
231 +38.3%
176 +5.4%
141 -15.6%
86 -48.5%
FP32 (float) / TFLOPS
12.536 +5.9%
11.201 -5.4%
3DMark Time Spy
20661 +102.8%
8009 -21.4%
5933 -41.8%
Blender
15026.3 +321%
2014 -43.6%
1055.6 -70.4%
552 -84.5%
OctaneBench
1328 +279.4%
163 -53.4%
89 -74.6%
47 -86.6%