AMD Radeon Vega 6 Embedded

AMD Radeon Vega 6 Embedded

About GPU

The AMD Radeon Vega 6 Embedded GPU is a solid integrated graphics solution for low-power devices. With a base clock speed of 300MHz and a boost clock of 1280MHz, this GPU offers enough performance to handle everyday computing tasks with ease. The 384 shading units provide decent graphics processing power, making it suitable for light gaming and multimedia applications. One of the key features of the AMD Radeon Vega 6 is its low TDP of 15W, making it an energy-efficient option for embedded systems and compact devices. The system shared memory type and clock speed allow for efficient utilization of available system memory, ensuring smooth and responsive performance. With a theoretical performance of 0.983 TFLOPS, the Radeon Vega 6 delivers sufficient graphics horsepower for tasks such as video playback, photo editing, and even some entry-level gaming. While it may not be suitable for more demanding applications and modern AAA games, it excels in handling everyday computing needs without requiring a dedicated graphics card. Overall, the AMD Radeon Vega 6 Embedded GPU is an excellent choice for small form factor devices, thin and light laptops, and other low-power systems. Its balance of performance, energy efficiency, and integration make it a compelling option for users who prioritize portability and battery life without sacrificing decent graphics capabilities.

Basic

Label Name
AMD
Platform
Integrated
Launch Date
May 2018
Model Name
Radeon Vega 6 Embedded
Generation
Raven Ridge
Base Clock
300MHz
Boost Clock
1280MHz
Bus Interface
IGP
Transistors
4,940 million
Compute Units
6
TMUs
?
Texture Mapping Units (TMUs) serve as components of the GPU, which are capable of rotating, scaling, and distorting binary images, and then placing them as textures onto any plane of a given 3D model. This process is called texture mapping.
24
Foundry
GlobalFoundries
Process Size
14 nm
Architecture
GCN 5.0

Memory Specifications

Memory Size
System Shared
Memory Type
System Shared
Memory Bus
?
The memory bus width refers to the number of bits of data that the video memory can transfer within a single clock cycle. The larger the bus width, the greater the amount of data that can be transmitted instantaneously, making it one of the crucial parameters of video memory. The memory bandwidth is calculated as: Memory Bandwidth = Memory Frequency x Memory Bus Width / 8. Therefore, when the memory frequencies are similar, the memory bus width will determine the size of the memory bandwidth.
System Shared
Memory Clock
SystemShared
Bandwidth
?
Memory bandwidth refers to the data transfer rate between the graphics chip and the video memory. It is measured in bytes per second, and the formula to calculate it is: memory bandwidth = working frequency × memory bus width / 8 bits.
System Dependent

Theoretical Performance

Pixel Rate
?
Pixel fill rate refers to the number of pixels a graphics processing unit (GPU) can render per second, measured in MPixels/s (million pixels per second) or GPixels/s (billion pixels per second). It is the most commonly used metric to evaluate the pixel processing performance of a graphics card.
10.24 GPixel/s
Texture Rate
?
Texture fill rate refers to the number of texture map elements (texels) that a GPU can map to pixels in a single second.
30.72 GTexel/s
FP16 (half)
?
An important metric for measuring GPU performance is floating-point computing capability. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable. Single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks, while double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy.
1.966 TFLOPS
FP64 (double)
?
An important metric for measuring GPU performance is floating-point computing capability. Double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy, while single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable.
61.44 GFLOPS
FP32 (float)
?
An important metric for measuring GPU performance is floating-point computing capability. Single-precision floating-point numbers (32-bit) are used for common multimedia and graphics processing tasks, while double-precision floating-point numbers (64-bit) are required for scientific computing that demands a wide numeric range and high accuracy. Half-precision floating-point numbers (16-bit) are used for applications like machine learning, where lower precision is acceptable.
1.003 TFLOPS

Miscellaneous

Shading Units
?
The most fundamental processing unit is the Streaming Processor (SP), where specific instructions and tasks are executed. GPUs perform parallel computing, which means multiple SPs work simultaneously to process tasks.
384
TDP
15W
Vulkan Version
?
Vulkan is a cross-platform graphics and compute API by Khronos Group, offering high performance and low CPU overhead. It lets developers control the GPU directly, reduces rendering overhead, and supports multi-threading and multi-core processors.
1.2
OpenCL Version
2.1
OpenGL
4.6
DirectX
12 (12_1)
Shader Model
6.4
ROPs
?
The Raster Operations Pipeline (ROPs) is primarily responsible for handling lighting and reflection calculations in games, as well as managing effects like anti-aliasing (AA), high resolution, smoke, and fire. The more demanding the anti-aliasing and lighting effects in a game, the higher the performance requirements for the ROPs; otherwise, it may result in a sharp drop in frame rate.
8

Benchmarks

FP32 (float)
Score
1.003 TFLOPS

Compared to Other GPU

FP32 (float) / TFLOPS
1.072 +6.9%
1.037 +3.4%
1.007 +0.4%
0.941 -6.2%