|
|
Компьютеры - CUDA - Особенности и спецификации различных версий13 июня 2011
Оглавление: 1. CUDA 2. Программная архитектура 3. Оборудование 4. Ограничения 5. Особенности и спецификации различных версий 6. Пример 7. CUDA как предмет в вузах
Feature support (unlisted features are
supported for all compute capabilities) |
Compute capability |
1.0 |
1.1 |
1.2 |
1.3 |
2.x |
3D grid of thread block |
Нет |
Да |
Integer atomic functions operating on
32-bit words in global memory |
Нет |
Да |
Integer atomic functions operating on
64-bit words in global memory |
Нет |
Да |
Integer atomic functions operating on
32-bit words in shared memory |
Warp vote functions |
Double-precision floating-point operations |
Нет |
Да |
Floating-point atomic addition operating on
32-bit words in global and shared memory |
Нет |
Да |
_ballot |
_threadfence_system |
_syncthread_count,
_syncthread_and,
_syncthread_or |
Surface functions |
Technical specifications |
Compute capability |
1.0 |
1.1 |
1.2 |
1.3 |
2.x |
Maximum dimensionality of grid of thread block |
2 |
3 |
Maximum x- or y- dimensions of a grid of thread blocks |
65535 |
Maximum dimensionality of thread block |
3 |
Maximum x- or y- dimension of a block |
512 |
1024 |
Maximum z- dimension of a block |
64 |
Maximum number of threads per block |
512 |
1024 |
Warp size |
32 |
Maximum number of resident blocks per multiprocessor |
8 |
Maximum number of resident warps per multiprocessor |
24 |
32 |
48 |
Maximum number of resident threads per multiprocessor |
768 |
1024 |
1536 |
Number of 32-bit registers per multiprocessor |
8 K |
16 K |
32 K |
Maximum amount of shared memory per multiprocessor |
16 KB |
48 KB |
Number of shared memory banks |
16 |
32 |
Amount of local memory per thread |
16 KB |
512 KB |
Constant memory size |
64 KB |
Cache working set per multiprocessor for constant memory |
8 KB |
Cache working set per multiprocessor for texture memory |
Device dependent, between 6 KB and 8 KB |
Maximum width for 1D texture
reference bound to a CUDA array |
8192 |
32768 |
Maximum width for 1D texture
reference bound to linear memory |
2 |
Maximum width and number of layers
for a 1D layered texture reference |
8192 x 512 |
16384 x 2048 |
Maximum width and height for 2D
texture reference bound to
linear memory or a CUDA array |
65536 x 32768 |
65536 x 65535 |
Maximum width, height, and number
of layers for a 2D layered texture reference |
8192 x 8192 x 512 |
16384 x 16384 x 2048 |
Maximum width, height and depth
for a 3D texture reference bound to linear
memory or a CUDA array |
2048 x 2048 x 2048 |
Maximum number of textures that
can be bound to a kernel |
128 |
Maximum width for a 1D surface
reference bound to a CUDA array |
Not
supported |
8192 |
Maximum width and height for a 2D
surface reference bound to a CUDA array |
8192 x 8192 |
Maximum number of surfaces that
can be bound to a kernel |
8 |
Maximum number of instructions per
kernel |
2 million |
Просмотров: 7290
|