October 2020 – rabbit hole 101

Memory access patterns to shared memory in CUDA

Posted on 2020-10-212021-02-16 by hofmannu

CUDA has a small amount of memory available for its threads called shared memory. As the name already suggests is that this memory is available to all threads within a block simultaneously. We want to use this property to make threads read memory from global memory to shared memory in a block, use the memory together, and afterwards write the result back into global memory to avoid multiple accesses to global memory. Nevertheless, there are some rules one need to respect for high performance.

Continue reading “Memory access patterns to shared memory in CUDA”

Setting up an ArchLinux based workstation

Posted on 2020-10-202022-06-29 by hofmannu

How to setup and configure an ArchLinux based workstation for scientific computing in a mulit-user environment.

Continue reading “Setting up an ArchLinux based workstation”