NPRG058: Advanced Programming in Parallel Environment

Assignment 2 - CUDA Histogram II

We continue with the Histogram saga (assuming you have completed the first assignment).

In this episode, we will try to improve the performance of the kernel with atomic updates using three optimizations:

Experiment! Implement all approaches (or their combinations) and empirically determine which approaches are better and when (i.e., test them for different inputs and different histogram sizes).