As part of a collaborative effort with a team of PhDs led by Dr. Dmitry Mikushin, I worked on improving the hash rate of open-source Ethereum PoW (ETHW) cryptocurrency miner, implemented in CUDA. Our objective was to push the performance of the miner closer to that of top-tier commercial solutions.
My primary role involved profiling and optimizing the existing CUDA code using NVIDIA's Nsight Systems and Nsight Compute tools. Through detailed performance analysis, I identified key bottlenecks in kernel execution and memory access patterns. I then applied a range of low-level optimizations at kernel level to enhance GPU efficiency.
These efforts led to a mining hash rate speedup of approximately 0.5%, placing our miner’s performance within 0.5% of the T-Rex miner, one of the best commercial Ethereum mining solutions available at the time.