Matrix chain multiplication is an an optimization problem that can be solved using dynamic programming. Given a sequence of matrices, the goal is to find the most efficient way to multiply these ...
CUDA kernels optimized with 2D thread blocks and grid dimensions. Simple interface to demonstrate performance on large matrix sizes (e.g., 1024x1024). Easily customizable for different matrix sizes or ...