I have Kmeans Algorithm implementation in C and CUDA.
Both are working, but the CUDA code is taking more time than C.
Which should be the other way.
The cuda code has to be faster than the C program.
I will provide my codes and the datasets.
Someone needs to debug the code and fix it, also tell me what's wrong.