Too large input size |
![]() |
| log in |
Questions and Answers : GPU applications : Too large input size
| Author | Message |
|---|---|
|
I’m working on a CUDA program, but I have some trouble when I partition the input data. cudaMemcpy(d_input, &input[index], partsize * sizeof(12byte_struct), cudaMemcpyHostToDevice); Kernel<<<nrofblocks, blocksize>>>(partsize, d_input, d_output); cudaMemcpy(output + index, d_output, partize * sizeof(float), cudaMemcpyDeviceToHost); The weird thing is that this works for small input (which would fit completely in GPU memory) when I ‘split’ it in multiple parts, but not for large input. Also, it does not seem to matter for a large input what operations are performed in the kernel; the output is the same. So, I think it has something to do with the memory. I don’t get any memory allocation or copy errors. Does anyone has an idea how I can solve this problem? Would freeing and reallocating d_input or d_output do any good? | |
| ID: 1181007 · | |
|
Hi Barry, | |
| ID: 1181013 · | |
|
Oke, thanks for the info. | |
| ID: 1181020 · | |
Questions and Answers : GPU applications : Too large input size
| Copyright © 2013 University of California |