Hello Johannes,
I tested nvJPEG library.
The result is after decoding on the GPU. (50 % jpeg compression)
** Test Result : 2048 x 2048 = 150fps , 2048 x 1024 = 295fps
For the information,
The nvJPEG image buffer consists of each RGB channel.
So we need to merge each channel in CPU side again for 24bit image.
I don't have enough experience with GPU processing and multi core processing .
Step 1 : FPGA(encoding) Got Full speed (2048 x 2048 = 150fps)
Step 2: GPU(decoding) Got Full speed (2048 x 2048 = 150fps)
** Without copying memory from GPU to host
Step 3: Copy image data of each channel from GPU to HOST memory. ( decreased frame rate)
** Just copy Red channel only ( 2048 x2048 = 140fps) copy Green&Blue channel (decrease frame rate more)
Step 4: RGB Color merge
This is the test result so far.
I need more analysis of Step 3 & 4 for increase speed.
Thanks.