Add JpegCompressionDistortion CPU and GPU operators #2823

jantonguirao · 2021-03-29T16:04:51Z

Signed-off-by: Joaquin Anton [email protected]

Why we need this PR?

Pick one, remove the rest

It adds a new operator needed to generate JPEG-like distortion

What happened in this PR?

Fill relevant points, put NA otherwise. Replace anything inside []

What solution was applied:
Added a JpegCompressionDistortion operator, both GPU and CPU implementations.
The CPU version uses OpenCV imencode/imdecode
The GPU version uses a custom CUDA kernel
Affected modules and functionalities:
New operator
Key points relevant for the review:
All
Validation and testing:
Python tests added
Documentation (including examples):
Docstr

JIRA TASK: [DALI-1932] [DALI-1941]

JanuszL · 2021-03-29T16:09:12Z

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

+    .AddOptionalArg("quality",
+        R"code(JPEG compression quality from 1 (lowest quality) to 95 (highest quality).
+
+Any values outside the range 1-99 will be clamped.)code",


Suggested change

Any values outside the range 1-99 will be clamped.)code",

Any values outside the range 1-95 will be clamped.)code",

?

It was actually the other way around. I fixed it now

I think that you can set quality 100 in many graphics editors. Also, I predict that having this parameter as float would be more convenient.

Both libjpeg and OpenCV use integer for the quality factor. I will check if 100 makes sense in the current formula.

mzient · 2021-03-29T18:29:09Z

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

+        cv::Mat in_mat(sh[0], sh[1], CV_8UC3, (void*) in_view[sample_idx].data);  // NOLINT
+        cv::Mat out_mat(sh[0], sh[1], CV_8UC3, (void*) out_view[sample_idx].data);  // NOLINT


Wouldn't that work?

Suggested change

cv::Mat in_mat(sh[0], sh[1], CV_8UC3, (void*) in_view[sample_idx].data); // NOLINT

cv::Mat out_mat(sh[0], sh[1], CV_8UC3, (void*) out_view[sample_idx].data); // NOLINT

cv::Mat in_mat(sh[0], sh[1], CV_8UC3, in_view[sample_idx].data);

cv::Mat out_mat(sh[0], sh[1], CV_8UC3, out_view[sample_idx].data);

it doesn't work (there are other overloads)

Isn't there a problem with the fact that our shape has int64 extents? What about this?

Suggested change

cv::Mat in_mat(sh[0], sh[1], CV_8UC3, (void*) in_view[sample_idx].data); // NOLINT

cv::Mat out_mat(sh[0], sh[1], CV_8UC3, (void*) out_view[sample_idx].data); // NOLINT

int h = sh[0], w = sh[1];

cv::Mat in_mat(h, w, CV_8UC3, in_view[sample_idx].data);

cv::Mat out_mat(h, w, CV_8UC3, out_view[sample_idx].data);

The main problem seems to be:

jpeg_compression_distortion_op_cpu.cc:69:59: error: invalid conversion from ‘const void*’ to ‘void*’

The problem is const - use const_cast<void*> then (or const_cast<uint8_t*> or whatever our format is).

dali/operators/image/distortion/jpeg_compression_distortion_op_gpu.cu

dali/operators/image/distortion/jpeg_compression_distortion_op.h

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

dali/operators/image/distortion/jpeg_compression_distortion_op.h

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

Signed-off-by: Joaquin Anton <[email protected]>

szalpal · 2021-04-12T12:45:34Z

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

+    .DocStr(R"code(Produces JPEG-like distortion in RGB images.
+
+The level of degradation of the image can be controlled with the ``quality`` argument,
+)code")


IMHO, there should be a clear explanation, what is a "JPEG-like distortion"

Suggested change

.DocStr(R"code(Produces JPEG-like distortion in RGB images.

The level of degradation of the image can be controlled with the ``quality`` argument,

)code")

.DocStr(R"code(Introduces JPEG compression artifacts to RGB images.

JPEG is a lossy compression format which exploits characteristics of natural images and human visual system to achieve high compression ratios. The information loss originates from sampling the color information at a lower spatial resolution than the brightness and from representing high frequency components of the image with a lower effective bit depth. The conversion to frequency domain and quantization is applied independently to 8x8 pixel blocks, which introduces additional artifacts at block boundaries.

This operation produces images by subjecting the input to a transformation that mimics JPEG compression with given ``quality`` factor followed by decompression .

)code")

?

dali/test/python/test_operator_jpeg_compression_distortion.py

Signed-off-by: Joaquin Anton <[email protected]>

jantonguirao · 2021-04-12T17:03:31Z

!build

dali-automaton · 2021-04-12T17:06:16Z

CI MESSAGE: [2260239]: BUILD STARTED

dali-automaton · 2021-04-12T18:39:23Z

CI MESSAGE: [2260239]: BUILD PASSED

mzient · 2021-04-12T20:01:33Z

dali/operators/image/distortion/jpeg_compression_distortion_op_cpu.cc

+        R"code(JPEG compression quality from 1 (lowest quality) to 100 (highest quality).
+
+Any values outside the range 1-100 will be clamped.)code",
+                    95, true);


It's a nitpick, really, but I think that a more realistic value would be 90. 80 is usually considered quite poor, 95 typically requires considerable magnification to see any distortion. Perhaps there shouldn't be any default value?

It's the default quality used in OpenCV and libjpeg-turbo, I believe.

Signed-off-by: Joaquin Anton <[email protected]>

jantonguirao · 2021-04-13T11:29:00Z

!build

dali-automaton · 2021-04-13T11:32:07Z

CI MESSAGE: [2263323]: BUILD STARTED

dali-automaton · 2021-04-13T12:45:07Z

CI MESSAGE: [2263323]: BUILD PASSED

JanuszL reviewed Mar 29, 2021

View reviewed changes

jantonguirao changed the title ~~[WIP] Add JpegCompressionDistortion operator~~ Add JpegCompressionDistortion operator Mar 29, 2021

mzient reviewed Mar 29, 2021

View reviewed changes

dali/operators/image/distortion/jpeg_compression_distortion_op_gpu.cu Outdated Show resolved Hide resolved

mzient reviewed Mar 29, 2021

View reviewed changes

dali/operators/image/distortion/jpeg_compression_distortion_op_gpu.cu Outdated Show resolved Hide resolved

klecki marked this pull request as draft March 30, 2021 09:42

jantonguirao changed the title ~~Add JpegCompressionDistortion operator~~ Add JpegCompressionDistortion CPU and GPU operators Mar 30, 2021

jantonguirao force-pushed the jpeg_distortion_op_gpu branch 2 times, most recently from 4b1e530 to 32b82b0 Compare April 2, 2021 12:44

jantonguirao assigned szalpal and awolant Apr 6, 2021

jantonguirao force-pushed the jpeg_distortion_op_gpu branch from 32b82b0 to 9ac4f2c Compare April 6, 2021 09:58

jantonguirao marked this pull request as ready for review April 6, 2021 10:00