[luci] Introduce Compress weights pass #13521

SlavikMIPT · 2024-07-25T11:22:16Z

This commit introduces CopressWeightsPass for Conv2D

ONE-DCO-1.0-Signed-off-by: Vyacheslav Bazhenov [email protected]

SlavikMIPT · 2024-07-25T11:28:28Z

Current encoded array format:
8b - tree size in bits, 8b - data size in bits, encoded Huffman tree and data bitstream

Using example:

Download mobilenet_v1_1.0_224_quant.tflite

tflite2circle mobilenet_v1_1.0_224_quant.tflite mobilenet_v1_1.0_224_quant.circle
circle2circle --compress_weights_huffman mobilenet_v1_1.0_224_quant.circle mobilenet_v1_1.0_224_quant.compressed.circle

Compression results for mobilenet_v1_1.0_224_quant.circle:
4,275,832 bytes -> 2,971,688 bytes (35% compression)

seanshpark · 2024-07-28T21:46:43Z

compiler/luci/export/src/CircleExporterUtils.cpp

+    case luci::WeightCompression::NONE:
+      return circle::WeightCompressionType_NONE;
+    case luci::WeightCompression::HUFFMAN:
+      return circle::WeightCompressionType_Huffman;


I assume you are adding weight compression with Huffman coding.
As you adding this define here, I think compress_weights_huffman instead of general compress_weights, would be better name.

@seanshpark fixed, please take a look

compiler/luci/export/src/CircleTensorExporter.cpp

compiler/luci/export/src/CircleExporterUtils.cpp

compiler/luci/export/src/CircleExporterUtils.h

compiler/luci/pass/include/luci/CircleOptimizer.h

compiler/luci/pass/src/CompressWeightsPass.cpp

seanshpark · 2024-07-31T21:55:18Z

compiler/luci/pass/src/CompressWeightsPass.cpp

+    auto conv2d = dynamic_cast<luci::CircleConv2D *>(node);
+    if (not conv2d)
+      continue;
+    loco::DataType weights_dtype = loco::must_cast<luci::CircleConst *>(conv2d->filter())->dtype();


please split lines. I'd like to have easy readable codes.

Suggested change

loco::DataType weights_dtype = loco::must_cast<luci::CircleConst *>(conv2d->filter())->dtype();

auto filter = loco::must_cast<luci::CircleConst *>(conv2d->filter());

auto weights_dtype = filter->dtype();

seanshpark · 2024-07-31T21:57:15Z

compiler/luci/pass/src/helpers/HuffmanEncoder.h

+      arr.push_back(
+        *(static_cast<const uint8_t *>(static_cast<const void *>(&kTreeSizeInBits)) + i));


can you plz split lines?

@seanshpark Maybe think about better place for HuffmanEncoder.h/HuffmanDecoder.h library? It will be used in onert-micro and other inferences - so in current configuration luci dependency is required, can we place this on top level as separate dependency?

can we place this on top level as separate dependency?

Need to think about this. this maybe the first one, ... and I don't think I will like this.

Let's go with duplicate codes. Sharing code each other will complicate dependency.

https://github.com/Samsung/ONE/pull/13521/files#r1699149426

seanshpark · 2024-07-31T22:01:21Z

res/CircleSchema/0.8/circle_schema.fbs

@@ -831,6 +837,7 @@ table Conv2DOptions {
  dilation_h_factor:int = 1;
  // Parameters for Conv2D version 8 or above.
  // When set, quantized_bias_type defines the dtype for both bias and accumulator.
+  weight_compression_type:WeightCompressionType = NONE;


we need to upgrade to 0.9 and this requires lots of other changes.

CC @hseok-oh

res/CircleSchema/0.8/circle_schema.fbs

seanshpark · 2024-07-31T22:13:15Z

compiler/luci/pass/src/CompressWeightsPass.cpp

+    }
+    else
+    {
+      throw std::runtime_error("Huffman weights compression supports s8 and u8");


plz do not throw, just debug info is OK.
we do not want to stop circle2circle with this reason.
this throw should be in the import module.

seanshpark · 2024-07-31T22:15:35Z

compiler/luci/lang/include/luci/IR/Nodes/CircleConv2D.h

@@ -34,7 +34,8 @@ namespace luci
 */
 class CircleConv2D final : public FixedArityNode<3, CircleNodeImpl<CircleOpcode::CONV_2D>>,
                           public CircleNodeMixin<CircleNodeTrait::FusedActFunc>,
-                           public CircleNodeMixin<CircleNodeTrait::Bias>
+                           public CircleNodeMixin<CircleNodeTrait::Bias>,
+                           public CircleNodeMixin<CircleNodeTrait::WeightCompression>


Q) why does Conv2D have this attribute? why not the filter Constant?

Q) why does Conv2D have this attribute? why not the filter Constant?

CircleConst is virtual node - and we need somehow export and import circle - so I suggest to set this attribute when importing op from circle and also set this in CircleConst

I don't understand. Please give more explanation.

seanshpark · 2024-07-31T22:17:07Z

Compression results for mobilenet_v1_1.0_224_quant.circle:
4,275,952 bytes -> 2,966,952 bytes (1.44 compression rate)

@SlavikMIPT , is the purpose of compression to reduce file size? or is there any other reasons?

seanshpark · 2024-07-31T22:17:48Z

I don't see any code changes in luci/import. is it OK?

seanshpark · 2024-07-31T22:19:41Z

I recommend to introduce luci-pass-value-py-test to validate compressed file is OK.
which will require luci-interpreter changes too.

compiler/luci/pass/src/CompressWeightsPass.cpp

SlavikMIPT · 2024-08-02T10:33:36Z

Compression results for mobilenet_v1_1.0_224_quant.circle:
4,275,952 bytes -> 2,966,952 bytes (1.44 compression rate)

@SlavikMIPT , is the purpose of compression to reduce file size? or is there any other reasons?

There are two purposes: reducing file size(for microcontrollers - this can allow to use models which don't fit in flash memory without compression) and reducing memory bandwidth requirements (which can be a bottleneck for hardware accelerators), Huffman and RLE encodings are relatively computationally cheap. Combining this with compression-aware training or quantization we potentially can achieve higher compression rates

compiler/luci/pass/src/helpers/HuffmanDecoder.h

compiler/luci/pass/src/helpers/HuffmanEncoder.h

seanshpark · 2024-09-01T22:54:13Z

compiler/luci/pass/src/helpers/HuffmanDecoder.h

@@ -0,0 +1,355 @@
+/*


1/ this is header only file. is there any particular reason to make so? why not split implementations to .cpp file?
2/ copy right contains only Samsung. is this file made by you from scratch?
3/ there is no .test.cpp file for this. can you add some?

ok

yes

ok

hseok-oh · 2024-09-04T05:46:35Z

@hseok-oh , please share your thoughts about modification of circle schema and compressed data.

Looks good.

SlavikMIPT · 2024-09-30T15:24:51Z

Refactored, but I am not sure that code duplication of Decoder/Encoder is good idea - I would think about extracting it into separate component

This commit introduces CopressWeightsPass for Conv2D ONE-DCO-1.0-Signed-off-by: Vyacheslav Bazhenov <[email protected]>

jinevening · 2024-10-15T07:57:59Z

compiler/luci/export/src/CircleTensorExporter.cpp

+      if (lhs->size<loco::DataType::FLOAT32>() != rhs->size<loco::DataType::FLOAT32>())
+        return false;


This code looks very ugly. Isn't it possible to just check lhs->compression() == rhs->compression()?

jinevening · 2024-10-15T08:01:29Z

compiler/luci/pass/include/luci/Pass/CompressWeightsPass.h

+ *
+ * To see the target Op pattern, please visit implementation.
+ */
+struct CompressWeightsPass final : public logo::Pass


The name looks too general. Can you rename it to something like CompressWeightsHuffmanPass?

jinevening · 2024-10-15T08:05:11Z

It seems that test codes are missing as @seanshpark pointed out.

I left some comments because @SlavikMIPT requested review. But I don't know details about the algorithm. It would be better to add another reviewer, e.g., @hseok-oh.

seanshpark · 2024-10-15T21:30:34Z

plz split each compiler/* module changes to separate PR.
and this PR change has too many files to review.

SlavikMIPT marked this pull request as draft July 25, 2024 11:22

seanshpark reviewed Jul 28, 2024

View reviewed changes

SlavikMIPT force-pushed the luci-compress-weights-pass branch 3 times, most recently from 98fc07b to 685f2b9 Compare July 31, 2024 14:48

SlavikMIPT marked this pull request as ready for review July 31, 2024 15:05

SlavikMIPT changed the title ~~[DRAFT][WIP][luci] Introduce Compress weights pass~~ [DRAFT[luci] Introduce Compress weights pass Jul 31, 2024

SlavikMIPT changed the title ~~[DRAFT[luci] Introduce Compress weights pass~~ [DRAFT][luci] Introduce Compress weights pass Jul 31, 2024

SlavikMIPT force-pushed the luci-compress-weights-pass branch 2 times, most recently from 950a0c0 to 2de0564 Compare July 31, 2024 16:17