fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

VTrngNghia · 2024-07-16T02:55:35Z

What does this PR do?

Adds a keyword argument to allow passing extra_options to ORTQuantizer.quantize()

To fix RuntimeError:

Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0'. shape_inference failed to return a type probably this node is from a different domain or using an input produced by such an operator. This may happen if you quantize a model already quantized. You may use extra_options DefaultTensorType to indicate the default weight type, usually onnx.TensorProto.FLOAT.

Maybe it can be added to AutoQuantizationConfig, but there any many @staticmethod for that, so maybe this quick fix is simpler.

Who can review?

It's very simple. Anyone can review.

ONNX / ONNX Runtime : @fxmarty, @echarlaix, @JingyaHuang, @michaelbenayoun
ONNX Runtime Training: @JingyaHuang
BetterTransformer: @fxmarty
GPTQ, quantization: @fxmarty, @SunMarc
TFLite export: @michaelbenayoun

severinsimmler · 2024-10-30T14:56:22Z

+1

Thanks for fixing this @VTrngNghia

Whadup · 2024-12-04T13:51:56Z

Is there any harm in just hard-coding "DefaultTensorType": onnx.TensorProto.FLOAT in the extra_options dict?

home15c6 · 2024-12-04T14:02:00Z

Do you mean hard-coding in your package code? Then no "harm" except every time you setup your environment (say, in Docker container), you'll have to apply that change again.
There are tools to automatically apply local changes (and I'm using them until they merge this PR). They're a bit hassle to setup.

Whadup · 2024-12-04T14:33:00Z

I meant to change the code in this PR to

 "extra_options": {
      "WeightSymmetric": quantization_config.weights_symmetric,
      "ActivationSymmetric": quantization_config.activations_symmetric,
      "EnableSubgraph": has_subgraphs,
      "ForceSymmetric": quantization_config.activations_symmetric and quantization_config.weights_symmetric,
      "AddQDQPairToWeight": quantization_config.qdq_add_pair_to_weight,
      "DedicatedQDQPair": quantization_config.qdq_dedicated_pair,
      "QDQOpTypePerChannelSupportToAxis": quantization_config.qdq_op_type_per_channel_support_to_axis,
      "DefaultTensorType": onnx.TensorProto.FLOAT
  },
and not pipe through the extra_options keyword argument

michaelbenayoun · 2024-12-12T13:54:14Z

optimum/onnxruntime/quantization.py

@@ -286,6 +287,7 @@ def quantize(
        calibration_tensors_range: Optional[Dict[str, Tuple[float, float]]] = None,
        use_external_data_format: bool = False,
        preprocessor: Optional[QuantizationPreprocessor] = None,
+        extra_options: Optional[Dict[str, Any]] = {}


Can you set the default to None and then create an empty dict in the method please?
Dict as default value is not good because they are mutable.

michaelbenayoun · 2024-12-12T13:54:32Z

cc @echarlaix

feat(quantization): add extra_options

4f9a167

VTrngNghia changed the title ~~feat(quantization): add extra_options~~ fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' Jul 16, 2024

michaelbenayoun reviewed Dec 12, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

VTrngNghia commented Jul 16, 2024 •

edited

Loading

severinsimmler commented Oct 30, 2024

Whadup commented Dec 4, 2024

home15c6 commented Dec 4, 2024

Whadup commented Dec 4, 2024

michaelbenayoun Dec 12, 2024

michaelbenayoun commented Dec 12, 2024

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

Are you sure you want to change the base?

fix Unable to find data type for weight_name='/encoder/layer.0/attention/output/dense/MatMul_output_0' #1959

Conversation

VTrngNghia commented Jul 16, 2024 • edited Loading

What does this PR do?

Who can review?

severinsimmler commented Oct 30, 2024

Whadup commented Dec 4, 2024

home15c6 commented Dec 4, 2024

Whadup commented Dec 4, 2024

michaelbenayoun Dec 12, 2024

Choose a reason for hiding this comment

michaelbenayoun commented Dec 12, 2024

VTrngNghia commented Jul 16, 2024 •

edited

Loading