Skip to content

1.21.6

Compare
Choose a tag to compare
@PINTO0309 PINTO0309 released this 17 May 14:17
· 133 commits to main since this release
  • MatMulInteger
    Currently, MatMulInteger is implemented as tf matmul with int32 inputs/outputs, which leads to generation of Flex(Batch)MatMul ops.

    When -rtpo MatMulInteger is specified, inputs of MatMulInteger are casted to float32 instead, allowing the node to be converted to the builtin FullyConnected or BatchMatMul ops.

    ONNX input:
    image

    Before:
    Screenshot_20240517_202911

    After:
    image

What's Changed

Full Changelog: 1.21.5...1.21.6