Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type #229

joshua-arch1 · 2023-05-04T06:44:54Z

As is duscussed in #223, the BF16 Extension has recently been proposed and we need to define new intrinsics for convert instruction (bf16-to-fp32/fp32-to-bf16), reinterpret function as well as vfwmaccbf16 in Zvfbfwma Extenstion.

Signed-off-by: joshua-arch1 <[email protected]>

kito-cheng · 2023-05-05T14:20:16Z

rvv-intrinsic-generator/rvv_intrinsic_gen/inst.py

@@ -331,7 +331,7 @@ def gen(g):
      "Vector Widening Floating-Point Fused Multiply-Add Functions",
      REF_DOC_URL +
      "#147-vector-widening-floating-point-fused-multiply-add-operations",
-      ["wmacc", "wnmacc", "wmsac", "wnmsac"], FTYPES, WFSEWS, WLMULS,
+      ["wmacc", "wnmacc", "wmsac", "wnmsac"], BFTYPES, WFSEWS, WLMULS,
      decorators.has_masking_no_maskedoff_policy)


I would like bfloat has a separated section and mention its still on the draft status

Signed-off-by: joshua-arch1 <[email protected]>

circYuan · 2023-05-08T08:59:38Z

Hi! It looks like the commits you provided are generating instructions like vfncvtbf16_rtz_bf16_f_w_f16. However, it seems like this instruction may not be correct because the spec doesn't define the rtz instruction for vector bf16 convert instructions. I think there may need some condition checking on line 115, cvt_op_template.py.

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 · 2023-05-15T08:48:59Z

The PR for __bf16 ABI has been approved and is going to be merged (riscv-non-isa/riscv-elf-psabi-doc#367). Maybe we can now accelerate finalizing our RVV bf16 intrinsic.

eopXD

Thank you for pushing this and sorry I haven't been able to review this in the past 2 weeks.

Please separate the vfwmaccbf16, vfwcvtbf16.f.f.v and vfncvtbf16.f.f.w in a separate section. Please merge your commits in #224 to here too.

Regarding the type descriptions in the document, I think you should make it explicit that the bfloat16 types and the intrinsics will not be available when zvbfmin and zvbfwma is not specified in the architecture.

On the other hand, I agree with Rich's latest comment that it would be convenient to have a bfloat16 load/store that represents a (load + fncvt) and (fwcvt + store).

I have replied on my thought regarding this and the planned v1.0 release in the mailing list [0].

[0] https://lists.riscv.org/g/tech-rvv-intrinsics/message/57

eopXD · 2023-05-16T00:32:11Z

rvv-intrinsic-generator/rvv_intrinsic_gen/inst.py

@@ -334,6 +334,13 @@ def gen(g):
      ["wmacc", "wnmacc", "wmsac", "wnmsac"], FTYPES, WFSEWS, WLMULS,
      decorators.has_masking_no_maskedoff_policy)

+  g.function_group(
+      mac_template,
+      "Vector BFloat16 Widening Multiply-Add Functions (draft)",


Please add Zvfbfwma in the subtitle.

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 · 2023-05-17T09:39:59Z

On the other hand, I agree with Rich's latest comment that it would be convenient to have a bfloat16 load/store that represents a (load + fncvt) and (fwcvt + store).

I have updated my PR according to your comments, except the load/store intrinsics. Maybe you have a different understanding with Rich's. He meant an int16 load/store followed by a reinterpret cast rather than fncvt/fwcvt. Which definition is better? @eopXD

eopXD · 2023-09-19T02:21:58Z

Hi,

May you rebase upon the latest main? Thank you.

eopXD · 2023-09-19T02:38:11Z

On top of rebasing, if possible, I think we can also have the effort of adding the intrinsics for Zvfbfwma and Zvfbfmin here. I imagine we would also need load/store intrinsics for the bfloat type. The narrow conversions and multiply-add instruction intrinsics will need a rounding mode variant.

kito-cheng · 2024-07-15T08:55:57Z

I believe this should covered by this and #293 merged into main branch, so close this :)

Update reint_op_template.py

d7f462d

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 closed this May 4, 2023

joshua-arch1 reopened this May 4, 2023

joshua-arch1 added 4 commits May 4, 2023 17:42

Update constants.py

9805fb3

Signed-off-by: joshua-arch1 <[email protected]>

Update inst.py

f5387af

Signed-off-by: joshua-arch1 <[email protected]>

Update mac_template.py

6ec7e95

Signed-off-by: joshua-arch1 <[email protected]>

Update cvt_op_template.py

f4a721c

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 mentioned this pull request May 4, 2023

Define new RVV C intrinsic API for bf16 type #225

Closed

joshua-arch1 changed the title ~~Update reint_op_template.py to define reinterpret intrinsics for new bf16 type~~ Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type May 4, 2023

kito-cheng reviewed May 5, 2023

View reviewed changes

joshua-arch1 added 2 commits May 6, 2023 15:48

Update inst.py

454cedb

Signed-off-by: joshua-arch1 <[email protected]>

Update constants.py

bc766e4

Signed-off-by: joshua-arch1 <[email protected]>

Update cvt_op_template.py

65289d4

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 requested a review from kito-cheng May 15, 2023 08:49

eopXD reviewed May 16, 2023

View reviewed changes

joshua-arch1 added 6 commits May 16, 2023 10:49

Introduce new types using bfloat16

9dae2c4

Signed-off-by: joshua-arch1 <[email protected]>

Update inst.py

fd6d264

Signed-off-by: joshua-arch1 <[email protected]>

Update cvt_op_template.py

75ae36b

Signed-off-by: joshua-arch1 <[email protected]>

Update cvt_op_template.py

ce33add

Signed-off-by: joshua-arch1 <[email protected]>

Update inst.py

1035c96

Signed-off-by: joshua-arch1 <[email protected]>

Update constants.py

67eeb5e

Signed-off-by: joshua-arch1 <[email protected]>

joshua-arch1 requested a review from eopXD May 17, 2023 08:33

eopXD force-pushed the master branch from d813d33 to eebf706 Compare July 13, 2023 09:02

kito-cheng closed this Jul 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type #229

Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type #229

joshua-arch1 commented May 4, 2023 •

edited

Loading

kito-cheng May 5, 2023

circYuan commented May 8, 2023 •

edited

Loading

joshua-arch1 commented May 15, 2023

eopXD left a comment

eopXD May 16, 2023

joshua-arch1 commented May 17, 2023 •

edited

Loading

eopXD commented Sep 19, 2023

eopXD commented Sep 19, 2023

kito-cheng commented Jul 15, 2024

Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type #229

Update rvv-intrinsic-generator to define new RVV C intrinsic API for bf16 type #229

Conversation

joshua-arch1 commented May 4, 2023 • edited Loading

kito-cheng May 5, 2023

Choose a reason for hiding this comment

circYuan commented May 8, 2023 • edited Loading

joshua-arch1 commented May 15, 2023

eopXD left a comment

Choose a reason for hiding this comment

eopXD May 16, 2023

Choose a reason for hiding this comment

joshua-arch1 commented May 17, 2023 • edited Loading

eopXD commented Sep 19, 2023

eopXD commented Sep 19, 2023

kito-cheng commented Jul 15, 2024

joshua-arch1 commented May 4, 2023 •

edited

Loading

circYuan commented May 8, 2023 •

edited

Loading

joshua-arch1 commented May 17, 2023 •

edited

Loading