This PR introduces a new OP for reciprocal calculation for `vector` types using `nvvm.rcp` OPs. Currently, it supports only f32 types --------- Co-authored-by: jingzec <jingzec@nvidia.com>
This PR introduces a new OP for reciprocal calculation for `vector` types using `nvvm.rcp` OPs. Currently, it supports only f32 types --------- Co-authored-by: jingzec <jingzec@nvidia.com>