/home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/develop/include/ck_tile/ops/norm_reduce/block/block_norm_reduce.hpp File Reference

/home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/develop/include/ck_tile/ops/norm_reduce/block/block_norm_reduce.hpp File Reference#

Composable Kernel: /home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/develop/include/ck_tile/ops/norm_reduce/block/block_norm_reduce.hpp File Reference
block_norm_reduce.hpp File Reference

Go to the source code of this file.

Classes

struct  ck_tile::BlockNormReduce< Problem_, Policy_ >
 
struct  ck_tile::BlockNormReduceSync< Problem_, Policy_ >
 
struct  ck_tile::BlockNormReduceCrossWarpSync< Problem_, Policy_ >
 

Namespaces

 ck_tile
 

Functions

template<typename BlockShape >
constexpr CK_TILE_DEVICE index_t ck_tile::block_tile_welford_calculate_max_count (int row_size)
 
template<typename VarDistributedTensor_ , bool FastFdiv_ = false>
constexpr CK_TILE_DEVICE void ck_tile::block_tile_welford_post_scale_var (VarDistributedTensor_ &var_tensor, int count, bool_constant< FastFdiv_ >={})