/home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/develop/include/ck_tile/ops/epilogue/chainer/cshuffle_epilogue_schedule.hpp Source File#
cshuffle_epilogue_schedule.hpp
Go to the documentation of this file.
Reusable simple epilogue operations which might be used to compose more complex one.
Definition: cluster_descriptor.hpp:13
constexpr auto make_node(Args... args)
Helper function for creating epilogue nodes.
Definition: epilogue_chainer.hpp:201
Definition: cshuffle_epilogue_chainer_ops.hpp:181
static constexpr index_t NumMXdlPerWavePerShuffle
Definition: cshuffle_epilogue_chainer_ops.hpp:326
static constexpr index_t MPerIterationShuffle
Definition: cshuffle_epilogue_chainer_ops.hpp:338
typename WG::CWarpDstr CWarpDstr
Definition: cshuffle_epilogue_chainer_ops.hpp:349
static constexpr index_t NumNXdlPerWavePerShuffle
Definition: cshuffle_epilogue_chainer_ops.hpp:327
static constexpr index_t NPerIterationShuffle
Definition: cshuffle_epilogue_chainer_ops.hpp:339
Scale working tile using tensor windows (CShuffle-specific)
Definition: cshuffle_epilogue_chainer_ops.hpp:79
Slice accumulator tile for CShuffle epilogue.
Definition: cshuffle_epilogue_chainer_ops.hpp:39
Cast working tile and store to LDS.
Definition: common_epilogue_ops.hpp:58
CShuffle epilogue scheduler providing pre-built schedules.
Definition: cshuffle_epilogue_schedule.hpp:42
static constexpr index_t NumAccess
Definition: cshuffle_epilogue_schedule.hpp:46
static CK_TILE_DEVICE auto create_context(OutWindow &out_window, const AccTile &acc_tile, const AuxWindows &aux_windows, void *p_smem)
Create context for epilogue operations.
Definition: cshuffle_epilogue_schedule.hpp:50
static CK_TILE_DEVICE auto make_schedule(Args &&... args)
Make schedule based on compile-time tag selection.
Definition: cshuffle_epilogue_schedule.hpp:60
Problem ProblemType
Definition: cshuffle_epilogue_schedule.hpp:43
Schedule type tags for epilogue selection.
Definition: cshuffle_epilogue_schedule.hpp:18
Apply elementwise operation with auxiliary tensors.
Definition: common_epilogue_ops.hpp:114
Load output tile from LDS with synchronization.
Definition: common_epilogue_ops.hpp:85
Move output and auxiliary windows by step from space-filling curve.
Definition: common_epilogue_ops.hpp:180
RowCol quantization schedule: Slice → ScaleWindow → CastStore → Load → ApplyD → Store → Move.
Definition: cshuffle_epilogue_schedule.hpp:23
Tensor quantization schedule: Slice → ScaleScalar → CastStore → Load → ApplyD → Store → Move.
Definition: cshuffle_epilogue_schedule.hpp:28
Definition: space_filling_curve.hpp:20
static constexpr CK_TILE_HOST_DEVICE index_t get_num_of_access()
Definition: space_filling_curve.hpp:46