CUTLASS
CUDA Templates for Linear Algebra Subroutines and Solvers

threadblock → layout Relation

File in include/cutlass/epilogue/threadblockIncludes file in include/cutlass/layout
default_thread_map_tensor_op.hpitch_linear.h
default_thread_map_wmma_tensor_op.hpitch_linear.h
epilogue.htensor.h
epilogue.hvector.h
epilogue/threadblock/predicated_tile_iterator.hlayout/matrix.h
epilogue_base.htensor.h
epilogue_base.hvector.h
interleaved_epilogue.htensor.h
interleaved_epilogue.hvector.h
output_tile_thread_map.hlayout/matrix.h
shared_load_iterator.hlayout/matrix.h