In the per-group quantization video, the instructor explains this line of code:
assert t_shape[1] % group_size == 0
as: asserting that “each row is divisible by group size”.
However, t_shape[1] returns the number of columns, so if I understand correctly, this code asserts that the number of columns is divisible by group size.