Grouped Query Attention: Reducing Inference Costs | Hackobar