Training dynamics study on 4.26M-param Llama model under 20M token budget | HACKOBAR_