What is speculative decoding? Speculative decoding is an inference optimization that uses a fast, small "draft" model to quickly propose several fu… | HACKOBAR_