(max_query_len=1), following the same pattern used by vLLM's EAGLE speculative decoding proposer. Registered via the ``vllm.general_plugins`` entry point so it is auto-discovered by all vLLM processes ...