[Bugfix] ngram spec decode attention error and repeat add sampled token ids error#2972
Open
wxsIcey wants to merge 1 commit intovllm-project:mainfrom
Open
[Bugfix] ngram spec decode attention error and repeat add sampled token ids error#2972wxsIcey wants to merge 1 commit intovllm-project:mainfrom
wxsIcey wants to merge 1 commit intovllm-project:mainfrom