-
-
Notifications
You must be signed in to change notification settings - Fork 8.4k
[Model][Jamba] Mamba cache single buffer #6739
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
tlrmchlsmth
merged 22 commits into
vllm-project:main
from
mzusman:mamba_cache_single_buffer_upstream
Aug 9, 2024
Merged
Changes from all commits
Commits
Show all changes
22 commits
Select commit
Hold shift + click to select a range
9de1f12
Mamba cache single buffer (#42)
mzusman b9ef930
Format
mzusman f9d311d
Change example
mzusman 69c0da8
Change tested model (trained), now the tests are more reliable
mzusman 9a3a1be
Bugfix, the dest index didn't run on the seq ids
mzusman c705ed2
Clean up
mzusman 7fd4e22
Revert "Clean up"
mzusman 7f97c4e
Revert "Bugfix, the dest index didn't run on the seq ids"
mzusman 52239d0
Revert "Change tested model (trained), now the tests are more reliable"
mzusman 27a15e4
Bugfix, the dest index didn't run on the seq ids
mzusman d7d07fb
Cleanup
mzusman 12d8648
Prettier version
mzusman 4fc3dce
Half instead of bf16
mzusman f2c7723
Formattin
mzusman 44788c4
Change test to float
mzusman e598d96
bf16 for the test
mzusman 7d553c9
Remove n > 1 test for now, need to check why it fails on L4
mzusman df269e5
Format
mzusman 60857a3
Factor out moving out the occupied index
mzusman 9e583d6
Add comment
mzusman c2e9a1d
Format
mzusman 3eeeeb7
Jamba model
mzusman File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.