
attention_sinks/README.md at main - GitHub
benchmark demo .gitignore .pre-commit-config.yaml CHANGELOG.md LICENSE README.md
Support AutoGPTQ by Minami-su · Pull Request #42 - GitHub
Browse the repository at this point in the history Update __init__.py Minami-su committed Jan 11, 2024 Copy the full SHA c74a15f View commit details Browse the repository at this point in the …
attention_sinks/demo/streaming.py at main - GitHub
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining - tomaarsen/attention_sinks
attention_sinks/demo/streaming_logs/attention_sinks/mistralai/Mistral ...
attention_sinks / demo / streaming_logs / attention_sinks / mistralai / Mistral-7B-Instruct-v0.1.txt Cannot retrieve latest commit at this time.
attention_sinks/demo/endless_logs/attention_sinks/meta …
setup.py attention_sinks / demo / endless_logs / attention_sinks / meta-llama / Llama-2-7b-hf.txt Cannot retrieve latest commit at this time.
attention_sinks/benchmark/perplexity.py at main - GitHub
Latest commit History History 170 lines (143 loc) · 6.17 KB main Breadcrumbs attention_sinks / benchmark /
attention_sinks/demo/endless_generation.py at main - GitHub
attention_sinks / demo / endless_generation.py Cannot retrieve latest commit at this time.
[WIP] add QWen model + benchmark results #15 - GitHub
Oct 11, 2023 · Open Sanster wants to merge 1 commit into tomaarsen: main base:main Choose a base branch Could not load branches Branch not found: { { refName }} { { refName }} Could …
attention_sinks/benchmark/outputs_falcon_7b/windowed.csv at …
Latest commit History History 8192 lines (8192 loc) · 675 KB main Breadcrumbs attention_sinks / benchmark / outputs_falcon_7b /
Support AutoGPTQ by Minami-su · Pull Request #42 - GitHub
Support AutoGPTQ #42 Show file tree Hide file tree Changes from all commits Commits Show all changes 5 commits Select commit Hold shift + click to select a range