About 20 results
Open links in new tab
  1. attention_sinks/README.md at main - GitHub

    benchmark demo .gitignore .pre-commit-config.yaml CHANGELOG.md LICENSE README.md

  2. Support AutoGPTQ by Minami-su · Pull Request #42 - GitHub

    Browse the repository at this point in the history Update __init__.py Minami-su committed Jan 11, 2024 Copy the full SHA c74a15f View commit details Browse the repository at this point in the …

  3. attention_sinks/demo/streaming.py at main - GitHub

    Extend existing LLMs way beyond the original training length with constant memory usage, without retraining - tomaarsen/attention_sinks

  4. attention_sinks/demo/streaming_logs/attention_sinks/mistralai/Mistral ...

    attention_sinks / demo / streaming_logs / attention_sinks / mistralai / Mistral-7B-Instruct-v0.1.txt Cannot retrieve latest commit at this time.

  5. attention_sinks/demo/endless_logs/attention_sinks/meta …

    setup.py attention_sinks / demo / endless_logs / attention_sinks / meta-llama / Llama-2-7b-hf.txt Cannot retrieve latest commit at this time.

  6. attention_sinks/benchmark/perplexity.py at main - GitHub

    Latest commit History History 170 lines (143 loc) · 6.17 KB main Breadcrumbs attention_sinks / benchmark /

  7. attention_sinks/demo/endless_generation.py at main - GitHub

    attention_sinks / demo / endless_generation.py Cannot retrieve latest commit at this time.

  8. [WIP] add QWen model + benchmark results #15 - GitHub

    Oct 11, 2023 · Open Sanster wants to merge 1 commit into tomaarsen: main base:main Choose a base branch Could not load branches Branch not found: { { refName }} { { refName }} Could …

  9. attention_sinks/benchmark/outputs_falcon_7b/windowed.csv at …

    Latest commit History History 8192 lines (8192 loc) · 675 KB main Breadcrumbs attention_sinks / benchmark / outputs_falcon_7b /

  10. Support AutoGPTQ by Minami-su · Pull Request #42 - GitHub

    Support AutoGPTQ #42 Show file tree Hide file tree Changes from all commits Commits Show all changes 5 commits Select commit Hold shift + click to select a range