Skip to content
@Infini-AI-Lab

Infini-AI-Lab

Next Generation AI algorithms and systems

Popular repositories Loading

  1. Sequoia Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Python 334 38

  2. TriForce TriForce Public

    [COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

    Python 239 17

  3. MagicPIG MagicPIG Public

    [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

    Python 192 14

  4. MagicDec MagicDec Public

    [ICLR2025] Breaking Throughput-Latency Trade-off for Long Sequences with Speculative Decoding

    Python 109 7

  5. UMbreLLa UMbreLLa Public

    LLM Inference on consumer devices

    Python 96 13

  6. gsm_infinite gsm_infinite Public

    Python 31 1

Repositories

Showing 10 of 18 repositories
  • UMbreLLa Public

    LLM Inference on consumer devices

    Infini-AI-Lab/UMbreLLa’s past year of commit activity
    Python 96 Apache-2.0 13 11 (6 issues need help) 8 Updated Mar 6, 2025
  • gsm_infinite Public
    Infini-AI-Lab/gsm_infinite’s past year of commit activity
    Python 31 1 0 0 Updated Feb 26, 2025
  • APE-Page Public
    Infini-AI-Lab/APE-Page’s past year of commit activity
    JavaScript 0 0 0 0 Updated Feb 12, 2025
  • APE Public
    Infini-AI-Lab/APE’s past year of commit activity
    Python 10 0 1 0 Updated Feb 12, 2025
  • RULER Public Forked from NVIDIA/RULER

    This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

    Infini-AI-Lab/RULER’s past year of commit activity
    Python 0 Apache-2.0 68 0 0 Updated Jan 30, 2025
  • Sequoia Public

    scalable and robust tree-based speculative decoding algorithm

    Infini-AI-Lab/Sequoia’s past year of commit activity
    Python 334 38 7 3 Updated Jan 28, 2025
  • lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness

    A framework for few-shot evaluation of language models.

    Infini-AI-Lab/lm-evaluation-harness’s past year of commit activity
    Python 0 MIT 2,191 0 0 Updated Jan 11, 2025
  • S2FT Public
    Infini-AI-Lab/S2FT’s past year of commit activity
    Python 16 2 0 0 Updated Jan 3, 2025
  • S2FT-Page Public
    Infini-AI-Lab/S2FT-Page’s past year of commit activity
    JavaScript 0 0 0 0 Updated Dec 30, 2024
  • MagicPIG Public

    [ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

    Infini-AI-Lab/MagicPIG’s past year of commit activity
    Python 192 Apache-2.0 14 10 1 Updated Dec 16, 2024

Top languages

Loading…

Most used topics

Loading…