[NEW Model] Add mamba #8513

JunnYu · 2024-05-30T10:13:00Z

PR types

New features

PR changes

Models

Description

Mamba: Linear-Time Sequence Modeling with Selective State Spaces
作者仓库
 huggingface仓库

需要develop版本的paddlepaddle
如何使用：
1.（可选）编译自定义cuda算子，https://github.com/JunnYu/mamba/tree/paddle-v2.2.2
2. 运行下面的代码。

import paddle
from paddlenlp.transformers import MambaForCausalLM, MambaTokenizer
from paddlenlp.generation import GenerationConfig

name = "state-spaces/mamba-2.8b-hf"

tokenizer = MambaTokenizer.from_pretrained(name)
model = MambaForCausalLM.from_pretrained(name, dtype="float16", low_cpu_mem_usage=True)
model.eval()

generation_config = GenerationConfig(
    decode_strategy="sampling",
    top_k=50,
    num_return_sequences=4,
    use_cache=True,
)

with paddle.no_grad():
    prompt = "Hello, it is"
    inputs = tokenizer.encode(prompt, return_tensors="pd")
    outputs = model.generate(**inputs, generation_config=generation_config, max_length=256)
    for e in tokenizer.batch_decode(outputs[0], skip_special_tokens=True):
        print(prompt + e)
        print('-'*100)

paddle-bot · 2024-05-30T10:13:06Z

Thanks for your contribution!

JunnYu · 2024-07-25T04:17:10Z

更新

codecov · 2024-08-16T06:06:55Z

Codecov Report

Attention: Patch coverage is 85.42056% with 78 lines in your changes missing coverage. Please review.

Project coverage is 55.00%. Comparing base (e0d2809) to head (1244675).
Report is 225 commits behind head on develop.

Files with missing lines	Patch %	Lines
paddlenlp/transformers/mamba/modeling.py	79.87%	65 Missing ⚠️
paddlenlp/transformers/mamba/tokenizer.py	92.35%	13 Missing ⚠️

Additional details and impacted files

@@             Coverage Diff             @@
##           develop    #8513      +/-   ##
===========================================
+ Coverage    54.79%   55.00%   +0.20%     
===========================================
  Files          636      640       +4     
  Lines        99876   100543     +667     
===========================================
+ Hits         54732    55307     +575     
- Misses       45144    45236      +92

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

* add mamba * update * do not shift * update * push * mamba 单测pass * mamba * update_model_kwargs_for_generation * fix tests * test * test * add tokenizer test

JunnYu added 2 commits May 29, 2024 13:40

add mamba

7cbdb35

update

2652154

JunnYu added 5 commits June 4, 2024 11:54

Merge branch 'develop' into add_mamba

14c6108

Merge branch 'develop' into add_mamba

98187ec

do not shift

c9a1ea4

update

c12a16b

push

2093f45

JunnYu and others added 10 commits July 26, 2024 12:06

mamba 单测pass

b81d3d5

Merge branch 'PaddlePaddle:develop' into add_mamba

741b818

Merge branch 'develop' into add_mamba

de01e92

mamba

10ada2c

update_model_kwargs_for_generation

e726d38

fix tests

05a0bcc

Merge branch 'PaddlePaddle:develop' into add_mamba

9fe92ee

test

e545840

Merge branch 'PaddlePaddle:develop' into add_mamba

5a06b2f

test

ad121ea

DrownFish19 previously approved these changes Aug 16, 2024

View reviewed changes

add tokenizer test

1244675

JunnYu dismissed DrownFish19’s stale review via 1244675 August 16, 2024 08:14

DrownFish19 approved these changes Aug 19, 2024

View reviewed changes

JunnYu merged commit b08e445 into PaddlePaddle:develop Aug 19, 2024
10 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NEW Model] Add mamba #8513

[NEW Model] Add mamba #8513

JunnYu commented May 30, 2024 •

edited

Loading

paddle-bot bot commented May 30, 2024

JunnYu commented Jul 25, 2024

codecov bot commented Aug 16, 2024 •

edited

Loading

[NEW Model] Add mamba #8513

[NEW Model] Add mamba #8513

Conversation

JunnYu commented May 30, 2024 • edited Loading

PR types

PR changes

Description

paddle-bot bot commented May 30, 2024

JunnYu commented Jul 25, 2024

codecov bot commented Aug 16, 2024 • edited Loading

Codecov Report

JunnYu commented May 30, 2024 •

edited

Loading

codecov bot commented Aug 16, 2024 •

edited

Loading