Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weโ€™ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: benchmark. #181

Merged
merged 2 commits into from
Mar 4, 2025
Merged

feat: benchmark. #181

merged 2 commits into from
Mar 4, 2025

Conversation

b4rtaz
Copy link
Owner

@b4rtaz b4rtaz commented Mar 4, 2025

This PR extends the metrics in inference mode.

...
๐Ÿ’ฟ Weights loaded
Tensor parallelism is all you need. Run LLMs on weak devices or make powerful devices even more powerful by distributing
๐Ÿ”ท๏ธ Eval  534 ms Sync  100 ms | Sent  6912 kB Recv 12540 kB | (24 tokens)
๐Ÿ”ถ Pred   68 ms Sync   25 ms | Sent   288 kB Recv   522 kB |  them
๐Ÿ”ถ Pred   58 ms Sync   15 ms | Sent   288 kB Recv   522 kB |  with
๐Ÿ”ถ Pred   57 ms Sync   11 ms | Sent   288 kB Recv   522 kB |  TP
๐Ÿ”ถ Pred   43 ms Sync   18 ms | Sent   288 kB Recv   522 kB | .
...
๐Ÿ”ถ Pred   47 ms Sync   15 ms | Sent   288 kB Recv   522 kB |  used
๐Ÿ”ถ Pred   52 ms Sync   32 ms | Sent   288 kB Recv   522 kB |  in
๐Ÿ”ถ Pred   42 ms Sync   11 ms | Sent   288 kB Recv   522 kB |  deep
๐Ÿ”ถ Pred   44 ms Sync   10 ms | Sent   288 kB Recv   522 kB |  learning

Evaluation
   nBatches: 32
    nTokens: 24
   tokens/s: 37.83 (26.43 ms/tok)
Prediction
    nTokens: 40
   tokens/s: 16.10 (62.10 ms/tok)

@b4rtaz b4rtaz merged commit a91745d into main Mar 4, 2025
3 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant