-
Notifications
You must be signed in to change notification settings - Fork 5.3k
Issues: hiyouga/LLaMA-Factory
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
deepseek r1 微调后我应该怎么加载lora参数推理呢
bug
Something isn't working
pending
This problem is yet to be addressed
#7185
opened Mar 6, 2025 by
joyyyhuang
1 task done
使用unsloth加速报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7177
opened Mar 6, 2025 by
GEK1
1 task done
MiniCPM-o-2_6的sft、lora训练报错:Some weights of the model checkpoint at /app123/model/MiniCPM-o-2_6 were not used when initializing MiniCPMO:
bug
Something isn't working
pending
This problem is yet to be addressed
#7169
opened Mar 5, 2025 by
winni0
1 task done
deepseek-moe-16B预训练问题
bug
Something isn't working
pending
This problem is yet to be addressed
#7165
opened Mar 5, 2025 by
zyp-byte
1 task done
跑open_r1_math数据集,qwen7b-instruct每次跑到53个step报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7163
opened Mar 5, 2025 by
fsq77
1 task done
Qwen/Qwen2.5-VL-7B-Instruct PPO 训练报错
bug
Something isn't working
pending
This problem is yet to be addressed
#7159
opened Mar 5, 2025 by
ulovecode
1 task done
单机多卡 使用web ui 进行Loar微调,报显卡mapping异常
bug
Something isn't working
pending
This problem is yet to be addressed
#7158
opened Mar 5, 2025 by
charlist8324
1 task done
qwen2.5vl 開啟unsloth時,使用lora检查点繼續訓練時出錯。
bug
Something isn't working
pending
This problem is yet to be addressed
#7156
opened Mar 4, 2025 by
mpeilun
1 task done
Errors when directly calling the "run_exp()" function under the "train" command
bug
Something isn't working
pending
This problem is yet to be addressed
#7155
opened Mar 4, 2025 by
Soever
1 task done
单机单卡SFT比单机多卡deepspeed Zero3效果好???
bug
Something isn't working
pending
This problem is yet to be addressed
#7153
opened Mar 4, 2025 by
Essence9999
1 task done
webui上选择的是bf16, 跑的时候报错并提示只支持bf16
bug
Something isn't working
pending
This problem is yet to be addressed
#7151
opened Mar 4, 2025 by
xudong2019
1 task done
After updating the version, I attempted to train qwen2_vl but encountered issues with slower training speed and decreased accuracy. I have not been able to identify the cause.
enhancement
New feature or request
pending
This problem is yet to be addressed
#7150
opened Mar 4, 2025 by
xueaa
1 task done
OSError: [Errno 7] Argument list too long
bug
Something isn't working
pending
This problem is yet to be addressed
#7144
opened Mar 3, 2025 by
leoozy
1 task done
reward model推理速度非常慢
bug
Something isn't working
pending
This problem is yet to be addressed
#7140
opened Mar 3, 2025 by
yingzhao27
1 task done
为什么会越跑越慢。。
bug
Something isn't working
pending
This problem is yet to be addressed
#7139
opened Mar 3, 2025 by
xudong2019
1 task done
SFT training works fine, but pretraining throws an error
bug
Something isn't working
pending
This problem is yet to be addressed
#7138
opened Mar 3, 2025 by
huangjf11
1 task done
Fine-tuning eating memory(Almost Default Setup)
bug
Something isn't working
pending
This problem is yet to be addressed
#7137
opened Mar 3, 2025 by
TahaSener
1 task done
运行webui的时候遭遇:argument of type 'bool' is not iterable
bug
Something isn't working
pending
This problem is yet to be addressed
#7132
opened Mar 3, 2025 by
unicornboat
1 task done
第一个epoch后loss剧增,eval未正确生效
bug
Something isn't working
pending
This problem is yet to be addressed
#7129
opened Mar 3, 2025 by
Moon-404
1 task done
RuntimeError: use_libuv was requested but PyTorch was build without libuv support
bug
Something isn't working
pending
This problem is yet to be addressed
#7124
opened Mar 1, 2025 by
jiangxinufo
1 task done
Minicpm带图微调dpo_lora时遇到问题RuntimeError: torch.cat(): expected a non-empty list of Tensors
bug
Something isn't working
pending
This problem is yet to be addressed
#7122
opened Mar 1, 2025 by
RocksyWhite
1 task done
<think>标签训练后,模型预测结果无<think>标签
bug
Something isn't working
pending
This problem is yet to be addressed
#7119
opened Feb 28, 2025 by
qubingxin
1 task done
模型微调参数优化
duplicate
This issue or pull request already exists
#7111
opened Feb 28, 2025 by
hackerhaiJu
1 task done
微调deepseek R1-7b保存后,推理乱码
bug
Something isn't working
pending
This problem is yet to be addressed
#7109
opened Feb 28, 2025 by
JYaooo
1 task done
Previous Next
ProTip!
no:milestone will show everything without a milestone.