PaddlePaddle · wawltor · Oct 16, 2024 · Oct 14, 2024
diff --git a/llm/README.md b/llm/README.md
@@ -115,15 +115,15 @@ PaddleNLP 支持多个主流大模型的 SFT、LoRA、Prefix Tuning 等精调策
 样例数据：
 
 ```text
-{"src": "类型#裙*颜色#蓝色*风格#清新*图案#蝴蝶结", "tgt": "裙身处采用立体蝴蝶结装饰辅以蓝色条带点缀，令衣身造型饱满富有层次的同时为其注入一丝甜美气息。将女孩清新娇俏的一面衬托而出。"}
+{"src": "Give three tips for staying healthy.", "tgt": "1.Eat a balanced diet and make sure to include plenty of fruits and vegetables. \n2. Exercise regularly to keep your body active and strong. \n3. Get enough sleep and maintain a consistent sleep schedule."}
 ...
 ```
 
-为了方便测试，我们也提供了广告生成数据集可以直接使用：
+为了方便测试，我们也提供了[tatsu-lab/alpaca](https://huggingface.co/datasets/tatsu-lab/alpaca)demo 数据集可以直接使用：
 
 ```shell
-wget https://bj.bcebos.com/paddlenlp/datasets/examples/AdvertiseGen.tar.gz
-tar -zxvf AdvertiseGen.tar.gz
+wget https://bj.bcebos.com/paddlenlp/datasets/examples/alpaca_demo.gz
+tar -xvf alpaca_demo.gz
 ```
 
 #### 2.2 全参精调：SFT

diff --git a/llm/config/llama/lora_argument.json b/llm/config/llama/lora_argument.json
@@ -6,7 +6,7 @@
     "gradient_accumulation_steps": 4,
     "per_device_eval_batch_size": 8,
     "eval_accumulation_steps":16,
-    "num_train_epochs": 3,
+    "num_train_epochs": 1,
     "learning_rate": 3e-04,
     "warmup_steps": 30,
     "logging_steps": 1,

diff --git a/llm/config/llama/sft_argument.json b/llm/config/llama/sft_argument.json
@@ -6,7 +6,7 @@
     "gradient_accumulation_steps": 2,
     "per_device_eval_batch_size": 8,
     "eval_accumulation_steps":16,
-    "num_train_epochs": 3,
+    "num_train_epochs": 1,
     "learning_rate": 3e-05,
     "warmup_steps": 30,
     "logging_steps": 1,