You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardexpand all lines: examples/software_dev/configs/SOP.json
+19-4
Original file line number
Diff line number
Diff line change
@@ -16,7 +16,12 @@
16
16
"begin_role": "Boss",
17
17
"node_description": "Imagine a scenario where the boss has presented a requirement. The architect is tasked with proposing a framework based on this requirement. The leader's role is to provide feedback on the architect's proposal, and another architect will finalize the framework based on the leader's comments.",
18
18
"node_prompt_templates": {
19
-
"target": "The target program is: <target>{prompt}</target>"
19
+
"target": "The target program is: <target>{prompt}</target>",
20
+
"style": "You need to follow the output style: {style}.\n",
21
+
"task": "The task you need to execute is: {task}.\n",
22
+
"rule": "The rule you need to follow is: {rule}.\n",
23
+
"demonstrations": "Here are the demonstrations you can refer to:\n{demonstrations}.\n",
24
+
"last": "{last}"
20
25
},
21
26
"node_prompt_paddings": {
22
27
"Boss": {
@@ -151,7 +156,12 @@
151
156
"begin_role": "Boss",
152
157
"node_description": "In this scenario, the boss has provided a requirement. The developer's task is to write code based on the architecture proposed by the architect. The leader evaluates the written code for elegance, readability, and functionality, providing feedback. Another developer makes necessary modifications to the code.",
153
158
"node_prompt_templates": {
154
-
"target": "The target program is: <target>{prompt}</target>"
159
+
"target": "The target program is: <target>{prompt}</target>",
160
+
"style": "You need to follow the output style: {style}.\n",
161
+
"task": "The task you need to execute is: {task}.\n",
162
+
"rule": "The rule you need to follow is: {rule}.\n",
163
+
"demonstrations": "Here are the demonstrations you can refer to:\n{demonstrations}.\n",
164
+
"last": "{last}"
155
165
},
156
166
"node_prompt_paddings": {
157
167
"Boss": {
@@ -305,7 +315,12 @@
305
315
"node_description": "In this scenario, the boss has provided a requirement. The debugger simulates a compiler to determine whether the code is runnable and provides feedback. The developer writes code based on the debugger's feedback. The leader evaluates whether the final code meets the boss's requirements and provides feedback for further modifications. The coder writes the final code to a file.",
306
316
"node_prompt_templates": {
307
317
"target": "The target program is: <target>{prompt}</target>",
308
-
"finalize code": "Please consider all the messages above and integrate the final project code. Ensures the code is clean, well-formatted. {extract}"
318
+
"finalize code": "Please consider all the messages above and integrate the final project code. Ensures the code is clean, well-formatted. {extract}",
319
+
"style": "You need to follow the output style: {style}.\n",
320
+
"task": "The task you need to execute is: {task}.\n",
321
+
"rule": "The rule you need to follow is: {rule}.\n",
322
+
"demonstrations": "Here are the demonstrations you can refer to:\n{demonstrations}.\n",
323
+
"last": "{last}"
309
324
},
310
325
"node_prompt_paddings": {
311
326
"Boss": {
@@ -480,7 +495,7 @@
480
495
"value": {
481
496
"extract": "Please extract the final project code and put them between <result> and </result>."
Copy file name to clipboardexpand all lines: examples/software_dev/configs/optimizer_config.json
+6-6
Original file line number
Diff line number
Diff line change
@@ -22,26 +22,26 @@
22
22
"log_path": "logs/trainer_god"
23
23
},
24
24
"meta_prompt": {
25
-
"loss_without_score": {
25
+
"loss": {
26
26
"order": [
27
-
"loss_with_ground_truth"
27
+
"loss_with_ground_truth_and_score"
28
28
],
29
29
"extract_key": [
30
30
"score",
31
31
"requirement_for_previous"
32
32
],
33
-
"loss_with_ground_truth": "You are a fine-tuner of a large model. I will provide you with some output results from the model and the expected correct results. You need to evaluate these data and provide a score out of 10, please wrap the score using <score></score>. Additionally, please provide some suggestions for modifying the model's output, using <requirement_for_previous></requirement_for_previous> to wrap your suggestions.\n\nHere is the model's output:\n<result>{result}</result>;\n\nThe expected result is:\n<ground_truth>{ground_truth}</ground_truth>\n\nPlease note:\n\nEnsure that the output is wrapped with <score></score> and <requirement_for_previous></requirement_for_previous> respectively.\nThe output should be as consistent as possible with the expected result while being correct. For example, if the expected result is “BUST”, and the model's output is “The women's lifestyle magazine is 'BUST' magazine.”, even though the answer is correct, you should advise the model to be more concise.\nThe standard for a score of 10 is that the model's output is exactly the same as the expected result in a case-insensitive manner, and without any unnecessary content. Even if the model's output is semantically correct, if it includes superfluous content, points should be deducted.",
33
+
"loss_with_ground_truth_and_score": "You are a fine-tuner of a large model. I will provide you with some output results from the model and the expected correct results. You need to evaluate these data and provide a score out of 10, please wrap the score using <score></score>. Additionally, please provide some suggestions for modifying the model's output, using <requirement_for_previous></requirement_for_previous> to wrap your suggestions.\n\nHere is the model's output:\n<result>{result}</result>;\n\nThe expected result is:\n<ground_truth>{ground_truth}</ground_truth>\n\nPlease note:\n\nEnsure that the output is wrapped with <score></score> and <requirement_for_previous></requirement_for_previous> respectively.\nThe output should be as consistent as possible with the expected result while being correct. For example, if the expected result is “BUST”, and the model's output is “The women's lifestyle magazine is 'BUST' magazine.”, even though the answer is correct, you should advise the model to be more concise.\nThe standard for a score of 10 is that the model's output is exactly the same as the expected result in a case-insensitive manner, and without any unnecessary content. Even if the model's output is semantically correct, if it includes superfluous content, points should be deducted.",
"loss_with_ground_truth_and_score": "You are a large language model fine-tuner. I will provide you with a model's output and the expected correct result. You need to evaluate it and suggest modifications to the model's output. Please use `<requirement_for_previous></requirement_for_previous>` to enclose your feedback.\n\nBelow is the model's output:\n<result>{result}</result>\n\nThe expected result is:\n<ground_truth>{ground_truth}</ground_truth>\n\nHere is the evaluation score for the model. Your goal is to optimize this score:\n<score>{score}</score>\n\nThe relevant information about this score is as follows:\n<evaluation_info>{score_info}</evaluation_info>\n\nNote:\n1. Ensure that `<requirement_for_previous></requirement_for_previous>` exists and appears once.\n2. If the model's output is satisfactory, you can output <requirement_for_previous>The output is satisfactory, no additional requirements</requirement_for_previous>.\n3. The output should be as close to the expected result as possible while ensuring correctness. For example, if the expected result is \"BUST\" and the model's output is \"The women's lifestyle magazine is 'BUST' magazine.\", even though this answer is correct, you should remind the model to be concise."
44
+
"loss_with_ground_truth": "You are a large language model fine-tuner. I will provide you with a model's output and the expected correct result. You need to evaluate it and suggest modifications to the model's output. Please use `<requirement_for_previous></requirement_for_previous>` to enclose your feedback.\n\nBelow is the model's output:\n<result>{result}</result>\n\nThe expected result is:\n<ground_truth>{ground_truth}</ground_truth>\n\nHere is the evaluation score for the model. Your goal is to optimize this score:\n<score>{score}</score>\n\nThe relevant information about this score is as follows:\n<evaluation_info>{score_info}</evaluation_info>\n\nNote:\n1. Ensure that `<requirement_for_previous></requirement_for_previous>` exists and appears once.\n2. If the model's output is satisfactory, you can output <requirement_for_previous>The output is satisfactory, no additional requirements</requirement_for_previous>.\n3. The output should be as close to the expected result as possible while ensuring correctness. For example, if the expected result is \"BUST\" and the model's output is \"The women's lifestyle magazine is 'BUST' magazine.\", even though this answer is correct, you should remind the model to be concise."
0 commit comments