You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
没jieba之前pred, 0: are a helpful assistant.
n:I are given helpfulless. and I are talking a conversation with a user. The seeker isates the conversation by and they are to respond with the seeker based The seeker should should as ['m tostrategy} to to { following. and the response is {reply}.Available is few of 1 conversations: and each strategy are their descriptions are as follows:
1-focusedWhat the clarification or to the topic or better the seekereeeker clarify their problem they they are.
-ended questions are preferred for as::bing:00igh: questions are be used to confirm specific information.
Validationate: Paraphrasing: Rest technique rest clear direct versionphrasing of the help-seeker's words. can help them clarify the situation from clearly. Strategy Validation: Feelings: Ackiculating the validate the help-seeker’s feelings, This Validation-disclosure: Shareulge personal experiences or you have had, that that you have with the help-seeker. help empathy understanding.
Validationirmation: Validationassurance: Ackirm the help-seer’s feelings and abilities, and abilities. provide reassurance that hope.
S Information: Offer specific and how to deal the improve avoid careful not avoid bestep and provide the what to do.
Problemal: Provide information that help help-seeker, such example, a, statistics, or, or, etc examples asking their.
Problem: Any ofasantries, small small strategies strategies that are not fit into the above categories.Question used with you, I's important of the strategies too I
user
I used Aff strategy the conversation, and my reply is I do you fears feel about this situation how reply
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
没jieba之前label, I used Question for the conversation, and my reply is How do your children feel about this? And his?
Reminder
System Info
正常运行lora下的sft
Reproduction
正常运行lora下的sft
Expected behavior
微调的模型是Qwen2-7B,问题同样出现在ChatGLM3-6B
因为想要在评估的过程中看到一些自定义指标所以改动了metric.py,改动如下:
metric.py下ComputeMetrics类,在__call__下补充
输出
发现维度不一致,因此把preds的最后一个维度挑选最大的下标值(也就是选择了概率最大的word)
至此可以正常输出,但是preds输出有乱码,而labels输出没问题说明不是tokenizer的问题
测试段的代码
输出
完整metric.py的代码如下
Others
No response
The text was updated successfully, but these errors were encountered: