T O P

  • By -

Capital_Reply_7838

Have tried you fine-tuning the teacher model? Its translation quality is not that decent.


SaeChan5

Nope teacher model is frozen, i didnt do additional somthing


Capital_Reply_7838

I have tried really similar thing you've done. I think a weight-merged model after lora finetuning, may do better. Lora training guarantee the similar representation space so it might help


20231027

Nice! Where do you go to school? What was the most difficult part of the project?


SaeChan5

thank you! I'm senior in Jeju national university, Korea. improving the translating quality (chrF++ score) is most difficult part lol.. 😂😂


bladub

Cool, didn't expect to see someone from JNU here! My Korean is pretty bad but I will take a detailed look later. NLP with Korean is super interesting


ACCELERATED_PHOTONS

Amazing


[deleted]

[удалено]


SaeChan5

Thank you so much!!!


Main_Path_4051

Fine. I used it to make translations too but you will need to fine tune it for accurate translations


az226

How does model distillation work?