Skip to content

TianduoWang/DPO-ST

Error
Looks like something went wrong!

About

[ACL 2024] Self-Training with Direct Preference Optimization Improves Chain-of-Thought Reasoning

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages