Tianjian Li

about
blog(current)
publications
cv

Blogpost_alignment

December 6, 2024

New blog post on why does the chosen and the rejected log-probs is decreased during DPO and why it is to some extent beneficial for alignment.

© Copyright 2026 Tianjian Li. Powered by Jekyll with al-folio theme. Hosted by GitHub Pages. Photos from Unsplash.