History of Open Alignment

The first main topic is "the race to reproduce open ChatGPT" and the likes of alpaca, koala, dolly, vicuna

which coincides with llama 1

then there's the reproducing core results phase, but people mostly using LoRA which I think hurt then

DPO came out as a paper in like July I think, was added places soon after, but didnt catch on for a while https://github.com/huggingface/trl/issues/405

Talks:

Papers / methods:

Evaluation Tools: