GeistHaus
log in · sign up

Reinforcing Recursive Language Models | alphaXiv

alphaxiv.org

RL fine-tuning small models to behave as recursive language models.

1 page links to this URL