zaim's Highlights on 'Self-Rewarding Language Models' | Glasp