"Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models."

Leonardo Ranaldi, André Freitas (2024)

Details and statistics

DOI: 10.48550/ARXIV.2405.00402

access: open

type: Informal or Other Publication

metadata version: 2024-06-09