LLaMA-2 7B Model checkpoints for Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

LLaMA-2 7B Model checkpoints for the paper Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models

Identifier
Source https://tudatalib.ulb.tu-darmstadt.de/handle/tudatalib/4268
Metadata Access https://tudatalib.ulb.tu-darmstadt.de/oai/openairedata?verb=GetRecord&metadataPrefix=oai_datacite&identifier=oai:tudatalib.ulb.tu-darmstadt.de:tudatalib/4268
Provenance
Creator Puerto, Haritz; Chubakov, Tilek; Zhu, Xiaodan; Tayyar Madabushi, Harish; Gurevych, Iryna
Publisher TU Darmstadt
Contributor TU Darmstadt
Publication Year 2024
Rights CC BY-SA 3.0; info:eu-repo/semantics/openAccess
OpenAccess true
Contact https://tudatalib.ulb.tu-darmstadt.de/page/contact
Representation
Language English
Resource Type Other
Format application/zip
Version 1.0
Discipline Other