Search Results for author: Shivanshu Verma

Found 1 papers, 0 papers with code

Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks

no code implementations23 Apr 2024 Amir Saeidi, Shivanshu Verma, Chitta Baral

Key observations reveal that alignment methods achieve optimal performance with smaller training data subsets, exhibit limited effectiveness in reasoning tasks yet significantly impact mathematical problem-solving, and employing an instruction-tuned model notably influences truthfulness.

Question Answering

Cannot find the paper you are looking for? You can Submit a new open access paper.