1 code implementation • 24 Oct 2023 • Zayne Sprague, Xi Ye, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett
We evaluate a range of LLMs and prompting techniques on this dataset and characterize the gaps that remain for techniques like chain-of-thought to perform robust reasoning.
1 code implementation • 5 Jul 2023 • Zayne Sprague, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett
Specifically, we evaluate whether embedding spaces exhibit a property we call deductive additivity: the sum of premise statement embeddings should be close to embeddings of conclusions based on those premises.
1 code implementation • 15 Jun 2023 • Rohan Chandra, Rahul Menon, Zayne Sprague, Arya Anantula, Joydeep Biswas
This paper presents a fully decentralized approach for realtime non-cooperative multi-robot navigation in social mini-games, such as navigating through a narrow doorway or negotiating right of way at a corridor intersection.
1 code implementation • 9 Mar 2023 • Zayne Sprague, Rohan Chandra, Jarrett Holtz, Joydeep Biswas
We present SocialGym 2, a multi-agent navigation simulator for social robot research.
2 code implementations • 1 Nov 2022 • Zayne Sprague, Kaj Bostrom, Swarat Chaudhuri, Greg Durrett
A growing body of work studies how to answer a question or verify a claim by generating a natural language "proof": a chain of deductive inferences yielding the answer based on a set of premises.
no code implementations • 16 Jan 2022 • Kaj Bostrom, Zayne Sprague, Swarat Chaudhuri, Greg Durrett
In settings from fact-checking to question answering, we frequently want to know whether a collection of evidence (premises) entails a hypothesis.