Search Results for author: Shashank Sonkar

Found 15 papers, 5 papers with code

Regressive Side Effects of Training Language Models to Mimic Student Misconceptions

no code implementations • 23 Apr 2024 • Shashank Sonkar, Naiming Liu, Richard G. Baraniuk

This paper presents a novel exploration into the regressive side effects of training Large Language Models (LLMs) to mimic student misconceptions for personalized education.

Hallucination Misconceptions

Paper
Add Code

Automated Long Answer Grading with RiceChem Dataset

1 code implementation • 22 Apr 2024 • Shashank Sonkar, Kangqi Ni, Lesa Tran Lu, Kristi Kincaid, John S. Hutchinson, Richard G. Baraniuk

With this work, we offer a fresh perspective on grading long, fact-based answers and introduce a new dataset to stimulate further research in this important area.

College Chemistry Natural Language Inference +1

Paper
Code

Marking: Visual Grading with Highlighting Errors and Annotating Missing Bits

no code implementations • 22 Apr 2024 • Shashank Sonkar, Naiming Liu, Debshila B. Mallick, Richard G. Baraniuk

We subsequently train language models to identify entailment, contradiction, and neutrality from student response, akin to NLI, and with the added dimension of identifying omissions from gold answers.

Natural Language Inference

Paper
Add Code

Pedagogical Alignment of Large Language Models

1 code implementation • 7 Feb 2024 • Shashank Sonkar, Kangqi Ni, Sapana Chaudhary, Richard G. Baraniuk

Building on this perspective, we propose a novel approach for constructing a reward dataset specifically designed for the pedagogical alignment of LLMs.

Paper
Code

Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions

no code implementations • 3 Oct 2023 • Naiming Liu, Shashank Sonkar, Zichao Wang, Simon Woodhead, Richard G. Baraniuk

We propose novel evaluations for mathematical reasoning capabilities of Large Language Models (LLMs) based on mathematical misconceptions.

Math Mathematical Reasoning +1

Paper
Add Code

Code Soliloquies for Accurate Calculations in Large Language Models

1 code implementation • 21 Sep 2023 • Shashank Sonkar, MyCo Le, Xinghe Chen, Naiming Liu, Debshila Basu Mallick, Richard G. Baraniuk

Our approach notably enhances the quality of synthetic conversation datasets, especially for subjects that are calculation-intensive.

Language Modelling Large Language Model +1

Paper
Code

Deduction under Perturbed Evidence: Probing Student Simulation Capabilities of Large Language Models

no code implementations • 23 May 2023 • Shashank Sonkar, Richard G. Baraniuk

We explore whether Large Language Models (LLMs) are capable of logical reasoning with distorted facts, which we call Deduction under Perturbed Evidence (DUPE).

StrategyQA valid

Paper
Add Code

Investigating the Role of Feed-Forward Networks in Transformers Using Parallel Attention and Feed-Forward Net Design

no code implementations • 22 May 2023 • Shashank Sonkar, Richard G. Baraniuk

This paper investigates the key role of Feed-Forward Networks (FFNs) in transformer models by utilizing the Parallel Attention and Feed-Forward Net Design (PAF) architecture, and comparing it to their Series Attention and Feed-Forward Net Design (SAF) counterparts.

Paper
Add Code

CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principles

1 code implementation • 22 May 2023 • Shashank Sonkar, Naiming Liu, Debshila Basu Mallick, Richard G. Baraniuk

We present a design framework called Conversational Learning with Analytical Step-by-Step Strategies (CLASS) for building advanced Intelligent Tutoring Systems (ITS) powered by high-performance Large Language Models (LLMs).

Chatbot Decision Making

Paper
Code

MANER: Mask Augmented Named Entity Recognition for Extreme Low-Resource Languages

no code implementations • 19 Dec 2022 • Shashank Sonkar, Zichao Wang, Richard G. Baraniuk

MANER re-purposes the <mask> token for NER prediction.

named-entity-recognition Named Entity Recognition +3

Paper
Add Code

A Visual Tour Of Current Challenges In Multimodal Language Models

no code implementations • 22 Oct 2022 • Shashank Sonkar, Naiming Liu, Richard G. Baraniuk

Transformer models trained on massive text corpora have become the de facto models for a wide range of natural language processing tasks.

Text-to-Image Generation Visual Grounding

Paper
Add Code

Embedding models through the lens of Stable Coloring

no code implementations • 29 Sep 2021 • Aditya Desai, Shashank Sonkar, Anshumali Shrivastava, Richard Baraniuk

Grounded on this framework, we show that many algorithms ranging across different domains are, in fact, searching for continuous stable coloring solutions of an underlying graph corresponding to the domain.

Denoising

Paper
Add Code

NePTuNe: Neural Powered Tucker Network for Knowledge Graph Completion

1 code implementation • 15 Apr 2021 • Shashank Sonkar, Arzoo Katiyar, Richard G. Baraniuk

Knowledge graphs link entities through relations to provide a structured representation of real world facts.

Ranked #11 on Link Prediction on FB15k-237

Link Prediction

Paper
Code

Attention Word Embedding

no code implementations • COLING 2020 • Shashank Sonkar, Andrew E. Waters, Richard G. Baraniuk

Word embedding models learn semantically rich vector representations of words and are widely used to initialize natural processing language (NLP) models.

Sentence Word Similarity

Paper
Add Code

qDKT: Question-centric Deep Knowledge Tracing

no code implementations • 25 May 2020 • Shashank Sonkar, Andrew E. Waters, Andrew S. Lan, Phillip J. Grimaldi, Richard G. Baraniuk

Knowledge tracing (KT) models, e. g., the deep knowledge tracing (DKT) model, track an individual learner's acquisition of skills over time by examining the learner's performance on questions related to those skills.

Knowledge Tracing Language Modelling

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.