Search Results for author: Taeyoun Kim

Found 4 papers, 0 papers with code

Predicting the Performance of Foundation Models via Agreement-on-the-Line

no code implementations2 Apr 2024 Aman Mehra, Rahul Saxena, Taeyoun Kim, Christina Baek, Zico Kolter, aditi raghunathan

Recently, it was shown that ensembles of neural networks observe the phenomena ``agreement-on-the-line'', which can be leveraged to reliably predict OOD performance without labels.

Heuristic Algorithm-based Action Masking Reinforcement Learning (HAAM-RL) with Ensemble Inference Method

no code implementations21 Mar 2024 Kyuwon Choi, Cheolkyun Rho, Taeyoun Kim, Daewoo Choi

This paper presents a novel reinforcement learning (RL) approach called HAAM-RL (Heuristic Algorithm-based Action Masking Reinforcement Learning) for optimizing the color batching re-sequencing problem in automobile painting processes.

reinforcement-learning Reinforcement Learning (RL)

Jailbreaking is Best Solved by Definition

no code implementations20 Mar 2024 Taeyoun Kim, Suhas Kotha, aditi raghunathan

The rise of "jailbreak" attacks on language models has led to a flurry of defenses aimed at preventing the output of undesirable responses.

Cannot find the paper you are looking for? You can Submit a new open access paper.