Search Results for author: Quan Nguyen

Found 23 papers, 6 papers with code

Near-optimal Per-Action Regret Bounds for Sleeping Bandits

no code implementations2 Mar 2024 Quan Nguyen, Nishant A. Mehta

In a setting with $K$ total arms and at most $A$ available arms in each round over $T$ rounds, the best known upper bound is $O(K\sqrt{TA\ln{K}})$, obtained indirectly via minimizing internal sleeping regrets.

VinaLLaMA: LLaMA-based Vietnamese Foundation Model

no code implementations18 Dec 2023 Quan Nguyen, Huy Pham, Dung Dao

In this technical report, we present VinaLLaMA, an open-weight, state-of-the-art (SOTA) Large Language Model for the Vietnamese language, built upon LLaMA-2 with an additional 800 billion trained tokens.

Language Modelling Large Language Model +1

Generalized Animal Imitator: Agile Locomotion with Versatile Motion Prior

no code implementations2 Oct 2023 Ruihan Yang, Zhuoqun Chen, Jianhan Ma, Chongyi Zheng, Yiyu Chen, Quan Nguyen, Xiaolong Wang

This paper introduces the Versatile Instructable Motion prior (VIM) - a Reinforcement Learning framework designed to incorporate a range of agile locomotion tasks suitable for advanced robotic applications.

Segmenting mechanically heterogeneous domains via unsupervised learning

no code implementations30 Aug 2023 Quan Nguyen, Emma Lejeune

These highly deformable materials can have heterogeneous material properties, and can experience heterogeneous deformations with or without underlying material heterogeneity.

Cross-Institutional Transfer Learning for Educational Models: Implications for Model Performance, Fairness, and Equity

1 code implementation1 May 2023 Josh Gardner, Renzhe Yu, Quan Nguyen, Christopher Brooks, Rene Kizilcec

We also find that stacked ensembling provides no additional benefits to overall performance or fairness compared to either a local model or the zero-shot transfer procedure we tested.

Fairness Transfer Learning

Adversarial Online Multi-Task Reinforcement Learning

1 code implementation11 Jan 2023 Quan Nguyen, Nishant A. Mehta

We prove a minimax lower bound of $\Omega(K\sqrt{DSAH})$ on the regret of any learning algorithm and an instance-specific lower bound of $\Omega(\frac{K}{\lambda^2})$ in sample complexity for a class of uniformly-good cluster-then-learn algorithms.

reinforcement-learning Reinforcement Learning (RL)

Challenges and perspectives in computational deconvolution of genomics data

no code implementations21 Nov 2022 Lana X. Garmire, Yijun Li, Qianhui Huang, Chuan Xu, Sarah Teichmann, Naftali Kaminski, Matteo Pellegrini, Quan Nguyen, Andrew E. Teschendorff

Deciphering cell type heterogeneity is crucial for systematically understanding tissue homeostasis and its dysregulation in diseases.

Benchmarking

Local Bayesian optimization via maximizing probability of descent

1 code implementation21 Oct 2022 Quan Nguyen, Kaiwen Wu, Jacob R. Gardner, Roman Garnett

Local optimization presents a promising approach to expensive, high-dimensional black-box optimization by sidestepping the need to globally explore the search space.

Bayesian Optimization Navigate

Nonmyopic Multiclass Active Search with Diminishing Returns for Diverse Discovery

no code implementations8 Feb 2022 Quan Nguyen, Roman Garnett

Active search is a setting in adaptive experimental design where we aim to uncover members of rare, valuable class(es) subject to a budget constraint.

Drug Discovery Experimental Design

Nonmyopic Multifidelity Active Search

1 code implementation11 Jun 2021 Quan Nguyen, Arghavan Modiri, Roman Garnett

Active search is a learning paradigm where we seek to identify as many members of a rare, valuable class as possible given a labeling budget.

Control and Simulation of a Grid-Forming Inverter for Hybrid PV-Battery Plants in Power System Black Start

no code implementations20 Mar 2021 Quan Nguyen, Mallikarjuna R. Vallem, Bharat Vyakaranam, Ahmad Tbaileh, Xinda Ke, Nader Samaan

This paper proposes the modeling, control, and simulation of a grid-forming inverter-based PV-battery power plant that can be used as a black start unit.

Robust High-speed Running for Quadruped Robots via Deep Reinforcement Learning

no code implementations11 Mar 2021 Guillaume Bellegarda, Yiyu Chen, Zhuochen Liu, Quan Nguyen

Policies can be learned in only a few million time steps, even for challenging tasks of running over rough terrain with loads of over 100% of the nominal quadruped mass.

reinforcement-learning Reinforcement Learning (RL) +1

Robust Quadruped Jumping via Deep Reinforcement Learning

no code implementations13 Nov 2020 Guillaume Bellegarda, Chuong Nguyen, Quan Nguyen

In this paper, we consider a general task of jumping varying distances and heights for a quadrupedal robot in noisy environments, such as off of uneven terrain and with variable robot dynamics parameters.

reinforcement-learning Reinforcement Learning (RL)

Robust Safety-Critical Control for Dynamic Robotics

no code implementations14 May 2020 Quan Nguyen, Koushil Sreenath

We present a novel method of optimal robust control through quadratic programs that offers tracking stability while subject to input and state-based constraints as well as safety-critical constraints for nonlinear dynamical robotic systems in the presence of model uncertainty.

OV: Validity-based Optimistic Smart Contracts

1 code implementation9 Apr 2020 Quan Nguyen, Andre Cronje, Michael Kong

Reasoning about the validity of the object states is challenging in concurrent smart contracts.

Distributed, Parallel, and Cluster Computing Programming Languages

Fast Stochastic Peer Selection in Proof-of-Stake Protocols

no code implementations12 Nov 2019 Quan Nguyen, Andre Cronje, Michael Kong

The problem of peer selection, which randomly selects a peer from a set, is commonplace in Proof-of-Stake (PoS) protocols.

Distributed, Parallel, and Cluster Computing Data Structures and Algorithms

StairDag: Cross-DAG Validation For Scalable BFT Consensus

no code implementations29 Aug 2019 Quan Nguyen, Andre Cronje, Michael Kong, Alex Kampa, George Samman

Unlike StakeDag's DAG, x-DAG ensures that each new block has to have parent blocks from both Users and Validators to achieve more safety and liveness.

Cryptography and Security Distributed, Parallel, and Cluster Computing

StakeDag: Stake-based Consensus For Scalable Trustless Systems

no code implementations5 Jul 2019 Quan Nguyen, Andre Cronje, Michael Kong, Alex Kampa, George Samman

We address a general model of trustless system in which participants are distinguished by their stake or trust: users and validators.

Distributed, Parallel, and Cluster Computing Cryptography and Security

ONLAY: Online Layering for scalable asynchronous BFT system

no code implementations13 May 2019 Quan Nguyen, Andre Cronje

This paper presents a new framework, namely \emph{\onlay}, for scalable asynchronous distributed systems.

Distributed, Parallel, and Cluster Computing

Fantom: A scalable framework for asynchronous distributed systems

no code implementations22 Oct 2018 Sang-Min Choi, Jiho Park, Quan Nguyen, Andre Cronje

We describe \emph{Fantom}, a framework for asynchronous distributed systems.

Distributed, Parallel, and Cluster Computing

OPERA: Reasoning about continuous common knowledge in asynchronous distributed systems

no code implementations4 Oct 2018 Sang-Min Choi, Jiho Park, Quan Nguyen, Andre Cronje, Kiyoung Jang, Hyunjoon Cheon, Yo-Sub Han, Byung-Ik Ahn

Each event block is signed by the hashes of the creating node and its $k$ peers.

Distributed, Parallel, and Cluster Computing

The Dialog State Tracking Challenge with Bayesian Approach

no code implementations20 Feb 2017 Quan Nguyen

Generative model has been one of the most common approaches for solving the Dialog State Tracking Problem with the capabilities to model the dialog hypotheses in an explicit manner.

dialog state tracking

Cannot find the paper you are looking for? You can Submit a new open access paper.