Search Results for author: Nahum Shimkin

Found 11 papers, 1 papers with code

Altitude-Loss Optimal Glides in Engine Failure Emergencies -- Accounting for Ground Obstacles and Wind

no code implementations • 13 Apr 2023 • Daniel Segal, Aharon Bar-Gill, Nahum Shimkin

Engine failure is a recurring emergency in General Aviation and fixed-wing UAVs, often requiring the pilot or remote operator to carry out carefully planned glides to safely reach a candidate landing strip.

Trajectory Planning

Paper
Add Code

Cooperative Multi-Agent Path Finding: Beyond Path Planning and Collision Avoidance

no code implementations • 23 May 2021 • Nir Greshler, Ofir Gordon, Oren Salzman, Nahum Shimkin

We introduce the Cooperative Multi-Agent Path Finding (Co-MAPF) problem, an extension to the classical MAPF problem, where cooperative behavior is incorporated.

Collision Avoidance Multi-Agent Path Finding

Paper
Add Code

Deep Randomized Least Squares Value Iteration

no code implementations • ICLR 2020 • Guy Adam, Tom Zahavy, Oron Anschel, Nahum Shimkin

Rather than using hand-design state representation, we use a state representation that is being learned directly from the data by a DQN agent.

reinforcement-learning Reinforcement Learning (RL)

Paper
Add Code

ILS-SUMM: Iterated Local Search for Unsupervised Video Summarization

1 code implementation • 8 Dec 2019 • Yair Shemer, Daniel Rotman, Nahum Shimkin

We consider shot-based video summarization where the summary consists of a subset of the video shots which can be of various lengths.

Metaheuristic Optimization Unsupervised Video Summarization

Paper
Code

Learning Control for Air Hockey Striking using Deep Reinforcement Learning

no code implementations • 26 Feb 2017 • Ayal Taitler, Nahum Shimkin

We consider the task of learning control policies for a robotic mechanism striking a puck in an air hockey game.

Q-Learning reinforcement-learning +1

Paper
Add Code

Averaged-DQN: Variance Reduction and Stabilization for Deep Reinforcement Learning

no code implementations • ICML 2017 • Oron Anschel, Nir Baram, Nahum Shimkin

Instability and variability of Deep Reinforcement Learning (DRL) algorithms tend to adversely affect their performance.

Atari Games reinforcement-learning +1

Paper
Add Code

The Max $K$-Armed Bandit: PAC Lower Bounds and Efficient Algorithms

no code implementations • 23 Dec 2015 • Yahel David, Nahum Shimkin

Under the PAC framework, we provide a lower bound on the sample complexity of any $(\epsilon,\delta)$-correct algorithm, and propose an algorithm that attains this bound up to logarithmic factors.

Paper
Add Code

The Max $K$-Armed Bandit: A PAC Lower Bound and tighter Algorithms

no code implementations • 23 Aug 2015 • Yahel David, Nahum Shimkin

We consider the Max $K$-Armed Bandit problem, where a learning agent is faced with several sources (arms) of items (rewards), and interested in finding the best item overall.

Paper
Add Code

An Online Convex Optimization Approach to Blackwell's Approachability

no code implementations • 1 Mar 2015 • Nahum Shimkin

The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing {\em steering directions} as projections from the current average payoff vector to the (convex) target set.

Paper
Add Code

Response-Based Approachability and its Application to Generalized No-Regret Algorithms

no code implementations • 30 Dec 2013 • Andrey Bernstein, Nahum Shimkin

The first (primary) condition is a geometric separation condition, while the second (dual) condition requires that the set be {\em non-excludable}, namely that for every mixed action of the opponent there exists a mixed action of the agent (a {\em response}) such that the resulting payoff vector belongs to $S$.

Paper
Add Code

Online Classification with Specificity Constraints

no code implementations • NeurIPS 2010 • Andrey Bernstein, Shie Mannor, Nahum Shimkin

To our best knowledge, this is the first algorithm that addresses the problem of the average tp-rate maximization under average fp-rate constraints in the online setting.

Binary Classification Classification +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.