Search Results for author: Nahum Shimkin

Found 11 papers, 1 papers with code

Altitude-Loss Optimal Glides in Engine Failure Emergencies -- Accounting for Ground Obstacles and Wind

no code implementations13 Apr 2023 Daniel Segal, Aharon Bar-Gill, Nahum Shimkin

Engine failure is a recurring emergency in General Aviation and fixed-wing UAVs, often requiring the pilot or remote operator to carry out carefully planned glides to safely reach a candidate landing strip.

Trajectory Planning

Cooperative Multi-Agent Path Finding: Beyond Path Planning and Collision Avoidance

no code implementations23 May 2021 Nir Greshler, Ofir Gordon, Oren Salzman, Nahum Shimkin

We introduce the Cooperative Multi-Agent Path Finding (Co-MAPF) problem, an extension to the classical MAPF problem, where cooperative behavior is incorporated.

Collision Avoidance Multi-Agent Path Finding

Deep Randomized Least Squares Value Iteration

no code implementations ICLR 2020 Guy Adam, Tom Zahavy, Oron Anschel, Nahum Shimkin

Rather than using hand-design state representation, we use a state representation that is being learned directly from the data by a DQN agent.

reinforcement-learning Reinforcement Learning (RL)

ILS-SUMM: Iterated Local Search for Unsupervised Video Summarization

1 code implementation8 Dec 2019 Yair Shemer, Daniel Rotman, Nahum Shimkin

We consider shot-based video summarization where the summary consists of a subset of the video shots which can be of various lengths.

Metaheuristic Optimization Unsupervised Video Summarization

Learning Control for Air Hockey Striking using Deep Reinforcement Learning

no code implementations26 Feb 2017 Ayal Taitler, Nahum Shimkin

We consider the task of learning control policies for a robotic mechanism striking a puck in an air hockey game.

Q-Learning reinforcement-learning +1

The Max $K$-Armed Bandit: PAC Lower Bounds and Efficient Algorithms

no code implementations23 Dec 2015 Yahel David, Nahum Shimkin

Under the PAC framework, we provide a lower bound on the sample complexity of any $(\epsilon,\delta)$-correct algorithm, and propose an algorithm that attains this bound up to logarithmic factors.

The Max $K$-Armed Bandit: A PAC Lower Bound and tighter Algorithms

no code implementations23 Aug 2015 Yahel David, Nahum Shimkin

We consider the Max $K$-Armed Bandit problem, where a learning agent is faced with several sources (arms) of items (rewards), and interested in finding the best item overall.

An Online Convex Optimization Approach to Blackwell's Approachability

no code implementations1 Mar 2015 Nahum Shimkin

The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing {\em steering directions} as projections from the current average payoff vector to the (convex) target set.

Response-Based Approachability and its Application to Generalized No-Regret Algorithms

no code implementations30 Dec 2013 Andrey Bernstein, Nahum Shimkin

The first (primary) condition is a geometric separation condition, while the second (dual) condition requires that the set be {\em non-excludable}, namely that for every mixed action of the opponent there exists a mixed action of the agent (a {\em response}) such that the resulting payoff vector belongs to $S$.

Online Classification with Specificity Constraints

no code implementations NeurIPS 2010 Andrey Bernstein, Shie Mannor, Nahum Shimkin

To our best knowledge, this is the first algorithm that addresses the problem of the average tp-rate maximization under average fp-rate constraints in the online setting.

Binary Classification Classification +2

Cannot find the paper you are looking for? You can Submit a new open access paper.