4.7 Article

Zermelo's problem: Optimal point-to-point navigation in 2D turbulent flows using reinforcement learning

Journal

CHAOS
Volume 29, Issue 10, Pages -

Publisher

AIP Publishing
DOI: 10.1063/1.5120370

Keywords

-

Funding

  1. European Union [339032]
  2. Knut and Alice Wallenberg Foundation [Dnar. KAW2014.0048]
  3. Vetenskapsradet [2018-03974]
  4. Swedish Research Council [2018-03974] Funding Source: Swedish Research Council

Ask authors/readers for more resources

To find the path that minimizes the time to navigate between two given points in a fluid flow is known as Zermelo's problem. Here, we investigate it by using a Reinforcement Learning (RL) approach for the case of a vessel that has a slip velocity with fixed intensity, Vs, but variable direction and navigating in a 2D turbulent sea. We show that an Actor-Critic RL algorithm is able to find quasioptimal solutions for both time-independent and chaotically evolving flow configurations. For the frozen case, we also compared the results with strategies obtained analytically from continuous Optimal Navigation (ON) protocols. We show that for our application, ON solutions are unstable for the typical duration of the navigation process and are, therefore, not useful in practice. On the other hand, RL solutions are much more robust with respect to small changes in the initial conditions and to external noise, even when V-s is much smaller than the maximum flow velocity. Furthermore, we show how the RL approach is able to take advantage of the flow properties in order to reach the target, especially when the steering speed is small. Published under license by AIP Publishing.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available