4.8 Article

Reinforcement learning with artificial microswimmers

Journal

SCIENCE ROBOTICS
Volume 6, Issue 52, Pages -

Publisher

AMER ASSOC ADVANCEMENT SCIENCE
DOI: 10.1126/scirobotics.abd9285

Keywords

-

Categories

Funding

  1. DFG Priority Program 1726 Microswimmers [237143019]
  2. DFG [432421051]
  3. Humboldt grant of the Alexander von Humboldt Foundation
  4. Czech Science Foundation [20-02955J]

Ask authors/readers for more resources

Artificial microswimmers, combined with machine learning algorithms, are studied for their adaptive behavior in noisy environments. The study shows that collective learning is possible with external control, and noise affects learning speed and decision-making strength. Time delays in feedback loops can lead to optimal behaviors, similar to bacteria's run-and-tumble times, in systems responding to noise.
Artificial microswimmers that can replicate the complex behavior of active matter are often designed to mimic the self-propulsion of microscopic living organisms. However, compared with their living counterparts, artificial microswimmers have a limited ability to adapt to environmental signals or to retain a physical memory to yield optimized emergent behavior. Different from macroscopic living systems and robots, both microscopic living organisms and artificial microswimmers are subject to Brownian motion, which randomizes their position and propulsion direction. Here, we combine real-world artificial active particles with machine learning algorithms to explore their adaptive behavior in a noisy environment with reinforcement learning. We use a real-time control of self-thermophoretic active particles to demonstrate the solution of a simple standard navigation problem under the inevitable influence of Brownian motion at these length scales. We show that, with external control, collective learning is possible. Concerning the learning under noise, we find that noise decreases the learning speed, modifies the optimal behavior, and also increases the strength of the decisions made. As a consequence of time delay in the feedback loop controlling the particles, an optimum velocity, reminiscent of optimal run-and-tumble times of bacteria, is found for the system, which is conjectured to be a universal property of systems exhibiting delayed response in a noisy environment.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.8
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available