Abstract
Background: Quadruped crawling robots will be faced with stability problems when walking on a raised slope. The stability of robot is affected by gait planning and selection of its foothold in this terrain. The slope reaction force on anterior and posterior legs is uneven. The selection strategy of its foothold should achieve good performance for the stability of the quadruped crawling robot.
Objective: Aimed at the uneven problem of slope reaction force on the anterior and posterior legs of the quadruped crawling robot when walking on the raised slope, a patent method for foothold optimization using reinforcement learning based on strategy search is proposed.
Methods: The kinematic model of the quadruped crawling robot is created in D-H coordinate method. According to the gait timing sequence method, the frame description of the quadruped crawling robot's gait on the slope is proposed. The fitting polynomial coefficients and fitting curves of all joints of the leg can be obtained by using the polynomial fitting calculation method. The reinforcement learning method based on Q-learning algorithm is proposed to find the optimal foothold by interacting with the slope environment. Comparative simulation and test of other gait and climbing slope gait, the climbing slope gait with and without the Q-learning algorithm is carried out by MATLAB platform.
Results: When the quadruped crawling robot adopts the reinforcement learning method based on Qlearning algorithm to select foothold, the robot posture curves are compared without optimization strategy. The result proves that the selection strategy of its foothold is valid.
Conclusion: The selection strategy of its foothold with reinforcement learning based on Q-learning algorithm can improve the stability of the quadruped crawling robot on the raised sloped.
[http://dx.doi.org/10.1007/s11370-019-00309-3]
[http://dx.doi.org/10.1109/TMECH.2016.2616284]
[http://dx.doi.org/10.1080/01691864.2017.1378591]
[http://dx.doi.org/10.1177/0278364917694244]
[http://dx.doi.org/10.3390/app9183911]
[http://dx.doi.org/10.1007/s42235-019-0021-8]
[http://dx.doi.org/10.2316/J.2020.206-0318]
[http://dx.doi.org/10.1109/ACCESS.2020.3016312]
[http://dx.doi.org/10.1007/s42235-020-0091-7]
[http://dx.doi.org/10.1155/2019/5491298]
[http://dx.doi.org/10.3390/electronics8121414]
[http://dx.doi.org/10.3390/app9091778]
[http://dx.doi.org/10.1177/1729881419890713]
[http://dx.doi.org/10.1007/s42235-019-0050-3]
[http://dx.doi.org/10.1109/LRA.2018.2800124]
[http://dx.doi.org/10.1108/IR-05-2019-0115]
[http://dx.doi.org/10.2174/2212797614666210413145741]
[http://dx.doi.org/10.3233/JIFS-169443]