Arşiv ve Dokümantasyon Merkezi
Dijital Arşivi

A reinforcement learning based controller to minimize forces on the crutches of a lower-limb exoskeleton

Basit öğe kaydını göster

dc.contributor Graduate Program in Systems and Control Engineering.
dc.contributor.advisor Samur, Evren.
dc.contributor.advisor Öncü, Sinan.
dc.contributor.author Utku, Aydın Emre.
dc.date.accessioned 2024-03-12T15:00:21Z
dc.date.available 2024-03-12T15:00:21Z
dc.date.issued 2022
dc.identifier.other SCO 2022 U85
dc.identifier.uri http://digitalarchive.boun.edu.tr/handle/123456789/21486
dc.description.abstract The majority of the metabolic energy consumption of a lower-limb exoskeleton user comes from the upper body effort, since the lower body can be considered to be passive. However, the upper body effort of lower limb exoskeleton users is ignored during motion controller development process in the literature. In this thesis study, deep reinforcement learning is used to develop a locomotion controller that minimizes the ground reaction forces (GRF) on crutches. The rationale for minimizing the ground reaction forces is to minimize the upper body effort of the user. A model of the human-exoskeleton system with crutches is created in URDF and XML formats. Reward functions that encourage the forward displacement of the center of mass of the exoskeleton-human system without falling and extreme joint torques are shaped. The state-of-the-art methods, Twin Delayed Deep Deterministic Policy Gradient (TD3) and Proximal Policy Optimization (PPO), are employed with the RaiSim and MuJoCo physics simulators and with different algorithm specific parameters in multiple training trials. The employed networks generate the joint torques based on the joint angle and velocities along with the ground reaction forces on feet and crutch tips. These generated joint torques are directly sent to the exoskeleton model and a new state is observed after implementing the action that the deep RL framework provides. Policies trained by the TD3 and PPO methods on RaiSim are observed to fail to generate proper control commands for a stable and natural looking gait. In general, it is observed that the PPO method generated higher rewards than the TD3 method on RaiSim. After failing to develop a desired policy with RaiSim, MuJoCo is employed as the simulator. Eventually, a policy that can generate a reasonable gait with a desired crutch usage and with 35% minimization in GRFs with respect to the baseline policy is developed.
dc.format.extent 111:001:PDF:b2795654:038389:0:0:0:0:0:0tFull text electronic versionvn
dc.publisher Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2022.
dc.subject.lcsh Robotic exoskeletons.
dc.subject.lcsh Reinforcement learning.
dc.title A reinforcement learning based controller to minimize forces on the crutches of a lower-limb exoskeleton
dc.format.pages xv, 71 leaves


Bu öğenin dosyaları

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster

Dijital Arşivde Ara


Göz at

Hesabım