Practical reinforcement learning for adaptive robotic manipulation: Sample efficiency, sim-to-real transfer, and context inference

Enayati, Amir Mehdi Soufi

Practical reinforcement learning for adaptive robotic manipulation: Sample efficiency, sim-to-real transfer, and context inference

dc.contributor.author	Enayati, Amir Mehdi Soufi
dc.contributor.supervisor	Najjaran, Homayoun
dc.date.accessioned	2025-09-17T20:16:50Z
dc.date.available	2025-09-17T20:16:50Z
dc.date.issued	2025
dc.degree.department	Department of Mechanical Engineering
dc.degree.level	Doctor of Philosophy PhD
dc.description.abstract	Modern robotic systems seek the ability to adapt to novel tasks and environments in a sample-efficient and robust manner. A framework is proposed to enable such adaptability through three interrelated and complementary contributions in the field of reinforcement learning (RL) for robot manipulation. The central challenges compromising the practical use of RL are addressed, including data efficiency, sim-to-real transfer, and context-aware generalization to unseen tasks. The first contribution addresses the sample-efficiency challenge. Demonstration Exploitation by Abstract Symmetry of Environments (Demo-EASE), introduces a sample-efficient RL framework with limited demonstrations that can be augmented by exploiting the symmetry. By identifying and leveraging symmetry in manipulation environments, abstract demonstrations are reused across multiple sub-regions of the task space. The implemented masked behavior cloning allows online adaptive balance between pure RL and imitation learning. Demo-EASE shows effective knowledge transfer, improved learning efficiency, and fewer interactions required while generalizing on the workspace of the expert policy. The second contribution focuses on improving the reliability of sim-to-real transfer. A novel concept, Real-Time Intrinsic Stochasticity (RT-IS), is introduced, demonstrating that inherent noise of real-time simulations can be beneficial when approximating real-world uncertainty. Experimental validation on simulated and physical robot tasks confirms that RT-IS improves deployability, requiring less explicit tuning than domain randomization and relaxing the threshold on modeling precision. The third contribution addresses the challenge of inferring task representation in a meta-RL setting. A transformer-based belief model, Context Representation via Action-Free Transformer encoder-decoder (CRAFT), is developed to infer variational latent task belief from sequences of states and rewards, without access to the agent's actions. This action-agnostic approach improves adaptability in partially observable environments and supports effective zero-shot learning. Tested on the MetaWorld benchmark, CRAFT outperforms existing baselines in generalization with meaningful task inference quality. Together, these contributions complete the roadmap to create a framework for adaptive reinforcement learning in robotics. The results demonstrate how structured data, intentional noise, and agent-agnostic long-horizon attention can create efficient, robust, and adaptable learning systems. This work lays a theoretical and practical foundation for future developments in adaptive robotics, enabling robots to operate across a broad spectrum of environments and objectives while cutting down on retraining.
dc.description.scholarlevel	Graduate
dc.identifier.bibliographicCitation	Amir M. Soufi Enayati, Zengjie Zhang, and Homayoun Najjaran. A methodical interpretation of adaptive robotics: Study and reformulation. Neurocomputing, 512:381–397, 2022.
dc.identifier.bibliographicCitation	Amir M. Soufi Enayati, Ram Dershan, Zengjie Zhang, Dean Richert, and Homayoun Najjaran. Facilitating sim-to-real by intrinsic stochasticity of real-time simulation in reinforcement learning for robot manipulation. IEEE Transactions on Artificial Intelligence, 5(4):1791–1804, 2023.
dc.identifier.uri	https://hdl.handle.net/1828/22768
dc.language	English	eng
dc.language.iso	en
dc.rights	Available to the World Wide Web
dc.subject	Reinforcement learning
dc.subject	Adaptive robotics
dc.subject	Robot manipulation
dc.subject	Task inference
dc.subject	Sim-to-real transfer
dc.subject	Behavior cloning
dc.subject	Bayes-adaptive MDP
dc.subject	General abstract symmetry
dc.title	Practical reinforcement learning for adaptive robotic manipulation: Sample efficiency, sim-to-real transfer, and context inference
dc.type	Thesis

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Enayati_Amir_Mehdi_Soufi_PhD_2025.pdf
Size:: 32.32 MB
Format:: Adobe Portable Document Format

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.62 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Electronic Theses and Dissertations (ETD)