Learning-from-Observation 2.0 Learning-from-Observation 2.0
Synthesis Lectures on Computer Vision

Learning-from-Observation 2.0

Automatic Acquisition of Robot Behavior from Human Demonstration

Katsushi Ikeuchi والمزيد
    • ‏34٫99 US$
    • ‏34٫99 US$

وصف الناشر

This book presents recent breakthroughs in the field of Learning-from-Observation (LfO) resulting from advancement in large language models (LLM) and reinforcement learning (RL) and positions it in the context of historical developments in the area. LfO involves observing human behaviors and generating robot actions that mimic these behaviors. While LfO may appear similar, on the surface, to Imitation Learning (IL) in the machine learning community and Programing-by-Demonstration (PbD) in the robotics community, a significant difference lies in the fact that these methods directly imitate human hand movements, whereas LfO encodes human behaviors into the abstract representations and then maps these representations onto the currently available hardware (individual body) of the robot, thus indirectly mimicking them. This indirect imitation allows for absorbing changes in the surrounding environment and differences in robot hardware. Additionally, by passing through this abstract representation, filtering can occur, distinguishing between important and less important aspects of human behavior, enabling imitation with fewer demonstrations and less demanding demonstrations. The authors have been researching the LfO paradigm for the past decade or so.  Previously, the focus was primarily on designing necessary and sufficient task representations to define specific task domains such as assembly of machine parts, knot-tying, and human dance movements. Recent advancements in Generative Pre-trained Transformers (GPT) and RL have led to groundbreaking developments in methods to obtain and map these abstract representations. By utilizing GPT, the authors can automatically generate abstract representations from videos, and by employing RL-trained agent libraries, implementing robot actions becomes more feasible.
In addition, this book:

Provides explanations of task encoders utilizing GPT and agent libraries via RL for executable programs for robots
Examines the selection and design of agent libraries that satisfy necessary and sufficient conditions for task domains
Discusses LfO with Piaget's child development theory and offers a historical retrospective of LfO research

النوع
كمبيوتر وإنترنت
تاريخ النشر
٢٠٢٥
٣١ أكتوبر
اللغة
EN
الإنجليزية
عدد الصفحات
٢٢٠
الناشر
Springer Nature Switzerland
البائع
Springer Nature B.V.
الحجم
٣٣٫٤
‫م.ب.‬
Active Lighting and Its Application for Computer Vision Active Lighting and Its Application for Computer Vision
٢٠٢٠
Digitally Archiving Cultural Objects Digitally Archiving Cultural Objects
٢٠٠٨
Machine Vision Beyond Visible Spectrum Machine Vision Beyond Visible Spectrum
٢٠١١
Structured Representation Learning Structured Representation Learning
٢٠٢٥
Video Object Segmentation Video Object Segmentation
٢٠٢٣
Video Object Tracking Video Object Tracking
٢٠٢٣
A Unifying Framework for Formal Theories of Novelty A Unifying Framework for Formal Theories of Novelty
٢٠٢٣
Advances in Face Presentation Attack Detection Advances in Face Presentation Attack Detection
٢٠٢٣
Fine-Grained Image Analysis: Modern Approaches Fine-Grained Image Analysis: Modern Approaches
٢٠٢٣