Learning-from-Observation 2.0 Learning-from-Observation 2.0
Synthesis Lectures on Computer Vision

Learning-from-Observation 2.0

Automatic Acquisition of Robot Behavior from Human Demonstration

    • USD 34.99
    • USD 34.99

Descripción editorial

This book presents recent breakthroughs in the field of Learning-from-Observation (LfO) resulting from advancement in large language models (LLM) and reinforcement learning (RL) and positions it in the context of historical developments in the area. LfO involves observing human behaviors and generating robot actions that mimic these behaviors. While LfO may appear similar, on the surface, to Imitation Learning (IL) in the machine learning community and Programing-by-Demonstration (PbD) in the robotics community, a significant difference lies in the fact that these methods directly imitate human hand movements, whereas LfO encodes human behaviors into the abstract representations and then maps these representations onto the currently available hardware (individual body) of the robot, thus indirectly mimicking them. This indirect imitation allows for absorbing changes in the surrounding environment and differences in robot hardware. Additionally, by passing through this abstract representation, filtering can occur, distinguishing between important and less important aspects of human behavior, enabling imitation with fewer demonstrations and less demanding demonstrations. The authors have been researching the LfO paradigm for the past decade or so.  Previously, the focus was primarily on designing necessary and sufficient task representations to define specific task domains such as assembly of machine parts, knot-tying, and human dance movements. Recent advancements in Generative Pre-trained Transformers (GPT) and RL have led to groundbreaking developments in methods to obtain and map these abstract representations. By utilizing GPT, the authors can automatically generate abstract representations from videos, and by employing RL-trained agent libraries, implementing robot actions becomes more feasible.
In addition, this book:

Provides explanations of task encoders utilizing GPT and agent libraries via RL for executable programs for robots
Examines the selection and design of agent libraries that satisfy necessary and sufficient conditions for task domains
Discusses LfO with Piaget's child development theory and offers a historical retrospective of LfO research

GÉNERO
Informática e Internet
PUBLICADO
2025
31 de octubre
IDIOMA
EN
Inglés
EXTENSIÓN
220
Páginas
EDITORIAL
Springer Nature Switzerland
VENDEDOR
Springer Nature B.V.
TAMAÑO
33.4
MB
Active Lighting and Its Application for Computer Vision Active Lighting and Its Application for Computer Vision
2020
Digitally Archiving Cultural Objects Digitally Archiving Cultural Objects
2008
Machine Vision Beyond Visible Spectrum Machine Vision Beyond Visible Spectrum
2011
Structured Representation Learning Structured Representation Learning
2025
Video Object Segmentation Video Object Segmentation
2023
Video Object Tracking Video Object Tracking
2023
A Unifying Framework for Formal Theories of Novelty A Unifying Framework for Formal Theories of Novelty
2023
Advances in Face Presentation Attack Detection Advances in Face Presentation Attack Detection
2023
Fine-Grained Image Analysis: Modern Approaches Fine-Grained Image Analysis: Modern Approaches
2023