ロボットによる巧みな物体操作を実現するための自己教師あり学習法の提案

村田, 真悟

Home »» Listing item »» Detail

Detail

Item Type

Article

ID

2020000008-20200199 　

Preview

Image
Caption

Full text

2020000008-20200199.pdf

Type	:application/pdf	Download
Size	:127.3 KB
Last updated	:Feb 16, 2024
Downloads	: 164

Total downloads since Feb 16, 2024 : 164
　

Release Date

Title

Title	ロボットによる巧みな物体操作を実現するための自己教師あり学習法の提案
Kana	ロボットニヨルタクミナブッタイソウサオジツゲンスルタメノジコキョウシアリガクシュウホウノテイアン
Romanization	Robotto ni yoru takumina buttai sōsa o jitsugensuru tame no jiko kyōshi ari gakushūhō no teian

Other Title

Title	Self-supervised learning method to realize skillful object manipulation by a robot
Kana
Romanization

Creator

Name	村田, 真悟
Kana	ムラタ, シンゴ
Romanization	Murata, Shingo
Affiliation	慶應義塾大学理工学部専任講師
Affiliation (Translated)
Role	Research team head
Link

Edition

Place

Publisher

Name	慶應義塾大学
Kana	ケイオウギジュクダイガク
Romanization	Keiō gijuku daigaku

Date

Issued (from:yyyy)	2021
Issued (to:yyyy)
Created (yyyy-mm-dd)
Updated (yyyy-mm-dd)
Captured (yyyy-mm-dd)

Physical description

1 pdf 　

Source Title

Name	学事振興資金研究成果実績報告書
Name (Translated)
Volume
Issue
Year	2020
Month
Start page
End page

ISSN

ISBN

DOI

URI

JaLCDOI

NII Article ID

Ichushi ID

Other ID

Doctoral dissertation

Dissertation Number
Date of granted
Degree name
Degree grantor

Abstract

本研究では，ロボットによる巧みな物体操作を実現するため，プレイデータを用いた自己教師あり学習法を提案し，その実装を行った．具体的には，深層学習をロボティクス分野に対して有効に適用するための，（1）新たな自己教師データ取得フレームワークと（2）取得したデータを学習するための深層生成モデルの構築を行った．
（1）では人がマスタースレーブ方式によって，その人の「興味」に基づいてロボットを操作することでロボット（自己）の多様な視覚・運動情報データを取得した．具体的には，ロボットが操作可能な物体が配置された環境において，興味に基づき自由にマスター側のロボットを操縦することでスレーブ側のロボットを1時間操作した．その際に，スレーブ側のロボットの関節角度とカメラ画像の時系列データを取得した．本研究ではこのデータを，人が自由に「遊びながら」取得したという意味でプレイデータと呼ぶ．
（2）では変分自己符号化器（VAE: Variational Autoencoder）と再帰型ニューラルネットワーク（RNN: Recurrent Neural Network）を統合した変分再帰型自己符号化器（VRAE: Variational Recurrent Autoencoder）を構築し，取得したプレイデータの学習を行った．VRAEは，取得した時系列データの全体から一部を切り取り，その最終状態に到達することを目指し，視覚・運動状態の予測学習を行った．学習後のモデルをロボットに搭載し，学習済・未学習の環境における物体操作能力の検証を行った．検証の結果，従来の正例のみを用いていた教師あり学習では困難であったエラーリカバリが，本研究で提案する自己教師あり学習法で実現可能であることが示された．また，従来の強化学習においては環境をランダムに探索するため無駄なデータが含まれるという問題があるが，提案手法は人がロボットを興味に基づいて操作するため，取得されたデータは多様性を有する意味のあるものとなり，データの効率的な取得も実現することができた．
In this study, we proposed and implemented a self-supervised learning method using play data to realize skillful object manipulation by a robot. Specifically, we developed (1) a new framework for acquiring self-supervised data and (2) a deep generative model for learning the acquired data in order to effectively apply deep learning to the field of robotics.
In (1), a human operator freely controls a robot based on his or her "interests" in a master-slave manner to acquire various visual and proprioceptive data from a robot. In particular, the operator controlled a slave robot in an environment with manipulatable objects by directly guiding a master robot based on his or her interests during an hour. Time series data of joint angles and camera images of the slave robot were recorded. In this study, we call these data "play data" in the sense that they were acquired while he or she was playing freely.
In (2), we developed a variational recurrent autoencoder (VRAE) that integrates a variational autoencoder (VAE) and a recurrent neural network (RNN). Given a final state of a part of whole time series data as a goal, the VRAE learned to generate a predictive sequence of the visual and proprioceptive states to reach the final state. We implemented the trained model in the robot and tested its ability to manipulate objects in both trained and untrained situations. The experimental results demonstrated that our proposed self-supervised learning method can achieve error recovery that is not difficult to be achieved in the conventional supervised learning method using only positive examples. In addition, while conventional reinforcement learning has the problem of including useless data due to the random exploration of an environment, our proposed method which asks human operators to manipulate a robot based on their interests realizes the meaningful and efficient data collection with diversity.

Table of contents

Keyword

NDC

Note

Language

日本語　

英語　

Type of resource

text 　

Genre

Research Paper 　

Text version

publisher 　

Related DOI

Access conditions

Last modified date

Feb 16, 2024 13:35:19 　

Creation date

Feb 16, 2024 13:35:19 　

Registerd by

mediacenter

History

Feb 16, 2024

インデックスを変更

Index

/ Public / Internal Research Fund / Keio Gijuku Academic Development Funds Report / Academic year 2020

Related to

慶應義塾大学学術情報リポジトリ（KOARA）

Language Select

Search

Indextree

Detail

Ranking

LINK