Items related to Hierarchical Relative Entropy Policy Search: An Information...

Hierarchical Relative Entropy Policy Search: An Information Theoretic Learning Algorithm in Multimodal Solution Spaces for Real Robots - Softcover

 
9783639475999: Hierarchical Relative Entropy Policy Search: An Information Theoretic Learning Algorithm in Multimodal Solution Spaces for Real Robots

Synopsis

Many real-world problems are inherently hierarchically structured. The use of this structure in an agent’s policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy — the `mixed option’ policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.

"synopsis" may belong to another edition of this title.

About the Author

Christian Daniel studied computational engineering at Technische Universitaet Darmstadt and EPFL Lausanne and is pursuing a PhD in Robot Learning. His research focuses on developing new learning algorithms for autonomous robots, especially in the field of robot skill learning and hierarchical reinforcement learning.

"About this title" may belong to another edition of this title.

Buy New

View this item

£ 9.60 shipping from Germany to United Kingdom

Destination, rates & speeds

Search results for Hierarchical Relative Entropy Policy Search: An Information...

Seller Image

Christian Daniel
Published by AV Akademikerverlag Jan 2014, 2014
ISBN 10: 3639475992 ISBN 13: 9783639475999
New Taschenbuch
Print on Demand

Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the `mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates. 68 pp. Englisch. Seller Inventory # 9783639475999

Contact seller

Buy New

£ 28.74
Convert currency
Shipping: £ 9.60
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 2 available

Add to basket

Seller Image

Christian Daniel
Published by AV Akademikerverlag, 2014
ISBN 10: 3639475992 ISBN 13: 9783639475999
New Taschenbuch
Print on Demand

Seller: AHA-BUCH GmbH, Einbeck, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. nach der Bestellung gedruckt Neuware - Printed after ordering - Many real-world problems are inherently hierarchically structured. The use of this structure in an agent's policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy - the `mixed option' policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates. Seller Inventory # 9783639475999

Contact seller

Buy New

£ 28.74
Convert currency
Shipping: £ 12.22
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 1 available

Add to basket

Seller Image

Christian Daniel|Gerhard Neumann
Published by AV Akademikerverlag, 2014
ISBN 10: 3639475992 ISBN 13: 9783639475999
New Softcover
Print on Demand

Seller: moluna, Greven, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. Autor/Autorin: Daniel ChristianChristian Daniel studied computational engineering at Technische Universitaet Darmstadt and EPFL Lausanne and is pursuing a PhD in Robot Learning. His research focuses on developing new learning algorithms for autonom. Seller Inventory # 4991377

Contact seller

Buy New

£ 24.41
Convert currency
Shipping: £ 21.82
From Germany to United Kingdom
Destination, rates & speeds

Quantity: Over 20 available

Add to basket

Seller Image

Christian Daniel
Published by AV Akademikerverlag Jan 2014, 2014
ISBN 10: 3639475992 ISBN 13: 9783639475999
New Taschenbuch

Seller: buchversandmimpf2000, Emtmannsberg, BAYE, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. Neuware -Many real-world problems are inherently hierarchically structured. The use of this structure in an agent¿s policy may well be the key to improved scalability and higher performance on motor skill tasks. However, such hierarchical structures cannot be exploited by current policy search algorithms. We concentrate on a basic, but highly relevant hierarchy ¿ the `mixed option¿ policy. Here, a gating network first decides which of the options to execute and, subsequently, the option-policy determines the action. Using a hierarchical setup for our learning method allows us to learn not only one solution to a problem but many. We base our algorithm on a recently proposed information theoretic policy search method, which addresses the exploitation-exploration trade-off by limiting the loss of information between policy updates.VDM Verlag, Dudweiler Landstraße 99, 66123 Saarbrücken 68 pp. Englisch. Seller Inventory # 9783639475999

Contact seller

Buy New

£ 28.74
Convert currency
Shipping: £ 30.56
From Germany to United Kingdom
Destination, rates & speeds

Quantity: 2 available

Add to basket