220603728

Preferred Forms

4xx's: Alternate Name Forms (6)

Works

Title	Sources
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning
Active exploration using trajectory optimization for robotic grasping in the presence of occlusions
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs
An Application of Reinforcement Learning to Aerobatic Helicopter Flight
Apprenticeship learning via inverse reinforcement learning
Automatic Curriculum Learning through Value Disagreement
Autonomous multilateral debridement with the Raven surgical robot
AvE: Assistance via Empowerment
Backprop KF: Learning Discriminative Deterministic State Estimators
A biological micro actuator: graded and closed-loop control of insect leg motion by electrical stimulation of muscles
Combinatorial Energy Learning for Image Segmentation
Compositional Plan Vectors
Compression with Flows via Local Bits-Back Coding
Deciphering the role of a coleopteran steering muscle via free flight stimulation
Denoising Diffusion Probabilistic Models
End-to-End Training of Deep Visuomotor Policies
Evaluating Protein Transfer Learning with TAPE
Evolved Policy Gradients
Gaussian belief space planning with discontinuities in sensing domains
A geometric approach to robotic laundry folding
Geometry-Aware Neural Rendering
GP-GPIS-OPT: Grasp planning with shape uncertainty using Gaussian process implicit surfaces and Sequential Convex Programming
Gradient Estimation Using Stochastic Computation Graphs
Gravity-Based Robotic Cloth Folding
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
Hindsight Experience Replay
The Importance of Sampling in Meta-Reinforcement Learning
Insect-machine hybrid system
Inverse Reward Design
Learning accurate kinematic control of cable-driven surgical robots using data cleaning and Gaussian Process Regression
Learning by observation for surgical subtasks: Multilateral cutting of 3D viscoelastic and 2D Orthotropic Tissue Phantoms
Learning first-order Markov models for control
Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics
Learning Plannable Representations with Causal InfoGAN
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
Learning vehicular dynamics, with application to modeling helicopters
Link Prediction in Relational Data
LQG-Based Planning, Sensing, and Control of Steerable Needles
LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information
Max-margin classification of incomplete data
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments
Multi-armed bandit models for 2D grasp planning with uncertainty
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient
On the Utility of Learning about Humans for Human-AI Coordination
One-Shot Imitation Learning
Planning Curvature and Torsion Constrained Ribbons in 3D With Application to Intracavitary Brachytherapy
Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization
Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds
A robot path planning framework that learns from experience
Robotics Science and Systems VII, c2012:
Scaling up Gaussian Belief Space Planning Through Covariance-Free Trajectory Optimization and Automatic Differentiation
Sigma hulls for Gaussian belief space planning for imprecise articulated robots amid obstacles
A single-use haptic palpation probe for locating subcutaneous blood vessels in robot-assisted minimally invasive surgery
Sparse Graphical Memory for Robust Planning
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model
Superhuman performance of surgical tasks by robots using iterative learning from human-guided demonstrations
A Survey of Research on Cloud Robotics and Automation
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning
Value Iteration Networks
VIME: Variational Information Maximizing Exploration

Selected Co-authors

Countries and Regions of Publication (1)

Publication Statistics

Selected Publishers (1)

About

Record Views

History of VIAF ID:220603728 (8)

Record ID	Action	Time
LC\|n 2011083087	add	2012-01-24T10:40:15+00:00
BNF\|16694634	add	2013-08-19T18:59:48+00:00
ISNI\|0000000359541234	add	2014-12-15T16:26:32+00:00
DNB\|1112731873	delete	2017-07-03T23:52:57.146315+00:00
J9U\|987007386793605171	add	2020-02-02T17:32:38.879011+00:00
NLI\|004760867	delete	2020-04-12T17:42:26.026419+00:00
CAOONL\|ncf11619682	add	2020-07-14T14:40:23.362631+00:00
WKP\|Q33374357	add	2022-04-27T13:49:32.338513+00:00

VIAF

Virtual International Authority File

Search

Pieter Abbeel

Abbeel, Pieter

VIAF ID: 220603728 (Personal)

Permalink: http://viaf.org/viaf/220603728

Preferred Forms

4xx's: Alternate Name Forms (6)

Works

Selected Co-authors

Countries and Regions of Publication (1)

Publication Statistics

Publication History

Identifiers

Selected Publishers (1)

About

Personal Information

Gender : Male

Nationality or associated country:

Language:

External Links

Record Views

History of VIAF ID:220603728 (8)