Pieter Abbeel
Abbeel, Pieter
VIAF ID: 220603728 (Personal)
Permalink: http://viaf.org/viaf/220603728
Preferred Forms
- 200 _ | ‡a Abbeel ‡b Pieter
-
- 100 1 _ ‡a Abbeel, Pieter
- 100 1 _ ‡a Abbeel, Pieter
-
- 100 0 _ ‡a Pieter Abbeel
4xx's: Alternate Name Forms (6)
Works
Title | Sources |
---|---|
#Exploration: A Study of Count-Based Exploration for Deep Reinforcement Learning | |
Active exploration using trajectory optimization for robotic grasping in the presence of occlusions | |
Addressing Sample Complexity in Visual Tasks Using HER and Hallucinatory GANs | |
An Application of Reinforcement Learning to Aerobatic Helicopter Flight | |
Apprenticeship learning via inverse reinforcement learning | |
Automatic Curriculum Learning through Value Disagreement | |
Autonomous multilateral debridement with the Raven surgical robot | |
AvE: Assistance via Empowerment | |
Backprop KF: Learning Discriminative Deterministic State Estimators | |
A biological micro actuator: graded and closed-loop control of insect leg motion by electrical stimulation of muscles | |
Combinatorial Energy Learning for Image Segmentation | |
Compositional Plan Vectors | |
Compression with Flows via Local Bits-Back Coding | |
Deciphering the role of a coleopteran steering muscle via free flight stimulation | |
Denoising Diffusion Probabilistic Models | |
End-to-End Training of Deep Visuomotor Policies | |
Evaluating Protein Transfer Learning with TAPE | |
Evolved Policy Gradients | |
Gaussian belief space planning with discontinuities in sensing domains | |
A geometric approach to robotic laundry folding | |
Geometry-Aware Neural Rendering | |
GP-GPIS-OPT: Grasp planning with shape uncertainty using Gaussian process implicit surfaces and Sequential Convex Programming | |
Gradient Estimation Using Stochastic Computation Graphs | |
Gravity-Based Robotic Cloth Folding | |
Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion | |
Hindsight Experience Replay | |
The Importance of Sampling in Meta-Reinforcement Learning | |
Insect-machine hybrid system | |
Inverse Reward Design | |
Learning accurate kinematic control of cable-driven surgical robots using data cleaning and Gaussian Process Regression | |
Learning by observation for surgical subtasks: Multilateral cutting of 3D viscoelastic and 2D Orthotropic Tissue Phantoms | |
Learning first-order Markov models for control | |
Learning Neural Network Policies with Guided Policy Search under Unknown Dynamics | |
Learning Plannable Representations with Causal InfoGAN | |
Learning to Poke by Poking: Experiential Learning of Intuitive Physics | |
Learning vehicular dynamics, with application to modeling helicopters | |
Link Prediction in Relational Data | |
LQG-Based Planning, Sensing, and Control of Steerable Needles | |
LQG-MP: Optimized path planning for robots with motion uncertainty and imperfect state information | |
Max-margin classification of incomplete data | |
MCP: Learning Composable Hierarchical Control with Multiplicative Compositional Policies | |
Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments | |
Multi-armed bandit models for 2D grasp planning with uncertainty | |
On a Connection between Importance Sampling and the Likelihood Ratio Policy Gradient | |
On the Utility of Learning about Humans for Human-AI Coordination | |
One-Shot Imitation Learning | |
Planning Curvature and Torsion Constrained Ribbons in 3D With Application to Intracavitary Brachytherapy | |
Planning locally optimal, curvature-constrained trajectories in 3D using sequential convex optimization | |
Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds | |
A robot path planning framework that learns from experience | |
Robotics Science and Systems VII, c2012: | |
Scaling up Gaussian Belief Space Planning Through Covariance-Free Trajectory Optimization and Automatic Differentiation | |
Sigma hulls for Gaussian belief space planning for imprecise articulated robots amid obstacles | |
A single-use haptic palpation probe for locating subcutaneous blood vessels in robot-assisted minimally invasive surgery | |
Sparse Graphical Memory for Robust Planning | |
Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model | |
Superhuman performance of surgical tasks by robots using iterative learning from human-guided demonstrations | |
A Survey of Research on Cloud Robotics and Automation | |
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning | |
Value Iteration Networks | |
VIME: Variational Information Maximizing Exploration |