DAPPER: Discriminability-Aware Policy-to-Policy Preference-Based Reinforcement Learning for Query-Efficient Robot Skill Acquisition
| Title: | DAPPER: Discriminability-Aware Policy-to-Policy Preference-Based Reinforcement Learning for Query-Efficient Robot Skill Acquisition |
|---|---|
| Authors: | Kadokawa, Y.; Frey, J.; Miki, T.; Matsubara, T.; Hutter, M. |
| Source: | IEEE Robotics & Automation Magazine IEEE Robot. Automat. Mag. Robotics & Automation Magazine, IEEE. 33(1):151-166 Mar, 2026 |
| Database: | IEEE Xplore Digital Library |