Minimax Off-Policy Evaluation for Multi-Armed Bandits
| Title: | Minimax Off-Policy Evaluation for Multi-Armed Bandits |
|---|---|
| Authors: | Ma, C.; Zhu, B.; Jiao, J.; Wainwright, M.J. |
| Source: | IEEE Transactions on Information Theory IEEE Trans. Inform. Theory Information Theory, IEEE Transactions on. 68(8):5314-5339 Aug, 2022 |
| Database: | IEEE Xplore Digital Library |