Manish Gupta, Jing Gao, et al.
IEEE TKDE
In a descending price multi-unit Dutch auction over the Internet, auctioneer gradually decrements per unit price of the item during the course of the auction. We investigate the problem of finding a decrementing price sequence that maximizes auctioneer’s total expected revenue using single-agent Reinforcement Learning. We contrast actual-return (Monte-Carlo based) learning methods with one step Q-learning and also with other adaptive strategies and report extensive comparative performance study. In our experimental design, we model bidders’ strategies in a unique way using various bid functions that capture realistic strategic behavior of bidders in such auction games. Monte-Carlo control algorithm developed here offers consistent performance and yields high average returns in all the experiments.
Manish Gupta, Jing Gao, et al.
IEEE TKDE
Ayushi Dalmia, J. Ganesh, et al.
WWW 2018
Manish Gupta, Charu C. Aggarwal, et al.
ASONAM 2011
Dong Wang, Tarek Abdelzaher, et al.
FUSION 2011