Publication: Infinite Horizon Multi-armed Bandits with Reward Vectors: Exploration/Exploitation Trade-off.