Publication: Tradeoff between exploration and exploitation of OQ(lambda) with non-Markovian update in dynamic environments.