Abstract: The dual control problem, first introduced by Feldbaum in the 1960s, is recognized as encapsulating the "exploration versus exploitation" dilemma, central to online learning and control.
Abstract: Dynamic pickup and delivery problems (DPDPs) with various constraints, such as docks, time windows, capacity, and last-in-first-out loading, have posed significant challenges for existing ...