Our work Flow to better: Offline preference-based reinforcement learning via preferred trajectory generation is accepted by ICLR 2024!