Design and build reinforcement learning environments that model real-world customer interaction workflows. Design RL agents that learn from these environments using real-world interaction data, rewards, and feedback loops. Define reward models and feedback loops using real-world signals such as outcomes and human feedback. Enable learning from production data by structuring interaction traces into training-ready datasets for offline and online learning. Experiment with multi‑agent systems and simulation frameworks for complex coordination and decision‑making. Collaborate with engineering and…