Multiverse Computing Interview Question

How did you use RL for your LLM project?