Stellantis Interview Question

Expliquez le Reinforcement Learning from Human Feedbac ?