See also Interactive NLP Workshop - References and Awesome RLHF
RLHF: Reinforcement Learning with Human Feedback. This definition is quite general, and would apply to any method of reinforcement learning with human feedback. Actually RLHF usually refers to a specific method of doing this. For a quick a overview, see section 3 of Rafailov 2023.