OpenAI Releases New InstructGPT 3.5 Model OpenAI announced the release of a new member of the GPT family today, InstructGPT-3.5-Turbo.
What is RLHF? - Reinforced Learning From Human Feedback RLHF means training an AI model with human feedback. By putting humans in the training loop to grade AI output, LLMS (Large Language Models) can learn to give more accurate and natural responses.