Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering
AWS Machine Learning
APRIL 24, 2024
To increase training samples for better learning, we also used another LLM to generate feedback scores. This method was described in A generative AI-powered solution on Amazon SageMaker to help Amazon EU Design and Construction. RLHF is widely used throughout generative artificial intelligence (AI) and LLM applications.
Let's personalize your content