Natural Language Processing
Benchmarking LLMs on Text Generation
This course explores benchmarking for open-ended generation tasks like summarization. You'll experiment with different prompting styles, compare models like GPT-3.5 and GPT-4, and evaluate results using both fuzzy string similarity and semantic similarity via embeddings.
Python
3 lessons
10 practices
1 hour
Badge for NLP Model Evaluation and Optimization,
Course details
Prompting for Summarization with LLMs
Reading CSV Files with Python
Simplifying Prompts for Better Summaries
Crafting Custom Prompts for Better Summaries
Turn screen time into skills time
Practice anytime, anywhere with our mobile app.
Sign up
Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal