Natural Language Processing
Benchmarking LLMs on Text Generation
This course explores benchmarking for open-ended generation tasks like summarization. You'll experiment with different prompting styles, compare models like GPT-3.5 and GPT-4, and evaluate results using both fuzzy string similarity and semantic similarity via embeddings.
Python
3 lessons
10 practices
1 hour
Course details
Reading CSV Files with Python
Simplifying Prompts for Better Summaries
Crafting Custom Prompts for Better Summaries

Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal