Troubleshooting
Troubleshooting
Common Issues
"Module not found" errors
# Ensure virtual environment is activated
source venv/bin/activate # or venv\Scripts\activate on Windows
# Reinstall dependencies
pip install -r requirements.txt --upgrade
API Key errors
Symptoms: AuthenticationError, Invalid API key
Solutions:
- Verify
.envfile is in project root - Check API keys are correct (no extra spaces)
- Ensure provider in YAML matches
.envconfiguration - Test API key manually with provider's playground
No quizzes found
Symptoms: Loaded 0 quizzes
Solutions:
- Verify quiz JSON files are in correct directory
- Check JSON format validity:
python -m json.tool data/quizzes/quiz.json - Ensure
source_materialfield references existing file - Review directory paths in config YAML
High variance in results
Symptoms: Std dev > 10
Possible causes:
- Ambiguous metric prompts
- Complex questions with multiple interpretations
- Insufficient runs
Solutions:
- Increase number of runs (5-10)
- Refine metric prompt for clarity
- Review raw LLM responses for patterns
- Try different evaluator models
Unexpected scores
Symptoms: Scores don't match manual assessment
Diagnostic steps:
- Review raw LLM responses in
<run-bundle>/results.json - Test metric prompt manually in LLM playground
- Verify source material quality and completeness
- Check if questions align with learning objectives
- Compare across multiple evaluators
API rate limit errors
Symptoms: RateLimitError, 429 Too Many Requests
Solutions:
- Add delays between API calls (implement in provider)
- Use batch processing with smaller batches
- Check your tier limits with provider
- Spread evaluation across longer time period
Memory issues
Symptoms: MemoryError, system slowdown
Solutions:
- Process quizzes in batches
- Reduce
max_tokensin config - Use lighter models (gpt-3.5-turbo vs gpt-4)
- Clear results between runs