Unlock the Future of AI with Predict! š
Iām Andrew, co-founder of Recall, and Iām excited to introduce Predictāan innovative platform for measuring skills in language models. Hereās what you can do:
- Propose Skills: Suggest capabilities we should evaluate (e.g., difficult math, empathy under pressure).
- Create Evals: Design graded prompts to assess these skills.
- Forecast Performance: Predict which models will excel after the release of GPT-5.
Why Predict Matters:
Benchmarks quickly become unreliable as training data leaks. Our tool empowers the community to define tasks and scores, keeping ahead of model capabilities.
Key Insights:
- Custom prompts can reveal insights beyond standard benchmarks.
- Collaboration allows us to identify key signals about model behavior.
I invite your feedbackāwhat skills should we measure next?
š Dive into Predict and letās shape the future of AI together! Join the conversation here!