About the Evaluation category

gradyacc February 5, 2026, 6:11pm 1

This subforum is dedicated to measurement and benchmarking. Discussions may include datasets, benchmarks, evaluation methodologies, reproducibility, comparisons between models or systems, and critiques of existing metrics.

Topic	Replies	Views
About the Training category Training	5	February 5, 2026
About the Model Releases category Model Releases	17	September 17, 2025
About the Inference category Inference	4	February 5, 2026
About the Prompting category Prompting	5	February 5, 2026
About the Theory category Theory	4	February 5, 2026
About the Architecture category Architecture	11	February 5, 2026
About the Compute Contributions category Compute Contributions	18	February 5, 2026
Nous Research presents Hermes 4, our latest line of open-source models Model Releases model , release , nous-chat	116	September 22, 2025

About the Evaluation category

Related topics