About the Architecture category

gradyacc February 5, 2026, 6:04pm 1

This subforum is for discussions about model structure and representation. Topics include attention mechanisms, architectural innovations, scaling patterns, inductive biases, and new ways of organizing computation inside neural networks.

Topic		Replies	Views
About the Training category Training	0	4	February 5, 2026
About the Prompting category Prompting	0	4	February 5, 2026
About the Theory category Theory	0	4	February 5, 2026
About the Inference category Inference	0	4	February 5, 2026
About the Evaluation category Evaluation	0	4	February 5, 2026
DSA attention (DeepSeek Sparse Attention) Architecture llm-research , training , inference	1	76	January 17, 2026
About the Model Releases category Model Releases	0	17	September 17, 2025
Nous Research presents Hermes 4, our latest line of open-source models Model Releases model , release , nous-chat	0	116	September 22, 2025

About the Architecture category

Related topics