About the Architecture category

This subforum is for discussions about model structure and representation. Topics include attention mechanisms, architectural innovations, scaling patterns, inductive biases, and new ways of organizing computation inside neural networks.