[1]
F. Xu, W. Hou, and L. Wei, “An Empirical Investigation into the Limitations of Sparse Mixture of Experts for Small Scale Character Level Modeling”, IJAIGM, vol. 2, no. 1, Apr. 2026, doi: 10.67119/1107ve04.