Ai2 Blog17d ago

EMO: Pretraining mixture of experts for emergent modularity

EMO is a new mixture-of-experts model trained so modular expert groups emerge from data, enabling users to select small task-specific expert subsets while preserving near full-model performance.