What is Mixture of Experts (MoE)?

Mixture of experts (MoE) is a neural network architecture that divides a model into many specialized sub-networks called "experts" and uses a routing mechanism to activate only a small subset of them for each input.

Mixture of Experts (MoE) - Prompt Engineering Glossary

Mixture of Experts (MoE): Mixture of experts (MoE) is a neural network architecture that divides a model into many specialized sub-networks called "experts" and uses a routing mechanism to activate only a small subset of them for each input. This design allows models to have trillions of total parameters while only using a fraction during each prediction, dramatically reducing computational cost without sacrificing capability.

Example

DeepSeek-R1 has 671 billion total parameters split across hundreds of expert sub-networks, but only 37 billion parameters are active for any single token. A router network decides which experts to activate — math experts for equations, language experts for translation — keeping inference fast and cost-effective despite the model's massive total size.

Frequently asked questions

What is Mixture of Experts (MoE)?: Mixture of experts (MoE) is a neural network architecture that divides a model into many specialized sub-networks called "experts" and uses a routing mechanism to activate only a small subset of them for each input.
Can you give an example of Mixture of Experts (MoE)?: DeepSeek-R1 has 671 billion total parameters split across hundreds of expert sub-networks, but only 37 billion parameters are active for any single token. A router network decides which experts to activate — math experts for equations, language experts for translation — keeping inference fast and cost-effective despite the model's massive total size.

Put this into practice

Build polished, copy-ready prompts in under 60 seconds with SurePrompts.

Try SurePrompts

Mixture of Experts (MoE)

Example

Frequently asked questions

What is Mixture of Experts (MoE)?

Can you give an example of Mixture of Experts (MoE)?

Related Terms

Put this into practice