Grain placement

Orleans ensures that when a grain call is made, an instance of that grain is available in memory on some server in the cluster to handle the request. If the grain isn't currently active in the cluster, Orleans picks a server to activate the grain on. This process is called grain placement. Placement is also one way Orleans balances load: placing busy grains evenly helps distribute the workload across the cluster.

The placement process in Orleans is fully configurable. Choose from out-of-the-box placement policies such as random, prefer-local, and load-based, or configure custom logic. This allows full flexibility in deciding where grains are created. For example, place grains on a server close to resources they need to operate on or close to other grains they communicate with.

Note

Starting in Orleans 9.2, the default placement strategy changed from random placement to resource-optimized placement. This provides better load distribution based on CPU and memory utilization across silos. If you require the previous random placement behavior, explicitly configure it as described in Configure the default placement strategy.

By default, Orleans picks a random compatible server.

Configure the placement strategy Orleans uses globally or per grain class.

Resource-optimized placement (default)

Note

Starting in Orleans 9.2, resource-optimized placement is the default placement strategy. You don't need to explicitly configure it unless you want to customize the options.

To use resource-optimized placement, add the ResourceOptimizedPlacementAttribute to your grain class.

Resource-optimized placement is not available in this version of Orleans. Consider upgrading to Orleans 8.1 or later to use this feature.

The resource-optimized placement strategy attempts to optimize cluster resources by balancing grain activations across silos based on available memory and CPU usage. It assigns weights to runtime statistics to prioritize different resources and calculates a normalized score for each silo. The silo with the lowest score is chosen for placing the upcoming activation. Normalization ensures each property contributes proportionally to the overall score. Adjust weights via ResourceOptimizedPlacementOptions based on specific requirements and priorities for different resources.

Additionally, this placement strategy exposes an option to build a stronger preference for the local silo (the one receiving the request to make a new placement) to be picked as the target for the activation. Control this via the LocalSiloPreferenceMargin property, part of the options.

Also, an online, adaptive algorithm provides a smoothing effect avoiding rapid signal drops by transforming the signal into a polynomial-like decay process. This is especially important for CPU usage and contributes overall to avoiding resource saturation on silos, particularly newly joined ones.

This algorithm is based on Resource-based placement with cooperative dual-mode Kalman filtering.

Configure this placement strategy by adding the ResourceOptimizedPlacementAttribute to the grain class.

Resource-optimized placement options

Configure the resource-optimized placement strategy using ResourceOptimizedPlacementOptions:

siloBuilder.Configure<ResourceOptimizedPlacementOptions>(options =>
{
    options.CpuUsageWeight = 40;
    options.MemoryUsageWeight = 20;
    options.AvailableMemoryWeight = 20;
    options.MaxAvailableMemoryWeight = 5;
    options.ActivationCountWeight = 15;
    options.LocalSiloPreferenceMargin = 5;
});

Option	Type	Default	Description
`CpuUsageWeight`	`int`	40	The importance of CPU usage. Higher values favor silos with lower CPU usage. Valid range: 0-100.
`MemoryUsageWeight`	`int`	20	The importance of memory usage. Higher values favor silos with lower memory usage. Valid range: 0-100.
`AvailableMemoryWeight`	`int`	20	The importance of available memory. Higher values favor silos with more available memory. Valid range: 0-100.
`MaxAvailableMemoryWeight`	`int`	5	The importance of maximum available memory. Higher values favor silos with higher physical memory capacity. Useful in clusters with unevenly distributed resources. Valid range: 0-100.
`ActivationCountWeight`	`int`	15	The importance of activation count. Higher values favor silos with fewer active grains. Valid range: 0-100.
`LocalSiloPreferenceMargin`	`int`	5	Margin for preferring the local silo. When set to 0, always selects the silo with lowest utilization. When set to 100, always prefers the local silo (equivalent to PreferLocalPlacement). Recommended: 5-10. Valid range: 0-100.

Note

The weight values don't need to sum to 100. Orleans normalizes them automatically, so they represent relative importance rather than absolute percentages.

Random placement

Orleans randomly selects a server from the compatible servers in the cluster. To configure this placement strategy, add the RandomPlacementAttribute to the grain class.

Local placement

If the local server is compatible, Orleans selects the local server; otherwise, it selects a random server. Configure this placement strategy by adding the PreferLocalPlacementAttribute to the grain class.

Hash-based placement

Orleans hashes the grain ID to a non-negative integer and applies a modulo operation with the number of compatible servers. It then selects the corresponding server from the list of compatible servers ordered by server address. Note that this placement isn't guaranteed to remain stable as cluster membership changes. Specifically, adding, removing, or restarting servers can alter the server selected for a given grain ID. Because grains placed using this strategy register in the grain directory, this change in placement decision as membership changes typically doesn't have a noticeable effect.

Configure this placement strategy by adding the HashBasedPlacementAttribute to the grain class.

Activation-count-based placement

This placement strategy attempts to place new grain activations on the least heavily loaded server based on the number of recently busy grains. It includes a mechanism where all servers periodically publish their total activation count to all other servers. The placement director then selects a server predicted to have the fewest activations by examining the most recently reported activation count and predicting the current count based on recent activations made by the placement director on the current server. The director selects several servers randomly when making this prediction to avoid multiple separate servers overloading the same server. By default, two servers are selected randomly, but this value can be configured via ActivationCountBasedPlacementOptions.

This algorithm is based on the thesis The Power of Two Choices in Randomized Load Balancing by Michael David Mitzenmacher. It's also used in Nginx for distributed load balancing, as described in the article NGINX and the "Power of Two Choices" Load-Balancing Algorithm.

Configure this placement strategy by adding the ActivationCountBasedPlacementAttribute to the grain class.

Stateless worker placement

Stateless worker placement is a special placement strategy used by stateless worker grains. This placement operates almost identically to PreferLocalPlacement, except each server can have multiple activations of the same grain, and the grain isn't registered in the grain directory since there's no need.

Configure this placement strategy by adding the StatelessWorkerAttribute to the grain class.

Silo-role-based placement

This is a deterministic placement strategy placing grains on silos with a specific role. Configure this placement strategy by adding the SiloRoleBasedPlacementAttribute to the grain class.

Choose a placement strategy

Choosing the appropriate grain placement strategy, beyond the defaults Orleans provides, requires monitoring and evaluation. The choice should be based on the app's size and complexity, workload characteristics, and deployment environment.

Random placement relies on the Law of Large Numbers, so it's usually a good default for unpredictable loads spread across many grains (10,000 or more).

Activation-count-based placement also has a random element, relying on the Power of Two Choices principle. This is a commonly used algorithm for distributed load balancing and is used in popular load balancers. Silos frequently publish runtime statistics to other silos in the cluster, including:

Available memory, total physical memory, and memory usage.
CPU usage.
Total activation count and recent active activation count.
- A sliding window of activations active in the last few seconds, sometimes referred to as the activation working set.

From these statistics, only activation counts are currently used to determine the load on a given silo.

Ultimately, experiment with different strategies and monitor performance metrics to determine the best fit. Selecting the right grain placement strategy optimizes the performance, scalability, and cost-effectiveness of Orleans apps.

Configure the default placement strategy

Starting in Orleans 9.2, Orleans uses resource-optimized placement by default. Override the default placement strategy by registering an implementation of PlacementStrategy during configuration.

To revert to the previous random placement behavior:

siloBuilder.Services.AddSingleton<PlacementStrategy, RandomPlacement>();

To use a custom placement strategy:

siloBuilder.Services.AddSingleton<PlacementStrategy, MyPlacementStrategy>();

Orleans uses random placement unless the default is overridden. Override the default placement strategy by registering an implementation of PlacementStrategy during configuration:

siloBuilder.Services.AddSingleton<PlacementStrategy, MyPlacementStrategy>();

Orleans uses random placement unless the default is overridden. Override the default placement strategy by registering an implementation of PlacementStrategy during configuration:

siloBuilder.ConfigureServices(services =>
    services.AddSingleton<PlacementStrategy, MyPlacementStrategy>());

Configure the placement strategy for a grain

Configure the placement strategy for a grain type by adding the appropriate attribute to the grain class. Relevant attributes are specified in the placement strategies sections above.

Sample custom placement strategy

First, define a class implementing the IPlacementDirector interface, requiring a single method. In this example, assume a function GetSiloNumber is defined that returns a silo number given the Guid of the grain about to be created.

public class SamplePlacementStrategyFixedSiloDirector : IPlacementDirector
{
    public Task<SiloAddress> OnAddActivation(
        PlacementStrategy strategy,
        PlacementTarget target,
        IPlacementContext context)
    {
        var silos = context.GetCompatibleSilos(target).OrderBy(s => s).ToArray();
        int silo = GetSiloNumber(target.GrainIdentity.PrimaryKey, silos.Length);

        return Task.FromResult(silos[silo]);
    }
}

Then, define two classes to allow assigning grain classes to the strategy:

[Serializable]
public sealed class SamplePlacementStrategy : PlacementStrategy
{
}

[AttributeUsage(AttributeTargets.Class, AllowMultiple = false)]
public sealed class SamplePlacementStrategyAttribute : PlacementAttribute
{
    public SamplePlacementStrategyAttribute() :
        base(new SamplePlacementStrategy())
    {
    }
}

Then, tag any grain classes intended to use this strategy with the attribute:

[SamplePlacementStrategy]
public class MyGrain : Grain, IMyGrain
{
    // ...
}

Finally, register the strategy when building the ISiloHost:

private static async Task<ISiloHost> StartSilo()
{
    var builder = new HostBuilder(c =>
    {
        // normal configuration methods omitted for brevity
        c.ConfigureServices(ConfigureServices);
    });

    var host = builder.Build();
    await host.StartAsync();

    return host;
}

private static void ConfigureServices(IServiceCollection services)
{
    services.AddPlacementDirector<SamplePlacementStrategy, SamplePlacementStrategyFixedSiloDirector>();
}

For a second simple example showing further use of the placement context, refer to PreferLocalPlacementDirector in the Orleans source repository.

Placement filtering

Placement filtering allows you to filter which silos are eligible for grain placement before the placement strategy selects a target. This works in conjunction with placement strategies and enables scenarios like:

Zone-aware placement: Place grains on silos in the same availability zone
Tiered deployments: Direct certain grains to premium or standard tier silos
Hardware affinity: Place compute-intensive grains on silos with specific capabilities

Orleans provides built-in filters based on silo metadata, including required-match and preferred-match filtering. You can also implement custom placement filters.

For detailed information on configuring silo metadata, using built-in filters, and implementing custom filters, see Grain placement filtering.

Activation repartitioning

Activation repartitioning is a feature that automatically optimizes grain call locality by migrating grain activations to be closer to the grains they communicate with most frequently. This feature can significantly improve performance by reducing network hops for inter-grain communication.

Warning

Activation repartitioning is an experimental feature. Use the #pragma warning disable ORLEANSEXP001 directive or add <NoWarn>ORLEANSEXP001</NoWarn> to your project file to suppress the experimental warning.

How it works

The repartitioner monitors grain-to-grain communication patterns and builds a graph of "edges" representing how frequently grains communicate. Periodically, it analyzes this graph to identify opportunities to improve locality by migrating grains to silos where their communication partners are located, while maintaining an approximately equal distribution of activations across silos.

Key characteristics:

Probabilistic tracking: Uses a probabilistic data structure to track the heaviest communication links while conserving memory
Balanced distribution: Attempts to keep activation counts balanced across silos
Recovery period: After migrating grains, a silo waits before participating in another repartitioning round to allow the system to stabilize
Anchoring filter: Identifies well-partitioned grains (those already close to their communication partners) and avoids migrating them

Enable activation repartitioning

Enable activation repartitioning using the AddActivationRepartitioner extension method:

#pragma warning disable ORLEANSEXP001
siloBuilder.AddActivationRepartitioner();
#pragma warning restore ORLEANSEXP001

Configuration options

Configure the repartitioner using ActivationRepartitionerOptions:

#pragma warning disable ORLEANSEXP001
siloBuilder.AddActivationRepartitioner();
siloBuilder.Configure<ActivationRepartitionerOptions>(options =>
{
    options.MaxEdgeCount = 10_000;
    options.MinRoundPeriod = TimeSpan.FromMinutes(1);
    options.MaxRoundPeriod = TimeSpan.FromMinutes(2);
    options.RecoveryPeriod = TimeSpan.FromMinutes(1);
    options.AnchoringFilterEnabled = true;
});
#pragma warning restore ORLEANSEXP001

Option	Type	Default	Description
`MaxEdgeCount`	`int`	10,000	Maximum number of edges (grain communication links) to track. Higher values improve accuracy but use more memory. Values under 100 are not recommended.
`MaxUnprocessedEdges`	`int`	100,000	Maximum number of unprocessed edges to buffer. Oldest edges are discarded when exceeded.
`MinRoundPeriod`	TimeSpan	1 minute	Minimum time between repartitioning rounds.
`MaxRoundPeriod`	TimeSpan	2 minutes	Maximum time between repartitioning rounds. Actual timing is random between min and max. For optimal results, aim for ~10 seconds multiplied by the maximum silo count.
`RecoveryPeriod`	TimeSpan	1 minute	Time a silo waits after a repartitioning round before participating in another. Must be less than or equal to `MinRoundPeriod`.
`AnchoringFilterEnabled`	`bool`	`true`	Enables tracking of well-partitioned grains to avoid unnecessarily migrating them. Reduces accuracy slightly but improves effectiveness.
`ProbabilisticFilteringMaxAllowedErrorRate`	`double`	0.01	Maximum error rate for the probabilistic filter (0.1% to 1% range). Only applies when `AnchoringFilterEnabled` is `true`.

Performance considerations

Convergence time: The repartitioner gradually improves locality over multiple rounds. If the system isn't converging fast enough, consider increasing MaxEdgeCount.
Memory usage: Higher MaxEdgeCount values consume more memory. The probabilistic data structure helps limit memory usage while maintaining reasonable accuracy.
Cluster stability: Avoid very short RecoveryPeriod values to prevent excessive grain migration.
Workload patterns: Works best with workloads that have stable communication patterns. Highly dynamic workloads may see less benefit.

When to use activation repartitioning

Consider enabling activation repartitioning when:

Your grains have predictable communication patterns (grain A frequently calls grain B)
You have a multi-silo cluster where network latency between silos is significant
Benchmarking shows improved throughput with the feature enabled

Avoid activation repartitioning when:

Grains communicate with many different grains randomly
Your cluster is small (2-3 silos) where migration overhead may outweigh benefits
Your grains frequently deactivate and reactivate, preventing stable patterns from emerging

Activation repartitioning was introduced as an experimental feature in Orleans 8.2. For the latest documentation on this feature, see the Orleans 9.0+ documentation.

Activation repartitioning is available in Orleans 8.2 and later.

Activation rebalancing

Activation rebalancing is a cluster-wide feature that automatically redistributes grain activations across silos to optimize both memory usage and activation count balance. Unlike activation repartitioning (which optimizes for communication locality), the rebalancer focuses on keeping silos evenly loaded in terms of resource consumption.

Warning

Activation rebalancing is an experimental feature. Use the #pragma warning disable ORLEANSEXP002 directive or add <NoWarn>ORLEANSEXP002</NoWarn> to your project file to suppress the experimental warning.

How it works

The activation rebalancer runs as a singleton grain that coordinates rebalancing across the cluster:

Statistics collection: Each silo periodically publishes its memory usage and activation count to the rebalancer.
Entropy calculation: The rebalancer calculates the current "entropy" of the cluster—a measure of how evenly distributed resources are. Maximum entropy means perfect balance.
Migration decisions: When entropy is below the target, the rebalancer identifies "heavy" silos (high memory/activation count) and "light" silos, then instructs heavy silos to migrate activations to light silos.
Session-based execution: Rebalancing runs in sessions with multiple cycles. A session ends when the cluster reaches acceptable balance or when no further improvement is detected.

The algorithm considers both memory consumption and activation counts, using the harmonic mean of memory usage to compute ideal activation distribution. This means silos with less available memory receive fewer activations.

Enable activation rebalancing

Enable activation rebalancing using the AddActivationRebalancer extension method:

#pragma warning disable ORLEANSEXP002
siloBuilder.AddActivationRebalancer();
#pragma warning restore ORLEANSEXP002

Configuration options

Configure the rebalancer using ActivationRebalancerOptions:

#pragma warning disable ORLEANSEXP002
siloBuilder.AddActivationRebalancer();
siloBuilder.Configure<ActivationRebalancerOptions>(options =>
{
    options.RebalancerDueTime = TimeSpan.FromSeconds(60);
    options.SessionCyclePeriod = TimeSpan.FromSeconds(15);
    options.MaxStagnantCycles = 3;
});
#pragma warning restore ORLEANSEXP002

Option	Type	Default	Description
`RebalancerDueTime`	TimeSpan	60 seconds	Initial delay before the first rebalancing session starts after the cluster stabilizes.
`SessionCyclePeriod`	TimeSpan	15 seconds	Time between rebalancing cycles within a session. Must be at least twice the statistics publishing interval.
`MaxStagnantCycles`	`int`	3	Maximum consecutive cycles without improvement before a session ends.
`EntropyQuantum`	`double`	0.0001	Minimum entropy change considered an improvement. Smaller values make the rebalancer more sensitive.
`AllowedEntropyDeviation`	`double`	0.0001	Base threshold for acceptable deviation from maximum entropy. When deviation is below this threshold, the cluster is considered balanced.
`ScaleAllowedEntropyDeviation`	`bool`	`true`	Whether to dynamically scale the allowed deviation based on cluster size and activation count.
`CycleNumberWeight`	`double`	0.1	Weight factor for cycle number in migration rate calculation. Higher values accelerate migration in later cycles.
`SiloNumberWeight`	`double`	0.1	Weight factor for silo count in migration rate calculation. Higher values migrate more activations in larger clusters.
`ActivationMigrationCountLimit`	`int`	int.MaxValue	Maximum number of activations to migrate in a single cycle. Use to throttle migration rate.

Requirements

Minimum cluster size: Requires at least 2 silos to function.
Memory statistics: Each silo must report valid (non-zero) memory usage. If any silo reports zero memory, rebalancing cycles are skipped.
Grain migratability: Grains marked with [Immovable] or [Immovable(ImmovableKind.Rebalancer)] are excluded from rebalancing.

When to use activation rebalancing

Consider enabling activation rebalancing when:

Your cluster has silos with uneven resource utilization (some silos running hot while others are underutilized)
You have long-running grain activations that may accumulate unevenly over time
Your workload patterns cause activation imbalances that placement strategies alone can't address

Avoid activation rebalancing when:

Your cluster is small (2-3 silos) where the overhead may outweigh benefits
Grains have very short lifetimes and deactivate quickly
You're already using activation repartitioning and want to avoid conflicting migrations—though the two features can coexist, as the repartitioner adjusts its behavior based on rebalancer reports

Comparison with activation repartitioning

Feature	Activation Rebalancing	Activation Repartitioning
Optimizes for	Memory and activation count balance	Communication locality
Scope	Cluster-wide coordination	Pairwise silo coordination
Experimental ID	`ORLEANSEXP002`	`ORLEANSEXP001`
Migration trigger	Resource imbalance	Communication patterns

Both features can be enabled simultaneously. The repartitioner automatically adjusts its tolerance based on the cluster imbalance reported by the rebalancer.

Activation rebalancing is available in Orleans 10.0 and later.

Athugasemdir

Var þessi síða gagnleg?

Last updated on 2026-01-23

Deila með

Grain placement

Resource-optimized placement (default)

Resource-optimized placement options

Random placement

Local placement

Hash-based placement

Activation-count-based placement

Stateless worker placement

Silo-role-based placement

Choose a placement strategy

Configure the default placement strategy

Configure the placement strategy for a grain

Sample custom placement strategy

Placement filtering

Activation repartitioning

How it works

Enable activation repartitioning

Configuration options

Performance considerations

When to use activation repartitioning

Activation rebalancing

How it works

Enable activation rebalancing

Configuration options

Requirements

When to use activation rebalancing

Comparison with activation repartitioning

Athugasemdir

Fleiri tilföng