React Compiler: Rust Port Feasibility Research

Executive Summary
Key Data Structures
The Shared Mutable Reference Problem
Environment as Shared Mutable State
Side Maps: Passes Storing HIR References
AliasingEffect: Shared References and Rust Ownership
Recommended Rust Architecture
Input/Output Format
Error Handling
Structural Similarity: TypeScript ↔ Rust Alignment
Pipeline Overview
Pass-by-Pass Analysis

Phase 1: Lowering (AST to HIR)
Phase 2: Normalization
Phase 3: SSA Construction
Phase 4: Optimization (Pre-Inference)
Phase 5: Type and Effect Inference
Phase 6: Mutation/Aliasing Analysis
Phase 7: Optimization (Post-Inference)
Phase 8: Reactivity Inference
Phase 9: Scope Construction
Phase 10: Scope Alignment and Merging
Phase 11: Scope Terminal Construction
Phase 12: Scope Dependency Propagation
Phase 13: Reactive Function Construction
Phase 14: Reactive Function Transforms
Phase 15: Codegen
Validation Passes

External Dependencies
Risk Assessment
Recommended Migration Strategy

Executive Summary

Porting the React Compiler from TypeScript to Rust is feasible and the Rust code can remain structurally very close to the TypeScript. The compiler's algorithms are well-suited to Rust. The TypeScript implementation relies on patterns that conflict with Rust's ownership model, but all have clean, well-understood solutions using arenas and indirect references:

Shared Identifier references: Multiple Place objects reference the same Identifier object. Solution: Arena-allocated identifiers on Environment, referenced by copyable IdentifierId index.
Shared ReactiveScope references: Multiple identifiers share the same ReactiveScope object (including its mutable range). Solution: Arena-allocated scopes on Environment, referenced by ScopeId.
Inner function storage: FunctionExpression/ObjectMethod instructions store inner HIRFunction values inline. Solution: Arena-allocated functions on Environment, referenced by FunctionId.
Type storage: Types stored inline on identifiers. Solution: Arena-allocated types on Environment, referenced by TypeId.
Instructions stored inline in blocks: BasicBlock.instructions stores Instruction objects directly. Solution: Flat instruction table on HIRFunction, referenced by InstructionId. The existing InstructionId (evaluation order counter) is renamed to EvaluationOrder since it applies to both instructions and terminals.
Environment as shared mutable singleton: The Environment object is threaded through the entire compilation via fn.env and mutated by many passes. Solution: Remove HIRFunction.env and pass env: &mut Environment separately. Maintain existing fields (no sub-struct grouping) to allow precise sliced borrows via direct field access.

Key finding on structural similarity: After deep analysis of every pass, the vast majority of compiler passes can be ported to Rust with ~85-95% structural correspondence — meaning you could view the TypeScript and Rust side-by-side and easily trace the logic. The main mechanical differences are:

match instead of switch (exhaustive by default in Rust)
HashMap<IdentifierId, T> instead of Map<Identifier, T> (reference identity → value identity)
Vec::retain() instead of delete-during-Set-iteration
std::mem::replace / std::mem::take for in-place enum variant swaps
Two-phase collect/apply instead of mutate-through-stored-references

Complexity breakdown (revised after deep per-pass analysis):

~25 passes are straightforward to port (simple traversal, local mutation, ID-only side maps)
~13 passes require moderate refactoring (stored references → IDs, iteration order changes)
~4 passes require significant redesign (InferMutationAliasingRanges, BuildHIR, CodegenReactiveFunction, AnalyseFunctions)
Input/output boundaries use JSON AST interchange via serde, with a Rust Babel AST type

Input/output format: Define a Rust representation of the Babel AST format using serde with custom serialization/deserialization (ensuring the "type" field is always produced, even outside of enum positions). Include full information from Babel, including source locations. A Scope type encodes the tree of scope information mapping to Babel's scope tree. The main public API is compile(BabelAst, Scope) -> Option<BabelAst>, returning None if no changes.

Error handling: Two categories — errors that would have thrown in TypeScript (invariants, todo errors, short-circuiting) return Err(CompilerDiagnostic) via Result, while non-throwing accumulated diagnostics are recorded directly on Environment. TypeScript non-null assertions become .unwrap() panics.

Note on InferMutationAliasingEffects: Previously categorized as "significant redesign" due to maps using JS reference identity with InstructionValue keys. An upstream refactor (PR #33650) replaces InstructionValue with interned AliasingEffect as allocation-site keys, eliminating synthetic InstructionValues and the effectInstructionValueCache. Since effects are already interned by content hash, they map directly to a copyable EffectId index in Rust. Additionally, AliasingEffect variants share Place references with InstructionValue fields — in Rust, Places are cloned cheaply (with arena-based IdentifierId). The CreateFunction variant's FunctionExpression reference is replaced with a FunctionId referencing the function arena on Environment. See §AliasingEffect section for the full analysis. This is "moderate refactoring" — no algorithmic redesign needed.

Key Data Structures

HIRFunction

HIRFunction {
  body: HIR {
    entry: BlockId,
    blocks: Map<BlockId, BasicBlock>    // ordered map, reverse postorder
  },
  instructions: Vec<Instruction>,        // flat instruction table, indexed by InstructionId
  params: Array<Place | SpreadPattern>,
  returns: Place,
  context: Array<Place>,                 // captured variables from outer scope
  aliasingEffects: Array<AliasingEffect> | null,
}

Note: env is removed from HIRFunction and passed separately as env: &mut Environment. Inner functions are stored in the function arena on Environment (see §Recommended Rust Architecture).

BasicBlock

BasicBlock {
  id: BlockId,
  kind: 'block' | 'value' | 'loop' | 'sequence' | 'catch',
  instructions: Vec<InstructionId>,      // indices into HIRFunction.instructions
  terminal: Terminal,                    // control flow (goto, if, for, return, etc.)
  preds: Set<BlockId>,
  phis: Set<Phi>,                        // SSA join points
}

Instruction

Instruction {
  order: EvaluationOrder,                // evaluation order (renamed from InstructionId)
  lvalue: Place,                         // destination
  value: InstructionValue,               // discriminated union (~40 variants)
  effects: Array<AliasingEffect> | null, // populated by InferMutationAliasingEffects
  loc: SourceLocation,
}

Note: The previous InstructionId type is renamed to EvaluationOrder because it represents evaluation order and is not instruction-specific (terminals also carry it). A new InstructionId type is introduced as an index into the HIRFunction.instructions table, allowing passes to reference instructions by a single copyable ID rather than (BlockId, usize).

Place (CRITICAL for Rust port)

Place {
  kind: 'Identifier',
  identifier: IdentifierId,  // ← index into Identifier arena on Environment (shared reference in TS)
  effect: Effect,             // Read, Mutate, Capture, Freeze, etc.
  reactive: boolean,          // set by InferReactivePlaces
  loc: SourceLocation,
}

Identifier (CRITICAL for Rust port)

Identifier {
  id: IdentifierId,           // unique after SSA (opaque number)
  declarationId: DeclarationId,
  name: IdentifierName | null, // null for temporaries, mutated by RenameVariables
  mutableRange: MutableRange,  // { start, end } — mutated by InferMutationAliasingRanges
  scope: ScopeId | null,       // index into scope arena — mutated by InferReactiveScopeVariables
  type: TypeId,                // index into type arena — mutated by InferTypes
  loc: SourceLocation,
}

FunctionExpression / ObjectMethod

FunctionExpression {
  loweredFunc: FunctionId,     // index into function arena on Environment
  ...                          // other fields remain inline
}

Note: Inner HIRFunction values are stored in a function arena on Environment, referenced by FunctionId. This replaces inline storage and provides a stable, copyable reference for passes that need to cache or access inner functions.

ReactiveScope

ReactiveScope {
  id: ScopeId,
  range: MutableRange,                              // mutated by alignment passes
  dependencies: Set<ReactiveScopeDependency>,        // populated by PropagateScopeDependencies
  declarations: Map<IdentifierId, ReactiveScopeDeclaration>,
  reassignments: Set<IdentifierId>,
  earlyReturnValue: { value: IdentifierId, loc, label } | null,
  merged: Set<ScopeId>,
}

MutableRange

MutableRange {
  start: EvaluationOrder,  // inclusive (renamed from InstructionId)
  end: EvaluationOrder,    // exclusive
}

The Shared Mutable Reference Problem

This is the central challenge for a Rust port. In TypeScript, the compiler relies on JavaScript's reference semantics in three pervasive patterns:

Pattern 1: Shared Identifier Mutation

// Multiple Place objects share the SAME Identifier object
const place1: Place = { identifier: someIdentifier, ... };
const place2: Place = { identifier: someIdentifier, ... }; // same object!

// A pass mutates the identifier through one place...
place1.identifier.mutableRange.end = 42;

// ...and the change is visible through the other
console.log(place2.identifier.mutableRange.end); // 42

Used by: InferMutationAliasingRanges, InferReactiveScopeVariables, InferTypes, InferReactivePlaces, RenameVariables, PromoteUsedTemporaries, EnterSSA, EliminateRedundantPhi, AnalyseFunctions, and many more.

Pattern 2: Shared ReactiveScope References

// Multiple Identifiers share the same ReactiveScope AND MutableRange
identifier.mutableRange = scope.range;  // line 132 of InferReactiveScopeVariables

// Now identifier.mutableRange IS scope.range (same JS object)
// A pass expands the scope range...
scope.range.end = 100;

// ...visible through the identifier
console.log(identifier.mutableRange.end); // 100

This is explicitly noted in AnalyseFunctions.ts (line 30-34): "NOTE: inferReactiveScopeVariables makes identifiers in the scope point to the same mutableRange instance."

Used by: AlignMethodCallScopes, AlignObjectMethodScopes, AlignReactiveScopesToBlockScopesHIR, MergeOverlappingReactiveScopesHIR, MemoizeFbtAndMacroOperandsInSameScope.

Pattern 3: Iterate-and-Mutate / Side Map References

// Store a reference to an HIR object in a side map
const nodes: Map<Identifier, Node> = new Map();
nodes.set(identifier, { id: identifier, ... });

// Later, mutate the object through the stored reference
node.id.mutableRange.end = 42; // mutates HIR through map reference

Used by: InferMutationAliasingRanges (AliasingState.nodes), EnterSSA (SSABuilder.#states.defs), InferMutationAliasingEffects (Context caches — see note below about upstream simplification), DropManualMemoization (sidemap.manualMemos), InlineIIFEs (functions map), AlignReactiveScopesToBlockScopesHIR (activeScopes), and others.

Environment as Shared Mutable State

Complete Environment Analysis

Environment is created once per top-level function compilation and stored on HIRFunction.env. It is shared via reference across the entire compilation, including nested functions.

Mutable State (mutated by passes)

Field	Mutated by	Pattern
`#nextIdentifer: number`	BuildHIR, EnterSSA, OutlineJSX, InferMutationAliasingEffects (via `createTemporaryPlace`)	Auto-increment counter
`#nextBlock: number`	BuildHIR, InlineIIFEs	Auto-increment counter
`#nextScope: number`	InferReactiveScopeVariables	Auto-increment counter
`#errors: CompilerError`	All validation passes, DropManualMemoization, InferMutationAliasingRanges, CodegenReactiveFunction	Append-only accumulator
`#outlinedFunctions: Array`	OutlineJSX, OutlineFunctions	Append-only list
`#moduleTypes: Map`	`getGlobalDeclaration` (lazy cache fill)	One-time lazy initialization

Read-Only State (accessed but never mutated)

Field	Accessed by
`config: EnvironmentConfig`	Pipeline.ts (feature flags), InferMutationAliasingEffects, DropManualMemoization, MemoizeFbtAndMacroOperandsInSameScope, InferReactiveScopeVariables
`fnType: ReactFunctionType`	Pipeline.ts
`outputMode: CompilerOutputMode`	Pipeline.ts, DeadCodeElimination
`#globals: GlobalRegistry`	InferTypes (via `getGlobalDeclaration`), DropManualMemoization
`#shapes: ShapeRegistry`	InferTypes (via `getPropertyType`, `getFunctionSignature`), InferMutationAliasingEffects, InferReactivePlaces, FlattenScopesWithHooksOrUseHIR, NameAnonymousFunctions
`logger`	Pipeline.ts, AnalyseFunctions
`programContext`	BuildHIR, CodegenReactiveFunction, OutlineJSX

How Environment is Shared with Nested Functions

Parent and nested functions share the exact same Environment instance. When lower() is called for a nested function expression, it receives the same env. This means:

ID counters are globally unique across the entire function tree
Errors from inner function compilation are visible to the parent
Outlined functions from inner compilations accumulate on the shared list
Configuration is shared (same feature flags everywhere)

This sharing is sequential, not concurrent: AnalyseFunctions processes each child function synchronously before returning to the parent.

Recommended Rust Representation

Remove HIRFunction.env and pass env: &mut Environment as a separate parameter to passes. Maintain the existing fields and types of the Environment struct — do not group them into sub-structs. Use direct field access (rather than methods) to allow precise sliced borrows of portions of the environment.

struct Environment {
    // Configuration (read-only after construction)
    config: EnvironmentConfig,
    fn_type: ReactFunctionType,
    output_mode: CompilerOutputMode,

    // Type registries (read-only after lazy init)
    globals: GlobalRegistry,
    shapes: ShapeRegistry,
    module_types: HashMap<String, Option<Global>>,

    // Mutable counters
    next_identifier: IdentifierId,
    next_block: BlockId,
    next_scope: ScopeId,

    // Arenas
    identifiers: Vec<Identifier>,         // indexed by IdentifierId
    scopes: Vec<ReactiveScope>,           // indexed by ScopeId
    functions: Vec<HIRFunction>,          // indexed by FunctionId
    types: Vec<Type>,                     // indexed by TypeId

    // Accumulated state
    errors: Vec<CompilerDiagnostic>,
    outlined_functions: Vec<OutlinedFunction>,

    // Other
    logger: Option<Logger>,
    program_context: ProgramContext,
}

Why no sub-structs: Keeping all fields flat on Environment allows Rust's borrow checker to reason about independent field borrows. For example, a pass can simultaneously borrow env.identifiers and env.config without conflict, because the borrow checker can see they are distinct fields. Grouping fields into sub-structs would require borrowing the entire sub-struct even when only one field is needed.

Pass signatures return Result for errors that would have thrown in TypeScript:

// Most passes: need mutable HIR + mutable environment
fn enter_ssa(func: &mut HIRFunction, env: &mut Environment) -> Result<(), CompilerDiagnostic> { ... }

// Validation passes:
fn validate_hooks_usage(func: &HIRFunction, env: &mut Environment) -> Result<(), CompilerDiagnostic> { ... }

// Passes that don't use env at all (many!):
fn merge_consecutive_blocks(func: &mut HIRFunction) { ... }
fn constant_propagation(func: &mut HIRFunction) { ... }

Key insight from per-pass analysis: The majority of passes (PruneMaybeThrows, MergeConsecutiveBlocks, ConstantPropagation, EliminateRedundantPhi, OptimizePropsMethodCalls, DeadCodeElimination, RewriteInstructionKinds, PruneUnusedLabelsHIR, FlattenReactiveLoopsHIR, and all reactive function transforms) do NOT use Environment at all. Only ~12 passes need env, and most only read config flags or call getHookKind().

For the AnalyseFunctions recursive pattern (where parent and child share the same Environment), &mut Environment works naturally because the recursive call completes before the parent continues — there is only one &mut active at a time.

Side Maps: Passes Storing HIR References

The Core Problem

Many passes store references to HIR values (Places, Identifiers, Instructions, InstructionValues, ReactiveScopes) in "side maps" (HashMaps, Sets, arrays) while simultaneously mutating the HIR. In Rust, this creates borrow conflicts because you cannot hold an immutable reference (in the map) while mutating through a different path.

Classification of Side Map Patterns

After analyzing every pass, side map patterns fall into four categories:

Category 1: ID-Only Maps (No Borrow Issues)

Maps keyed and valued by opaque IDs (IdentifierId, BlockId, ScopeId, InstructionId, DeclarationId). These are Copy types with no aliasing concerns.

Passes: PruneMaybeThrows, MergeConsecutiveBlocks, ConstantPropagation, DeadCodeElimination, RewriteInstructionKinds, InferReactivePlaces (reactive set), PruneUnusedLabelsHIR, FlattenReactiveLoopsHIR, FlattenScopesWithHooksOrUseHIR, StabilizeBlockIds, and most reactive function transforms.

Rust approach: Direct HashMap<IdType, T> / HashSet<IdType>. No changes needed.

Category 2: Reference-Identity Maps (Replace Keys with IDs)

Maps using JavaScript object identity (===) as the key, typically Map<Identifier, T> or Map<BasicBlock, T> or DisjointSet<Identifier> / DisjointSet<ReactiveScope>.

Passes: EnterSSA (Map<BasicBlock, State>, Map<Identifier, Identifier>), EliminateRedundantPhi (Map<Identifier, Identifier>), InferMutationAliasingRanges (Map<Identifier, Node>), InferReactiveScopeVariables (DisjointSet<Identifier>), InferReactivePlaces (DisjointSet<Identifier>), AlignMethodCallScopes (DisjointSet<ReactiveScope>), AlignObjectMethodScopes (Set<Identifier>, DisjointSet<ReactiveScope>), MergeOverlappingReactiveScopes (DisjointSet<ReactiveScope>).

Rust approach: Replace with HashMap<IdentifierId, T>, HashMap<BlockId, T>, DisjointSet<IdentifierId>, DisjointSet<ScopeId>. This is always simpler and more correct than the TypeScript — it eliminates an entire class of bugs where cloned objects silently fail identity checks.

Category 3: Instruction/Value Reference Maps (Store Indices Instead)

Maps that store references to actual Instruction, FunctionExpression, or InstructionValue objects, then later access fields on those objects or mutate them.

Passes: InferMutationAliasingEffects (Map<Instruction, InstructionSignature>, Map<FunctionExpression, AliasingSignature>), DropManualMemoization (Map<IdentifierId, TInstruction<FunctionExpression>>, ManualMemoCallee.loadInstr), InlineIIFEs (Map<IdentifierId, FunctionExpression>), NameAnonymousFunctions (Node.fn: FunctionExpression).

Note: InferMutationAliasingEffects currently uses Map<InstructionValue, AbstractValue> and Map<IdentifierId, Set<InstructionValue>> with InstructionValue objects as allocation-site identity tokens (JS reference identity), including both real InstructionValues from the HIR (for CreateFunction) and synthetic objects fabricated as allocation-site markers. An upstream refactor (PR #33650) replaces all InstructionValue keys with interned AliasingEffect objects, eliminating the synthetic InstructionValues and effectInstructionValueCache entirely. Since effects are already interned by content hash, reference identity equals content identity — exactly what's needed for Rust. In Rust, the EffectId (index into the interning table) serves as the allocation-site key directly. See §AliasingEffect section for the full analysis.

Rust approach: Store only what is actually needed:

If the map is for existence checking: use HashSet<IdentifierId>
If specific fields are needed later: extract and store those fields (e.g., store InstructionId to reference the instruction table)
Instructions are stored in a flat table on HIRFunction, referenced by InstructionId — passes can reference any instruction by a single copyable ID
FunctionExpression/ObjectMethod inner functions are accessed via FunctionId referencing the function arena on Environment
For InferMutationAliasingEffects: use InstructionId for instruction signature cache, EffectId (interning table index) for value-identity maps, FunctionId for function signature caches

Category 4: Scope Reference Sets with In-Place Mutation (Arena Access)

Sets or maps of ReactiveScope references where the scope's range fields are mutated while the scope is in the collection.

Passes: AlignReactiveScopesToBlockScopesHIR (Set<ReactiveScope> iterated while mutating scope.range), AlignMethodCallScopes (DisjointSet forEach with range mutation), AlignObjectMethodScopes (same pattern), MergeOverlappingReactiveScopesHIR (DisjointSet with range mutation), MemoizeFbtAndMacroOperandsInSameScope (scope range mutation).

Rust approach: Store ScopeId in sets/DisjointSets. Mutate through arena: env.scopes[scope_id].range.start = .... The set holds copyable IDs, and the mutation goes through the arena — completely disjoint borrows.

Critical Insight: The Shared MutableRange Aliasing

The most architecturally significant side map pattern is in InferReactiveScopeVariables (line 132):

identifier.mutableRange = scope.range;

This makes ALL identifiers in a scope share the SAME MutableRange object as the scope. Every subsequent scope-alignment pass relies on this: mutating scope.range.start automatically updates all identifiers' mutableRange.

Recommended Rust approach: Identifiers store scope: Option<ScopeId>. The "effective mutable range" is always accessed through the scope arena:

fn effective_mutable_range(id: &Identifier, scopes: &[ReactiveScope]) -> MutableRange {
    match id.scope {
        Some(scope_id) => scopes[scope_id.index()].range,
        None => id.mutable_range, // pre-scope original range
    }
}

All downstream passes that read identifier.mutableRange (like isMutable(), inRange()) would need access to env.scopes. This is a mechanical refactor — every call site accesses the scope arena via Environment.

AliasingEffect: Shared References and Rust Ownership

Overview

AliasingEffect is a discriminated union (17 variants) that describes data flow, mutation, and other side effects of instructions and terminals. Effects are created by InferMutationAliasingEffects, stored on Instruction.effects and Terminal.effects, and consumed by InferMutationAliasingRanges, AnalyseFunctions, validation passes, and PrintHIR. This section analyzes the shared references between AliasingEffect variants, Instruction, and InstructionValue, and how they map to Rust ownership.

Shared Reference Inventory

Every AliasingEffect variant contains Place objects. In the TypeScript implementation, these are the same JS object references as the Places in the InstructionValue and Instruction.lvalue — not copies. This creates a web of shared references:

Category A: Place Sharing (Instruction/InstructionValue → Effect)

Nearly every instruction kind in computeSignatureForInstruction creates effects that directly reference Places from the instruction:

InstructionValue Kind	Effect Created	Shared Place Fields
`ArrayExpression`	`Create into:lvalue`, `Capture from:element into:lvalue`	`lvalue`, each `element` from `value.elements`
`ObjectExpression`	`Create into:lvalue`, `Capture from:property.place into:lvalue`	`lvalue`, each `property.place` from `value.properties`
`PropertyStore/ComputedStore`	`Mutate value:object`, `Capture from:value into:object`	`value.object`, `value.value`, `lvalue`
`PropertyLoad/ComputedLoad`	`CreateFrom from:object into:lvalue`	`value.object`, `lvalue`
`PropertyDelete/ComputedDelete`	`Mutate value:object`	`value.object`, `lvalue`
`Destructure`	`CreateFrom from:value.value into:place` per pattern item	`value.value`, each pattern item place
`JsxExpression`	`Freeze value:operand`, `Capture`, `Render place:tag/child`	`lvalue`, `value.tag`, each child, each prop place
`GetIterator`	`Alias/Capture from:collection into:lvalue`	`value.collection`, `lvalue`
`IteratorNext`	`MutateConditionally value:iterator`, `CreateFrom from:collection`	`value.iterator`, `value.collection`, `lvalue`
`StoreLocal`	`Assign from:value.value into:value.lvalue.place`	`value.value`, `value.lvalue.place`, `lvalue`
`LoadLocal`	`Assign from:value.place into:lvalue`	`value.place`, `lvalue`
`Await`	`MutateTransitiveConditionally value:value.value`, `Capture`	`value.value`, `lvalue`

Category B: Call Instructions — Deep Sharing via Apply

For CallExpression, MethodCall, and NewExpression, a single Apply effect is created that shares multiple fields including the args array itself:

// From computeSignatureForInstruction (line 1832-1841)
effects.push({
  kind: 'Apply',
  receiver,              // same Place as value.receiver or value.callee
  function: callee,      // same Place as value.callee or value.property
  mutatesFunction: ...,
  args: value.args,      // THE SAME ARRAY REFERENCE from InstructionValue
  into: lvalue,          // same Place as instruction.lvalue
  signature,             // shared FunctionSignature from type registry
  loc: value.loc,
});

The args field is the exact same array object as the InstructionValue's args. In Rust, this must be either cloned or accessed via the instruction.

Category C: FunctionExpression — The Deepest Sharing

The CreateFunction variant holds a direct reference to the FunctionExpression or ObjectMethod InstructionValue:

// From computeSignatureForInstruction (line 1946-1953)
effects.push({
  kind: 'CreateFunction',
  into: lvalue,
  function: value,  // THE SAME FunctionExpression/ObjectMethod InstructionValue
  captures: value.loweredFunc.func.context.filter(
    operand => operand.effect === Effect.Capture,
  ),
});

This is the most architecturally significant sharing because effect.function is used in three distinct ways:

As an allocation-site token in abstract interpretation (reference identity):
- state.initialize(effect.function, {...}) → #values.set(value, kind) — FunctionExpression as map key
- state.define(effect.into, effect.function) → #variables.set(id, new Set([value])) — FunctionExpression as set value
For deep structural access:
- effect.function.loweredFunc.func.aliasingEffects — reads the nested function's inferred effects
- effect.function.loweredFunc.func.context — iterates captured variables
For mutation of the nested function's context:
- operand.effect = Effect.Read (line 838) — mutates Place.effect on the nested function's context variables

Rust approach: CreateFunction stores a FunctionId referencing the function arena on Environment. Allocation-site identity uses EffectId (from effect interning), deep structural access uses env.functions[function_id], and context mutation uses &mut env.functions[function_id].context.

Allocation-Site Identity: InstructionValue → AliasingEffect (PR #33650)

The abstract interpretation in InferenceState tracks the abstract kind (Mutable, Frozen, Primitive, etc.) of each "allocation site" and which allocation sites each identifier points to. Currently this uses InstructionValue objects as allocation-site identity tokens via JS reference identity:

#values: Map<InstructionValue, AbstractValue>   // InstructionValue as KEY (reference identity)
#variables: Map<IdentifierId, Set<InstructionValue>>  // InstructionValue as SET VALUE

Allocation sites are created from:

Params/context variables: Synthetic {kind: 'Primitive'} or {kind: 'ObjectExpression'} objects
Create/CreateFrom effects: Synthetic InstructionValues via effectInstructionValueCache (maps interned effect → synthetic InstructionValue)
CreateFunction effects: The actual FunctionExpression InstructionValue from the HIR

Upstream simplification (facebook/react#33650): This PR replaces InstructionValue with the interned AliasingEffect itself as the allocation-site key:

#values: Map<AliasingEffect, AbstractValue>     // interned AliasingEffect as KEY
#variables: Map<IdentifierId, Set<AliasingEffect>>

The changes:

Params/context: Synthetic InstructionValue objects are replaced with AliasingEffect objects (e.g., {kind: 'Create', into: place, value: ValueKind.Context, reason: ValueReason.Other})
Create/CreateFrom effects: effectInstructionValueCache is eliminated entirely. state.initialize(effect, ...) and state.define(place, effect) use the interned effect directly as the key/value
CreateFunction effects: state.initialize(effect.function, ...) → state.initialize(effect, ...) — the CreateFunction effect itself is the key, not the FunctionExpression
state.values() return type: Changes from Array<InstructionValue> to Array<AliasingEffect>. Code that checks function values now uses values[0].kind === 'CreateFunction' and accesses values[0].function for the FunctionExpression
freezeValue method: Checks value.kind === 'CreateFunction' and accesses value.function.loweredFunc.func.context instead of value.kind === 'FunctionExpression'

Since effects are already interned by content hash (via context.internEffect()), reference identity equals content identity. This means the interned AliasingEffect maps directly to a copyable EffectId index in Rust — no separate AllocationSiteId type is needed.

Key insight for CreateFunction: After PR #33650, the CreateFunction effect's function field (the FunctionExpression/ObjectMethod reference) is no longer used as a map key for allocation-site tracking. It is only used for:

Deep structural access: effect.function.loweredFunc.func.context and .aliasingEffects
As a key in functionSignatureCache: Map<FunctionExpression, AliasingSignature> (the one remaining reference-identity map using FunctionExpression)
Mutation: operand.effect = Effect.Read on context variables

In Rust, CreateFunction stores a FunctionId referencing the function arena on Environment. The function's context and aliasing effects are accessed via env.functions[function_id]. The allocation-site identity is the EffectId of the interned CreateFunction effect. The functionSignatureCache keys by FunctionId instead of FunctionExpression reference.

Effect Interning

Effects are interned by content hash in Context.internEffect():

internEffect(effect: AliasingEffect): AliasingEffect {
  const hash = hashEffect(effect);           // hash based on identifier IDs, not Place references
  let interned = this.internedEffects.get(hash);
  if (interned == null) {
    this.internedEffects.set(hash, effect);
    interned = effect;
  }
  return interned;
}

The hash uses place.identifier.id (a number) rather than Place reference identity. The interned effect retains the Place references from whichever instruction first created that hash. In the fixpoint loop, re-processing an instruction may produce an effect with the same hash but different Place objects; interning returns the original effect with its original Place references. This is safe in TypeScript (both Places point to the same shared Identifier), but in Rust it means the interned effect's Places may not be the "current" instruction's Places — they are equivalent by ID but different allocations.

With PR #33650, the interned effect is also the allocation-site key. Since interning guarantees that the same EffectId is returned for structurally identical effects, the fixpoint loop correctly converges — the same allocation site is used across iterations.

Consumers: How Effects Are Read

InferMutationAliasingRanges (primary consumer)

Iterates instr.effects for every instruction and reads Place fields:

effect.into.identifier → used as key in AliasingState.nodes and to call state.create()
effect.from.identifier → used in state.assign(), state.capture(), state.maybeAlias()
effect.value.identifier → stored in mutations array, passed to state.mutate()
effect.function.loweredFunc.func → used in state.create() for Function nodes
effect.place.identifier → stored in renders array for Render effects
effect.error → for MutateFrozen/MutateGlobal/Impure, recorded on Environment

Also reads terminal effects: block.terminal.effects for Alias and Freeze effects on maybe-throw/return terminals.

Also reads effects a second time (Part 2, lines 359-421) to compute legacy per-operand Effect enum values. This pass accesses effect.*.identifier.id and effect.*.identifier.mutableRange.end through effect Places.

Key observation: InferMutationAliasingRanges reads identifier.id, identifier (for the reference-identity map key), and identifier.mutableRange from effect Places. It never mutates them through the effect's Places (mutations go through the graph nodes). With arena-based identifiers, place.identifier is an IdentifierId (Copy), and mutableRange is accessed via the identifier arena. No Place reference comparison is done — all passes access identifiers through their IDs, never by comparing Place object references.

AnalyseFunctions

Reads fn.aliasingEffects (the function-level effects from InferMutationAliasingRanges) to populate context variable effect annotations:

effect.from.identifier.id — for Assign/Alias/Capture/CreateFrom/MaybeAlias variants
effect.value.identifier.id — for Mutate/MutateConditionally/MutateTransitive/MutateTransitiveConditionally

Only reads identifier IDs. Does not access Places beyond .identifier.id.

ValidateNoFreezingKnownMutableFunctions

Reads fn.aliasingEffects on nested FunctionExpression values:

Stores Mutate/MutateTransitive effects in Map<IdentifierId, AliasingEffect>
Reads effect.value.identifier.id, effect.value.identifier.name, effect.value.loc

Accesses Identifier fields (name, loc) beyond just the ID, but these are read-only.

Other Passes (do NOT read AliasingEffects)

ValidateLocalsNotReassignedAfterRender, ValidateNoImpureFunctionsInRender, and PruneNonEscapingScopes import from AliasingEffects.ts or InferMutationAliasingEffects.ts but only use getFunctionCallSignature or the legacy Effect enum on Places — they do not read instr.effects or fn.aliasingEffects.

PrintHIR

Reads all effect fields for debug output. Read-only.

Recommended Rust Representation

AliasingEffect Enum

With arena-based identifiers, Place becomes a small Copy/Clone struct. Effects can own cloned Places:

#[derive(Clone)]
enum AliasingEffect {
    Freeze { value: Place, reason: ValueReason },
    Mutate { value: Place, reason: Option<MutationReason> },
    MutateConditionally { value: Place },
    MutateTransitive { value: Place },
    MutateTransitiveConditionally { value: Place },
    Capture { from: Place, into: Place },
    Alias { from: Place, into: Place },
    MaybeAlias { from: Place, into: Place },
    Assign { from: Place, into: Place },
    Create { into: Place, value: ValueKind, reason: ValueReason },
    CreateFrom { from: Place, into: Place },
    ImmutableCapture { from: Place, into: Place },
    Render { place: Place },

    Apply {
        receiver: Place,
        function: Place,
        mutates_function: bool,
        args: Vec<PlaceOrSpreadOrHole>,    // cloned from InstructionValue
        into: Place,
        signature: Option<FunctionSignature>,
        loc: SourceLocation,
    },
    CreateFunction {
        into: Place,
        /// Index into function arena on Environment.
        /// Used to access context variables, aliasing effects, etc.
        function: FunctionId,
        captures: Vec<Place>,              // cloned from context, filtered
    },

    MutateFrozen { place: Place, error: CompilerDiagnostic },
    MutateGlobal { place: Place, error: CompilerDiagnostic },
    Impure { place: Place, error: CompilerDiagnostic },
}

Key design decisions:

Place is cloned, not shared: Since Place stores IdentifierId (a Copy type) + Effect + bool + SourceLocation, it is small enough to clone cheaply. No shared references needed.
CreateFunction.function stores a FunctionId referencing the function arena on Environment. Code that needs func.context or func.aliasingEffects accesses env.functions[function_id] directly (see Accessing Functions from CreateFunction below).
Apply.args is a cloned Vec, not a shared reference to the InstructionValue's args. This is a shallow clone of Place/SpreadPattern/Hole values (all small, copyable types with arena IDs).

EffectId as Allocation-Site Identity

With PR #33650, the interned AliasingEffect replaces InstructionValue as the allocation-site key. In Rust, the EffectId (index into the interning table) serves directly as the allocation-site identity — no separate AllocationSiteId is needed:

struct InferenceState {
    /// The kind of each value, keyed by the EffectId of its creation effect
    values: HashMap<EffectId, AbstractValue>,
    /// The set of allocation sites pointed to by each identifier
    variables: HashMap<IdentifierId, SmallVec<[EffectId; 2]>>,
}

impl InferenceState {
    /// Initialize a value at the given allocation site
    fn initialize(&mut self, effect_id: EffectId, kind: AbstractValue) {
        self.values.insert(effect_id, kind);
    }

    /// Define a variable to point at an allocation site
    fn define(&mut self, place: &Place, effect_id: EffectId) {
        self.variables.insert(place.identifier, smallvec![effect_id]);
    }

    /// Look up which allocation sites a place points to
    fn values(&self, place: &Place) -> &[EffectId] {
        self.variables.get(&place.identifier).expect("uninitialized").as_slice()
    }
}

Each call to state.initialize(effect, kind) / state.define(place, effect) in TypeScript becomes state.initialize(effect_id, kind) / state.define(place, effect_id) in Rust, where effect_id is the EffectId returned by the effect interner. This applies uniformly to all creation effects:

Create/CreateFrom: The interned effect's EffectId is both the interning key and the allocation-site key
CreateFunction: Same — the interned CreateFunction effect's EffectId is the allocation-site key (the FunctionExpression reference is no longer used as a key)
Params/context: Synthetic AliasingEffect::Create values are interned and their EffectId serves as the allocation site

The effectInstructionValueCache is eliminated entirely (PR #33650 removes it). The functionSignatureCache: Map<FunctionExpression, AliasingSignature> becomes HashMap<FunctionId, AliasingSignature> — keyed by the FunctionId rather than the FunctionExpression reference.

Effect Interning

struct EffectInterner {
    effects: Vec<AliasingEffect>,        // indexed by EffectId
    by_hash: HashMap<String, EffectId>,  // dedup by content hash
}

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct EffectId(u32);

impl EffectInterner {
    fn intern(&mut self, effect: AliasingEffect) -> EffectId {
        let hash = hash_effect(&effect);
        *self.by_hash.entry(hash).or_insert_with(|| {
            let id = EffectId(self.effects.len() as u32);
            self.effects.push(effect);
            id
        })
    }
}

Since the interned effect IS the allocation-site key, there is no additional cache or mapping needed. The EffectId serves as interning dedup key, allocation-site identity, and cache key for applySignatureCache. The functionSignatureCache is keyed by FunctionId.

Accessing Functions from CreateFunction

In Rust, CreateFunction stores function: FunctionId, so the inner function is accessed directly from the function arena on Environment:

// Read access:
let inner_func = &env.functions[effect.function];

// Mutable access:
let inner_func = &mut env.functions[effect.function];

No instruction lookup or index is needed — the FunctionId provides direct O(1) access to the inner function's context variables, aliasing effects, and other data.

Context Variable Mutation

The mutation operand.effect = Effect.Read (in applyEffect for CreateFunction) modifies Places on the nested function's context. In Rust:

// During CreateFunction processing, after determining abstract kinds:
let inner_func = &mut env.functions[effect.function];
for operand in &mut inner_func.context {
    if operand.effect == Effect::Capture {
        let kind = state.kind(operand).kind;
        if matches!(kind, ValueKind::Primitive | ValueKind::Frozen | ValueKind::Global) {
            operand.effect = Effect::Read;
        }
    }
}

Since inner functions live in the function arena on Environment (not inline in the instruction), the borrow to env.functions[function_id] is completely disjoint from the outer HIRFunction being processed. No collect-then-apply workaround is needed.

Summary of Rust Approach for AliasingEffect

TypeScript Pattern	Rust Equivalent	Complexity
Effect Places share InstructionValue Places	Clone Places (cheap with `IdentifierId`)	Trivial
`Apply.args` shares InstructionValue's args array	Clone the `Vec<PlaceOrSpreadOrHole>`	Trivial
`CreateFunction.function` = the FunctionExpression	Store `FunctionId`, direct arena access	Trivial
`InstructionValue` as allocation-site key (→ `AliasingEffect` after #33650)	`EffectId` from interning table	Trivial
`effectInstructionValueCache` (eliminated by #33650)	Not needed — `EffectId` is the allocation site directly	N/A
`functionSignatureCache` (FunctionExpr → Signature)	`HashMap<FunctionId, AliasingSignature>`	Trivial
Effect interning by content hash	`EffectInterner` with `Vec` + `HashMap`	Low
`operand.effect = Effect.Read` mutation	`&mut env.functions[function_id].context` — disjoint borrow	Trivial
`applySignatureCache` (Signature × Apply → Effects)	`HashMap<(EffectId, EffectId), Vec<AliasingEffect>>`	Low
`state.values(place)` returning `AliasingEffect[]`	Returns `&[EffectId]`	Trivial

Overall assessment: AliasingEffect translates cleanly to Rust. With PR #33650, the interned EffectId serves as both the dedup key and allocation-site identity, eliminating the need for a separate AllocationSiteId. Place sharing is resolved by cloning (cheap with arena-based identifiers), and inner function access uses FunctionId into the function arena on Environment. No fundamental algorithmic redesign is needed. The fixpoint loop, effect interning, and abstract interpretation structure remain structurally identical.

Recommended Rust Architecture

Arena-Based Identifier Storage

Stored as identifiers: Vec<Identifier> directly on Environment.

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct IdentifierId(u32);

#[derive(Clone)]
struct Place {
    identifier: IdentifierId,  // index into Environment.identifiers
    effect: Effect,
    reactive: bool,
    loc: SourceLocation,
}

struct Identifier {
    id: IdentifierId,
    declaration_id: DeclarationId,
    name: Option<IdentifierName>,
    mutable_range: MutableRange,
    scope: Option<ScopeId>,
    ty: TypeId,                 // index into Environment.types
    loc: SourceLocation,
}

Arena-Based Scope Storage

Stored as scopes: Vec<ReactiveScope> directly on Environment.

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct ScopeId(u32);

Arena-Based Function Storage

Stored as functions: Vec<HIRFunction> directly on Environment. FunctionExpression and ObjectMethod instruction values store a FunctionId instead of inline function data.

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct FunctionId(u32);

Arena-Based Type Storage

Stored as types: Vec<Type> directly on Environment. Identifier.ty stores a TypeId instead of an inline Type value.

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct TypeId(u32);

Instructions Table

Instructions are stored in a flat table on HIRFunction (instructions: Vec<Instruction>), indexed by InstructionId. BasicBlock.instructions becomes Vec<InstructionId>, referencing into this table. The existing InstructionId type is renamed to EvaluationOrder since it represents evaluation order and is present on both instructions and terminals.

#[derive(Copy, Clone, Hash, Eq, PartialEq)]
struct InstructionId(u32);

#[derive(Copy, Clone, Hash, Eq, PartialEq, Ord, PartialOrd)]
struct EvaluationOrder(u32);

This allows passes to cache or reference an instruction's location via a single copyable ID, avoiding (BlockId, usize) tuples.

CFG Representation

/// Use IndexMap for insertion-order iteration (matching JS Map semantics)
struct HIR {
    entry: BlockId,
    blocks: IndexMap<BlockId, BasicBlock>,
}

Pass Signature Patterns

Passes return Result for errors that would have thrown in TypeScript.

/// Most passes: mutable HIR + mutable environment
fn enter_ssa(func: &mut HIRFunction, env: &mut Environment) -> Result<(), CompilerDiagnostic> { ... }

/// Validation passes
fn validate_hooks_usage(func: &HIRFunction, env: &mut Environment) -> Result<(), CompilerDiagnostic> { ... }

/// Passes that don't need env at all (many!)
fn merge_consecutive_blocks(func: &mut HIRFunction) { ... }
fn constant_propagation(func: &mut HIRFunction) { ... }

Key Rust Patterns for Common TypeScript Idioms

Pattern A: InstructionValue Variant Swap (`std::mem::replace`)

// TypeScript: instr.value = { kind: 'CallExpression', callee: instr.value.property, ... }
// Rust: take ownership, destructure, construct new variant
let old = std::mem::replace(&mut instr.value, InstructionValue::Tombstone);
if let InstructionValue::MethodCall { property, args, loc, .. } = old {
    instr.value = InstructionValue::CallExpression { callee: property, args, loc };
} else {
    instr.value = old;
}

Pattern B: Place Cloning via Spread (`{...place}`)

// TypeScript: const newPlace = { ...place, effect: Effect.Read }
// Rust: Place is Clone (or Copy if small enough)
let new_place = Place { effect: Effect::Read, ..place.clone() };

Pattern C: Delete-During-Set-Iteration (`retain`)

// TypeScript: for (const phi of block.phis) { if (dead) block.phis.delete(phi); }
// Rust: retain is the idiomatic equivalent
block.phis.retain(|phi| !is_dead(phi));

Pattern D: Map Iteration with Block Deletion

// TypeScript: for (const [, block] of fn.body.blocks) { fn.body.blocks.delete(id); }
// Rust: collect keys first, then remove + get_mut
let block_ids: Vec<BlockId> = blocks.keys().copied().collect();
for block_id in block_ids {
    if should_merge(block_id) {
        let removed = blocks.remove(&block_id).unwrap();
        let pred = blocks.get_mut(&pred_id).unwrap();
        pred.instructions.extend(removed.instructions);
    }
}

Pattern E: Closure Variables Set Inside Builder Callbacks

// TypeScript: let callee = null; builder.enter(() => { callee = ...; return terminal; });
// Rust: closure returns the value, or use Option<T> initialized before
let (block_id, callee) = builder.enter(|b| {
    let callee = /* compute */;
    let terminal = /* build */;
    (terminal, callee)  // return both
});

Input/Output Format

Define a Rust representation of the Babel AST format using serde with custom serialization/deserialization in order to ensure that the "type" field is always produced, even outside of enum positions. Include full information from Babel, including source locations. Define a Scope type that encodes the tree of scope information, mapping to the information that Babel represents in its own scope tree.

The main public API is roughly:

/// Returns None if the function doesn't need changes, Some with the compiled output otherwise.
fn compile(ast: BabelAst, scope: Scope) -> Option<BabelAst>

This replaces the current Babel-plugin integration pattern where the compiler receives NodePath objects. The JSON AST interchange decouples the Rust compiler from any specific JS parser or AST format at the implementation level while maintaining Babel compatibility at the serialization boundary.

Error Handling

In general there are two categories of errors:

Anything that would have thrown, or would have short-circuited, should return an Err(...) with the single diagnostic
Otherwise, accumulate errors directly onto the environment
Error handling must preserve the full details of the errors: reason, description, location, details, suggestions, category, etc

Specific Error Patterns and Approaches

TypeScript Pattern	Example	Rust Approach
Non-null assertions (`!`)	`value!.field`	Panic via `.unwrap()` or similar
Throwing expressions	`throw ...`, `CompilerError.invariant()`, `CompilerError.throwTodo()`, `CompilerError.throw*()`	Make the function return `Result<_, CompilerDiagnostic>`, return `Err(...)`
Non-throwing (invariant)	Local `error` + `error.pushDiagnostic()` where the error IS an invariant	Make the function return `Result<_, CompilerDiagnostic>`, change `pushDiagnostic()` to `return Err(...)`
Non-throwing (non-invariant)	Local `error` + `error.pushDiagnostic()`, `env.recordError()`	Keep as-is — accumulate on environment

Pass and Pipeline Structure

// pipeline.rs
fn compile(
    ast: Ast,
    scope: Scope,
    env: &mut Environment,
) -> Result<CompileResult, CompilerDiagnostic> {
    // "?" to handle cases that would have thrown or produced an invariant
    let mut hir = lower(ast, scope, env)?;
    some_compiler_pass(&mut hir, env)?;
    // ...
    let ast = codegen(...)?;

    if env.has_errors() {
        Ok(CompileResult::Failure(env.take_errors()))
    } else {
        Ok(CompileResult::Success(ast))
    }
}

// <compiler_pass>.rs
fn pass_name(
    func: &mut HirFunction,
    env: &mut Environment,
) -> Result<(), CompilerDiagnostic>;

Structural Similarity: TypeScript ↔ Rust Alignment

Design Goal

The Rust code should be visually and structurally aligned with the original TypeScript. A developer should be able to have the TypeScript on the left side of the screen and the Rust on the right, scroll them together, and easily see how the logic corresponds.

What Looks Nearly Identical (~95% match)

Most passes consist of these patterns that translate almost line-for-line:

TypeScript Pattern	Rust Equivalent
`switch (value.kind) { case 'X': ... }`	`match &value { InstructionValue::X { .. } => ... }`
`for (const [, block] of fn.body.blocks)`	`for block in func.body.blocks.values()`
`for (const instr of block.instructions)`	`for instr in &block.instructions`
`const map = new Map<K, V>()`	`let mut map: HashMap<K, V> = HashMap::new()`
`map.get(key) ?? defaultValue`	`map.get(&key).copied().unwrap_or(default)`
`if (x === null) { ... }`	`if x.is_none() { ... }` or `let Some(x) = x else { ... }`
`CompilerError.invariant(cond, ...)`	`assert!(cond, "...")` or `panic!("...")`
`do { ... } while (changed)`	`loop { ... if !changed { break; } }`
`array.push(item)`	`vec.push(item)`
`set.has(item)`	`set.contains(&item)`

What Looks Slightly Different (~80% match)

TypeScript Pattern	Rust Equivalent	Reason
`Map<Identifier, T>` (reference keys)	`HashMap<IdentifierId, T>`	Reference identity → value identity
`DisjointSet<ReactiveScope>`	`DisjointSet<ScopeId>`	Same reason
`place.identifier.mutableRange.end = x`	`env.identifiers[place.identifier].mutable_range.end = x`	Arena indirection
`identifier.scope = sharedScope`	`identifier.scope = Some(scope_id)`	Reference → ID
`for...of` with `Set.delete()`	`set.retain(	x
`instr.value = { kind: 'X', ... }`	`instr.value = InstructionValue::X { ... }` (with `mem::replace`)	Ownership swap

What Looks Substantially Different (~60% match)

TypeScript Pattern	Rust Equivalent	Reason
Storing `&Instruction` in side map	Store `InstructionId`, access via instruction table	Cannot hold references during mutation
Builder closures capturing outer `&mut`	Return values from closures, or split borrows	Borrow checker
`node.id.mutableRange.end = x` (graph node → HIR mutation)	Collect updates, apply to `env.identifiers` after traversal	Cannot mutate HIR through graph references
`identifier.mutableRange = scope.range` (shared object aliasing)	`identifier.scope = Some(scope_id)` + lookup via arena	Fundamental ownership model difference

Passes Ranked by Structural Similarity to Rust

Nearly identical (95%+): PruneMaybeThrows, OptimizePropsMethodCalls, FlattenReactiveLoopsHIR, FlattenScopesWithHooksOrUseHIR, MergeConsecutiveBlocks, DeadCodeElimination, PruneUnusedLabelsHIR, RewriteInstructionKindsBasedOnReassignment, EliminateRedundantPhi, all validation passes, PruneUnusedLabels, PruneUnusedScopes, PruneNonReactiveDependencies, PruneAlwaysInvalidatingScopes, StabilizeBlockIds, PruneHoistedContexts

Very similar (85-95%): ConstantPropagation, EnterSSA, InferTypes, InferReactivePlaces, DropManualMemoization, InlineIIFEs, MemoizeFbtAndMacroOperandsInSameScope, AlignMethodCallScopes, AlignObjectMethodScopes, OutlineFunctions, NameAnonymousFunctions, BuildReactiveScopeTerminalsHIR, PropagateScopeDependenciesHIR, PropagateEarlyReturns, MergeReactiveScopesThatInvalidateTogether, PromoteUsedTemporaries, RenameVariables, ExtractScopeDeclarationsFromDestructuring

Moderately similar (70-85%): AnalyseFunctions, InferReactiveScopeVariables, AlignReactiveScopesToBlockScopesHIR, MergeOverlappingReactiveScopesHIR, OutlineJSX, BuildReactiveFunction, PruneNonEscapingScopes, OptimizeForSSR, PruneUnusedLValues

Moderately similar (70-85%) (additional): InferMutationAliasingEffects (after PR #33650: allocation-site keys → EffectId via interning, Place sharing → Clone, CreateFunction → FunctionId arena access — see §AliasingEffect section)

Requires redesign (50-70%): InferMutationAliasingRanges (graph-through-HIR mutation), BuildHIR (Babel AST coupling), CodegenReactiveFunction (Babel AST output)

Pipeline Overview

Babel AST
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 1: Lowering                           │
│   BuildHIR (lower)                          │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 2-3: Normalization + SSA              │
│   PruneMaybeThrows                          │
│   DropManualMemoization                     │
│   InlineIIFEs                               │
│   MergeConsecutiveBlocks                    │
│   EnterSSA                                  │
│   EliminateRedundantPhi                     │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 4-5: Optimization + Type Inference    │
│   ConstantPropagation                       │
│   InferTypes                                │
│   OptimizePropsMethodCalls                  │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 6: Mutation/Aliasing Analysis         │
│   AnalyseFunctions                          │
│   InferMutationAliasingEffects              │
│   DeadCodeElimination                       │
│   InferMutationAliasingRanges               │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 7-8: Post-Inference + Reactivity      │
│   InferReactivePlaces                       │
│   RewriteInstructionKindsBasedOnReassignment│
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 9-12: Scope Construction + Alignment  │
│   InferReactiveScopeVariables               │
│   MemoizeFbtAndMacroOperandsInSameScope     │
│   OutlineJSX / OutlineFunctions             │
│   AlignMethodCallScopes                     │
│   AlignObjectMethodScopes                   │
│   AlignReactiveScopesToBlockScopesHIR        │
│   MergeOverlappingReactiveScopesHIR          │
│   BuildReactiveScopeTerminalsHIR             │
│   FlattenReactiveLoopsHIR                    │
│   FlattenScopesWithHooksOrUseHIR             │
│   PropagateScopeDependenciesHIR              │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 13-14: Reactive Function              │
│   BuildReactiveFunction (CFG → tree)        │
│   PruneUnusedLabels                         │
│   PruneNonEscapingScopes                    │
│   PruneNonReactiveDependencies              │
│   PruneUnusedScopes                         │
│   MergeReactiveScopesThatInvalidateTogether │
│   PruneAlwaysInvalidatingScopes             │
│   PropagateEarlyReturns                     │
│   PruneUnusedLValues                        │
│   PromoteUsedTemporaries                    │
│   ExtractScopeDeclarationsFromDestructuring │
│   StabilizeBlockIds                         │
│   RenameVariables                           │
│   PruneHoistedContexts                      │
└─────────────────────────────────────────────┘
    │
    ▼
┌─────────────────────────────────────────────┐
│ Phase 15: Codegen                           │
│   CodegenReactiveFunction (tree → Babel AST)│
└─────────────────────────────────────────────┘
    │
    ▼
Babel AST (with memoization)

Pass-by-Pass Analysis

Phase 1: Lowering

BuildHIR (`lower`)

What it does: Converts Babel AST to HIR by traversing the AST and building a control-flow graph with BasicBlocks, Instructions, and Terminals.

Environment usage: Heavy. Uses env.nextIdentifierId, env.nextBlockId for all ID allocation. Uses env.recordError() for fault-tolerant error handling. Uses env.parentFunction.scope for Babel scope analysis. Uses env.isContextIdentifier() and env.programContext. Environment is shared with nested function lowering via recursive lower() calls.

Side maps:

#bindings: Map<string, {node, identifier}> — caches Identifier objects by name, using Babel node reference equality to distinguish same-named variables in different scopes
#context: Map<t.Identifier, SourceLocation> — Babel node keys (reference identity)
#completed: Map<BlockId, BasicBlock> — ID-keyed (safe)
followups: Array<{place, path}> — temporary Place storage during destructuring

Structural similarity: ~65%. The HIRBuilder class maps to a Rust struct with &mut self methods. The enter()/loop()/label() closure patterns translate to methods taking impl FnOnce(&mut Self) -> Terminal. However, several patterns require restructuring:

Variables assigned inside closures and read outside (e.g., let callee = null; builder.enter(() => { callee = ...; })) must return values from the closure instead
resolveBinding() uses Babel node reference equality (mapping.node === node) — needs parser-specific node IDs
Recursive lower() for nested functions needs std::mem::take to extract child function data
The Babel AST input arrives as JSON (deserialized via serde), replacing direct Babel NodePath traversal

Unexpected issues: Babel bug workarounds (lines 413-418, 4488-4498) would not be needed with a different parser. The promoteTemporary() pattern is straightforward in Rust. The fbtDepth counter is trivial.

Phase 2: Normalization

PruneMaybeThrows

Env usage: None. Side maps: Map<BlockId, BlockId> (IDs only). Similarity: ~95%. Simple terminal mutation (handler = null), phi rewiring, and CFG cleanup. The phi operand mutation-during-iteration needs drain().collect() in Rust. Block iteration order must be RPO for chain resolution.

DropManualMemoization

Env usage: getGlobalDeclaration, getHookKindForType, recordError, createTemporaryPlace, config flags. Side maps: IdentifierSidemap with 6 collections — functions stores TInstruction references (use HashSet<IdentifierId> instead), manualMemos.loadInstr stores instruction reference (store InstructionId instead), others are ID-keyed. Similarity: ~85%. Two-phase collect+rewrite. In Rust, the functions map needs only existence checking (not the actual instruction reference). manualMemos.loadInstr only needs .id — store the ID directly.

InlineImmediatelyInvokedFunctionExpressions

Env usage: env.nextBlockId, env.nextIdentifierId (via createTemporaryPlace). Side maps: functions: Map<IdentifierId, FunctionExpression> stores instruction value references. Similarity: ~80%. The functions map stores FunctionExpression references — in Rust, store FunctionId for the inner function. The queue-while-iterating pattern needs index-based loop (while i < queue.len()). Block ownership transfer uses blocks.remove() + blocks.insert().

MergeConsecutiveBlocks

Env usage: None. Side maps: MergedBlocks (ID-only map), fallthroughBlocks (ID-only set). Similarity: ~90%. Main Rust challenge: iteration + deletion. Collect block IDs first, then remove() + get_mut(). Phi operand rewriting needs collect-then-apply.

Phase 3: SSA Construction

EnterSSA

Env usage: env.nextIdentifierId for fresh SSA identifiers. Side maps: #states: Map<BasicBlock, State> with defs: Map<Identifier, Identifier> (both reference-identity keyed), unsealedPreds: Map<BasicBlock, number>, #unknown/#context: Set<Identifier>. Similarity: ~85%. All reference-identity maps become ID-keyed: Vec<State> indexed by BlockId, HashMap<IdentifierId, IdentifierId> for defs. The recursive getIdAt() works cleanly because IdentifierId is Copy — no borrows held across recursive calls. The enter() closure for nested functions is just save/restore of self.current. makeType() global counter must become per-compilation.

EliminateRedundantPhi

Env usage: None. Side maps: rewrites: Map<Identifier, Identifier> (reference keys). Similarity: ~95%. Becomes HashMap<IdentifierId, IdentifierId>. rewritePlace becomes place.identifier_id = new_id. Phi deletion during iteration becomes block.phis.retain(|phi| ...). The fixpoint loop and labeled continue translate directly.

Phase 4: Optimization (Pre-Inference)

ConstantPropagation

Env usage: None. Side maps: constants: Map<IdentifierId, Constant> (ID-keyed, safe). Similarity: ~90%. The fixpoint loop, evaluateInstruction() switch, and terminal rewriting all map directly. Constants map stores cloned Primitive/LoadGlobal values (small, cheap to clone). The CFG cleanup cascade after branch elimination needs shared infrastructure. The block.kind === 'sequence' guard translates to an enum check.

OptimizePropsMethodCalls

Env usage: None. Side maps: None. Similarity: ~98%. The simplest pass in the compiler. A single linear scan with one match arm and std::mem::replace for the value swap. ~20 lines of Rust.

Phase 5: Type and Effect Inference

InferTypes

Env usage: getGlobalDeclaration, getPropertyType, getFallthroughPropertyType, config flags. Side maps: Unifier.substitutions: Map<TypeId, Type> (ID-keyed), names: Map<IdentifierId, string> (ID-keyed). Similarity: ~90%. Unification-based type inference is very natural in Rust. The Type enum needs Box<Type> for recursive variants (Function.return, Property.objectType). The TypeScript generator pattern for constraint generation can be replaced with direct unifier.unify() calls during the walk. The apply() phase is straightforward mutable traversal. makeType() global counter needs per-compilation scope.

Phase 6: Mutation/Aliasing Analysis

AnalyseFunctions

Env usage: Shares Environment between parent and child via fn.env. Uses logger. Side maps: None (operates entirely through in-place HIR mutation). Similarity: ~85%. The recursive lowerWithMutationAliasing pattern works with &mut because it is sequential. Inner functions are stored in the function arena on Environment and accessed via FunctionId, so no extraction/replacement is needed. The mutableRange reset (identifier.mutableRange = {start: 0, end: 0}) is a simple value write in Rust (no aliasing to break because Rust uses values, not shared objects).

InferMutationAliasingEffects

Env usage: env.config (3 reads), env.getFunctionSignature, env.enableValidations, createTemporaryPlace. InferenceState stores env as read-only reference. Side maps: statesByBlock/queuedStates (BlockId-keyed), Context class with caches (Map<Instruction, InstructionSignature>, Map<FunctionExpression, AliasingSignature>, Map<AliasingSignature, Map<AliasingEffect, ...>>), InferenceState with #values: Map<InstructionValue, AbstractValue> and #variables: Map<IdentifierId, Set<InstructionValue>>. Similarity: ~80%.

Shared references in AliasingEffect (see §AliasingEffect: Shared References and Rust Ownership for full analysis): computeSignatureForInstruction creates effects that share Place objects with the Instruction's lvalue and InstructionValue fields. The Apply effect shares the args array reference. The CreateFunction effect stores the actual FunctionExpression/ObjectMethod InstructionValue. In Rust, Places are cloned (cheap with IdentifierId) and CreateFunction stores a FunctionId for function arena access.

Allocation-site identity: Currently uses InstructionValue as reference-identity keys. PR #33650 replaces this with interned AliasingEffect objects — since effects are already interned by content hash, the interned effect IS the allocation-site key. In Rust, this maps to EffectId (index into the interning table). No separate AllocationSiteId is needed.

Reference-identity maps and their Rust equivalents (after PR #33650):

instructionSignatureCache: Map<Instruction, ...> → HashMap<InstructionId, InstructionSignature>
#values: Map<AliasingEffect, AbstractValue> → HashMap<EffectId, AbstractValue> (EffectId = interning index = allocation-site ID)
#variables: Map<IdentifierId, Set<AliasingEffect>> → HashMap<IdentifierId, SmallVec<[EffectId; 2]>>
effectInstructionValueCache → eliminated by PR #33650
functionSignatureCache: Map<FunctionExpression, ...> → HashMap<FunctionId, AliasingSignature> (key by FunctionId from arena)
applySignatureCache: Map<AliasingSignature, Map<AliasingEffect, ...>> → HashMap<EffectId, HashMap<EffectId, ...>>
internedEffects: Map<string, AliasingEffect> → EffectInterner { effects: Vec<AliasingEffect>, by_hash: HashMap<String, EffectId> }

All keys become Copy types (InstructionId, EffectId, IdentifierId), trivially Hash + Eq, with no reference identity needed.

The overall structure (fixpoint loop, InferenceState clone/merge, applyEffect recursion, Context caching) can remain nearly identical. The applyEffect recursive method works with &mut InferenceState + &mut Context parameters — Rust's reborrowing handles the recursion naturally.

Context variable mutation: During CreateFunction processing, operand.effect = Effect.Read mutates Places on the nested function's context. In Rust, the inner function is accessed via &mut env.functions[function_id], which is completely disjoint from the outer HIRFunction being processed.

DeadCodeElimination

Env usage: env.outputMode (one read for SSR hook pruning). Side maps: State.identifiers: Set<IdentifierId>, State.named: Set<string> (both value-keyed, safe). Similarity: ~95%. Two-phase mark-and-sweep is perfectly natural in Rust. Vec::retain replaces retainWhere. Destructuring pattern rewrites use iter_mut() + truncate().

InferMutationAliasingRanges (HIGH COMPLEXITY)

Env usage: env.enableValidations (one read), env.recordError (error recording). Side maps: AliasingState.nodes: Map<Identifier, Node> (reference-identity keys), each Node containing createdFrom/captures/aliases/maybeAliases: Map<Identifier, number> and edges: Array<{node: Identifier, ...}>. Also mutations/renders arrays storing Place references. Similarity: ~75%.

Effect consumption: Iterates instr.effects for every instruction, reading Place fields (effect.into, effect.from, effect.value, effect.place). For CreateFunction effects, accesses effect.function.loweredFunc.func to create Function graph nodes. In Rust, CreateFunction stores FunctionId; the function is accessed via env.functions[function_id] (see §AliasingEffect section). All other effect Place accesses only need place.identifier (an IdentifierId in Rust), with no shared reference concerns.

All Identifier-keyed maps become HashMap<IdentifierId, T>. The critical node.id.mutableRange.end = ... pattern (mutating HIR through graph node references) needs restructuring: either store computed range updates on the Node and apply after traversal (recommended), or use arena-based identifiers. The BFS in mutate() collects edge targets into temporary Vec<IdentifierId> before pushing to queue, resolving borrow conflicts. The two-part structure (build graph → apply ranges) maps well to Rust's two-phase pattern. The temporal index counter and edge ordering translate directly.

Potential latent issue: The edges array uses break (line 763) assuming monotonic insertion order, but pending phi edges from back-edges could break this ordering. The Rust port should consider using continue instead of break for safety.

Phase 7: Optimization (Post-Inference)

OptimizeForSSR

Env usage: None directly (conditional on pipeline outputMode check). Side maps: inlinedState: Map<IdentifierId, InstructionValue> (ID-keyed). Similarity: ~90%. Stores cloned InstructionValue objects. The two-pass pattern translates directly.

Phase 8: Reactivity Inference

InferReactivePlaces

Env usage: getHookKind(fn.env, ...) for hook detection. Side maps: ReactivityMap.reactive: Set<IdentifierId> (safe), ReactivityMap.aliasedIdentifiers: DisjointSet<Identifier> (reference-identity), StableSidemap.map: Map<IdentifierId, {isStable}> (ID-keyed). Similarity: ~85%. DisjointSet becomes DisjointSet<IdentifierId>. The isReactive() side-effect pattern (sets place.reactive = true during reads) works in Rust as fn is_reactive(&self, place: &mut Place) -> bool — the ReactivityMap holds only IDs while place is mutably borrowed from the HIR, so borrows are disjoint. The fixpoint loop translates directly.

RewriteInstructionKindsBasedOnReassignment

Env usage: None. Side maps: declarations: Map<DeclarationId, LValue | LValuePattern> stores references to lvalue objects for retroactive .kind mutation. Similarity: ~85%. The aliased-mutation-through-map pattern is best handled with a two-pass approach: Pass 1 collects HashSet<DeclarationId> of reassigned variables, Pass 2 assigns InstructionKind values. Or use HashMap<DeclarationId, InstructionKind> and apply in a final pass.

Phase 9: Scope Construction

InferReactiveScopeVariables

Env usage: env.nextScopeId, env.config.enableForest, env.logger. Side maps: scopeIdentifiers: DisjointSet<Identifier> (reference-identity), declarations: Map<DeclarationId, Identifier> (stores Identifier references), scopes: Map<Identifier, ReactiveScope> (reference keys). Similarity: ~75%.

THE CRITICAL ALIASING PASS: Line 132 identifier.mutableRange = scope.range creates the shared-MutableRange aliasing that all downstream scope passes depend on. In Rust with arenas: identifiers store scope: Option<ScopeId>. The "effective mutable range" is accessed via scope lookup. All downstream passes that read mutableRange access the scope arena via env.scopes. DisjointSet becomes DisjointSet<IdentifierId>, scopes map becomes HashMap<IdentifierId, ScopeId>.

MemoizeFbtAndMacroOperandsInSameScope

Env usage: fn.env.config.customMacros (one read). Side maps: macroKinds: Map<string, MacroDefinition> (string keys), macroTags: Map<IdentifierId, MacroDefinition> (ID keys), macroValues: Set<IdentifierId> (IDs). Similarity: ~90%. All ID-keyed. The scope mutation (operand.identifier.scope = scope, expandFbtScopeRange) becomes identifier.scope = Some(scope_id) + env.scopes[scope_id].range.start = min(...). The cyclic MacroDefinition structure can use arena indices or hardcoded match logic.

Phase 10: Scope Alignment and Merging

AlignMethodCallScopes

Env usage: None. Side maps: scopeMapping: Map<IdentifierId, ReactiveScope | null> (ID keys), mergedScopes: DisjointSet<ReactiveScope> (reference-identity). Similarity: ~90%. DisjointSet becomes DisjointSet<ScopeId>. Range merging through arena: env.scopes[root_id].range.start = min(...). Scope rewriting: identifier.scope = Some(root_id).

AlignObjectMethodScopes

Env usage: None. Side maps: objectMethodDecls: Set<Identifier> (reference-identity), DisjointSet<ReactiveScope>. Similarity: ~88%. Same patterns as AlignMethodCallScopes. Set<Identifier> becomes HashSet<IdentifierId>. Porting hazard: The lvalue-only scope repointing (Phase 2b) relies on shared Identifier references. With arena-based identifiers where each Place has its own copy, repointing must cover ALL occurrences, not just lvalues. If using a central identifier arena (recommended), lvalue-only repointing is fine.

AlignReactiveScopesToBlockScopesHIR

Env usage: None. Side maps: activeScopes: Set<ReactiveScope> (reference-identity, iterated while mutating scope.range), seen: Set<ReactiveScope>, placeScopes: Map<Place, ReactiveScope> (dead code — never read), valueBlockNodes: Map<BlockId, ValueBlockNode>. Similarity: ~85%. activeScopes becomes HashSet<ScopeId>. Scope mutation through arena: for &scope_id in &active_scopes { env.scopes[scope_id].range.start = min(...); } — perfectly clean borrows (HashSet is immutable, arena is mutable). The placeScopes map can be omitted entirely.

MergeOverlappingReactiveScopesHIR

Env usage: None. Side maps: joinedScopes: DisjointSet<ReactiveScope> (reference-identity), placeScopes: Map<Place, ReactiveScope> (Place reference keys). Similarity: ~85%. DisjointSet becomes DisjointSet<ScopeId>. Same arena-based range merging pattern. Place-keyed maps become unnecessary with identifier-arena approach.

Phase 11: Scope Terminal Construction

BuildReactiveScopeTerminalsHIR

Env usage: None. Side maps: rewrittenFinalBlocks: Map<BlockId, BlockId> (IDs), nextBlocks: Map<BlockId, BasicBlock> (block storage), queuedRewrites. Similarity: ~85%. Complete blocks map replacement (fn.body.blocks = nextBlocks). Block splitting creates new blocks from instruction slices. Phi rewriting across old/new blocks. All structurally translatable.

FlattenReactiveLoopsHIR

Env usage: None. Side maps: activeLoops: Array<BlockId> (IDs only). Similarity: ~98%. Simple terminal variant replacement (scope → pruned-scope). Uses Vec::retain for the active loops stack. ~40 lines of Rust logic. The terminal swap uses std::mem::replace or shared inner data struct.

FlattenScopesWithHooksOrUseHIR

Env usage: getHookKind(fn.env, ...) (one hook resolution call). Side maps: activeScopes: Array<{block, fallthrough}>, prune: Array<BlockId> (both ID-only). Similarity: ~95%. Two-phase detect/rewrite. Stack-based scope tracking with Vec::retain. Terminal variant conversion. Very clean Rust translation.

Phase 12: Scope Dependency Propagation

PropagateScopeDependenciesHIR

Env usage: None directly. Side maps: temporaries: Map<IdentifierId, ReactiveScopeDependency> (ID-keyed, but ReactiveScopeDependency contains identifier: Identifier reference), DependencyCollectionContext with #declarations: Map<DeclarationId, Decl>, #reassignments: Map<Identifier, Decl> (reference keys), deps: Map<ReactiveScope, Array<...>> (reference keys). Similarity: ~80%. Reference-keyed maps become ID-keyed. deps becomes HashMap<ScopeId, Vec<ReactiveScopeDependency>>. The PropertyPathRegistry tree with parent pointers needs arena allocation. Scope mutation (scope.declarations.set(...), scope.dependencies.add(...)) through arena.

Phase 13: Reactive Function Construction

BuildReactiveFunction

Env usage: Copies fn.env to reactive function. Side maps: Scheduling/traversal state during CFG-to-tree conversion. Similarity: ~80%. Major structural transformation (CFG → tree). The builder pattern works with &mut state. Deep recursion for value blocks is bounded by CFG depth. Shared Places/scopes/identifiers use arena indices in the new tree structure.

Phase 14: Reactive Function Transforms

All reactive function transforms use the ReactiveFunctionVisitor / ReactiveFunctionTransform pattern.

ReactiveFunctionVisitor/Transform pattern → Rust traits:

trait ReactiveFunctionTransform {
    type State;
    fn transform_terminal(&mut self, stmt: &mut ReactiveTerminalStatement, state: &mut Self::State)
        -> Transformed<ReactiveStatement> { Transformed::Keep }
    fn transform_instruction(&mut self, stmt: &mut ReactiveInstructionStatement, state: &mut Self::State)
        -> Transformed<ReactiveStatement> { Transformed::Keep }
    // ... default implementations for traversal ...
}

enum Transformed<T> {
    Keep,
    Remove,
    Replace(T),
    ReplaceMany(Vec<T>),
}

The traverseBlock method handles ReplaceMany by lazily building a new Vec (only allocating on first mutation). This maps to Rust's Option<Vec<T>> pattern.

Individual passes:

Pass	Env	Side Maps	Similarity
PruneUnusedLabels	None	`Set<BlockId>`	~95%
PruneNonEscapingScopes	None	Dependency graph with cycle detection	~85%
PruneNonReactiveDependencies	None	None significant	~95%
PruneUnusedScopes	None	None significant	~95%
MergeReactiveScopesThatInvalidateTogether	None	Scope metadata comparison	~85%
PruneAlwaysInvalidatingScopes	None	None significant	~95%
PropagateEarlyReturns	None	Early return tracking state	~85%
PruneUnusedLValues	None	Lvalue usage tracking	~90%
PromoteUsedTemporaries	None	Identifier name mutation	~90%
ExtractScopeDeclarationsFromDestructuring	None	None significant	~90%
StabilizeBlockIds	None	`Map<BlockId, BlockId>` remapping	~95%
RenameVariables	None	Name collision tracking	~90%
PruneHoistedContexts	None	Context declaration tracking	~95%

Phase 15: Codegen

CodegenReactiveFunction

Env usage: env.programContext (imports, bindings), env.getOutlinedFunctions(), env.recordErrors(), env.config. Side maps: Context class with cache slot management, scope metadata tracking. Similarity: ~60%.

The most significantly different pass due to AST output generation. 1000+ lines of t.*() Babel API calls are replaced with constructing Rust Babel AST types that serialize to JSON via serde. Core scope logic (cache slot allocation, dependency checking, memoization code structure) can look structurally similar.

The uniqueIdentifiers and fbtOperands parameters translate directly.

Validation Passes

~15 validation passes share a common pattern: read-only HIR/ReactiveFunction traversal + error reporting via env.recordError(). They are the easiest passes to port. Common structure:

fn validate_hooks_usage(func: &HIRFunction, env: &mut Environment) -> Result<(), ()> {
    for block in func.body.blocks.values() {
        for instr in &block.instructions {
            match &instr.value {
                // check for violations, record errors
            }
        }
    }
    Ok(())
}

All use HashMap<IdentifierId, T> for state tracking (ID-keyed, safe). Some return CompilerError directly instead of recording. The tryRecord() wrapping pattern maps to Result in Rust.

External Dependencies

Input/Output: JSON AST Interchange

The Rust compiler defines its own representation of the Babel AST format using serde with custom serialization/deserialization, ensuring the "type" field is always produced (even outside of enum positions). Input ASTs are deserialized from JSON, and output ASTs are serialized back to JSON for consumption by the Babel plugin. A Scope type encodes the scope tree information that Babel provides. The main public API is compile(BabelAst, Scope) -> Option<BabelAst>, returning None if no changes are needed.

This approach decouples the Rust compiler from any specific JS parser — the JSON boundary handles the translation. The resolveBinding() pattern in BuildHIR (which uses Babel node reference equality in TypeScript) maps to scope-tree lookups via the Scope type.

Risk Assessment

Low Risk (straightforward port)

All validation passes
Simple transformation passes (PruneMaybeThrows, PruneUnusedLabelsHIR, FlattenReactiveLoopsHIR, FlattenScopesWithHooksOrUseHIR, StabilizeBlockIds, RewriteInstructionKindsBasedOnReassignment, OptimizePropsMethodCalls, MergeConsecutiveBlocks)
Reactive pruning passes (PruneUnusedLabels, PruneUnusedScopes, PruneAlwaysInvalidatingScopes, PruneNonReactiveDependencies)

Medium Risk (requires systematic refactoring)

SSA passes (EnterSSA, EliminateRedundantPhi) — reference-identity maps → ID maps
Scope construction passes — centralized scope arena with ID-based references
Type inference (InferTypes) — arena-based Type storage, TypeId generation
Constant propagation — separated constants map, CFG cleanup infrastructure
Dead code elimination — two-phase collect/apply
Scope alignment passes — DisjointSet, arena-based range mutation
Reactive function transforms — Visitor/MutVisitor trait design with Transformed enum

Medium Risk (additional)

InferMutationAliasingEffects: After PR #33650, allocation-site identity uses interned AliasingEffect (→ EffectId), eliminating InstructionValue keys and effectInstructionValueCache. Remaining reference-identity maps use Instructions (→ InstructionId) and FunctionExpressions (→ FunctionId). All become copyable ID-keyed maps. Place sharing between effects and instructions is resolved by cloning (cheap with arena-based identifiers). CreateFunction's FunctionExpression reference becomes a FunctionId referencing the function arena. Fixpoint loop and abstract interpretation structure port directly. See §AliasingEffect section for full analysis.

High Risk (significant redesign)

BuildHIR: JSON AST deserialization, scope tree integration, closure-heavy builder patterns
InferMutationAliasingRanges: Graph-through-HIR mutation, temporal reasoning, deferred range updates
CodegenReactiveFunction: JSON AST output construction via serde, 1000+ lines of AST building
AnalyseFunctions: Recursive nested function processing via function arena, shared mutableRange semantics

Critical Architectural Decisions (must be designed upfront)

Arena-based storage on Environment: Identifiers, scopes, functions, and types are stored as flat Vec fields on Environment, referenced by copyable ID types (IdentifierId, ScopeId, FunctionId, TypeId). Affects every pass.
Instructions table: Instructions stored in flat Vec<Instruction> on HIRFunction, referenced by InstructionId. Old InstructionId renamed to EvaluationOrder.
Scope-based mutableRange access: After InferReactiveScopeVariables, effective mutable range = scope's range. All downstream isMutable()/inRange() calls access the scope arena via env.scopes.
JSON AST interchange: Input/output via serde-serialized Babel AST types and a Scope type for scope tree information.
Environment as single &mut: No sub-struct grouping — flat fields allow precise sliced borrows. Passed separately from HIRFunction.
Error handling: Result<_, CompilerDiagnostic> for thrown errors, accumulated errors on Environment.

Recommended Migration Strategy

Phase 1: Foundation

Define Rust data model (flat Environment with arena fields for Identifiers/Scopes/Functions/Types, all ID newtypes)
Define HIR types as Rust enums/structs (InstructionValue ~40 variants, Terminal ~20 variants)
Define flat Environment struct with arena fields, counters, config, and accumulated state
Implement shared infrastructure: DisjointSet<T: Copy>, IndexMap wrappers, visitor utilities
Define Babel AST types with serde serialization/deserialization for JSON AST interchange
Build JSON serialization for HIR (enables testing against TypeScript implementation)

Phase 2: Core Pipeline

Port BuildHIR (highest effort, most value — requires JSON AST deserialization and Scope type integration)
Port normalization passes (PruneMaybeThrows, MergeConsecutiveBlocks — simple, builds confidence)
Port SSA (EnterSSA, EliminateRedundantPhi — establishes arena patterns)
Port ConstantPropagation, InferTypes
Validate output matches TypeScript via JSON comparison at each stage

Phase 3: Analysis Engine

Port AnalyseFunctions (establishes recursive compilation pattern)
Port InferMutationAliasingEffects (establish EffectId interning table — EffectId serves as allocation-site identity, FunctionId-based function arena access for CreateFunction)
Port DeadCodeElimination
Port InferMutationAliasingRanges (establish deferred-range-update pattern)
Port InferReactivePlaces

Phase 4: Scope System

Port InferReactiveScopeVariables (establishes ScopeId → mutableRange indirection)
Port scope alignment passes (Align*, Merge* — establish DisjointSet pattern)
Port BuildReactiveScopeTerminalsHIR
Port PropagateScopeDependenciesHIR

Phase 5: Output

Port BuildReactiveFunction (establishes reactive tree representation)
Port reactive function transforms (Prune*, Promote*, Rename* — use trait-based visitor)
Port CodegenReactiveFunction with JSON AST output
Port validation passes (easiest, can be done in parallel)
End-to-end integration testing

React Compiler: Rust Port Feasibility Research

Table of Contents

Executive Summary

Key Data Structures

HIRFunction

BasicBlock

Instruction

Place (CRITICAL for Rust port)

Identifier (CRITICAL for Rust port)

FunctionExpression / ObjectMethod

ReactiveScope

MutableRange

The Shared Mutable Reference Problem

Pattern 1: Shared Identifier Mutation

Pattern 2: Shared ReactiveScope References

Pattern 3: Iterate-and-Mutate / Side Map References

Environment as Shared Mutable State

Complete Environment Analysis

Mutable State (mutated by passes)

Read-Only State (accessed but never mutated)

How Environment is Shared with Nested Functions

Recommended Rust Representation

Side Maps: Passes Storing HIR References

The Core Problem

Classification of Side Map Patterns

Category 1: ID-Only Maps (No Borrow Issues)

Category 2: Reference-Identity Maps (Replace Keys with IDs)

Category 3: Instruction/Value Reference Maps (Store Indices Instead)

Category 4: Scope Reference Sets with In-Place Mutation (Arena Access)

Critical Insight: The Shared MutableRange Aliasing

AliasingEffect: Shared References and Rust Ownership

Overview

Shared Reference Inventory

Category A: Place Sharing (Instruction/InstructionValue → Effect)

Category B: Call Instructions — Deep Sharing via Apply

Category C: FunctionExpression — The Deepest Sharing

Allocation-Site Identity: InstructionValue → AliasingEffect (PR #33650)

Effect Interning

Consumers: How Effects Are Read

InferMutationAliasingRanges (primary consumer)

AnalyseFunctions

ValidateNoFreezingKnownMutableFunctions

Other Passes (do NOT read AliasingEffects)

PrintHIR

Recommended Rust Representation

AliasingEffect Enum

EffectId as Allocation-Site Identity

Effect Interning

Accessing Functions from CreateFunction

Context Variable Mutation

Summary of Rust Approach for AliasingEffect

Recommended Rust Architecture

Arena-Based Identifier Storage

Arena-Based Scope Storage

Arena-Based Function Storage

Arena-Based Type Storage

Instructions Table

CFG Representation

Pass Signature Patterns

Key Rust Patterns for Common TypeScript Idioms

Pattern A: InstructionValue Variant Swap (std::mem::replace)

Pattern B: Place Cloning via Spread ({...place})

Pattern C: Delete-During-Set-Iteration (retain)

Pattern D: Map Iteration with Block Deletion

Pattern E: Closure Variables Set Inside Builder Callbacks

Input/Output Format

Error Handling

Specific Error Patterns and Approaches

Pass and Pipeline Structure

Structural Similarity: TypeScript ↔ Rust Alignment

Design Goal

What Looks Nearly Identical (~95% match)

What Looks Slightly Different (~80% match)

What Looks Substantially Different (~60% match)

Passes Ranked by Structural Similarity to Rust

Pipeline Overview

Pass-by-Pass Analysis

Phase 1: Lowering

BuildHIR (lower)

Phase 2: Normalization

Pattern A: InstructionValue Variant Swap (`std::mem::replace`)

Pattern B: Place Cloning via Spread (`{...place}`)

Pattern C: Delete-During-Set-Iteration (`retain`)

BuildHIR (`lower`)