Describe drop elaboration (#1240)
This commit is contained in:
parent
d4f300263e
commit
6643adf704
|
|
@ -116,6 +116,7 @@
|
||||||
- [Opaque Types](./opaque-types-type-alias-impl-trait.md)
|
- [Opaque Types](./opaque-types-type-alias-impl-trait.md)
|
||||||
- [Pattern and Exhaustiveness Checking](./pat-exhaustive-checking.md)
|
- [Pattern and Exhaustiveness Checking](./pat-exhaustive-checking.md)
|
||||||
- [MIR dataflow](./mir/dataflow.md)
|
- [MIR dataflow](./mir/dataflow.md)
|
||||||
|
- [Drop elaboration](./mir/drop-elaboration.md)
|
||||||
- [The borrow checker](./borrow_check.md)
|
- [The borrow checker](./borrow_check.md)
|
||||||
- [Tracking moves and initialization](./borrow_check/moves_and_initialization.md)
|
- [Tracking moves and initialization](./borrow_check/moves_and_initialization.md)
|
||||||
- [Move paths](./borrow_check/moves_and_initialization/move_paths.md)
|
- [Move paths](./borrow_check/moves_and_initialization/move_paths.md)
|
||||||
|
|
|
||||||
|
|
@ -0,0 +1,195 @@
|
||||||
|
# Drop elaboration
|
||||||
|
|
||||||
|
<!-- toc -->
|
||||||
|
|
||||||
|
## Dynamic drops
|
||||||
|
|
||||||
|
According to the [reference][reference-drop]:
|
||||||
|
|
||||||
|
> When an initialized variable or temporary goes out of scope, its destructor
|
||||||
|
> is run, or it is dropped. Assignment also runs the destructor of its
|
||||||
|
> left-hand operand, if it's initialized. If a variable has been partially
|
||||||
|
> initialized, only its initialized fields are dropped.
|
||||||
|
|
||||||
|
When building the MIR, the `Drop` and `DropAndReplace` terminators represent
|
||||||
|
places where drops may occur. However, in this phase, the presence of these
|
||||||
|
terminators does not guarantee that a destructor will run. That's because the
|
||||||
|
target of a drop may be uninitialized (usually because it has been moved from)
|
||||||
|
before the terminator is reached. In general, we cannot know at compile-time whether a
|
||||||
|
variable is initialized.
|
||||||
|
|
||||||
|
```rust
|
||||||
|
let mut y = vec![];
|
||||||
|
|
||||||
|
{
|
||||||
|
let x = vec![1, 2, 3];
|
||||||
|
if std::process::id() % 2 == 0 {
|
||||||
|
y = x; // conditionally move `x` into `y`
|
||||||
|
}
|
||||||
|
} // `x` goes out of scope here. Should it be dropped?
|
||||||
|
```
|
||||||
|
|
||||||
|
In these cases, we need to keep track of whether a variable is initialized
|
||||||
|
*dynamically*. The rules are laid out in detail in [RFC 320: Non-zeroing
|
||||||
|
dynamic drops][RFC 320].
|
||||||
|
|
||||||
|
## Drop obligations
|
||||||
|
|
||||||
|
From the RFC:
|
||||||
|
|
||||||
|
> When a local variable becomes initialized, it establishes a set of "drop
|
||||||
|
> obligations": a set of structural paths (e.g. a local `a`, or a path to a
|
||||||
|
> field `b.f.y`) that need to be dropped.
|
||||||
|
>
|
||||||
|
> The drop obligations for a local variable x of struct-type `T` are computed
|
||||||
|
> from analyzing the structure of `T`. If `T` itself implements `Drop`, then `x` is a
|
||||||
|
> drop obligation. If `T` does not implement `Drop`, then the set of drop
|
||||||
|
> obligations is the union of the drop obligations of the fields of `T`.
|
||||||
|
|
||||||
|
When a structural path is moved from (and thus becomes uninitialized), any drop
|
||||||
|
obligations for that path or its descendants (`path.f`, `path.f.g.h`, etc.) are
|
||||||
|
released. Types with `Drop` implementations do not permit moves from individual
|
||||||
|
fields, so there is no need to track initializedness through them.
|
||||||
|
|
||||||
|
When a local variable goes out of scope (`Drop`), or when a structural path is
|
||||||
|
overwritten via assignment (`DropAndReplace`), we check for any drop
|
||||||
|
obligations for that variable or path. Unless that obligation has been
|
||||||
|
released by this point, its associated `Drop` implementation will be called.
|
||||||
|
For `enum` types, only fields corresponding to the "active" variant need to be
|
||||||
|
dropped. When processing drop obligations for such types, we first check the
|
||||||
|
discriminant to determine the active variant. All drop obligations for variants
|
||||||
|
besides the active one are ignored.
|
||||||
|
|
||||||
|
Here are a few interesting types to help illustrate these rules:
|
||||||
|
|
||||||
|
```rust
|
||||||
|
struct NoDrop(u8); // No `Drop` impl. No fields with `Drop` impls.
|
||||||
|
|
||||||
|
struct NeedsDrop(Vec<u8>); // No `Drop` impl but has fields with `Drop` impls.
|
||||||
|
|
||||||
|
struct ThinVec(*const u8); // Custom `Drop` impl. Individual fields cannot be moved from.
|
||||||
|
|
||||||
|
impl Drop for ThinVec {
|
||||||
|
fn drop(&mut self) { /* ... */ }
|
||||||
|
}
|
||||||
|
|
||||||
|
enum MaybeDrop {
|
||||||
|
Yes(NeedsDrop),
|
||||||
|
No(NoDrop),
|
||||||
|
}
|
||||||
|
```
|
||||||
|
|
||||||
|
## Drop elaboration
|
||||||
|
|
||||||
|
One valid model for these rules is to keep a boolean flag (a "drop flag") for
|
||||||
|
every structural path that is used at any point in the function. This flag is
|
||||||
|
set when its path is initialized and is cleared when the path is moved from.
|
||||||
|
When a `Drop` occurs, we check the flags for every obligation associated with
|
||||||
|
the target of the `Drop` and call the associated `Drop` impl for those that are
|
||||||
|
still applicable.
|
||||||
|
|
||||||
|
This process—transforming the newly built MIR with its imprecise `Drop` and
|
||||||
|
`DropAndReplace` terminators into one with drop flags—is known as drop
|
||||||
|
elaboration. When a MIR statement causes a variable to become initialized (or
|
||||||
|
uninitialized), drop elaboration inserts code that sets (or clears) the drop
|
||||||
|
flag for that variable. It wraps `Drop` terminators in conditionals that check
|
||||||
|
the newly inserted drop flags.
|
||||||
|
|
||||||
|
Drop elaboration also splits `DropAndReplace` terminators into a `Drop` of the
|
||||||
|
target and a write of the newly dropped place. This is somewhat unrelated to what
|
||||||
|
we've discussed above.
|
||||||
|
|
||||||
|
Once this is complete, `Drop` terminators in the MIR correspond to a call to
|
||||||
|
the "drop glue" or "drop shim" for the type of the dropped place. The drop
|
||||||
|
glue for a type calls the `Drop` impl for that type (if one exists), and then
|
||||||
|
recursively calls the drop glue for all fields of that type.
|
||||||
|
|
||||||
|
## Drop elaboration in `rustc`
|
||||||
|
|
||||||
|
The approach described above is more expensive than necessary. One can imagine
|
||||||
|
a few optimizations:
|
||||||
|
|
||||||
|
- Only paths that are the target of a `Drop` (or have the target as a prefix)
|
||||||
|
need drop flags.
|
||||||
|
- Some variables are known to initialized (or uninitialized) when they are
|
||||||
|
dropped. These do not need drop flags.
|
||||||
|
- If a set of paths are only dropped or moved from via a shared prefix, those
|
||||||
|
paths can share a single drop flag.
|
||||||
|
|
||||||
|
A subset of these are implemented in `rustc`.
|
||||||
|
|
||||||
|
In the compiler, drop elaboration is split across several modules. The pass
|
||||||
|
itself is defined [here][drops-transform], but the [main logic][drops] is
|
||||||
|
defined elsewhere since it is also used to build [drop shims][drops-shim].
|
||||||
|
|
||||||
|
Drop elaboration designates each `Drop` in the newly built MIR as one of four
|
||||||
|
kinds:
|
||||||
|
|
||||||
|
- `Static`, the target is always initialized.
|
||||||
|
- `Dead`, the target is always **un**initialized.
|
||||||
|
- `Conditional`, the target is either wholly initialized or wholly
|
||||||
|
uninitialized. It is not partly initialized.
|
||||||
|
- `Open`, the target may be partly initialized.
|
||||||
|
|
||||||
|
For this, it uses a pair of dataflow analyses, `MaybeInitializedPlaces` and
|
||||||
|
`MaybeUninitializedPlaces`. If a place is in one but not the other, then the
|
||||||
|
initializedness of the target is known at compile-time (`Dead` or `Static`).
|
||||||
|
In this case, drop elaboration does not add a flag for the target. It simply
|
||||||
|
removes (`Dead`) or preserves (`Static`) the `Drop` terminator.
|
||||||
|
|
||||||
|
For `Conditional` drops, we know that the initializedness of the variable as a
|
||||||
|
whole is the same as the initializedness of its fields. Therefore, once we
|
||||||
|
generate a drop flag for the target of that drop, it's safe to call the drop
|
||||||
|
glue for that target.
|
||||||
|
|
||||||
|
### `Open` drops
|
||||||
|
|
||||||
|
`Open` drops are the most complex, since we need to break down a single `Drop`
|
||||||
|
terminator into several different ones, one for each field of the target whose
|
||||||
|
type has drop glue (`Ty::needs_drop`). We cannot call the drop glue for the
|
||||||
|
target itself because that requires all fields of the target to be initialized.
|
||||||
|
Remember, variables whose type has a custom `Drop` impl do not allow `Open`
|
||||||
|
drops because their fields cannot be moved from.
|
||||||
|
|
||||||
|
This is accomplished by recursively categorizing each field as `Dead`,
|
||||||
|
`Static`, `Conditional` or `Open`. Fields whose type does not have drop glue
|
||||||
|
are automatically `Dead` and need not be considered during the recursion. When
|
||||||
|
we reach a field whose kind is not `Open`, we handle it as we did above. If the
|
||||||
|
field is also `Open`, the recursion continues.
|
||||||
|
|
||||||
|
It's worth noting how we handle `Open` drops of enums. Inside drop elaboration,
|
||||||
|
each variant of the enum is treated like a field, with the invariant that only
|
||||||
|
one of those "variant fields" can be initialized at any given time. In the
|
||||||
|
general case, we do not know which variant is the active one, so we will have
|
||||||
|
to call the drop glue for the enum (which checks the discriminant) or check the
|
||||||
|
discriminant ourselves as part of an elaborated `Open` drop. However, in
|
||||||
|
certain cases (within a `match` arm, for example) we do know which variant of
|
||||||
|
an enum is active. This information is encoded in the `MaybeInitializedPlaces`
|
||||||
|
and `MaybeUninitializedPlaces` dataflow analyses by marking all places
|
||||||
|
corresponding to inactive variants as uninitialized.
|
||||||
|
|
||||||
|
### Cleanup paths
|
||||||
|
|
||||||
|
TODO: Discuss drop elaboration and unwinding.
|
||||||
|
|
||||||
|
## Aside: drop elaboration and const-eval
|
||||||
|
|
||||||
|
In Rust, functions that are eligible for evaluation at compile-time must be
|
||||||
|
marked explicitly using the `const` keyword. This includes implementations of
|
||||||
|
the `Drop` trait, which may or may not be `const`. Code that is eligible for
|
||||||
|
compile-time evaluation may only call `const` functions, so any calls to
|
||||||
|
non-const `Drop` implementations in such code must be forbidden.
|
||||||
|
|
||||||
|
A call to a `Drop` impl is encoded as a `Drop` terminator in the MIR. However,
|
||||||
|
as we discussed above, a `Drop` terminator in newly built MIR does not
|
||||||
|
necessarily result in a call to `Drop::drop`. The drop target may be
|
||||||
|
uninitialized at that point. This means that checking for non-const `Drop`s on
|
||||||
|
the newly built MIR can result in spurious errors. Instead, we wait until after
|
||||||
|
drop elaboration runs, which eliminates `Dead` drops (ones where the target is
|
||||||
|
known to be uninitialized) to run these checks.
|
||||||
|
|
||||||
|
[RFC 320]: https://rust-lang.github.io/rfcs/0320-nonzeroing-dynamic-drop.html
|
||||||
|
[reference-drop]: https://doc.rust-lang.org/reference/destructors.html
|
||||||
|
[drops]: https://github.com/rust-lang/rust/blob/master/compiler/rustc_mir_dataflow/src/elaborate_drops.rs
|
||||||
|
[drops-shim]: https://github.com/rust-lang/rust/blob/master/compiler/rustc_mir_transform/src/shim.rs
|
||||||
|
[drops-transform]: https://github.com/rust-lang/rust/blob/master/compiler/rustc_mir_dataflow/src/elaborate_drops.rs
|
||||||
Loading…
Reference in New Issue