Optimize common usages of `AssetReader` #14082

JoJoJet · 2024-06-30T23:01:12Z

Objective

The AssetReader trait allows customizing the behavior of fetching bytes for an AssetPath, and expects implementors to return dyn AsyncRead + AsyncSeek. This gives implementors of AssetLoader great flexibility to tightly integrate their asset loading behavior with the asynchronous task system.

However, almost all implementors of AssetLoader don't use the async functionality at all, and just call AsyncReadExt::read_to_end(&mut Vec<u8>). This is incredibly inefficient, as this method repeatedly calls poll_read on the trait object, filling the vector 32 bytes at a time. At my work we have assets that are hundreds of megabytes which makes this a meaningful overhead.

Solution

Turn the Reader type alias into an actual trait, with a provided method read_to_end. This provided method should be more efficient than the existing extension method, as the compiler will know the underlying type of Reader when generating this function, which removes the repeated dynamic dispatches and allows the compiler to make further optimizations after inlining. Individual implementors are able to override the provided implementation -- for simple asset readers that just copy bytes from one buffer to another, this allows removing a large amount of overhead from the provided implementation.

Now that Reader is an actual trait, I also improved the ergonomics for implementing AssetReader. Currently, implementors are expected to box their reader and return it as a trait object, which adds unnecessary boilerplate to implementations. This PR changes that trait method to return a pseudo trait alias, which allows implementors to return impl Reader instead of Box<dyn Reader>. Now, the boilerplate for boxing occurs in ErasedAssetReader.

Testing

I made identical changes to my company's fork of bevy. Our app, which makes heavy use of read_to_end for asset loading, still worked properly after this. I am not aware if we have a more systematic way of testing asset loading for correctness.

Migration Guide

The trait method bevy_asset::io::AssetReader::read (and read_meta) now return an opaque type instead of a boxed trait object. Implementors of these methods should change the type signatures appropriately

impl AssetReader for MyReader {
    // Before
    async fn read<'a>(&'a self, path: &'a Path) -> Result<Box<Reader<'a>>, AssetReaderError> {
        let reader = // construct a reader
        Box::new(reader) as Box<Reader<'a>>
    }

    // After
    async fn read<'a>(&'a self, path: &'a Path) -> Result<impl Reader + 'a, AssetReaderError> {
        // create a reader
    }
}

bevy::asset::io::Reader is now a trait, rather than a type alias for a trait object. Implementors of AssetLoader::load will need to adjust the method signature accordingly

impl AssetLoader for MyLoader {
    async fn load<'a>(
        &'a self,
        // Before:
        reader: &'a mut bevy::asset::io::Reader,
        // After:
        reader: &'a mut dyn bevy::asset::io::Reader,
        _: &'a Self::Settings,
        load_context: &'a mut LoadContext<'_>,
    ) -> Result<Self::Asset, Self::Error> {
}

Additionally, implementors of AssetReader that return a type implementing futures_io::AsyncRead and AsyncSeek might need to explicitly implement bevy::asset::io::Reader for that type.

impl bevy::asset::io::Reader for MyAsyncReadAndSeek {}

alice-i-cecile

Nice to see this getting stress-tested and polished :) Thanks!

Minor nits about docs.

crates/bevy_asset/src/io/mod.rs

# Objective The `AssetReader` trait allows customizing the behavior of fetching bytes for an `AssetPath`, and expects implementors to return `dyn AsyncRead + AsyncSeek`. This gives implementors of `AssetLoader` great flexibility to tightly integrate their asset loading behavior with the asynchronous task system. However, almost all implementors of `AssetLoader` don't use the async functionality at all, and just call `AsyncReadExt::read_to_end(&mut Vec<u8>)`. This is incredibly inefficient, as this method repeatedly calls `poll_read` on the trait object, filling the vector 32 bytes at a time. At my work we have assets that are hundreds of megabytes which makes this a meaningful overhead. ## Solution Turn the `Reader` type alias into an actual trait, with a provided method `read_to_end`. This provided method should be more efficient than the existing extension method, as the compiler will know the underlying type of `Reader` when generating this function, which removes the repeated dynamic dispatches and allows the compiler to make further optimizations after inlining. Individual implementors are able to override the provided implementation -- for simple asset readers that just copy bytes from one buffer to another, this allows removing a large amount of overhead from the provided implementation. Now that `Reader` is an actual trait, I also improved the ergonomics for implementing `AssetReader`. Currently, implementors are expected to box their reader and return it as a trait object, which adds unnecessary boilerplate to implementations. This PR changes that trait method to return a pseudo trait alias, which allows implementors to return `impl Reader` instead of `Box<dyn Reader>`. Now, the boilerplate for boxing occurs in `ErasedAssetReader`. ## Testing I made identical changes to my company's fork of bevy. Our app, which makes heavy use of `read_to_end` for asset loading, still worked properly after this. I am not aware if we have a more systematic way of testing asset loading for correctness. --- ## Migration Guide The trait method `bevy_asset::io::AssetReader::read` (and `read_meta`) now return an opaque type instead of a boxed trait object. Implementors of these methods should change the type signatures appropriately ```rust impl AssetReader for MyReader { // Before async fn read<'a>(&'a self, path: &'a Path) -> Result<Box<Reader<'a>>, AssetReaderError> { let reader = // construct a reader Box::new(reader) as Box<Reader<'a>> } // After async fn read<'a>(&'a self, path: &'a Path) -> Result<impl Reader + 'a, AssetReaderError> { // create a reader } } ``` `bevy::asset::io::Reader` is now a trait, rather than a type alias for a trait object. Implementors of `AssetLoader::load` will need to adjust the method signature accordingly ```rust impl AssetLoader for MyLoader { async fn load<'a>( &'a self, // Before: reader: &'a mut bevy::asset::io::Reader, // After: reader: &'a mut dyn bevy::asset::io::Reader, _: &'a Self::Settings, load_context: &'a mut LoadContext<'_>, ) -> Result<Self::Asset, Self::Error> { } ``` Additionally, implementors of `AssetReader` that return a type implementing `futures_io::AsyncRead` and `AsyncSeek` might need to explicitly implement `bevy::asset::io::Reader` for that type. ```rust impl bevy::asset::io::Reader for MyAsyncReadAndSeek {} ```

UkoeHB · 2024-07-10T04:50:49Z

EDIT: Solved this by removing <'_> from the reader. It should be mentioned in the migration guide.

This PR messed up my (very basic) asset loaders:

return type references an anonymous lifetime, which is not constrained by the fn input types
lifetimes appearing in an associated or opaque type are not considered constrained
consider introducing a named lifetime parameter

It's claiming the '_ lifetime in Reader<'_> is being captured. My loader does call reader.read_to_end(...).await?; so presumably the lifetime is now leaking into the future.

Example:

struct CobwebAssetLoader;

impl AssetLoader for CobwebAssetLoader
{
    type Asset = CobwebAssetFile;
    type Settings = ();
    type Error = CobwebAssetLoaderError;

    async fn load<'a>(
        &'a self,
        reader: &'a mut dyn Reader<'_>,
        _settings: &'a (),
        load_context: &'a mut LoadContext<'_>,
    ) -> Result<Self::Asset, Self::Error>
    {
        let mut bytes = Vec::new();
        reader.read_to_end(&mut bytes).await?;
        let data: serde_json::Value = from_slice(&bytes)?;
        Ok(CobwebAssetFile {
            file: LoadableFile::new(&load_context.asset_path().path().to_string_lossy()),
            data,
        })
    }

    fn extensions(&self) -> &[&str]
    {
        &[".caf.json"]
    }
}

JoJoJet · 2024-07-10T17:24:04Z

Seems like a misleading error message for what should be a simple fix (just remove the explicit elided lifetime). Do you mind making a rustc issue?

UkoeHB · 2024-07-10T18:26:33Z

Ok, rust-lang/rust#127585

JoJoJet added 3 commits June 30, 2024 14:59

return an opque type from read and read_meta

97f2b77

add trait method read_to_end

1d95a42

make read_to_end overrideable

27cd5dd

alice-i-cecile added this to the 0.15 milestone Jun 30, 2024

JoJoJet added the C-Performance A change motivated by improving speed, memory usage or compile times label Jun 30, 2024

alice-i-cecile reviewed Jun 30, 2024

View reviewed changes

crates/bevy_asset/src/io/mod.rs Show resolved Hide resolved

alice-i-cecile reviewed Jun 30, 2024

View reviewed changes

crates/bevy_asset/src/io/mod.rs Show resolved Hide resolved

JoJoJet added 18 commits June 30, 2024 16:10

document Reader

9c8d16f

document STACK_FUTURE_SIZE

9605d98

remove unused import

a7df529

fix a fn

8d83edc

fix trait objects

51bf9cc

fix a typo

5c3583c

remove lifetime

37537e7

remove unused import

be3e33c

remove anonymous lifetimes

ef97501

fix an import

01d00a6

remove more unused imports

1b72790

import

67caaea

readd import

0ca07e8

try

7a0375b

fixes

985e7fe

fix test

758cbef

example

0d8270c

import path

3628e4e

JoJoJet added 8 commits June 30, 2024 18:32

impl Reader for Box<dyn Reader>

f5e3474

impl keyword

e74acea

remove another import

ac2aac1

remove more lifetimes

0b670be

specify type for doc

45ccf67

fix a typo

bbc500d

fix a method name

ad67e45

use extend_from_slice

3c6d353

alice-i-cecile approved these changes Jul 1, 2024

View reviewed changes

use associated type bounds to improve style

8b0d587

cart approved these changes Jul 1, 2024

View reviewed changes

cart added this pull request to the merge queue Jul 1, 2024

Merged via the queue into bevyengine:main with commit 5876352 Jul 1, 2024
31 checks passed

JoJoJet deleted the optimize-asset-read branch July 1, 2024 23:45

UkoeHB mentioned this pull request Jul 10, 2024

Misleading error message for remove explicit elided lifetime rust-lang/rust#127585

Open

This was referenced Nov 12, 2024

0.15 Migration: Missing guide for 14082 (Optimize common usages of AssetReader) bevyengine/bevy-website#1793

Closed

generate-release-notes tool missed some PRs that need stubs bevyengine/bevy-website#1724

Closed

alice-i-cecile mentioned this pull request Nov 21, 2024

Add missing migration guides for 0.15 bevyengine/bevy-website#1831

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize common usages of `AssetReader` #14082

Optimize common usages of `AssetReader` #14082

JoJoJet commented Jun 30, 2024 •

edited

Loading

alice-i-cecile left a comment

UkoeHB commented Jul 10, 2024 •

edited

Loading

JoJoJet commented Jul 10, 2024

UkoeHB commented Jul 10, 2024

Optimize common usages of AssetReader #14082

Optimize common usages of AssetReader #14082

Conversation

JoJoJet commented Jun 30, 2024 • edited Loading

Objective

Solution

Testing

Migration Guide

alice-i-cecile left a comment

Choose a reason for hiding this comment

UkoeHB commented Jul 10, 2024 • edited Loading

JoJoJet commented Jul 10, 2024

UkoeHB commented Jul 10, 2024

Optimize common usages of `AssetReader` #14082

Optimize common usages of `AssetReader` #14082

JoJoJet commented Jun 30, 2024 •

edited

Loading

UkoeHB commented Jul 10, 2024 •

edited

Loading