-
-
Notifications
You must be signed in to change notification settings - Fork 4.2k
Improve frustum culling of skinned meshes through per-joint bounds #21837
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
…ional. This is arguably unnecessary as the `Mesh3d` component currently requires a transform, but maybe that changes in future.
…nent + asset. So `SkinnedMeshBounds` is now a component with a `Handle<SkinnedMeshBoundsAsset>`. This is annoying, but fixes render-world-only meshes breaking, and means `update_skinned_mesh_bounds` can efficiently ignore skinned meshes without skinned bounds.
…added bounds rendering for testing, but might remove it later.
…joint indices rather than keeping them separate. This is a little bit wasteful (two bytes per joint), but much simpler.
…e `SkinnedMeshBoundsAsset::from_mesh`.
|
I very much like the approach presented here and think it's a very reasonable default. The policy enum should be expanded to include @pcwalton's approach: don't generate any AABB, but keep frustum culling, since all AABBs will be artist-authored and inserted manually |
|
Oh nice, I just filed #21845 for the rest pose approach. |
I'd like to consider some different approaches: enum GltfSkinnedMeshBoundsPolicy {
...
/// Skinned meshes are assigned this `Aabb` component.
Fixed { aabb: Aabb },
/// Skinned meshes are assigned a `SkinnedMeshBounds` component, which will
/// be used by the `bevy_camera` plugin to update the `Aabb` component.
///
/// The `Aabb` component will be updated to enclose the given AABB relative to the named joint.
/// If the joint name is `None`, the AABB is relative to the root joint.
FixedJointRelative { aabb: Aabb3d, joint_name: Option<String> },
}So I'm not sure if I'll try adding these options in this PR as it's already a bit chonky. |
Objective
Mostly fix #4971 by adding a new option for updating skinned mesh
Aabbcomponents from joint transforms.skinned_bounds_pr.mp4
This fixes cases where vertex positions are only modified through skinning. It doesn't fix other cases like morph targets and vertex shaders.
The PR kind of upstreams
bevy_mod_skinned_aabb, but with some changes to make it simpler and more reliable.Why A Draft?
I'm fishing for design feedback and any other concerns before I finish up the PR. The current state is functional and mostly tested. The remaining work is documentation, error handling, final testing and release notes. There also needs to be a decision on whether the new option is enabled by default in the glTF importer.
Background
If a main world entity has a
Mesh3dcomponent then it's automatically assigned anAabbcomponent. This is done bybevy_cameraorbevy_gltf. TheAabbis used bybevy_camerafor frustum culling. It can also be used bybevy_pickingas an optimization, and by third party crates.But there's a problem - the
Aabbcan be wrong if something changes the mesh's vertex positions after theAabbis calculated. The most common culprits are skinning or morph targets. Vertex positions can also be changed by mutating theMeshasset (#4294), or by vertex shaders.The most common solution has been to disable frustum culling via the
NoFrustumCullingcomponent. This is simple, and might even be the most efficient approach for apps where meshes tend to stay on-screen. But it's annoying to implement, bad for apps where meshes are often off-screen, and it only fixes frustum culling - it doesn't help other systems that use theAabb.Solution
This PR adds a reliable and reasonably efficient option for updating the
Aabbof a skinned mesh from its animated joint transforms. See the "How does it work" section for more detail.The glTF loader can add skinned bounds automatically if a new
GltfSkinnedMeshBoundsPolicysetting is enabled inGltfPluginorGltfLoaderSettings:For non-glTF cases, the user can create a
SkinnedMeshBoundsAssetfrom aMeshand bind it to an entity with theSkinnedMeshBoundscomponent.See the
custom_skinned_meshexample for real code.I'm unsure if the
SkinnedMeshBoundsname is right. It's not "the bounds of the skinned mesh" - that's theAabb. It's the things that contribute to the skinned mesh bounds.Bonus Features
GltfSkinnedMeshBoundsPolicy::NoFrustumCullingThis is a convenience for users who prefer the
NoFrustumCullingworkaround, but want to avoid the hassle of adding it after a glTF scene has been spawned.Note that PR #21787 is doing something similar, or maybe adding a different option. I'm fine if that PR lands first and this PR is adjusted to fit.
Gizmos
bevy_gizmos::SkinnedMeshBoundsGizmoPlugincan draw the per-joint AABBs.Like with
SkinnedMeshBounds, I'm unsure if the name is right. It's not technically drawing the bounds of the skinned mesh - it's drawing the per-joint bounds that contribute to the bounds of the skinned mesh.Testing
I also hacked
custom_skinned_meshto simulate awkward cases like rotated and off-screen entities.How Does It Work?
Click to expand
Summary
When created from a
Mesh, theSkinnedBoundsMeshAssetcalculates an AABB for each joint - the AABB encloses all the vertices skinned to that joint. Then every frame,bevy_camera::update_skinned_mesh_boundsuses the current joint transforms to calculate anAabbthat encloses all the joint AABBs.This approach is reliable, in that the final
Aabbwill always enclose the skinned vertices. But it can be larger than necessary. In practice it's tight enough to be useful, and rarely more than 50% bigger.This approach works even with non-rigid transforms and soft skinning. If there's any doubt then I can add more detail.
Awkward Bits
There's a few issues that stop the solution being as simple and efficient as it could be.
Problem 1: Joint transforms are world-space,
Aabbis entity-space.Aabb, but that's not possible.Aabbis directly calculated in entity-space.Aabbin world-space and then transforms it to entity-space.Aabb.Problem 2: Joint AABBs are in a surprising(?) space.
MeshandSkinnedMeshInverseBindposes- and those assets can be mixed and matched.SkinnedMeshBoundsAssetfor each combination ofMeshandSkinnedMeshInverseBindposes.bevy_mod_skinned_aabbuses this approach - it's slow and fragile.)Meshasset, so for eachMeshthere's a matchingSkinnedMeshBoundsAsset.Pseudo-Code
Here's a pseudo-code version of the update so we can compare against other options.
Future Options
For frustum culling there's a cheeky way to optimize and simplify skinned bounds - put frustum culling in the renderer and calculate a world-space AABB during
extract_skins. It's a good fit because skin extraction has already grabbed the joint transform and calculatedworld_from_mesh. I estimate this would make skinned bounds 3x faster.(Note that putting frustum culling into the renderer doesn't mean removing the current main world visibility system - it just means the main world system would be separate opt-in system)
Another option is to change main world frustum culling to use a world-space AABB. So there would be a new
GlobalAabbcomponent that gets updated each frame fromAabband the entity transform (which is basically the same as transform propagation and the relationship betweenTransformandGlobalTransform). This has some advantages and disadvantages but I won't get into them here - I think putting frustum culling into the renderer is a better option.Performance
Click to expand
Initialization
Creating the skinned bounds asset for
Fox.glb(576 verts, 22 skinned joints) takes 0.03ms. Loading the whole glTF takes 8.7ms, so this is a <1% increase. The relative increase could be higher for glTFs that need less processing (e.g. pre-compressed textures).Per-Frame
The
many_foxesexample has 1000 skinned meshes, each with 22 skinned joints. Updating the skinned bounds take 0.086ms. This is a throughput of roughly 250,000 joints per millisecond, using two threads.The whole animation update takes 3.67ms (where "animation update" = advancing players + graph evaluation + transform propagation). So we can kinda sorta claim that this PR increases the cost of skinned animation by roughly 3%. But that's very hand-wavey and situation dependent.
This was tested on an AMD Ryzen 7900 but with
TaskPoolOptions::with_num_threads(6)to simulate a lower spec CPU. Comparing against a few other threading options:So the parallel iterator is better but quickly hits diminishing returns as the number of threads increases.
Future Options
The "How Does It Work" section mentions moving skinned mesh bounds into the renderer's skin extraction. Based on some microbenchmarks, I estimate this would reduce non-parallel
many_foxeson my CPU from 0.141ms to 0.049ms, so roughly 3x faster. Requiring AVX2 (to enable broadcast loads) or pre-splatting (to fake broadcast loads for SSE) would knock off another 25%. And fancier SIMD approaches could do better again.There's also approaches that trade reliability for performance. For character rigs, an effective optimization is to fold face and finger joints into a single bound on the head and hand joints. This can reduce the number of joints required by 50-80%. Another option is to use bounding spheres instead of AABBs.
FAQ
Click to expand
Should the new option be enabled by default in the glTF importer?
I think so. Bugs caused by skinned mesh culling have been a regular pain for both new and experienced users. I'm guessing most people would pay the CPU cost to have skinned meshes Just Work(tm). Sophisticated users in search of performance can choose to disable the option and use a fixed AABB.
Why is the automatic option tied to the glTF importer? Why do custom meshes have to be done manually? Shouldn't this be automatic for any mesh?
bevy_mod_skinned_aabbtook the automatic approach, and I don't think the outcome was good. It needs some surprisingly fiddly and fragile logic to decide when an entity has the right combination of assets in the right loaded state. And its broken byRenderAssetUsages::RENDER_WORLD.So this PR takes a more modest and manual approach. I think there's plenty of scope to make things more general and automated in future, particularly as the asset pipeline matures. So if the glTF importer becomes a purer glTF -> BSN transform, then adding skinned bounds could be a general scene transform or asset transform that's shared with other importers and custom mesh generators.
Why is there a separate
SkinnedMeshBoundsAsset? Couldn't that data go in theMeshorSkinnedMeshInverseBindposesassets?The data can't go in
SkinnedMeshInverseBindposes- it has to match theMeshit's used with, and a singleSkinnedMeshInverseBindposescan be shared between multipleMeshassets (see the "Awkward Bits" section above for more background).The first version of this PR did put the data in
Mesh. But a separate asset plus component has two advantages:MeshisRenderAssetUsages::RENDER_WORLD.update_skinned_mesh_boundscan efficiently skip any skinned meshes without skinned bounds.The situation could change. Maybe frustum culling moves to the render world and can safely use data from
Mesh- although that could be a problem if some other system still wants skinned bounds in the main world. Another path is forRenderAssetUsagesto be redesigned, so the renderer and main worlds can negotiate which parts they need.Why
SkinnedMeshBounds? Shouldn't it beSkinnedMeshAabbs?Playing it safe in case things change, e.g. bounding spheres might become an an option.
Why is the bounds update in
bevy_camera? Shouldn't it be inbevy_mesh?This gets tricky because the behavior depends on who wants to use
Aabbcomponents. Maybe an app usesbevy_meshbut doesn't usebevy_camera, so updating theAabbwould be redundant. Or maybe the app doesn't usebevy_camera, but does usebevy_picking. Or maybe it usesbevy_pickingbut not the mesh picking parts. Etc etc. Plugins are hard!Putting the update in
bevy_cameraseems like the path of least resistance for now. Maybe that changes if plugins can negotiate features.What Do Other Engines Do?
Click to expand
An approach that's been proposed several times for Bevy is copying Unity's "fixed AABB from animation poses". I think this is more complicated and less reliable than many people expect. More complicated because linking animations to meshes can often be difficult. Less reliable because it doesn't account for ragdolls and procedural animation. But it could still be viable for for simple cases like a single self-contained glTF with basic animation.