GSoC 2025: GPU-accelerated raster ops #2658

Firestar99 · 2025-05-20T16:45:22Z

Firestar99
May 20, 2025
Collaborator

Problem

Graphite used not have any hardware acceleration and compose the final image entirely on the CPU,
but in the past year, the vello renderer was implemented to accelerate the image composition on the
GPU. However, vello is only well suited for composing vector elements and plain raster images. It
does not implement any operations for (post-)processing raster images, like adjusting hue,
saturation, blending images, a posterization effect, etc. Currently, these “raster ops” are still
evaluated on the CPU, given that Graphite runs within the Web Assembly (wasm) environment of
the browser, limited to a single thread processing one pixel at a time. This significantly slows editing
with any raster-based content being present, in practice 2-3 fps on a typical desktop system.

Benefit

I want to accelerate the processing of rasterization ops using the GPU to achieve acceptable levels of
performance.
Instead of porting the existing raster ops written in Rust to a shader language, I want to use the
rust-gpu shader compiler to compile the existing Rust-based raster ops into shaders. It allows
Graphite to continue using the existing CPU path if ~~the new WebGPU API~~ WebGL is not available, without
duplicating the code across several languages.

Final Report

graphite_final_report.mp4

The video above compares the previous CPU-based, shown first, to the new GPU / shader based raster image manipulation. By moving the image around, the image adjustments are reevaluated constantly and we can observe how fast these reevaluations are performed. The CPU path is only able to emit a new frame every second, whereas the GPU shaders can evaluate the adjustments close to real-time. However, we can observe significant hitching on the GPU path due to us inefficiently reallocating GPU images every frame, which would require some further investigations.

The source image is a 720p PNG export of the isometric fountain image that has it's brightness and contrast adjusted, been posterized and finally has some color level adjustment been done. The top graph shown first uses CPU nodes to evaluate the adjustments, the bottom path uploads the image to the GPU and evaluates all the adjustments in GPU shaders.

How to make new shader nodes

Note that GPUs have limitations in the kind of operations they can do. Your node may only use the primitive types u32 or f32, you must not use u8, u16. u64, usize or f64 anywhere. Enums must be C-like enums (must not have values like Option) and must be #[repr(u32)]. Many of the "node::registry" types like Percentage or Angle have equivalent f32 variants named PercentageF32 and AngleF32, you must use those to not use f64. Shaders are #[no_std], so you must only use symbols from core and replace all use statements from std with core. Notably, this excludes Vec, Box, Arc and any derivative types using them. If your crate contains nodes that require symbols from std, I recommend giving them their own module and excluding the entire module with #[cfg(feature = "std")]. This way you don't need to feature gate individual use statements and functions, which is often error prone.

To specify a node to be of some shader node type, add shader_node(<node name>) to the node macro. The shader system does allow for many kinds of shader nodes, though only implemented None and PerPixelAdjust were implemented. None marks the node as "not a shader node", but importantly adds the required std feature gates to the node implementation. See "graphite integration" on how the feature gate works.

The PerPixelAdjust runs the node once per pixel, where it is passed the color from input images and must return the resulting color. This functions similar to the Adjust trait, though it is not limited to that trait specifically. It differentiates between two kinds of node params: uniform params that are the same for each pixel and image params marked with #[gpu_image] that are passed the Color of some input image at that pixel's location.
All uniform params must implement BufferStruct, which may be #[derive(BufferStruct)] on any struct or enum. Enums additionally require #[repr(u32)] and #[derive(num_enum::FromPrimitive, num_enum::IntoPrimitive)]. Structs or enums only used within the function, but not passed as parameters, may have but don't necessarily need to be a BufferStruct. (Small detail: any BufferStruct::Buffer must also have an alignment of 4, otherwise you may (not must) get padding issues within the generated Uniform struct. Not sure if there's an easy way around that issue.)
Image params take a Raster<GPU> image as an input on the node graph, but expect that parameter to accept an instance of Color and the function to return Color. Typically, you implement this by giving your node function a T: MyTrait generic that impl Color for MyTrait, define the type of all image params as T and the return type also as T. The node macro will generate a new "gpu" node that mirrors the original node, with the image params replaced by Raster<GPU> (and a &WgpuExecutor param appended). As well as codegen the associated shader entry point that loads the colors from the input images, calls the node function once for each pixel, and stores the returned Color to the output image.

Currently, you may only have exactly one image param in a node function, due to some limitations in the shader runtime. I hope to clean that up soon.

The technical bits

We want to use the rust-gpu shader compiler to not have duplicate code between shader and cpu nodes. A typical shader compiler like glslc (or shaderc) takes input files written in the C-like glsl language and turns them into SPIRV, a binary intermediary representations (IR) for shaders. Think of it like Java or C# bytecode, that the graphics driver accepts and compiles down to the machine code needed for your graphics card. rust-gpu works quite similar, just that it's input language is ordinary rust and not some C-like custom language.

Unlike a typical shader compiler, rust-gpu is not a full compiler but merely a "codegen backend" for the rustc compiler. This allows rust-gpu to reuse all the tokenization, parsing, type system, borrow checking etc. of the standard rustc compiler, and thus parse all the contructs of the rust programming language. At the very end of the rustc compiler pipeline, the Middle IR (MIR) is passed into a codegen backend to generate machine code. This codegen backend is typically LLVM, but is replaceable by a dynamically loaded library, like the rustc_codegen_spirv crate of rust-gpu.

Codegen backends are build is an unstable internal interface that can change on a whim. This has a few important implications:

rust-gpu must be upgraded continually for each new rustc release, as this interface changes quite often. If there are major changes, it may take a while for rust-gpu to adopt the newer toolchain, as was the case with rust edition 2024.
rust-gpu must be used with the exact nightly rustc compiler it was written against. This typically means fixing the entire project to that specific toolchain version, but we want the CPU code of graphite to compile on stable.

rust-gpu edition 2024

With the release of rustc 1.85.0 the rust edition 2024 was stablilized, but also came with significant changes to the codegen backend interface. This proved to be a significant challenge, so while edition 2024 released in February, it took us until July to port to newer toolchain versions. This was a huge blocker for graphite, as they and the entire rust ecosystem had already ported over to edition 2024, which the older compiler could not support. We would have to wait for rust-gpu to support edition 2024, as backporting all of the crates to edition 2021 was not an option.

The porting has primarly been done by @eddyb in Rust-GPU/rust-gpu#249 while I was mostly testing the branch against various projects and debugging issues as they poped up. Notably, this update also required changing the "target specs", a set of json specification files the rustc compiler requires of every codegen backend.

cargo-gpu

When I first looked at cargo-gpu, it was a command line application to compile a shader crate to SPIRV, the shader IR. Notably, it did not care for what toolchain you used in your project and automated the entire process of setting up rust-gpu: It would download rust-gpu and the required toolchain, compile rust-gpu with that toolchain and cache the build, and finally compile the shaders using the selected rust-gpu and toolchain version.

However, it had a few issues that needed to be resolved before we could use it:

It assumed that the "target specs" of rust-gpu would never change, which they did in edition 2024. Updating target json support Rust-GPU/cargo-gpu#75
It was only usable from the command line, and we wanted to use it as a library in a build script to automate the build as much as possible. Make cargo-gpu a library, usable from build scripts Rust-GPU/cargo-gpu#71
While building these and familarizing myself with the entire codebase, I've noticed that their support for multiple rust-gpu versions was quite complicated and could be a lot simpler. refactor: use SpirvBuilder directly, just manage rustc_backend_spirv dylibs Rust-GPU/cargo-gpu#69

While integrating these needed changes, I effectively refactored the entire codebase in 12 PRs and 3.005++ 4.610-- lines.

wgsl transpiling with naga

SPIRV is an open standard for shader IR for exchanging shaders between various compilers and graphics APIs. It was originally invented for the OpenCL compute API, but has evolved into the shader IR of the Vulkan API, which is the main graphics API on both Linux and Android systems. Even Windows is replacing their DXIL with SPIRV, making it the primary cross-platform shader IR to use. It was even set to be the main shader IR for the WebGPU API, but some parties were strictly against using that open IR standard. And so we got wgsl, a new C-like shading language with a rust-like syntax that's not anything like rust itself. And as is typical of web, it is sent to the browser as source code, so we may see some platform differences between Firefox, Chrome and Safari.

But since rust-gpu emits SPIRV, how to we convert from SPIRV to wgsl? Firefox implements the WebGPU API with their wgpu crate, which has a shader transpiler called naga. It was primarily build to convert wgsl into the different output shader languages needed for Firefox to run on all platforms: SPIRV for Vulkan on Linux and Windows, MSL on Mac and HLSL for Windows. But it also supports SPIRV input and WGSL output, so we can chain the compilers to go from rust to SPIRV to wgsl. This isn't anything new, schell's renderling has been doing this quite successfully and has contributed many fixes to the SPIRV input module.

But I didn't just want to setup wgsl transpiling for graphite, it would be much nicer if rust-gpu could handle that internally. This would also allow us to specialize the SPIRV we're emitting for naga, if necessary. So I got to implementing a spirv-unknown-naga-wgsl target, both of which need a bit more work before they can be merged:

To test this entire stack without having to integrate this directly into graphite, I choose to reuse a GameJam Game of mine called Colorbubble. It uses wgpu, hand-written wgsl and can be deployed to the web, just like graphite should. And with only a single hand-written wgsl file of 75 lines, it is the perfect small project to test the entire tech stack of cargo-gpu and naga-wgsl transpiling. The result can be found on the branch cargo-gpu, which can be compiled on stable and deployed to the web. I also build a vertical slice in week 3 to compile graphite's invert node into a shader, as it's the simplest node without any parameters.

Graphite integration

The proper integration into graphite was more difficult than initially expected due to rust-gpu's #[no-std] requirement. From rustc's perspective, rust-gpu is essentially an embedded target that does not have access to the standard library or a global memory allocator, thus it requires every crate to be no-std (and not need the alloc crate either). This makes symbols like Vec, Box, Arc unavailable, since they require a global allocator. If you still want to use some synbol from std if it is available, you'd have to mask it out with a feature, typically the std feature on libraries. But correctly masking out std symbols is a quite error prone process, and it is quite easy to accidentally import from std instead of core.

I was told that graphite supported a no-std compile without the std feature, from a previous but quite different rust-gpu integration attempt. But alas, that support has withered away, with almost nothing being masked out correctly. So I decided to pursue a different approach to no-std compatibility: Instead of masking out all the std symbols, we instead have a separate shader crate that is (almost) completely no-std and reexported in the std crate. This allows me to move certain no-std symbols from the std crate to the shader crate, without breaking the paths of upstream crates. And since the shader crate is no-std by default, you will notice immediately if something isn't masked correctly.

The original plan was to split out code from the gcore crate into a variety of smaller crates, leaving gcore with just no-std types. I've extracted gapplication_io, gpath_bool, gsvg_renderer, gmath_nodes, gelement-nodes, graster-nodes, gbrush, and gtext (never PRed), but it become obvious that this is a monumental task that I could not finish within the time frame given. But with graster-nodes extracted, we could start integrating shaders into it without affecting everything surrounding it.

Since gcore wasn't going to be no-std anytime soon, we decided on creating the gcore-shaders crate that operates as the no-std only part of gcore and moved over Graphite's color and blending modules. Most of this was in PR #2925, though it did require multiple fixups from smaller breakages before the shader code was merged. Another difficulty were the types to give node parameter units, such as percentage or angle, since they were f64. In #3095 I created alternative f32 variants and @0HyperCube managed to fixup the UI to support f32 types again. A similar story ensued with Table<Color> parameters, since it contains a Vec, but since the nodes only ever used the first Color anyway, we decided to switch to a plain Color in #3096.

The graster-nodes has a similarly looking graster-nodes-shaders, although it functions quite differently. Due to the node macros only being able to emit tokens where they are in the code, we need to compile the entirety of graster-nodes as a shader and use careful feature gating to exclude std symbols. The node macro has been modified to wrap the CPU and GPU node implementations with a feature gate on the (hardcoded) std feature, if a shader node declaration is present. The shaders crate has build script to compile graster-nodes into a wgsl shader and includes the result as a string. We could put the build script on graster-nodes directly, however, building and executing build scripts can't be feature gated. So we would always compile the cargo-gpu dependency, even when not building with rust-gpu shaders, and build it a second time when compiling the shaders, since that has a different target directory.

A new crate with shader node support could easily be added by copying much from graster-nodes: It needs a copy of the graster-nodes-shaders crate, an std feature that enables the dependency on the shader crate, reexport the WGSL_SHADER from the shader crate in lib.rs, and feature gate everything std behind the std feature. Node functions can be feature gated by adding a standard #[cfg(...)] to gate the entire function, by adding either a cfg() or a shader_node(None) to the node macro keep the function but gate everything the node macro emits. At the moment, you'd also have to copy the mod fullscreen_vertex, this should likely be moved to a macro when a second crate is necessary.

Runtime

It was specifically requested to be compatible with WebGL since WebGPU support still isn't widespread enough, so we're limited to fragment shaders since compute shaders are unavailable in WebGL. So I've opted for a pretty standard rasterization pipeline, with a vertex shader emitting a fullscreen triangle and the fragment shader calling the node function for each pixel in the output image. A storage buffer (not a uniform buffer since it has extra alignment requirements) is bound to binding 0 to pass a Uniform struct containing all uniform params, and the input images start at binding 1 counting upwards (or start at 0 if there are no uniform params). The fragment shader simply calls the node function and returns its Color as output, with uniform params loading their values from the storage buffer and image params texelFetch-ing their colors from their associated image at the glFragCoord (glsl) / @builtin(position) (wgsl) integer pixel coordintes.

When executed, each GPU operation allocates an output image and draws into it on it's own, there is no "global reasoning" or execution system present. This means multiple sequential GPU operations simply execute one after another, which is quite inefficient on not just the memory consumption but also the memory bandwidth requirements. I proposed building a shader composition system to merge shaders and reduce these bottlenecks, but didn't have enough time to implement it. And the current system is plenty faster than the CPU one anyway.

Related PRs:

Summary

Overall, the project was a success in that the infrastructure to support shader nodes has been setup. Unfortunately, the graphite integration took quite a bit longer than initially expected, which resulted only in a handfull of shader nodes being available as of right now. But with the supporting infrastructure present and most blockers resolved, porting further nodes to shaders should be very straight forward.

All code changes summarized (between 2025-04-15 and 2025-09-01):

graphite: 51 PRs and 18,501++ 24,039-- lines
rust-gpu: 29 PRs and 15,455++ 14,347-- lines
cargo-gpu: 12 PRs and 3,005++ 4,610-- lines
cargo-gpu: 11 PRs and 3,095++ 2,793-- lines
in total: 103 PRs and 40,056++ 45,789-- lines

Future directions

This is a List of things that could be implemented in the future to further advance the system:

Hide the additional WgpuExecutor input on GPU nodes and connect it automatically
- Shaders: Provide WgpuExecutor via Scope #3144
Allow GPU nodes to not have exactly one input image, enables blend node (2 inputs), noise and mandelbrot node (0 inputs)
A Convolutional kernel runtime to execute blurs, edge detection and the like
- For mirrored convolutional kernels, switch to a 2-pass evaluation (see 2-pass gaussian blur)
- Add a compute shader variant utilizing shared memory to reduce memory bandwidth requirements
Building a prototype brush engine was discussed at some point
Shader composition to reduce memory bandwidth requirements

old notes, please ignore

old

Constraints

rust-gpu integration should be invisible to other contributors
- the project must build on stable
- rust-gpu toolchain must be automagically installed
- rust-gpu codegen backend must be automagically build and used to compile shaders
- if compile times are bad, may switch from build scripts to on-demand cmdline
must run on WebGL
- vello recently added WebGL support
- no compute shaders, everything must be done in fragment shaders
- transpile spirv to wgsl

Deliverables

prepare rust-gpu for integration
- instead of backporting graphite to edition 2021, help out upgrading rust-gpu to edition 2024
- rustup: update to nightly-2025-05-09 (~1.88) and Rust 2024 edition. Rust-GPU/rust-gpu#249
cargo-gpu allows using rust-gpu on a stable toolchain, manages toolchain install and codegen backend build
- refactor: use SpirvBuilder directly, just manage rustc_backend_spirv dylibs Rust-GPU/cargo-gpu#69
- Make cargo-gpu a library, usable from build scripts Rust-GPU/cargo-gpu#71
- bug: parallel cargo-gpu installing crashes Rust-GPU/cargo-gpu#72 (comment)
- allow different target jsons for different codegen backend versions, blocker for rust-gpu 2024, see comment
- integrate naga spirv -> wgsl transpiler
first per-pixel op: invert
- images should be propagated as wgpu images
- invert op should launch a draw
port simple per-pixel ops to shaders
- eg. Threshold, Hue, Saturation, Brightness, Posterization
- support uniforms to pass values to shaders
port convolution kernels to shaders
- eg. blur, bloom, edge detection
- may access neighboring pixels to evaluate their kernel
- may add faster compute shader variant, if supported
extension: prototype brush engine
extension: shader compositing

Firestar99 · 2025-05-20T16:49:11Z

Firestar99
May 20, 2025
Collaborator Author

Preparation

Anything before week 1

help debug rust-gpu edition 2024: rustup: update to nightly-2025-05-09 (~1.88) and Rust 2024 edition. Rust-GPU/rust-gpu#249
full refactor of cargo gpu to simplify it a lot: refactor: use SpirvBuilder directly, just manage rustc_backend_spirv dylibs Rust-GPU/cargo-gpu#69
begin port to a lib: Make cargo-gpu a library, usable from build scripts Rust-GPU/cargo-gpu#71

0 replies

Firestar99 · 2025-05-25T17:43:41Z

Firestar99
May 25, 2025
Collaborator Author

Week 1: being sick

Am sick this week, likely caught something last week in the (amazing) rustweek conference. Still made some progress:

The rust-gpu edition 2024 PR needs to change the "rustc target json files", as rustc's validator would otherwise complain. cargo-gpu was never build to deal with different target jsons, and always uses the one it's shipped with. To not have a hard break between before / after that PR, I refactored it to have per-install (of our codegen backend) target jsons, extracted from our codegen backend. Includes some refactor from the lib port.
- Updating target json support Rust-GPU/rust-gpu#256
- Updating target json support Rust-GPU/cargo-gpu#75
minor
- reviewed Fix windows builds Rust-GPU/rust-gpu#252
- reviewed fix: buff the help menu for the wgpu-runner Rust-GPU/rust-gpu#254

0 replies

Firestar99 · 2025-05-26T17:14:55Z

Firestar99
May 26, 2025
Collaborator Author

Week 2: cargo-gpu naga transpile vertical slice

Demo

A vertical slice of using cargo gpu to compile rust-gpu shaders and transpile them to wgsl with naga, so they can be passed to wgpu. Based on a GameJam game of mine with just 4 shaders, small enough to allow quick iteration and API exploration.
https://github.com/Firestar99/colorbubble/tree/cargo-gpu
Branch cargo-gpu

Progress

rust-gpu 2024 prep: Was asked to put the target specs into a new rustc_codegen_spirv-target-specs crate:
lib port mostly done Make cargo-gpu a library, usable from build scripts Rust-GPU/cargo-gpu#71
started on Naga transpiling to wgsl Rust-GPU/cargo-gpu#76
found a bug that may bite us later: build script: cargo clippy fails Rust-GPU/cargo-gpu#77
debugging why hypercube has issues building the demo on fedora, randomly fixed?
some initial code investigation on how to integrate this into graphite
minor

0 replies

Firestar99 · 2025-06-04T09:40:43Z

Firestar99
Jun 4, 2025
Collaborator Author

Week 3: invert_gpu

Demo

A full vertical slice with a new gpu invert node in this draft PR! I hope the image isn't too confusing, this is a screenshot of the node graph, which I then inserted as an image and inverted using the gpu invert node, creating a node graph just like the original screenshot. Also we're suffering from memory leaks, which I'll have to debug soonish.

Progress

Demo GPU invert node #2690
- debugging why cargo-gpu's build of rustc_codegen_spirv has wasm compile flags leak into it, failing ci. Can't repo locally, and the runner seems to be down rn.
Various small fixes for Graphite Rust-GPU/cargo-gpu#80
review and work on difftesting for rust-gpu Add differential testing harness Rust-GPU/rust-gpu#216
It's been in the pipeline for quite a while and we finally want to get it underway. Having this we would have probably noticed things like misimplemented intrinsics way earlier.... and noticed that the fix itself was indeed buggy as well
speedup rust-gpu ci: Split compiletests and difftests into parallel CI jobs Rust-GPU/rust-gpu#272
make NonUniform merge-able behind feature gate Non uniform for everything! Rust-GPU/rust-gpu#177
minor

0 replies

Firestar99 · 2025-06-10T16:04:19Z

Firestar99
Jun 10, 2025
Collaborator Author

Week 4: `spirv-unknown-wgsl`

Progress

add spirv-unknown-naga-wgsl target via naga Rust-GPU/rust-gpu#280
learn how to add new targets with add targets vulkan1.3, vulkan1.4 and spv1.6 Rust-GPU/rust-gpu#271
- required updating CI tooling for vulkan1.4 ci: replace spirv-tools prebuilds with vulkan sdk from lunarg Rust-GPU/rust-gpu#279
- which also unblocked Update mac target in CI to aarch64-apple-darwin. Rust-GPU/rust-gpu#258
minor

0 replies

Firestar99 · 2025-06-16T11:38:05Z

Firestar99
Jun 16, 2025
Collaborator Author

Week 5: refactoring `gcore`

Progress

graphite gcore refactoring
cleanup and compiletest add spirv-unknown-naga-wgsl target via naga Rust-GPU/rust-gpu#280
minor
- ci: resolve target tuple from rustc Rust-GPU/rust-gpu#286
- fix compiletests Rust-GPU/rust-gpu#290

Away for an event during the extended weekend.

0 replies

Firestar99 · 2025-06-23T08:47:58Z

Firestar99
Jun 23, 2025
Collaborator Author

Week 6: refactoring `gcore` even more

Progress

0 replies

Firestar99 · 2025-06-30T12:13:53Z

Firestar99
Jun 30, 2025
Collaborator Author

Week 7: rust-gpu progress

Progress

graphite gcore refactoring
- quite some merge conflicts from last week's PRs
- Fix running tests locally; fix migrations for relocated nodes #2805
rust-gpu spirv-unknown-wgsl target
- split out Upgrade wgpu 23 -> 25 Rust-GPU/rust-gpu#309
- split out Make read_clock_khr always available Rust-GPU/rust-gpu#310
- split out and finish off enum SpirvTargetEnv
  - enum SpirvTargetEnv containing all available targets Rust-GPU/rust-gpu#311
  - target enum: fixup cargo gpu Rust-GPU/cargo-gpu#92
- finish off add spirv-unknown-naga-wgsl target via naga Rust-GPU/rust-gpu#280
- better const folding for int and bool Rust-GPU/rust-gpu#317
rust-gpu minor

0 replies

Firestar99 · 2025-07-07T09:41:07Z

Firestar99
Jul 7, 2025
Collaborator Author

Week 8: rust-gpu edition 2024

Progress

graphite gcore refactoring
rust-gpu edition 2024 is merged 🎉
rust-gpu
- better const folding for int and bool Rust-GPU/rust-gpu#317
- target has gotten an extra naga add spirv-unknown-naga-wgsl target via naga Rust-GPU/rust-gpu#280
minor

0 replies

Firestar99 · 2025-07-14T11:09:20Z

Firestar99
Jul 14, 2025
Collaborator Author

Week 9: clippy and miri with cargo-gpu

Progress

graphite
- fixup Upgrade vello to version 0.5.0 and wgpu to version 25 #2890
rust-gpu
- better const folding for int and bool Rust-GPU/rust-gpu#317
- target has gotten an extra naga add spirv-unknown-naga-wgsl target via naga Rust-GPU/rust-gpu#280
- Fix cargo-gpu in build script failing when called by Miri or Clippy
  - Fix cargo-gpu in build script failing when called by Miri or Clippy Rust-GPU/rust-gpu#335
  - Fix cargo-gpu in build script failing when called by Miri or Clippy Rust-GPU/cargo-gpu#101
- difftest
  - minor changes in Add 7 new difftests and push constants support Rust-GPU/rust-gpu#321
  - major speedups difftest: nextest support and speedups Rust-GPU/rust-gpu#334
- rust-gpu clippy speedup
  - fix main ci Rust-GPU/spirv-tools-rs#17
  - skip compiling C++ code when running clippy Rust-GPU/spirv-tools-rs#16
  - publish a minor update to spirv-tools-sys and spirv-tools (version 0.12.1)
  - ci: simplify clippy Rust-GPU/rust-gpu#331

0 replies

Firestar99 · 2025-07-21T08:49:53Z

Firestar99
Jul 21, 2025
Collaborator Author

Week 10: prep graster-nodes for shaders

Progress

0 replies

Firestar99 · 2025-07-28T10:13:52Z

Firestar99
Jul 28, 2025
Collaborator Author

Week 11: first wgsl compilation

Demo

$ git checkout shaders
$ cargo b -p graphene-raster-nodes
    Finished `dev` profile [optimized + debuginfo] target(s) in 0.26s
$ ll ./target/spirv-builder/spirv-unknown-naga-wgsl/release/deps/graphene_raster_nodes.wgsl -h
-rw-r--r-- 1 firestar99 users 149K Aug  3 23:37 ./target/spirv-builder/spirv-unknown-naga-wgsl/release/deps/graphene_raster_nodes.wgsl

Progress

graphite
- Shaders: graster-nodes no-std fixups #2984
- Shaders: runtime and shader node codegen #2985
rust-gpu
minor rust-gpu
- review and fixup Update most significant versions of direct dependencies Rust-GPU/rust-gpu#349
- make / review Update rust-gpu dependency Rust-GPU/cargo-gpu#107 Update dependencies Rust-GPU/cargo-gpu#106

0 replies

Firestar99 · 2025-08-18T09:39:34Z

Firestar99
Aug 18, 2025
Collaborator Author

2 week break

Not advancing the week counter, just taking 2 weeks off

0 replies

Firestar99 · 2025-08-18T09:40:02Z

Firestar99
Aug 18, 2025
Collaborator Author

Week 12

Demo: invert node

The invert node works again. The rest has non-functioning nodes, as the runtime does not yet support non-texture arguments. (The "Extract Executor" is copied from within the "Upload Texture" node)

Demo: posterize node

We can now pass parameters to shaders, allowing nodes like posterize to work! Though any node using floating point numbers is still broken, since I had to switch them from f64 to f32* and neither graphite's UI or node graph seems to support that properly.

*f64 on GPUs: Not only is it typically 64 times slower on consumer level GPUs, but WebGPU spec doesn't expose f64 either. The wgpu impl exposes a SHADER_F64 feature, which is only available on native vulkan, if the desktop app wants to have it for whatever reason.

Progress

graphite
- Remove unused cargo dependencies #3063
- node-macro: cleanup ParsedField struct #3064
graphite shaders
rust-gpu
- review and find ICE in Make black_box a no-op Rust-GPU/rust-gpu#363

0 replies

Firestar99 · 2025-08-27T12:12:45Z

Firestar99
Aug 27, 2025
Collaborator Author

Week 13

Demo

Progress

graphite shaders has been split up into multiple PRs:
graphite CI and cargo
graphite UI support, with help from @0HyperCube
- Add node input type support for f32 to enable usage on GPU #3095
- Change Table<Color> node inputs to Color where only one value is used so GPU nodes work #3096
rust-gpu
- Implement Default for InstalledBackend Rust-GPU/cargo-gpu#108
cargo/rustc bug reporting
- crate-type = ["rlib", "dylib"] fails to build with test profile rust-lang/cargo#15890

0 replies

Uh oh!

GSoC 2025: GPU-accelerated raster ops #2658

Uh oh!

Uh oh!

Firestar99 May 20, 2025 Collaborator

Problem

Benefit

Final Report

How to make new shader nodes

The technical bits

rust-gpu edition 2024

cargo-gpu

wgsl transpiling with naga

Graphite integration

Runtime

Summary

Future directions

old

Constraints

Deliverables

Replies: 15 comments

Uh oh!

Firestar99 May 20, 2025 Collaborator Author

Preparation

Uh oh!

Uh oh!

Firestar99 May 25, 2025 Collaborator Author

Week 1: being sick

Uh oh!

Uh oh!

Firestar99 May 26, 2025 Collaborator Author

Week 2: cargo-gpu naga transpile vertical slice

Demo

Progress

Uh oh!

Uh oh!

Firestar99 Jun 4, 2025 Collaborator Author

Week 3: invert_gpu

Demo

Progress

Uh oh!

Uh oh!

Firestar99 Jun 10, 2025 Collaborator Author

Week 4: spirv-unknown-wgsl

Progress

Uh oh!

Uh oh!

Firestar99 Jun 16, 2025 Collaborator Author

Week 5: refactoring gcore

Progress

Uh oh!

Uh oh!

Firestar99 Jun 23, 2025 Collaborator Author

Week 6: refactoring gcore even more

Progress

Uh oh!

Uh oh!

Firestar99 Jun 30, 2025 Collaborator Author

Week 7: rust-gpu progress

Progress

Uh oh!

Uh oh!

Firestar99 Jul 7, 2025 Collaborator Author

Week 8: rust-gpu edition 2024

Progress

Uh oh!

Uh oh!

Firestar99 Jul 14, 2025 Collaborator Author

Week 9: clippy and miri with cargo-gpu

Progress

Uh oh!

Uh oh!

Firestar99 Jul 21, 2025 Collaborator Author

Week 10: prep graster-nodes for shaders

Progress

Uh oh!

Uh oh!

Firestar99 Jul 28, 2025 Collaborator Author

Week 11: first wgsl compilation

Demo

Firestar99
May 20, 2025
Collaborator

Firestar99
May 20, 2025
Collaborator Author

Firestar99
May 25, 2025
Collaborator Author

Firestar99
May 26, 2025
Collaborator Author

Firestar99
Jun 4, 2025
Collaborator Author

Firestar99
Jun 10, 2025
Collaborator Author

Week 4: `spirv-unknown-wgsl`

Firestar99
Jun 16, 2025
Collaborator Author

Week 5: refactoring `gcore`

Firestar99
Jun 23, 2025
Collaborator Author

Week 6: refactoring `gcore` even more

Firestar99
Jun 30, 2025
Collaborator Author

Firestar99
Jul 7, 2025
Collaborator Author

Firestar99
Jul 14, 2025
Collaborator Author

Firestar99
Jul 21, 2025
Collaborator Author

Firestar99
Jul 28, 2025
Collaborator Author