ScieCode's profile picture. Graphics Engineer

Guilherme Avila

@ScieCode

Graphics Engineer

Guilherme Avila reposted

New blog post! "Load store conflicts", in which we look at some performance sensitive code that has surprisingly dramatic performance swings based on the compiler and the microarchitecture used. RTs appreciated! zeux.io/2025/05/03/loa…


Guilherme Avila reposted

GH stars / retweets welcome as usual ;) GitHub release link: github.com/zeux/meshoptim…


Found another potential optimization, breaking up instruction dependency, and allowing better pipeline while quarter-rate rcp runs. Got mixed results here, either no net gain or ~5% gain. Probably depends on the compiler, but should use this formulation eitherway.

ScieCode's tweet image. Found another potential optimization, breaking up instruction dependency, and allowing better pipeline while quarter-rate rcp runs.

Got mixed results here, either no net gain or ~5% gain. Probably depends on the compiler, but should use this formulation eitherway.

I see Stubbe's octahedral decode opt in almost every usage I come across. But I've never seen it used for encoding. The same symmetry applies here! and its ~50% faster. Especially important for octahedral-encoded textures, as it's required for every sample! please RT.

ScieCode's tweet image. I see Stubbe's octahedral decode opt in almost every usage I come across. But I've never seen it used for encoding.

The same symmetry applies here! and its ~50% faster.

Especially important for octahedral-encoded textures, as it's required for every sample! please RT.


Guilherme Avila reposted

meshoptimizer v0.21 is out! This release has many improvements related to meshlets: a new function to optimize meshlets for HW efficiency, extra features for meshopt_simplify* to make Nanite style processing easier to implement and faster, and more! RTs and ⭐️ appreciated :)


Example of full padded octahedral-encoded mipchain. Pre-convolved with GGX roughness lobes.

ScieCode's tweet image. Example of full padded octahedral-encoded mipchain. Pre-convolved with GGX roughness lobes.

different spectral band


plasma tornado. I love the sun.




Loading...

Something went wrong.


Something went wrong.