Skip to content

[WOLF] Vulkan Accelerated Render of GLTF - a fresnel based renderer

License

Notifications You must be signed in to change notification settings

rsahlin/varg-engine

Repository files navigation

[WOLF] Vulkan Accelerated Render of GLTF

Copyright Rickard Sahlin
This project is licensed under the terms of the MIT license.

This project is a continuation of:
https://github.com/rsahlin/graphics-by-opengl
https://github.com/rsahlin/gltf-viewer

Just looking for prebuilt binaries?

If you just want to test the engine out by running prebuilt binaries,
head over to:
executables

PBR and Physically Correct Shading

First some words about 'Physically Based Rendering' - or PBR for short.

I think it is time to move past the PBR moniker.
Sure, it's served us well for the past 20 odd years - with it's origin in the 'dark ages' where everyone was using fudge values to get renderers to create somewhat plausible results.
Along came PBR as a concept and instead of having arbitrary values we got to choose parameters (loosely) based on physical parameters.
This was a huge improvement and a major leap for designers, modellers and material technicians - basically anyone using content creation tools to produce models or materials.

However....

With physical properties chosen mostly based on what made sense in editors, we ended up with parameters such as:
Metalness, or dielectrics/insulators
Gloss
Albedo or basecolor
Roughness

All of these make sense when used an editor by someone that wants to create the 3D appearance of something realistically looking.
But, come on - metalness??? That's not a good parameter to put at the core of your physically correct render - maybe if you are communicating with a welder or a mine operation - but not for calculating the light interactions of the material.

The solution?
Let's take a holistic look at how these models come to be.

3D Workflow

To end up with something awesome rendered on the screen we need to support a 3D workflow:

Content Creation

The creation of the 3D data - this encompasses everything done in different editor such as 3D modelling packages or image tools.
Here it makes perfect sense to use simple to understand names for parameters - I would argue that this is where most of the influence on PBR comes from.
This is everything from material creation to modelling of 3D topology.

The Dataset

This is the data that is saved/exported or converted from the content creation step.
The dataset absolutely must be firmly based in the physics of light, not electricity or metallurgy.
This should be totally removed from both the previous, content creation step, and the following render step.
With current PBR proposals I would say that this step is somewhat broken, it's deviated from the light/surface interactions that it must model.

Rendering

The final step, or at least the almost final step - before post-processing - taking the dataset and calculating an output that is physically correct.
To do this correct we must take a step back and look at the single most important event in all light calculations:
What happes when light travels between different media - in our case - as the light 'hits' the surface defined by the dataset.
This has been known for over 100 years and is given by the Fresnel equations - the fulcrum of physically correct shading.

So, let's use the following
-IOR - with angle of incidence we get reflection and transmission.
-Absorption factor - amount of transmitted light that is absorbed in the material.
-Reflection color - how the material colors the reflected light.
-Transmission color - how the material colors the transmitted light.
-Surface dispersion - how the material disperses light on a micro-surface level. In my opinion this is an anomaly.
It is needed for some cases of roughness, though I really dislike the roughness property - we are not really modelling the surface roughness per se.
What we are modelling is a theoretic model of how light is scattered by a micro-surface geometry.
Using this model light is broken down into fractions - maybe somewhat plausible for some cases of 'diffuse' refraction - the incoming light is scaled [0.0 - 1.0] based on roughness.
Clearly not what happens to reflected light - this issue is also connected to the omission of lightsource solid angle - for instance in the glTF datamodel.

To put this into action we need to adress the last 2 stages of the 3D Workflow - namely the dataset and light shading.

Media (Surface) Light Property Dataset

This is the dataset for the media properties when it comes to light calculations.
In short - physical properties that model how light interacts with the surface and inside the media.

Fresnel Based Light Shading

Physically correct light shading using the Media Light Property Dataset.
First some basic concepts:
As light travels from one media to another, ie light 'hits' our surface, it is either reflected or transmitted.
Reflected light is, well reflected, think of it as bouncing off the surfce.
Transmitted light interacts with the surface and is either re-transmitted or continues into the media.
I will not call re-transmitted light diffused - I prefer to say that the light bends, similar to the effect in a prism.
Think of glossy paint - transmitted light will interact with the paint and be colored by it - and then be re-transmitted probably more in the direction of the reflection vector.
The term 'specular reflection' is vaguely better.
However, since specular means mirror, there is a risk that the reflective power (fresnel) is overlooked.

Better to simply use the terms reflection and transmission.

The first rule of Fresnel Based Light Shading

The first rule of Fresnel Based Light Shading is to start with the IOR and angle of incidence to calculate the Fresnel power function.
Use the media reflection color for the reflective part and the media transmission color for re-emitted transmission.
How much of the transmitted light that may be re-emitted is govererned by the absorption factor (for metals this value is 1).
Transmitted light is affected by surface dispersion factor:
-0 means no dispersion.
-1 means fully dispersed over 4PI.
Transmission intensity goes from 1 down to 1 / 4PI as dispersion goes up.
Irradiance Map (Spherical Harmonics) is calculated in same way as the transmissive light.
Environment map reflections use first rule of FBLS.

One of the major problems with most of todays realtime PBR implementations is that they use a specular and diffuse model that produces way too much light.
Imagine a light hitting a media at normal incidence, going from air to an IOR of 1.5.
Using the Fresnel power function we get reflectance at 0 degrees - R0: R0 = ((IOR1 - IOR2) / (IOR1 + IOR2)) ^2 = 0.04
This means that of the incoming light, 4% is reflected back. The rest is transmitted. Period.

All glTF implementations that I have seen mess this up by not limiting the NormalDistributionFunction (NDF) - instead it usually returns tenfold or more light.
(This has somewhat to do with using an inaccurate model for re-transmitted, or diffused, light)

The second rule of Fresnel Based Light Shading

Do not rely on the Lambertian diffuse model for transmitted light.
This, or any other diffuse light model that I know of, is an oversimplification of how re-transmitted light behaves.
I understand why it has been used!
It's simple and fast and produces somewhat good looking results, especially in cases where there is no irradiance map.
[An irradiance map captures the light reflecting off the environment. It can be a texture, spherical harmonics or some other technique]

Instead, use some algorithm that takes absorption and the dispersion of matte (rough) materials into account.

I would guess that all materials absorb light, for instance imagine a leaf or a green plant, it will at least attenuate most if not all of the transmitted light.
Afterall, how can photosynthesis work if light does not enter into the materia?
So how much of transmitted light is absorbed by common materials?
Very good question......

The third rule of Fresnel Based Light Shading

The third rule of Fresnel Based Light Shading is to never, and I really mean never, bake or combine factors affecting light distribution into colors.
While this may seem like an optimization at first glance it prohibits proper light calculations.
Examples of this is how glTF handles metals - the ior cannot be set so the reflective power is baked into the material color.
This will give inconsistent result compared to having the metallic ior and color.

Another example is glTF sheen extension - here there is no factor to specify amount of light that interacts with the perpendicular fibres.
It's baked into the sheen color - this will also give inconsistent results, the factor is needed to calculate amount of light that proceeds to interact with the base material.

The VARG - Engine - A Fresnel Based Light Shader

VARG is written in Java.
To interface with Vulkan, which is a C based API, LWJGL3 is used.
The dependencies are minimal - only GLFW and Vulkan packages are used, plus the needed Native libraries of course.
The communication with LWJGL3 is separated into it's own module.
Currently Windows and Linux platforms are supported - VARG can run on any platform that LWJGL3 [GLFW and Vulkan] can be built for.

VARG supports compute and graphics pipelines, geometry is streamlined and made into drawcalls before being executed by the renderer.
Physically based rendering is supported using Fresnel Based Light Shading.

SPIR-V compilation is currently done on target, using the platform command 'glslc'.
Hash values are used to differentiate shader permutations.
Shaders can be pre-built if wanted.

glTF Corset model glTF Flight Helmet model glTF Waterbottle model

Features

Supports PBR based on metallic/roughness parameters, fresnel power equation using Schlick approximation. Point light support.
Irradiance Map through spherical harmonics, these can be displayed as background.
Environment map, textured stored using KTX2 cubemap format - environment map can be displayed as background.
Fragment shading rate can be set on background and/or geometry.
Textures are stored in bundles (2D texture array) with same size and texture modes, shader usage of samplers is greatly reduced for most models.
MipMaps created on GPU.

glTF models are loaded using gltf-io.

Drawcall optimization

Geometry data, textures and uniform data is pre-processed to create an optimized set of drawcalls.
Below are two examples of this:

10 000 asteroids - a model with 10 000 nodes, meshes and materials - normally 10 000 drawcalls.
The glTF Sponza model which usually renders with some 100 drawcalls (check PlayCanvas/Babylon etc)

Example of drawcall optimization - 1 drawcall Example of drawcall optimization - 3 drawcalls

Framegrabber

Load any number of glTF models - render a given set of camera angles and save the output in PNG or JPG format.
The framebuffer content will be fetched after render of one frame.
This means that the saved image will contain what you actually see on the screen, if using multisamplebuffer the output will be the resolved buffer.
Pixel data is converted to 8 bit RGB and saved either as PNG or JPEG.
Any number of camera angles can be configured, once these are output the grabber will move on to the next glTF in the list.

Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation Framegrabber output using camera Y axis rotation

Mesh shader experiment using voxels:

Support for mesh shaders, this is a test implementation with a task shader that assembles the shape of the face.
Mesh shader outputs colored voxels that are rendered together with a glTF model.

1.3 million voxels together with a glTF model

1.3 million voxels

Incomplete todo list, in no particular order:

  • Use LWJGL3 to compile shaders to get rid of platform dependency to glslc.
  • Release APK with working viewer, for instance on Windows and Linux.
  • Add suport for glTF BASISU compressed textures.
  • Add support for glTF clearcoat extension.
  • Implement animations.
  • Sort alpha/transmission triangles.

Some notable design changes - the project is now split up into several parts:
The glTF io abstraction (gltf-io) which handles loading of glTF and mapping of JSON to java.
Image io and KTX (gltf-imageio) - adds support for image/textures using KTX
Vulkan rendering (varg-engine) - this takes care of Vulkan rendering

Engine is likely to be continously work in progress with more features and cool shaders implemented in the future.

Build instructions

Platform dependencies to Vulkan validation layers and glslc:
Download the Vulkan SDK from https://vulkan.lunarg.com/sdk/home
Add the 'bin' folder to path, check by opening a command line prompt and type: "glslc --version"

This project uses maven to build.
In order to build, some project dependencies needs to be cloned and built beforehand.
gltf-io
gltf-imageio

To be sure of project dependencies - check the pom.

gltf-io - build using 'mvn clean compile install -DskipTests'
gltf-imageio - build using 'mvn clean compile install -DskipTests'

build varg-engine using 'mvn clean compile install -DskipTests'

Fresnel Based Light Shading Testmodels

This section covers render correctness and supplies some glTF models to be used when checking for physically correct rendering.
They are placed in the "\testmodels\fresnelreflectiontest" folder.

Light reflection correctness

This is to test that Fresnell light reflection, not transmission, is calculated correctly.
Assuming light is moving from a media with an IOR of 1.0 (air) to an IOR of 1.5 - this is the default of glTF.
Reflected light is calculated at normal incidence:
R0 = ( (IOR1 - IOR2) / (IOR1 + IOR2) ) ^2 = 0.04

To test this a model using one directional light facing directly into the screen at intensity 0.9 lumen/m2 is used.
This light hits a material that has roughness 0.0 and an albedo/basecolor of 0.0 - thus no light will be re-transmitted or diffused.
There are 6 black material quads, facing the camera, divided on the topmost two rows, background color is set to dark green so that the quads can be seen.
Assuming an SNORM framebuffer [0.0 - 1.0] and no exposure, or 1.0, since the camera in glTF does not have aperture, shutter, ISO - it's simply a viewport.

The lowest row is for comparison, they are emissive, black metal - the only light leaving will be the emissive light.
The emissive light is, going from left to right [0.9 * 0.04] - [0.9] - [1-0]
To be rendered physically correct the reflected light from the black material quads shall be the same as the leftmost emissive material (sRGB 0x353535).

The second model is the same but with a higher intensity light (20 lumen/m2) which will give 80% of max light at normal incidence (sRGB 0xE7E7E7)

Fresnel reflection correctness test Fresnel reflection correctness test