Implement `f16_t`, `f16vec2_t`, `f16vec4_t` half float types for safety #728

illwieckz · 2022-11-05T11:51:30Z

Implement f16_t, f16vec2_t, f16vec4_t half float types for safety, also fix some bugs where stuff should have been converted to or from halfFloat.

Now the way the type is defined, there should be a compilation error when passing a f16_vec4t to a i16_vec4t etc.

Original mesage:

It's a very minor fix, I noticed shaderVertex_t.texCoords was defined with i16vec4_t type while being the output of floatToHalf(vec4_t, f16vec4_t).

It worked because f16vec4_t is defined as i16vec4_t.

I caught that by experimenting with different (half-)float vertex formats (alternate code for missing ARB_half_float_vertex):

src/engine/renderer/tr_surface.cpp: In function ‘void Tess_AddSprite(const vec_t*, Color::Color32Bit, float, float)’:
src/engine/renderer/tr_surface.cpp:432:28: error: no matching function for call to ‘floatToHalf(vec4_t, i16vec4_t)’
  432 |                 floatToHalf( texCoord, tess.verts[ ndx + i ].texCoords );
      |                 ~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

I added an extra commit making more obvious that f16vec4_t is defined as i16vec4_t anyway.

…ny places tr_local: define a simple f16_t type tr_local: shaderVertex_t.texCoords is f16vec4_t because it is the output of floatToHalf tr_local: make it more obvious that f16vec4_t is i16vec4_t tr_local: R_CalcTangents expects f16vec2_t not i16vec2_t tr_model_iqm: IQModel_t.texcoords is f16vec2_t because it is the output of floatToHalf tr_local,tr_model_md3,tr_model_skel: vboData_t.st is f16vec2_t because it is the output of floatToHalf tr_local: vboData_t.spriteOrientation is f16vec4_t because it is the output of floatToHalf tr_local: order data like everywhere else in code

necessarily-equal · 2022-11-05T12:51:32Z

I've come up with this too while having a look

diff --git a/src/engine/renderer/tr_local.h b/src/engine/renderer/tr_local.h
index b253b431..45e2a76a 100644
--- a/src/engine/renderer/tr_local.h
+++ b/src/engine/renderer/tr_local.h
@@ -44,7 +44,8 @@ using i16vec4_t = int16_t[4];
 using u16vec4_t = uint16_t[4];
 using i16vec2_t = int16_t[2];
 using u16vec2_t = uint16_t[2];
-using f16vec4_t = int16_t[4]; // half float vector
+using f16vec2_t = i16vec2_t; // half float vector
+using f16vec4_t = i16vec4_t; // half float vector
 
 // GL conversion helpers
 static inline float unorm8ToFloat(byte unorm8) {
@@ -3096,7 +3097,7 @@ inline bool checkGLErrors()
 
        void R_CalcTangents( vec3_t tangent, vec3_t binormal,
                             const vec3_t v0, const vec3_t v1, const vec3_t v2,
-                            const i16vec2_t t0, const i16vec2_t t1, const i16vec2_t t2 );
+                            const f16vec2_t t0, const f16vec2_t t1, const f16vec2_t t2 );
 
        /*
         * QTangent representation of tangentspace:
diff --git a/src/engine/renderer/tr_main.cpp b/src/engine/renderer/tr_main.cpp
index 6afcd563..8d466e1c 100644
--- a/src/engine/renderer/tr_main.cpp
+++ b/src/engine/renderer/tr_main.cpp
@@ -102,7 +102,7 @@ void R_CalcTangents( vec3_t tangent, vec3_t binormal,
 
 void R_CalcTangents( vec3_t tangent, vec3_t binormal,
                     const vec3_t v0, const vec3_t v1, const vec3_t v2,
-                    const i16vec2_t t0, const i16vec2_t t1, const i16vec2_t t2 )
+                    const f16vec2_t t0, const f16vec2_t t1, const f16vec2_t t2 )
 {
        vec2_t t0f, t1f, t2f;
 
diff --git a/src/engine/renderer/tr_model_md5.cpp b/src/engine/renderer/tr_model_md5.cpp
index e59652fc..701a0d38 100644
--- a/src/engine/renderer/tr_model_md5.cpp
+++ b/src/engine/renderer/tr_model_md5.cpp
@@ -325,7 +325,7 @@ bool R_LoadMD5( model_t *mod, void *buffer, const char *modName )
                        for (unsigned k = 0; k < 2; k++ )
                        {
                                token = COM_ParseExt2( &buf_p, false );
-                               v->texCoords[ k ] = atof( token );
+                               v->texCoords[ k ] = floatToHalf( atof( token ) );
                        }
 
                        // skip )

I think the last change may be an actual bugfix

illwieckz · 2022-11-05T14:03:26Z

This one breaks md5 texturing:

diff --git a/src/engine/renderer/tr_model_md5.cpp b/src/engine/renderer/tr_model_md5.cpp
index e59652fc..701a0d38 100644
--- a/src/engine/renderer/tr_model_md5.cpp
+++ b/src/engine/renderer/tr_model_md5.cpp
@@ -325,7 +325,7 @@ bool R_LoadMD5( model_t *mod, void *buffer, const char *modName )
                        for (unsigned k = 0; k < 2; k++ )
                        {
                                token = COM_ParseExt2( &buf_p, false );
-                               v->texCoords[ k ] = atof( token );
+                               v->texCoords[ k ] = floatToHalf( atof( token ) );
                        }
 
                        // skip )

Before:

After:

Edit: probably because of:

Daemon/src/engine/renderer/tr_local.h

Lines 2176 to 2182 in cdc6421

    
           ALIGNED(16, struct md5Vertex_t 
        
           { 
        
           	vec4_t      position; 
        
           	vec4_t      tangent; 
        
           	vec4_t      binormal; 
        
           	vec4_t      normal; 
        
           	vec2_t      texCoords;

illwieckz · 2022-11-05T19:11:58Z

Note to myself for things that may be done in other PRs after this one:

IQModel_t can be defined using f16vector<N>_t instead of pointers to f16_t, same for the allocation in R_LoadIQModel.
Edit: no or not in a straightforward manner, pointers are iterated on the data.
To support systems without ARB_half_float_vertex, the magic will probably happen in R_CopyVertexData by converting texCoords back to float (plus setting GL_FLOAT anytime GL_HALF_FLOAT is set).

src/engine/renderer/tr_local.h

…s texCoordsF as it's the only one to use float

…eOrientation

illwieckz · 2022-11-05T23:20:24Z

Damn, it looks like my patch to convert md5mesh texcoords to half-float as soon as possible breaks alignment…

illwieckz · 2022-11-06T00:53:22Z

I removed the commit to convert md5mesh texCoords as half-floats as soon as possible because that meant the CPU code (likely used on older and slower hardware) had to do the conversion back to float to do the computations so it would slow down this code path, and it also breaks alignment so that would also slow down this code path more.

It may be possible to convert them to half float as soon as possible when using the GPU code path in a way it may improve the GPU code path but this out of topic of this PR (there is no regression).

I renamed the md5mesh texCoords as texCoordsF to avoid confusion as it is the only one texCoords in code base to not be half-float.

illwieckz · 2022-11-06T01:23:04Z

So vboData_t::spriteOrientation is never set anywhere (meaning the faulty Vector4Copy is dead code), and it was that way since the field was introduced in b10e8c2. I guess the "autosprite" thing (no idea what it is) has probably been broken at least since then. I guess we should delete the RSF_SPRITE stuff unless somebody has any idea what it is supposed to do.

@slipher note that we have some known bugs related to sprites. Some can be seen in metro map, in catacombs: missing fire sprites, distorted chain sprite…

I would not be surprised if that was the incomplete code for those features. Actually gimhael fixed/completed some other autosprite code for other sprites in metro at this time.

slipher · 2022-11-07T13:13:29Z

Can this be squashed? Somehow there are 10 commits to do what was not very big as 1 commit...

illwieckz · 2022-11-14T21:45:15Z

I squashed all the commits that were converting i-types to f-types into one.

illwieckz · 2022-11-17T01:17:38Z

@DolceTriade any comment? 🙂️

DolceTriade · 2022-11-17T03:01:55Z

I don' t understand much of it, but it seems ok to me.

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch from 28d162f to ea5e08d Compare November 5, 2022 17:39

tr_shade_calc: multiply float with float

62d008a

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch from e6b78f5 to 84cf51e Compare November 5, 2022 18:50

illwieckz commented Nov 5, 2022

View reviewed changes

src/engine/renderer/tr_local.h Outdated Show resolved Hide resolved

illwieckz and others added 4 commits November 5, 2022 20:45

tr_model_md5,tr_model_skel,tr_surface: rename md5vertex_t.texCoords a…

b4df935

…s texCoordsF as it's the only one to use float

tr_shade_calc: convert 0 to half float before setting f16vec4_t sprit…

75245c2

…eOrientation

tr_local: define a half-float struct f16_t for type safety

c71f11a

tr_shade_calc: use half float sign bit to avoid calling halfToFloat

31d1339

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch from 84cf51e to 723f8a1 Compare November 5, 2022 22:47

illwieckz changed the title ~~shaderVertex_t.texCoords is f16vec4_t because it is the output of floatToHalf~~ Implement f16_t, f16vec2_t, f16vec4_t half float types for safety Nov 5, 2022

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch 2 times, most recently from 1e3bb11 to 4b679de Compare November 6, 2022 00:48

illwieckz added the A-Renderer label Nov 6, 2022

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch from 4b679de to 67f4383 Compare November 14, 2022 21:38

illwieckz force-pushed the illwieckz/shadervertex-texcoords-type branch from 67f4383 to 31d1339 Compare November 17, 2022 01:16

illwieckz merged commit 75d9763 into master Nov 17, 2022

illwieckz deleted the illwieckz/shadervertex-texcoords-type branch November 17, 2022 15:12

This was referenced Jun 4, 2024

Huge performance drop on r300 with default Thunder and Vega scene #1172

Closed

Support graphics cards without ARB_half_float_vertex #1179

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `f16_t`, `f16vec2_t`, `f16vec4_t` half float types for safety #728

Implement `f16_t`, `f16vec2_t`, `f16vec4_t` half float types for safety #728

illwieckz commented Nov 5, 2022 •

edited

Loading

necessarily-equal commented Nov 5, 2022

illwieckz commented Nov 5, 2022 •

edited

Loading

illwieckz commented Nov 5, 2022 •

edited

Loading

illwieckz commented Nov 5, 2022

illwieckz commented Nov 6, 2022 •

edited

Loading

illwieckz commented Nov 6, 2022

slipher commented Nov 7, 2022

illwieckz commented Nov 14, 2022

illwieckz commented Nov 17, 2022

DolceTriade commented Nov 17, 2022

Implement f16_t, f16vec2_t, f16vec4_t half float types for safety #728

Implement f16_t, f16vec2_t, f16vec4_t half float types for safety #728

Conversation

illwieckz commented Nov 5, 2022 • edited Loading

necessarily-equal commented Nov 5, 2022

illwieckz commented Nov 5, 2022 • edited Loading

illwieckz commented Nov 5, 2022 • edited Loading

illwieckz commented Nov 5, 2022

illwieckz commented Nov 6, 2022 • edited Loading

illwieckz commented Nov 6, 2022

slipher commented Nov 7, 2022

illwieckz commented Nov 14, 2022

illwieckz commented Nov 17, 2022

DolceTriade commented Nov 17, 2022

Implement `f16_t`, `f16vec2_t`, `f16vec4_t` half float types for safety #728

Implement `f16_t`, `f16vec2_t`, `f16vec4_t` half float types for safety #728

illwieckz commented Nov 5, 2022 •

edited

Loading

illwieckz commented Nov 5, 2022 •

edited

Loading

illwieckz commented Nov 5, 2022 •

edited

Loading

illwieckz commented Nov 6, 2022 •

edited

Loading