Improve recast performance for `.ply` files by dilirity · Pull Request #10 · r5valkyrie/r5sdk

dilirity · 2026-03-29T00:04:51Z

Summary

Optimize PLY mesh loading, parallelize shared pipeline stages (normals, BVH, rendering), and harden the PLY parser against malformed files. Builds on the OBJ performance work merged in #13.

What changed

PLY loader rewrite (`MeshLoaderPly.cpp`)

Replaced std::ifstream with single fread into a memory buffer -- eliminates millions of tiny I/O calls
Bulk memcpy for vertex data (binary layout matches rdVec3D)
Validated header parsing with strtoll and INT_MAX bounds
Pre-allocation bounds check: rejects files where claimed vertex/face counts exceed actual file size before allocating
Face index validation: every triangle index checked against [0, m_vertCount) before use
Integer overflow protection on all size calculations (long long arithmetic, m_triCount <= INT_MAX/3 guard)

Parallel normal calculation (`MeshLoaderObj.cpp`, `MeshLoaderPly.cpp`)

Chunk-based work distribution (4096 triangles per chunk) with std::atomic counter
Spawns hardware_concurrency - 1 worker threads writing to non-overlapping output ranges
Applied to both OBJ and PLY loaders

Parallel BVH construction (`ChunkyTriMesh.cpp`)

Replaced qsort + C comparators with std::sort + inlined lambdas
std::execution::par for sorting arrays >= 32K items
Parallel bounds computation across all cores (same atomic chunk pattern)
Parallel tree subdivision: pre-computes subtree node counts, spawns left/right on separate threads for the top 4 levels
maxNodes guard prevents overlap if pre-computed counts exceed allocation

VBO rendering (`Editor.cpp`, `EditorInterfaces.cpp`)

Cached display list vertex data uploaded to GPU via VBOs (glBufferData with GL_DYNAMIC_DRAW)
Each frame binds and draws from GPU memory instead of transferring from CPU RAM
GL buffer extension function pointers loaded via SDL_GL_GetProcAddress with null validation at startup
VBOs properly created, re-uploaded on cache dirty, and deleted in destructor

Navmesh cache invalidation (`Editor.cpp`, `Editor_SoloMesh.cpp`, `Editor_TempObstacles.cpp`, `NavMeshPruneTool.cpp`)

Added invalidateNavMeshCache() calls after mesh rebuild, navmesh load, traverse link rebuild, and static pathing rebuild

How to test

Open the NavEditor
Load a large PLY mesh (1GB+)
Verify the mesh loads and renders at interactive frame rates (expect 60-90 FPS depending on GPU)
Build a navmesh and verify it renders correctly
Toggle "NavMesh" checkbox off/on -- cache should update without stalls
Click "Rebuild Static Pathing Data" -- navmesh display should refresh
Try loading a small/malformed PLY file -- should fail gracefully without crashing

Rewrite PLY loader to read the entire file in one fread with bulk memcpy for vertices instead of millions of tiny ifstream reads. Parallelize normal calculation across all cores for both OBJ and PLY loaders. Parallelize BVH bounds computation in chunky tri mesh builder. Replace qsort with std::sort (inlined comparators) and use std::execution::par for large arrays. Parallelize BVH subdivision by pre-computing subtree sizes and recursing left/right on separate threads for the top 4 levels. Upload cached display list vertex data to GPU via VBOs (glBufferData with GL_STATIC_DRAW) so each frame just binds and draws from GPU memory instead of transferring from CPU RAM every frame. Load GL buffer extension function pointers via SDL_GL_GetProcAddress.

Guard parallel BVH subdivision against maxNodes overflow by validating pre-computed node counts fit before spawning threads. Use GL_DYNAMIC_DRAW instead of GL_STATIC_DRAW for VBO uploads since display list data can change on cache invalidation. Validate GL extension function pointers on startup and exit gracefully if VBO support is unavailable.

Replace atoi with strtol for vertex/face count parsing to detect overflow and malformed values. Use long long instead of ssize_t for file size and vertex byte calculations to avoid truncation on 32-bit.

dilirity force-pushed the improve/recast-performance-ply branch from 6176f94 to 8f2a798 Compare April 3, 2026 19:56

dilirity changed the base branch from main to S16-S21-MERGE April 3, 2026 19:56

dilirity force-pushed the improve/recast-performance-ply branch from 8f2a798 to 790bbaf Compare April 3, 2026 20:09

dilirity added 2 commits April 3, 2026 23:37

Invalidate navmesh cache (not sure a bout this one, needs testing)

5caac7f

dilirity force-pushed the improve/recast-performance-ply branch 2 times, most recently from 029def1 to 5caac7f Compare April 3, 2026 20:47

dilirity added 4 commits April 3, 2026 23:59

Ensure the multiplication happens in 64-bit before any possible overflow

4b9f3ab

Validate each face index

7bd11ca

Harden PLY header parsing and file size handling

fc4da93

Replace atoi with strtol for vertex/face count parsing to detect overflow and malformed values. Use long long instead of ssize_t for file size and vertex byte calculations to avoid truncation on 32-bit.

dilirity marked this pull request as ready for review April 3, 2026 21:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve recast performance for `.ply` files#10

Improve recast performance for `.ply` files#10
dilirity wants to merge 6 commits intor5valkyrie:S16-S21-MERGEfrom
dilirity:improve/recast-performance-ply

dilirity commented Mar 29, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dilirity commented Mar 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changed

PLY loader rewrite (MeshLoaderPly.cpp)

Parallel normal calculation (MeshLoaderObj.cpp, MeshLoaderPly.cpp)

Parallel BVH construction (ChunkyTriMesh.cpp)

VBO rendering (Editor.cpp, EditorInterfaces.cpp)

Navmesh cache invalidation (Editor.cpp, Editor_SoloMesh.cpp, Editor_TempObstacles.cpp, NavMeshPruneTool.cpp)

How to test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dilirity commented Mar 29, 2026 •

edited

Loading

PLY loader rewrite (`MeshLoaderPly.cpp`)

Parallel normal calculation (`MeshLoaderObj.cpp`, `MeshLoaderPly.cpp`)

Parallel BVH construction (`ChunkyTriMesh.cpp`)

VBO rendering (`Editor.cpp`, `EditorInterfaces.cpp`)

Navmesh cache invalidation (`Editor.cpp`, `Editor_SoloMesh.cpp`, `Editor_TempObstacles.cpp`, `NavMeshPruneTool.cpp`)