Technology — How Simvera Works

Step 01 · Digitize

Real Objects Become Digital Assets.

We create high-fidelity 3D models of industrial hazard objects — traffic cones, barriers, warning signs, equipment. These aren't just meshes; they include physically accurate materials, textures, and reflectance properties so the simulator can render them under any lighting condition.

Photoreal 3D-rendered traffic cone asset with PBR materials

asset · cone_001.usdtris 47,318

[ scanned asset · pbr materials ]

Step 02 · Simulate

Photorealistic Environments at Scale.

3D assets are loaded into CARLA and NVIDIA Omniverse simulators. We place them in varied factory and tunnel environments with randomised lighting, camera angles, occlusions, and backgrounds. Every frame is auto-labeled by the simulator at render time.

Photoreal Omniverse render of a traffic cone under randomised lighting and weather

omniverse · scene matrixn=247 variants

[ randomised lighting · pose · weather ]

Step 03 · Train

SynYOLO — Trained on Synthetic. Deployed on Real.

SynYOLO is our detection architecture built on YOLOX, trained exclusively on synthetic data generated in photorealistic simulation. Unlike standard YOLOX models that require thousands of manually labeled real images, SynYOLO learns entirely from rendered scenes — and transfers directly to real factory cameras. No annotation teams. No labeling bias. Ground truth comes free from the simulator.

synyolo · trainmAP50 0.918

[ synthetic frames → synyolo → checkpoint ]

Step 04 · Calibrate

Camera-to-World Coordinate Mapping.

Using polynomial regression calibrated against known floor points, we translate pixel-space detections into real-world (x, y) coordinates. Our mapping handles lens distortion from wide-angle industrial cameras — what the model sees becomes where the hazard actually is.

cam-04 · calibratedRMSE 6.8 px

[ pixel → world (x, y, z) ]

Step 05 · Deploy

Real-Time Detection on Factory Cameras.

The trained model runs inference on live camera feeds. Detected objects are localised in world coordinates and fused with existing 5G / UWB positioning systems for complete situational awareness — a single, coherent picture of where every hazard is, in real time.

Live cam-04 feed with world-coordinate grid overlay — pixel-space inference fused with 5G/UWB positioning

● LIVE · cam-0416 ms / frame

[ rtsp feed · world coords · 5G/UWB fusion ]

For cloud partners

Why We Need GPU Compute.

Synthetic-data pipelines are GPU-heavy at every stage. The numbers below are the order of magnitude we're working at today, and where 2026 is heading.

01 · RENDERING

Photoreal generation

Rendering photoreal training frames in Omniverse — every object class needs broad coverage across lighting, weather, and angle.

10k+ scenes / class

02 · TRAINING

Multi-GPU SynYOLO training

Training SynYOLO on the rendered datasets. Multi-GPU runs to converge on each new hazard class and condition mix.

100k+ synthetic frames

03 · SCALING

Class library expansion

We're scaling from a single class (traffic cones, live today) toward an industrial hazard library covering safety, infrastructure, and compliance.

50+ classes planned ’26

How Simvera Works.