Technology

How Simvera Works.

A complete pipeline from physical objects to deployed AI. Five stages, end to end — every one automated, every one reproducible.

Step 01 · Digitize

Real Objects Become Digital Assets.

We create high-fidelity 3D models of industrial hazard objects — traffic cones, barriers, warning signs, equipment. These aren't just meshes; they include physically accurate materials, textures, and reflectance properties so the simulator can render them under any lighting condition.

01
Photoreal 3D-rendered traffic cone asset with PBR materials
asset · cone_001.usdtris 47,318
[ scanned asset · pbr materials ]
Step 02 · Simulate

Photorealistic Environments at Scale.

3D assets are loaded into CARLA and NVIDIA Omniverse simulators. We place them in varied factory and tunnel environments with randomised lighting, camera angles, occlusions, and backgrounds. Every frame is auto-labeled by the simulator at render time.

02
Photoreal Omniverse render of a traffic cone under randomised lighting and weather
omniverse · scene matrixn=247 variants
[ randomised lighting · pose · weather ]
Step 03 · Train

SynYOLO — Trained on Synthetic. Deployed on Real.

SynYOLO is our detection architecture built on YOLOX, trained exclusively on synthetic data generated in photorealistic simulation. Unlike standard YOLOX models that require thousands of manually labeled real images, SynYOLO learns entirely from rendered scenes — and transfers directly to real factory cameras. No annotation teams. No labeling bias. Ground truth comes free from the simulator.

03
SYNTHETIC FRAMES 100k+ RENDERED · LABELLED SynYOLO TRAINING YOLOX BACKBONE MULTI-GPU CHECKPOINT .918 mAP50
synyolo · trainmAP50 0.918
[ synthetic frames → synyolo → checkpoint ]
Step 04 · Calibrate

Camera-to-World Coordinate Mapping.

Using polynomial regression calibrated against known floor points, we translate pixel-space detections into real-world (x, y) coordinates. Our mapping handles lens distortion from wide-angle industrial cameras — what the model sees becomes where the hazard actually is.

04
Camera-to-world coordinate mapping overlay — pixel-space detections projected to real-world (x, y) floor coordinates
cam-04 · calibratedRMSE 6.8 px
[ pixel → world (x, y, z) ]
Step 05 · Deploy

Real-Time Detection on Factory Cameras.

The trained model runs inference on live camera feeds. Detected objects are localised in world coordinates and fused with existing 5G / UWB positioning systems for complete situational awareness — a single, coherent picture of where every hazard is, in real time.

05
Live cam-04 feed with world-coordinate grid overlay — pixel-space inference fused with 5G/UWB positioning
● LIVE · cam-0416 ms / frame
[ rtsp feed · world coords · 5G/UWB fusion ]
Tech stack

Industrial-grade open infrastructure.

Every component is widely used, well-supported, and battle-tested. No proprietary lock-in, no obscure dependencies.

CARLA Simulator
Autonomy-grade sim
NVIDIA Omniverse
Photoreal RTX
SynYOLO
YOLOX-based · synthetic-only
Python
PyTorch · ONNX
OpenCV
Vision · calibration
For cloud partners

Why We Need GPU Compute.

Synthetic-data pipelines are GPU-heavy at every stage. The numbers below are the order of magnitude we're working at today, and where 2026 is heading.

01 · RENDERING

Photoreal generation

Rendering photoreal training frames in Omniverse — every object class needs broad coverage across lighting, weather, and angle.

10k+ scenes / class
02 · TRAINING

Multi-GPU SynYOLO training

Training SynYOLO on the rendered datasets. Multi-GPU runs to converge on each new hazard class and condition mix.

100k+ synthetic frames
03 · SCALING

Class library expansion

We're scaling from a single class (traffic cones, live today) toward an industrial hazard library covering safety, infrastructure, and compliance.

50+ classes planned ’26