1. APPLICATION

Simply: This project aims to:

Stream webcam video from the browser to the server via WebRTC,
Capture frames from the video stream as JPEG images, run YOLOX model inference for these images to detect objects, send prediction results to client, with object type and box coordinates, then draw detected boxes on the browser.

With distributed details: This project aims to:

Web UI: Stream webcam video from the browser to the media server via WebRTC,
Media Server: Capture frames from the video stream as JPEG images, push image binary arrays to Redis Streams,
Inference Worker Instances: Consume Redis Streams (STREAM_IMAGES = "images"), take one image item from queue, run YOLOX model inference for this image to detect objects, send prediction results to another Redis Streams (STREAM_PREDICTIONS = "predictions") with object type and box coordinates,
Signaling Server: Consume Redis Streams (STREAM_PREDICTIONS = "predictions"), take prediction results to client browser via web sockets,
Web UI: Listen for Signaling web sockets, for incoming prediction results, draw coordinate boxes and prediction labels on the screen.

This project consists of WebRTC signaling and orchestrator service(Go), WebRTC media server service (Go), YOLOX model deep learning inference service (Python), and Web front-end (TypeScript).

Application topology:

Web UI Screenshot:

Client side logs:

You can track the incoming predictions via web sockets on the browser's console:

< Previous chapter: INFRASTRUCTURE | Next chapter: MONITORING >

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

01-APPLICATION.md

01-APPLICATION.md

1. APPLICATION

Files

01-APPLICATION.md

Latest commit

History

01-APPLICATION.md

File metadata and controls

1. APPLICATION