Skip to content

Computer Vision

The Computer Vision Team empowers Mujin’s robots to perceive the world. In unstructured environments like logistics warehouses, robots need to handle millions of different SKUs, variable lighting, and dynamic obstacles. We build the eyes that make this possible.

We utilize high-end 3D cameras and custom sensors to generate accurate point clouds of the workspace. This allows the robot to know exactly where objects are in 3D space.

Using advanced machine learning and geometric algorithms, we identify items to pick—whether they are boxes, sacks, or loose parts—and determine their precise orientation for grasping.

Machine Learning

Deep learning models for segmentation and classification of novel goods.

Geometric Vision

Model-based matching for precise industrial parts handling.

Innovation: Interactive Calibration Unique

Section titled “Innovation: Interactive Calibration ”

One of our key innovations is autonomous calibration. Traditionally, calibrating a camera to a robot is a tedious manual process. Our system can automatically calibrate itself by having the robot move a target around, ensuring sub-millimeter accuracy in minutes, not hours.

  • Transparent/Reflective Objects: Handling difficult materials that confuse standard depth sensors.
  • Occlusion: Figuring out how to pick an object that is partially hidden by others (bin picking).
  • Speed: Processing massive point clouds in real-time to keep up with the robot’s cycle time.
  • Languages: C++, Python, CUDA
  • Libraries: OpenCV, PCL (Point Cloud Library), PyTorch, TensorFlow
  • Hardware: Industrial 3D Cameras, GPU computing