-
-
Welcome Page with usage instructions
-
Tab to take an image of surroundings
-
Shows detected components and scores them based on urgency
-
AI to answer questions about surroundings and potential repairs
-
Allows user to verbalize a question about the setting
-
Checklist of repairs
-
Stores all past scans and allows user to export it as CSV
CAT Lens is the combination of Cat Inspect and Cat AI Assistant into an intelligent and multimodal field agent. Instead of technicians manually tapping through checklists and typing notes, CAT Lens uses computer vision and voice AI to convert observations into structured inspection data. When a technician points their camera at a leaking hose, the system detects the issue, auto-tags the inspection item as Red inside Cat Inspect, generates a professional repair comment, and triggers a repair workflow in SIS 2.0. In “Dirty Hands Voice Mode,” a technician under a machine can simply say, “Secondary fuel filter seal looks worn but no leaks,” and the AI logs the condition as Yellow/Monitor, attaches contextual notes, and schedules a follow-up at 250 service hours, which eliminates paperwork, reduces errors, and accelerates decision-making.
Beyond automation, CAT Lens hopes to improve spatial intelligence in inspections. Using AR-based spatial tagging and a vision-language model such as Gemini 1.5 Pro, the system reasons about defects. It can compare wear depth across track shoes, identify asymmetry, and visually “pin” Green/Yellow/Red markers onto machine components in augmented reality. This turns static checklists into a live digital overlay of machine health. The result is safer and faster inspections with reduced cognitive load, fewer missed defects, and structured data generated from image and voice inputs. CAT Lens is an AI-powered inspection powerhouse that turns unstructured field data into insights, optimizes service logistics, and sets a new standard for intelligent operations at scale.
Log in or sign up for Devpost to join the conversation.