Skip to main content

Documentation Index

Fetch the complete documentation index at: https://visionagents.ai/llms.txt

Use this file to discover all available pages before exploring further.

View Golf Coach Example on GitHub

Check out the complete AI Golf Coach example in our GitHub repository
In this example, we build a real-time golf coach that watches your swing via camera and gives spoken feedback. It combines Ultralytics YOLO pose detection to analyze body position with Gemini Live for real-time coaching — all running on Stream’s low-latency edge network. This pattern applies to any video coaching use case: sports training, physical therapy, workout guidance, or drone monitoring.
Vision Agents requires a Stream account for real-time transport. Most providers offer free tiers to get started.

What You Will Build

  • Analyze golf swings in real time using YOLO pose detection
  • Process video at configurable FPS with Gemini Live or OpenAI Realtime
  • Deliver spoken coaching feedback based on body position and movement
  • Hot-swap between AI providers with a one-line config change
  • Run on Stream’s low-latency edge network

Next Steps

Live Sports Commentator

Use Roboflow object detection for multi-object tracking

Ultralytics Integration

Explore YOLO model options and configuration