Capability
Team — Media
Upload a photo or short video. The Python ML service runs caption + OCR (vision) / Whisper (audio) so the chat LLM can reason about it.
Capability
Upload a photo or short video. The Python ML service runs caption + OCR (vision) / Whisper (audio) so the chat LLM can reason about it.