Image Input - MythicDot.AI

👁️

Visual Understanding

Describe scenes, identify objects, understand context

📝

Text Extraction (OCR)

Read and extract text from images and documents

🔍

Visual Reasoning

Answer questions about image content

📊

Chart Analysis

Interpret graphs, charts, and diagrams

🏷️

Object Detection

Identify and locate objects within images

🎨

Style Analysis

Recognize artistic styles, colors, and composition

How It Works

🏞️

🔎 AI Analysis

Scene Type

Outdoor landscape - mountain vista

Objects Detected

Mountains, lake, trees, clouds

Colors

Blue sky, green forest, white peaks

Text Found

None detected

Supported Formats

.PNG

PNG Images

.JPEG

JPEG Images

.GIF

GIF Images

.WEBP

WebP Images

Quick Start

Python - Analyze an Image

from mythicdot import MythicDot
import base64

client = MythicDot()

# Option 1: URL
response = client.chat.completions.create(
    model="mythic-4",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "What's in this image?"},
                {
                    "type": "image_url",
                    "image_url": {"url": "https://example.com/photo.jpg"}
                }
            ]
        }
    ]
)

# Option 2: Base64
with open("image.png", "rb") as f:
    image_data = base64.b64encode(f.read()).decode()

response = client.chat.completions.create(
    model="mythic-4",
    messages=[
        {
            "role": "user",
            "content": [
                {"type": "text", "text": "Describe this image"},
                {
                    "type": "image_url",
                    "image_url": {
                        "url": f"data:image/png;base64,{image_data}"
                    }
                }
            ]
        }
    ]
)
                

Use Cases

📄

Document Processing

Extract text from scanned documents, receipts, forms, and handwritten notes

🛒

E-commerce

Analyze product images, generate descriptions, and categorize inventory

♿

Accessibility

Generate alt text for images to improve accessibility for screen readers

🔧

Technical Support

Analyze screenshots of error messages and UI issues for troubleshooting

💡 Tips for Best Results

Use high-resolution images when possible (up to 20MB)
Provide specific questions about the image for focused analysis
Use multiple images in a single request for comparison tasks
Enable "high" detail mode for documents with small text
Combine with web search for context about unfamiliar objects

Start Analyzing Images

Give your AI applications the power of vision.

Vision Guide → Multi-Modal →

🖼️ Image Input