Skip to main content

Image-To-Text (Vision)

Vision Tasks​

Example Request: Image Analysis

Note: API only support BASE64 String as Image input.

Possible Media Types​

  • image/png
  • image/gif
  • image/webp

Example​

import requests

url = "https://api.umamiai.xyz/v1/vision"
headers = {
"Authorization": "Bearer YOUR_UMAMIAI_API_KEY",
"Content-Type": "application/json"
}
payload = {
"model": "claude-3-5-sonnet-20240620",
"max_tokens": 1024,
"messages": [
{
"role": "user",
"content": [
{
"type": "image",
"source": {
"type": "base64",
"media_type": "image/jpeg",
"data": "/9j/4QCARXhpZgAATU0AKgAAAAgABQESAAMAAAABAAEAAAEaAAUAAAABAAAASgEbAAUAAAABAAAAUgEoAAMAAAABAAIAAIdpAAQAAAABAAAAWgAAAAAAAABIAAAAAQAAAEgAAAABAAKgAgAEAAAAAQAABXigAwAEAAAAAQAAAxkAAAAA/+EA+mh0

response = requests.post(url, json=payload, headers=headers)
print(response.json())