Image-To-Text (Vision)
Vision Tasks​
Example Request: Image Analysis
Note: API only support BASE64 String as Image input.
Possible Media Types​
image/png
image/gif
image/webp
Example​
import requests
url = "https://api.umamiai.xyz/v1/vision"
headers = {
"Authorization": "Bearer YOUR_UMAMIAI_API_KEY",
"Content-Type": "application/json"
}
payload = {
"model": "claude-3-5-sonnet-20240620",
"max_tokens": 1024,
"messages": [
{
"role": "user",
"content": [
{
"type": "image",
"source": {
"type": "base64",
"media_type": "image/jpeg",
"data": "/9j/4QCARXhpZgAATU0AKgAAAAgABQESAAMAAAABAAEAAAEaAAUAAAABAAAASgEbAAUAAAABAAAAUgEoAAMAAAABAAIAAIdpAAQAAAABAAAAWgAAAAAAAABIAAAAAQAAAEgAAAABAAKgAgAEAAAAAQAABXigAwAEAAAAAQAAAxkAAAAA/+EA+mh0
response = requests.post(url, json=payload, headers=headers)
print(response.json())