Vision Models

Overview

Our API enables you to use machine learning models for tasks that require visual capabilities. These models are referred to as vision models. Within our API, we offer two categories of vision models: OCR and OFR.

OCR: Optical Character Recognition

With OCR technology, you can analyze any document and extract text as well as other characters and symbols. This allows you to detect:

Text
Paragraph blocks
Handwriting
Text inside PDF/TIFF files

OCR Optical Character Recognition

OFR: Optical Feature Recognition

In contrast to OCR, OFR allows you to analyze not just documents but also images. You can filter exactly what you want to find in the image by the features they include:

Crop hints
Faces
Image properties
Labels
Landmarks
Logos
Multiple objects
Explicit content
Web entities and pages
And many more

OFR Optical Feature Recognition

Overview​

OCR: Optical Character Recognition​

OFR: Optical Feature Recognition​

Overview

OCR: Optical Character Recognition

OFR: Optical Feature Recognition