Google Vision - Read text from image action

The Read text from image action allows you to read text from an image. Supported images include:

  • Photos: For example, a photograph might contain a street or traffic sign. Google Vision returns the extracted string, individual words, and their bounding boxes.

    Abbey Road photo

  • Scanned documents: Google Vision returns the extracted text with page, block, paragraph, word, and break information.

    Scanned document

Input

Input fieldDescription
Image contentProvide a file contents datapill, a string, a public URL, or a Google Cloud Storage URI.
Image typeSelect an image type. Google Vision optimizes the output according to the image type.

Output

Output fieldDescription
Text annotationsList of all individual words extracted from the image.
LocaleLocale of the extracted word.
TextThe extracted word.
Bounding polyThe position of the word on the image. Includes X and Y coordinates.
Full text annotationThe full text extracted from the image.
PagesIncludes details on the pages, such as width, height, and paragraph position.
Full textThe full text extracted from the image.

Last updated: