Vision extension
  • 23 Aug 2023
  • 2 Minutes to read
  • Dark
    Light
  • PDF

Vision extension

  • Dark
    Light
  • PDF

Article Summary

With the Vision extension, you can use the Google Cloud Vision API to get more insight out of images from your records. You can quickly classify your images into thousands of categories (like "dog," "lighthouse," or "Sahara"), extract those labels, and save them to a field in your base—meaning that you can tag hundreds of images with just a few clicks. You can also identify logos, or use optical character recognition (OCR) to extract text from images. Please note that this extension requires a Google Cloud Vision API key.


Introduction

Plan availabilityAll paid plans
Permissions
  • Owners/Creators - Can create, delete, and modify extensions
  • Editors - Can edit the configuration of previously configured extensions
Platform(s)Web/Browser, Mac app, and Windows app
Related reading

Guide: Build your own Airtable extensions

Airtable terminology

Extension - Extensions are modular components that add visuals or functionality to a base, and were shown in the base's dashboard. Users can create custom extensions, or they can use extensions created by Airtable or other open-source extensions.

Install the vision extension

  1. Start by clicking the Extensions option at the top-right-hand corner of your base. 
  2. You can then click Add an extension 
  3. This will open up the extensions menu where you can search for and install the Vision extension.


Enter your Google Cloud API key

After adding the extension to your base, you'll be prompted to enter your Google Cloud API key. To get your API key you need to:

  1. Create a Google Cloud project. Go to the Google Cloud Platform Console and create a new project. You may also use an existing project that you own.
  2. Enable billing for your project. On the Billing page, add a payment method for your project. You will only be charged when you analyze images.
  3. Enable the Cloud Vision API. Go to the Cloud Vision API page and add it to your project. On the Credentials page, create an API key.

Once you've followed these steps, paste your API key in the space provided in the Vision extension setup window.

Note
Over time, Google may change the API and requirements of the Cloud Vision feature. We recommend you consult their Cloud Vision documentation when there are discrepancies.

enter_api_key

Configure extension settings

Next, you'll need to select which images you want to analyze and what features to extract from them. To select the images to analyze, click the gear icon next to "Images to analyze" to adjust which table, view, and field you want to use.

image_options

Then, choose what features to extract from the images by selecting one or more of the three available options.

Analyze images

Once you've configured the available options, click "Analyze" to begin extracting the selected information based on how Google categorizes your images. This will bring up a confirmation dialog confirming your use of the API. After you confirm by clicking the Analyze button, you'll see previews of the text that the extension has extracted from your images on their associated records.

Before savings the results you'll need to select a destination text field (using a single line text field or long text field) into which the extracted text can go. You can select the destination text field next to each of the options you've selected (shown in the screenshot below).

Next, click Save.

Extracted logos

Extracted logos will output the company or brand name associated with any logos depicted in your attachments.

Screen Shot 2017-11-15 at 5.15.09 PM

Extracted text

Extracted text will pull words from your images and put them in a designated text field.

Screen Shot 2017-11-15 at 5.40.06 PM

Extracted labels

Extracted labels will pull labels from your images and place them in the designated text field as comma separated values.

Screen Shot 2017-11-15 at 5.13.21 PM



Was this article helpful?