Vision extension
  • 05 Jul 2022
  • 2 Minutes to read
  • Dark
    Light

Vision extension

  • Dark
    Light

With the Vision extension, you can use the Google Cloud Vision API to get more insight out of images from your records. You can quickly classify your images into thousands of categories (like "dog," "lighthouse," or "Sahara"), extract those labels, and save them to a field in your base—meaning that you can tag hundreds of images with just a few clicks. You can also identify logos, or use optical character recognition (OCR) to extract text from images. Please note that this extension requires a Google Cloud Vision API key.

NOTE

Extensions let you extend the functionality of your bases: you can use extensions to bring new information into Airtable, visualize and summarize your records in colorful ways, and even directly integrate your Airtable bases with your favorite extensions.

Install the vision extension

Start by clicking the "Apps" button at the top-right hand corner of your base. You can then click "Install an extension", and then search for and install the Vision extension.

add_app

Enter your Google Cloud API key

After adding the extension to your base, you'll be prompted to enter your Google Cloud API key. To get your API key you need to:

  1. Create a Google Cloud project. Go to the Google Cloud Platform Console and create a new project. You may also use an existing project that you own.
  2. Enable billing for your project. On the Billing page, add a payment method for your project. You will only be charged when you analyze images.
  3. Enable the Cloud Vision API. Go to the Cloud Vision API page and add it to your project. On the Credentials page, create an API key.

Once you've followed these steps, paste your API key in the space provided in the Vision extension setup window.

enter_api_key

Configure extension settings

Next, you'll need to select which images you want to analyze and what features to extract from them. To select the images to analyze, click the gear icon next to "Images to analyze" to adjust which table, view, and field you want to use.

image_options

Then, choose what features to extract from the images by select one or more of the three available options.

features.png

Analyze images

Once you've configured the available options, click "Analyze" to begin extracting the selected information based on how Google categorizes your images. This will bring up a confirmation dialog confirming your use of the API. After you confirm by clicking the Analyze button, you'll see previews of the text that the extension has extracted from your images on their associated records.

Before savings the results you'll need to select a destination text field (using a single line text field or long text field) into which the extracted text can go. You can select the destination text field next to each of the options you've selected (shown in the screenshot below).

results.png

Next, click save.

Extracted logos

Extracted logos will output the company or brand name associated with any logos depicted in your attachments.

Screen Shot 2017-11-15 at 5.15.09 PM

Extracted text

Extracted text will pull words from your images and put them in a designated text field.

Screen Shot 2017-11-15 at 5.40.06 PM

Extracted labels

Extracted labels will pull labels from your images and place them in the designated text field as comma separated values.

Screen Shot 2017-11-15 at 5.13.21 PM


Was this article helpful?