Next Gen Computer Vision Capabilities with Project Florence Foundation Model

Introduction

This article will help you learn how to unleash your creativity and innovation using Computer Vision in the Azure AI Studio. I also will explore the different capabilities of Computer Vision Studio.

Computer Vision 

Computer Vision is an area of artificial intelligence that deals with visual perception and its unified service that offers innovative vision studio capabilities with the Project Florence Foundation model. It provides your apps the ability to read text, analyze images and detect faces with technology like OCR and Spatial Analysis.

Project Florence

Project Florence is a Microsoft AI Cognitive Services initiative and also advances the state of the art of computer vision technologies like OCR, Spatial Analysis, and Image Analysis.

Project Florence, which helps to develop the next-generation framework for visual recognition.

Computer Vision Studio Capabilities

Vision Studio Capabilities

Computer Vision Studio offers different capabilities

  • Video summary and frame locator: Video search and summarization uses a combination of natural language processing and computer vision techniques to analyze the content of a video.
    It can quickly and concisely summarize the main points of a video and allows you to search for specific moments within the video, making it easy to find relevant content.
  • Remove backgrounds from images: Easily remove the background and preserve foreground elements in your image.
  • Search photos with image retrieval: Retrieve specific moments within your photo album. For example, you can query a wedding you attended last summer, your pet, or your favorite city. Search for images based on the content of the image itself rather than relying solely on manually assigned keywords or tags.
  • Add captions to images: Generate a human-readable sentence that describes the content of an image.
  • Add dense captions to images: Generate human-readable captions for all important objects detected in your image.
  • Detect common objects in images: Detect and extract bounding boxes based on thousands of recognizable objects and living beings.
  • Extract common tags from images: Extract tags based on thousands of recognizable objects, living beings, scenery, and actions.
  • Detect sensitive content in images: Detect sensitive content in images so you can moderate their usage in your applications. You can use this feature to detect when images contain Adult, Racy, or Gory content.
  • Recognize and analyze shelf products: Find specific objects within a single image for use cases, such as locating products on shelves, merchandise on a store display, or items in an assembly line.
  • Detect faces in an image: Detect the location of one or more human faces in images, along with attributes such as pose, face mask, and facial landmarks.Azure.

Computer Vision Steps

We have the following steps to create a Computer Vision Service in the Azure Portal.

Go to the Azure portal and sign in with your Azure account.

Search "Computer Vision" in the search bar and select "Computer Vision" from the search results.

Computer Vision Studio

Click on the Create button for Computer Vision.

Computer Vision Studio

In the Basics tab, provide the following information Choose the Subscription.

Then Create a Resource Group named testRG.

Choose the Region as East US and type the name as retailvision26.

Select the Pricing tier as Free F0 tier.

Check the  Acknowledge box for accepting the terms and conditions.

Click the Next button on the Computer Vision Page.

Computer Vision Studio

Click Review + Create button.

You will get a validation message as Validation Passed.

Computer Vision Studio

Click Create button.

Deployment started initializing in a minute or two this became successful.

Computer Vision Studio

Click Go to Resource button, and then you will be able to find the Vision Studio.

Click Vision Studio from the Computer Vision page.

Vision Studio Capabilities

Glimpses of Vision Studio

Vision Studio Capabilities

Video Summary and frame locator

This feature is in the PREVIEW mode and chooses a video clip to see the summarization and frame locator capabilities. Also, the users can upload the file in the Blob container from the Azure Portal.

Here the users clicked the Datacenter video that showcases a data center, depicting the hardware that powers it and the people that maintain it.

Videos that have been uploaded to Vision Studio will be stored in your account for 48 hours. After that, it will be deleted automatically.

Video Summary and Frame Locator

You can see that Computer Vision is used in various use cases like Retail, Factory, and Warehouse.

Click Run a test and then choose the option to locate specific frames using a search query.

In the Search Query, the users type the query as a Person with a laptop and then click the Run button.

Search Query

In the Results section, Vision Studio completely searched the query, and it will match the query within the specific time frame, which satisfies the query as a Person with a laptop.

Vision Studio Capabilities

Remove backgrounds from images

This feature is in the PREVIEW mode, and the users can upload the images in the Blob container from the Azure Portal.

This feature easily removes backgrounds from the images.

Remove backgrounds from images

Now the users can see the background tree images are removed from the original image.

Remove backgrounds from images

Search photos with image retrieval

This feature is in the PREVIEW mode, and the users can upload the images in the Blob container from the Azure Portal.

Here the users can choose the Nature gallery from the menu.

Serach photos with image retrival

The user wants to a retrieve query as Dew drops on a plant and then clicks the Search button.

The Query results vary from most relevant in the dataset to least relevant. Use the slider to view more or less images based on their relevance to the retrieved query.

Computer Vision Studio

Summary

In this article, we successfully learned and deployed the Computer Vision Studio. We explored the different Vision Studio Capabilities, which become a Visual treat for human beings. Using Vision Studio, the possibilities, and opportunities for innovation are unlimited.

Thanks for reading this article!!! 


Similar Articles