Copilot Vision takes the generative AI concept in a new direction: Instead of creating text or images based on prompts, it can understand and react to visual input and provide context and explanations. Copilot Vision is currently in limited preview in the Edge web browser, which runs on Android, iOS, macOS, and Windows.
I got to try Copilot Vision firsthand, and it’s like nothing you’ve ever seen in a web browser. The Google Lens feature in Chrome bears a slight resemblance, letting you highlight objects on a page and get search results in a side panel, but it’s not conversational. Copilot Vision, by contrast, is a real browsing companion. It takes in everything visual and textual on a page and verbally converses with you about it. I’m here to walk you through how to get it and how it works.
How to Set Up Copilot Vision
For now, Copilot Vision works only for select Copilot Pro…