AIAutomationAPI IntegrationMachine LearningSide Project

How I Accidentally Built Visionary AI

I built an AI screen assistant to cheat on a quiz, and it accidentally became the most useful tool I've ever made. It even speaks multiple languages now.

March 19, 20264 min readBy Muhammad Hasham Khan

Look, I'm not going to pretend this was some grand vision or a "eureka moment in the shower" kind of thing.

I was bored. I had a university quiz coming up. And the thought of copy-pasting questions into ChatGPT every two minutes made me want to throw my laptop out the window.

So naturally, I did what any reasonable person would do: I spent hours building an AI agent instead of just... you know... studying.

The Problem Nobody Asked Me to Solve

Here's the thing about online quizzes — they're annoying. You're staring at questions on one tab, frantically copying text, opening ChatGPT in another tab, pasting, waiting, reading, going back... it's exhausting.

My brain is lazy. My fingers are lazy. Even my copy-paste shortcuts were getting tired.

I thought: "Wouldn't it be nice if something just watched my screen and answered my questions without me doing anything?"

And that's how Visionary AI was born. Not from genius. From pure, unadulterated laziness.

What This Thing Actually Does

Visionary is basically that friend who looks over your shoulder, except it's not annoying and it actually knows stuff.

It watches your screen. Everything. Your browser, your code editor, your error messages, whatever. Then it just... talks to you. You can ask it anything:

  • "Why is my code breaking?"
  • "What does this Windows error mean?"
  • "How do I fix this thing?"
  • "Explain this like I'm five"

And it answers. Right there. No copy-pasting. No switching tabs. No "let me Google that for you" moments.

It's like having a really smart friend who never gets tired of your dumb questions. Except this friend is powered by Google's Gemini AI and doesn't judge you for not knowing what a syntax error is.

The Plot Twist I Didn't See Coming

So I built this thing, tested it, and thought "Cool, it works in English."

Then one day I'm on a call with my friend, speaking in Urdu (because that's what normal humans do), and Visionary is just running in the background on my browser.

Suddenly, this AI agent starts responding to me in Urdu.

I didn't program that. I didn't even know it could do that.

Apparently, Google Gemini is so good at languages that my accidental creation became multilingual without me lifting a finger. It just... picked up my conversation and switched languages like it was no big deal.

That was genuinely shocking. And honestly? Kind of creepy. But mostly cool.

What I Actually Learned

  1. The best tools are built out of laziness Seriously. Necessity is the mother of invention, but laziness is the cool uncle who actually gets things done.

  2. AI is way smarter than we give it credit for I built a simple screen-watching bot and it turned into a multilingual assistant. I barely did anything. Google Gemini did all the heavy lifting.

  3. You don't need to be a genius to build cool stuff I'm not some AI researcher. I just had an API key and a problem. That's it. If I can do it, literally anyone can.

The Bottom Line

Visionary AI started as a lazy solution to avoid studying for a quiz. It ended up being one of the most useful tools I've ever made.

It helps with coding errors, Windows problems, random questions, and apparently it can switch between languages like a multilingual parrot.

Did I pass that quiz? That's not important. Did I build something accidentally awesome? Absolutely.

Sometimes the best projects are the ones you stumble into while trying to avoid real work.

Want to build your own AI agent? Get an API key, have a problem worth solving (or avoiding), and let laziness guide you. It worked for me.

Give it a try here: Visionary AI


Have questions about AI or want to share your project? Reach out to me on LinkedIn or connect via Twitter.

MH

Muhammad Hasham Khan

Hasham Khan — Google Certified AI Engineer, Full-Stack Developer & Software Engineer from Lahore, Pakistan. Specializing in LLMs, AI Agents, RAG, Python & Next.js. Available to hire.