A smart, voice-controlled desktop assistant built with Python. It can respond to your voice commands, open applications, answer general knowledge questions using AI, fetch time and date, and more — all with hands-free control.
- 🎤 Voice Recognition using
speech_recognition - 🗣️ Text-to-Speech Output using
pyttsx3 - 🧠 Smart Q&A powered by Gemini (Google's Generative AI)
- 🕒 Tells current date & time
- 📂 Opens desktop apps and files
- ❌ Voice-based control to stop responses (e.g., "ok stop")
- 🌐 Optional: Wikipedia integration (can be replaced with AI)
- Python 3.x
- SpeechRecognition
- pyttsx3
- Google Generative AI API (Gemini)
os,threading,subprocess, and other standard Python libraries
-
Clone this repo:
git clone https://github.com/<your-username>/<your-repo-name>.git cd <your-repo-name>
-
Install dependencies:
pip install -r requirements.txt
-
Set up your Gemini API key:
- Create a file named
config.py - Add your API key like this:
API_KEY = "your-gemini-api-key"
- Create a file named
-
Run the assistant:
python assistant.py
Harrshan S
Biomedical Engineering Student @ SRM IST
Passionate about Artificial Intelligence, Machine Learning, and Innovative Healthcare Solutions.
This project is open-source and available under the MIT License.
- Integrate GUI using
tkinterorPyQt - Add email/SMS functionalities
- Add task reminders and calendar support
Feel free to fork and customize it for your own use! Contributions welcome.