A Windows based AI powered desktop pet written in Python. It can move around on your screen and jump on top of active program windows. And with the help of Google's Gemini Vision model, it can generate funny comments on current on-screen activity by taking screenshots of your screen every minute.
Note
All of the cat sprites used in this project are not mine. They have been taken from here.
- Uses tkinter to display sprite images and text on screen.
- Uses win32gui library to access and utitlize the Windows API to get the active program windows and their X, Y positions.
- Uses pyttsx3 library for Text-to-speech.
- Uses Google's Gemini Vision model for generating comments.
- Add support for linux.
- Add better TTS.
- Add & use more idle animations.
- Make the sprite draggable using mouse.
- Make the sprite stick on other program window borders.
Install Python on your machine if you haven't already.
Download this repository manually, or if you have git installed::
git clone https://github.com/R37r0-Gh057/pyCatAI-pet
Once inside the directory, open your terminal enter the following commands to install the necessary libraries:
pip install -r requirements.txt
Important
Make sure that you have generated your own Gemini API key and placed it in the script:
pyCatAI-pet/lib/CommentGenerator.py
Line 10 in fa14266
Run the main.py
file from terminal:
python main.py
If you want to suggest a new feature or functionality then you can open a new issue with the "enhancement"
title.
If you want to add features or enhance existing code by yourself then feel free to open a Pull Request:
- Fork this repository
- Create a separate branch
- Make your changes
- Open pull request
You can get started by checking the currently open issues, or create new ones.
Feel free to reach out to me on discord: @retr0_gh0st