Mins Bot
A floating desktop AI assistant. A swirling orb sits on your desktop and expands into a full chat panel with voice, vision, browser automation, OS control, and 180+ skills that actually do things instead of just talking.
# Download
Grab the latest build for your machine. The Windows package is a portable ZIP — unzip and run, no installer needed. macOS and Linux builds are coming soon.
.dmg for Apple Silicon and Intel.deb · .AppImageMinsBot\MinsBot.exe. The bundled JRE means you don't need a separate Java install.# Installation
Download the package for your OS and run it. The bundled JRE means no separate Java install is required.
| OS | File | Steps |
|---|---|---|
| Windows latest | MinsBot-1.0-windows-latest.zip · always points to the latest build |
Unzip → run MinsBot\MinsBot.exe |
| macOS soon | — | Coming soon |
| Linux soon | — | Coming soon |
Run from source
# First run
On first launch a setup wizard prompts for the three primary LLM keys (OpenAI, Anthropic, Gemini). The rest is configurable from the Setup tab. Keys are written to a local application-secrets.properties file at the project root — gitignored, never bundled into installers, and rejected at build time if a populated key is detected on the classpath.
# Try saying
Type any of these into the floating chat panel:
kill port 8080how many files in my downloads foldertake a screenshotzip C:\projects\mins-bot to D:\backup.zipauto-start yourself on login
play music by radioheadnarrate me a bedtime story for my doguse a jarvis voice
explain this pageread the article aloudextract all rows from this table
audit a landing pagemarket the app and publish to bluesky
# Capabilities
The agent picks the right tool from 180+ built-in skills the moment you say what you want — no command names to remember.
01 · Chat & agent
- Multiple model providers — OpenAI, Anthropic, Gemini, Groq, plus a local model for fully offline use
- Persistent memory and chat transcripts across restarts
- Autonomous mode that works on standing instructions while you're away
02 · Voice (offline-capable)
- Local text-to-speech with 11+ curated voices, including a “Jarvis” preset
- Speeds up for status updates, slows down for narration
- Optional cloud voices (ElevenLabs, Fish Audio, OpenAI) when you want them
03 · Browser — "AI guide for any website"
- Drives your Chrome — click, fill, navigate, extract — under your own profile
- Explain the current page out loud
- Read articles aloud in your chosen voice
- Walk you step-by-step through how to use a site you've never seen
- Pull every matching row off a page into a clean spreadsheet
04 · Marketing automation
- One-shot growth flow — find trends, study competitors, draft ad copy, schedule posts, line up outreach
- Building blocks for lead lists, gigs, content research, landing-page audits, proposals, hashtag suggestions, and more
- Publishes for you — Bluesky, Mastodon, webhooks, email — only when you ask
05 · System / OS control
- Kill a stuck app, free a busy port, quit any program, control any window
- Find big files, clean up temp clutter, hunt down duplicates, tidy your disk
- Network, VPN, firewall, Docker, Git, build status
- Lock the screen, sleep, schedule a shutdown — or auto-start the bot on login
06 · Daily ops
- Multi-monitor screenshots, media keys, play music from your library or online
- Zip and unzip folders, bulk-rename files, watch a log live
- Soothing audio for pets — cat TV, dog calming music, bedtime stories for the dog
07 · Deliverables
- Generate full PDF, Word, and PowerPoint decks straight from chat — "create a deck about top EVs of 2025"
- Charts, tables, and images embedded; markdown tables become real slide tables
08 · File & text ops
- Find files by name, search inside files, compare two files, peek at the start/end of a file
- See file size, line count, and stats; open anything in its default app
- Copy, paste, and recall recent clipboard items
# Data & privacy
- Chat memory, transcripts, voice prefs, and bot data live in
~/mins_bot_data/and./memory/— local only - Cloud LLM calls go directly to the providers you configure; leave them blank to run fully offline via Ollama
- No telemetry. No outbound calls beyond the providers you've configured and skill-supplied URLs
- Installers bundle no API keys. A build-time guard fails the build if a populated key is found on the classpath
# Spec table
Quick reference of what is retained, where, and by whom.
| Item | Value | Location | Notes |
|---|---|---|---|
data.at_rest | on_device | ~/mins_bot_data/ · ./memory/ | No copy elsewhere |
credentials | local_file | application-secrets.properties | Gitignored · never bundled |
training_use | null | — | Never |
telemetry | null | — | None emitted |
model.calls | direct | Device → provider | Or fully offline via Ollama |
default.port | 8765 | localhost | Configurable in application.properties |
license.app | proprietary | EULA | See LICENSE |
license.skills_sdk | MIT | Standalone skills | Redistributable |