GitHub - Masriyan/Octopus-Ai: 🐙 Meet Octopus AI: Because Two Hands Just Aren't Enough Let’s be honest: being a human is exhausting. You only have two arms, one brain, and a desperate, daily need for caffeine.

🐙 Because Two Hands Just Aren't Enough

Gemini berkata

🐙 Meet Octopus AI: Because Two Hands Just Aren't Enough

Let's be honest: being a human is exhausting. You only have two arms, one brain, and a desperate, daily need for caffeine. How are you supposed to handle a never-ending to-do list with hardware like that?

Enter Octopus AI.

The Philosophy (Why an Octopus?)

We looked at the animal kingdom for the ultimate productivity guru and found the undisputed multitasking ninja of the sea. Why? Because octopuses are freakishly smart and boast eight highly capable arms.

They can open child-proof jars from the inside, solve puzzles, and juggle multiple tasks without breaking a sweat (mostly because they live underwater, but you get the point). We took that big-brained, multi-limbed brilliance and turned it into an AI tool designed to do your heavy lifting.

What Can the Tentacles Do for You?

🦾 Eight-Armed Multitasking: While your clumsy human hands are still typing a single sentence, Octopus AI is already crunching data, drafting emails, organizing your schedule, and virtually high-fiving itself.

🧠 Escape-Artist Intelligence: Got a problem that feels like you're stuck in a locked box? Octopus AI uses its massive, squishy digital brain to squeeze through complex problems and find elegant solutions.

🔄 Total Flexibility: It adapts to your workflow seamlessly. No rigid bones, no friction—just smooth, intelligent automation wrapping around your daily tasks.

🧹 100% Mess-Free: All the genius of a cephalopod, with absolutely zero ink squirted on your nice clean desk when it gets surprised.

Stop drowning in a sea of tabs and endless tasks. Let Octopus AI wrap its virtual tentacles around your workload, so you can go back to doing what humans do best: taking naps and drinking coffee. ☕

🏗️ Architecture

graph TB
    subgraph Frontend["🎨 Frontend (HTML/CSS/JS)"]
        UI[Chat Interface]
        Settings[Settings Panel]
    end

    subgraph Backend["⚙️ FastAPI Backend"]
        Agent[🐙 Agent Engine]
        Config[Config Manager]
        Memory[Memory / Persistence]
    end

    subgraph LLM["🧠 LLM Providers"]
        OpenAI[OpenAI<br/>GPT-4o / GPT-4o-mini]
        Anthropic[Anthropic<br/>Claude 3.5 Sonnet]
        Gemini[Google Gemini<br/>Gemini 3 Flash]
        Ollama[Ollama<br/>Llama / Mistral]
    end

    subgraph Tools["🦑 Tentacle Tools"]
        Shell[🐚 Shell]
        FileOps[📁 File Ops]
        WebBrowse[🌐 Web Browse]
        CodeRun[💻 Code Runner]
        Search[🔍 Web Search]
    end

    UI -- WebSocket --> Agent
    Settings -- REST API --> Config
    Agent --> LLM
    Agent --> Tools
    Agent --> Memory

🦑 Features

🔧 Five Powerful Tentacles

Tentacle	Capability	Description
🐚	Shell Commands	Execute system commands with real-time output streaming
📁	File Operations	Read, write, list, search, and manage files & directories
🌐	Web Browse	Fetch, parse, and summarize any web page
💻	Code Execution	Run Python code in a sandboxed environment
🔍	Web Search	Search the internet via DuckDuckGo

🧠 Multi-Provider LLM Support

Switch between AI providers on the fly — no restart needed:

Provider	Models	Authentication
OpenAI	GPT-4o, GPT-4o-mini, GPT-4-Turbo	API Key
Anthropic	Claude 3.5 Sonnet, Claude 3.5 Haiku, Claude 3 Opus	API Key
Google Gemini	Gemini 3 Flash, Gemini 2.5 Pro/Flash	API Key or Google Sign-In
Ollama	Llama 3.2, Mistral, Code Llama + any local model	Local (free!)

🎨 Premium Dark-Ocean GUI

Glassmorphism design with deep-ocean dark theme
Animated octopus welcome screen with CSS tentacle animation
Real-time streaming chat with full Markdown rendering
Live tool execution visualization — see each tentacle in action
Settings panel with provider/model/temperature selection
Responsive design optimized for desktop & mobile
Google Sign-In for seamless Gemini integration

💾 Persistent Memory

Conversations automatically saved to disk as JSON
Auto-generated conversation titles
Full-text searchable conversation history
Configurable context window (up to 50 messages)

🛡️ Security & Sandboxing (v2.1+)

Path Isolation: All File and Shell interactions are securely jailed strictly to the data/workspace directory.
Network Containment: Python subprocess environments are spawned using Linux unshare -rn to sever network access completely and neutralize code-based data extraction.
Robust Prompt Armor: Extracted texts from DuckDuckGo queries and Web navigations are structurally isolated within XML wrappers (like <untrusted> or <external_content>), effectively stripping them of instruct-override privileges and preventing Prompt Injections.
Local IP Anchoring: To harden the backend against unauthenticated external hijacking, Octopus utilizes a strict LocalhostRestrictionMiddleware layer blocking all non-local connections.

🚀 Installation

Prerequisites

Requirement	Version
Python	3.10 or higher
pip	Latest recommended
API Key	At least one (OpenAI / Anthropic / Gemini) — or Ollama for free local models

Quick Start

# 1. Clone the repository
git clone https://github.com/Masriyan/Octopus-Ai.git
cd Octopus-Ai

# 2. Make the start script executable
chmod +x start.sh

# 3. Launch everything (auto-installs deps, starts backend + frontend)
./start.sh

Then open http://localhost:5500 in your browser. 🎉

Manual Setup

If you prefer to set things up manually:

# Create virtual environment
python3 -m venv venv
source venv/bin/activate

# Install dependencies
pip install -r backend/requirements.txt

# Copy environment config
cp .env.example .env
# Edit .env and add your API key(s)

# Start the backend
cd backend
python3 -m uvicorn main:app --host 0.0.0.0 --port 8000 --reload &

# Start the frontend (in another terminal)
cd frontend
python3 -m http.server 5500

Configure API Keys

Open http://localhost:5500
Click the ⚙️ Settings button in the sidebar
Select your preferred LLM provider
Enter your API key and click Save
Start chatting! 🐙

Using Ollama (Free / Local)

# Install Ollama
curl -fsSL https://ollama.ai/install.sh | sh

# Pull a model
ollama pull llama3.2

# In Octopus AI settings → select "Ollama" as provider

📖 Usage

Basic Chat

Simply type your message and Octopus AI will respond. It automatically detects when tools would be helpful and uses them proactively.

Capability Quick-Start Cards

The welcome screen features interactive cards that demonstrate each tentacle:

Card	Example Prompt
🐚 Shell	"List all files in my home directory"
📁 Files	"Read and summarize the README.md in the current project"
🔍 Search	"Search the web for the latest AI news"
💻 Code	"Write a Python script to calculate Fibonacci numbers and run it"
🌐 Web	"Fetch and summarize the contents of https://news.ycombinator.com"
🦑 Multi	"Help me analyze my system information"

Settings & Configuration

Setting	Description
LLM Provider	Switch between OpenAI, Anthropic, Gemini, or Ollama
Model	Choose the specific model for the selected provider
Temperature	Control response creativity (0.0 = focused, 1.0 = creative)
Tentacle Permissions	Enable/disable individual tools
API Keys	Securely save provider API keys
Google Sign-In	Authenticate with Google for Gemini access

WebSocket Streaming

Octopus AI uses WebSocket connections for real-time, token-by-token streaming — no polling, no delays.

📁 Project Structure

Octopus-Ai/
├── backend/
│   ├── main.py              # FastAPI server + WebSocket endpoints
│   ├── agent.py             # Core agent engine with tool loop
│   ├── llm_providers.py     # OpenAI / Anthropic / Gemini / Ollama
│   ├── config.py            # Configuration manager
│   ├── memory.py            # Conversation persistence (JSON)
│   ├── requirements.txt     # Python dependencies
│   └── tools/
│       ├── __init__.py      # Tool registry & schema builder
│       ├── shell_tool.py    # 🐚 Shell command execution
│       ├── file_tool.py     # 📁 File system operations
│       ├── web_tool.py      # 🌐 HTTP page fetching
│       ├── code_tool.py     # 💻 Python code execution
│       └── search_tool.py   # 🔍 DuckDuckGo web search
├── frontend/
│   ├── index.html           # Main application page
│   ├── css/main.css         # Deep-ocean dark theme
│   └── js/app.js            # Frontend logic & WebSocket client
├── data/                    # Created at runtime (git-ignored)
│   ├── config.json          # User preferences & API keys
│   └── memory/              # Saved conversations
├── docs/
│   └── images/              # Documentation assets
├── .env.example             # Environment variable template
├── .gitignore               # Git ignore rules
├── start.sh                 # One-command launcher
├── CHANGELOG.md             # Release history
├── CONTRIBUTING.md          # Contribution guidelines
├── LICENSE                  # MIT License
└── README.md                # ← You are here

🛠️ Development

Backend (FastAPI)

cd backend
python3 -m uvicorn main:app --reload --port 8000

Frontend (Static)

cd frontend
python3 -m http.server 5500

API Documentation

Visit http://localhost:8000/docs for the interactive Swagger UI.

REST API Endpoints

Method	Endpoint	Description
`GET`	`/api/health`	Health check
`GET`	`/api/config`	Get configuration (keys masked)
`POST`	`/api/config`	Update configuration
`POST`	`/api/config/apikey`	Save an API key
`GET`	`/api/conversations`	List all conversations
`POST`	`/api/conversations`	Create new conversation
`GET`	`/api/conversations/{id}`	Get conversation with messages
`DELETE`	`/api/conversations/{id}`	Delete conversation
`GET`	`/api/models/{provider}`	List available models
`POST`	`/api/auth/google`	Google OAuth authentication
`WS`	`/ws/chat/{conv_id}`	WebSocket for real-time chat

🗺️ Roadmap

gantt
    title Octopus AI Development Roadmap
    dateFormat  YYYY-MM
    section Core
    Multi-LLM Providers       :done, 2025-01, 2025-02
    Tool System (5 tentacles) :done, 2025-01, 2025-02
    WebSocket Streaming       :done, 2025-02, 2025-03
    section Enhancements
    Plugin System             :active, 2025-03, 2025-05
    RAG / Document Chat       :2025-04, 2025-06
    Voice Input/Output        :2025-05, 2025-07
    section Infrastructure
    Docker Support            :2025-03, 2025-04
    Auth & Multi-User         :2025-05, 2025-07
    Cloud Deployment          :2025-06, 2025-08

🤝 Contributing

Contributions are welcome! Please see CONTRIBUTING.md for guidelines.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-tentacle)
Commit your changes (git commit -m 'Add amazing tentacle')
Push to the branch (git push origin feature/amazing-tentacle)
Open a Pull Request

📄 License

This project is licensed under the MIT License — see the LICENSE file for details.

🐙 Philosophy

An octopus has eight arms, each capable of independent action — tasting, gripping, exploring. Octopus AI embodies this: many tools, each specialized, working together to accomplish any task.

Made with 🐙 by Masriyan

⭐ Star this repo • 🐛 Report Bug • 💡 Request Feature

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
backend		backend
data		data
docs/images		docs/images
frontend		frontend
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
start.sh		start.sh

Folders and files

Latest commit

History

Repository files navigation

Gemini berkata

🐙 Meet Octopus AI: Because Two Hands Just Aren't Enough

The Philosophy (Why an Octopus?)

What Can the Tentacles Do for You?

🏗️ Architecture

🦑 Features

🔧 Five Powerful Tentacles

🧠 Multi-Provider LLM Support

🎨 Premium Dark-Ocean GUI

💾 Persistent Memory

🛡️ Security & Sandboxing (v2.1+)

🚀 Installation

Prerequisites

Quick Start

Manual Setup

Configure API Keys

Using Ollama (Free / Local)

📖 Usage

Basic Chat

Capability Quick-Start Cards

Settings & Configuration

WebSocket Streaming

📁 Project Structure

🛠️ Development

Backend (FastAPI)

Frontend (Static)

API Documentation

REST API Endpoints

🗺️ Roadmap

🤝 Contributing

📄 License

🐙 Philosophy

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages