TermiLLM

A terminal-based LLM chat app that runs locally and interacts with your vLLM server

Backend Architecture

TermiLLM is a client. It does not run model inference itself.

The intended design is:

TermiLLM runs as a terminal client in its own Python environment
The inference backend runs as a separate service
The two communicate over HTTP using OpenAI-compatible endpoints such as /v1/models and /v1/chat/completions

This means you can:

Run vLLM in a different local Python environment
Run vLLM on another machine and point TermiLLM at it
Replace vLLM with another OpenAI-compatible backend later

vLLM Integration

TermiLLM works well with vLLM, but vLLM is expected to be started separately from the TermiLLM client. Before using TermiLLM:

Install vLLM in a separate environment if needed
Start a vLLM server with your preferred model, for example:

python -m vllm.entrypoints.api_server --model Qwen/Qwen2.5-Coder-3B-Instruct --port 8000

For local development, a common setup is:

terminal A: activate your vllm environment and start the vLLM server on port 8000
terminal B: activate TermiLLM's environment and run ./run.sh

Features

Interactive Chat Interface: Connect to your local vLLM backend with streaming responses
User Experience:
- Colorful output using Rich for a pleasant terminal experience
- Keyboard navigation to review chat history
- Stream responses from your local LLM in real-time
Command System:
- /help - Display available commands
- /clear - Clear the current conversation
- /exit - Exit the application
- /model - Change the model on the fly
- /temp - Adjust temperature setting
- /max_tokens - Change maximum token output
Configuration Management:
- Persistent settings via JSON configuration file
- Dynamic model switching without restarting

Usage

source ./venv.sh
./run.sh

By default, TermiLLM connects to http://localhost:8000.

You can also specify a different model or server:

./run.sh --model Qwen/Qwen2.5-Coder-3B-Instruct --base-url http://localhost:8000

If your inference service is already running in another local environment or on another machine, only the base_url and model name need to match that backend.

Configuration

TermiLLM creates a configuration file named termillm_config.json in the application directory that stores your settings. You can edit this file directly to customize your preferences:

{
  "model": "Qwen/Qwen2.5-Coder-3B-Instruct",
  "base_url": "http://localhost:8000",
  "temperature": 0.7,
  "max_tokens": 2048
}

Settings can also be changed from within the application using commands like /model, /temp, and /max_tokens.

Development Roadmap

Interesting? Feel free to contribute or create a PR for features you want or bugs you found! The following is the plan:

V 1.0.0

V 1.0.1 (In Progress)

Restructure the Python app into replaceable modules
Add a Python MVP agent loop
Add confirmation and safety policy for command execution
Add pytest
CI/CD

V 1.1.0 (Planned)

Future Tasks

A LangChain Mode
Moving to bubbletea style
Integrated local inference into it
Integrated model into it

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
termillm		termillm
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
TermiLLM.py		TermiLLM.py
config-sample.json		config-sample.json
requirements.txt		requirements.txt
run.sh		run.sh
termillm_config.json		termillm_config.json
venv.sh		venv.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TermiLLM

Backend Architecture

vLLM Integration

Features

Usage

Configuration

Development Roadmap

V 1.0.0

V 1.0.1 (In Progress)

V 1.1.0 (Planned)

Future Tasks

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TermiLLM

Backend Architecture

vLLM Integration

Features

Usage

Configuration

Development Roadmap

V 1.0.0

V 1.0.1 (In Progress)

V 1.1.0 (Planned)

Future Tasks

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages