
Code stays local.AI understands
Terminal-first
developer AI.
A local-first REPL that understands your repository, remembers across sessions, and routes intelligently to the right model for the task. Built for developers who want AI that stays under their control.
Persistent REPL Session
Launch once with `velune` and stay inside. Ask questions directly, use slash commands like `/run`, `/model`, and `/optimus`, or pick from interactive menus. Your session never drops.
Codebase Understanding
Velune maps how your project connects — functions, modules, and dependencies. Ask high-level questions about your architecture, not just which file contains what.
Memory Across Sessions
Velune remembers past decisions, recurring patterns, and architectural context across days and weeks. The longer you use it, the better it understands your project.
Any Model, One CLI
Run locally with Ollama or route to OpenAI, Anthropic, Gemini, Groq, and more. Switch between speed and power with a single command. Credentials stay on your machine.
Install.Audit.Analyze.
Install
in secondsInstall Velune as a Python package. Supports Ollama (local), OpenAI, Anthropic, or any provider. Credentials stored securely in OS keyring.
Launch
the REPLStart the persistent REPL session. Direct prompts or slash commands. Arrow-key navigation, interactive pickers, live streaming.
Work
interactivelyAsk questions, use /run for autonomous tasks (Phase 2+), adjust with /optimus (fast) or /godly (powerful). Your memory persists.
See it
work.
Install, init, and run. Five specialised agents review your task from every angle — planning, writing, reviewing, challenging, and synthesising — before anything is shipped.
Connect
everything.
Route to OpenAI, Anthropic, Groq, xAI, Gemini, or run locally with Ollama. Understand your codebase via GitHub. Execute in Terminal. All in one CLI.
Privacy by
default.

Runs locally by default
Velune works entirely on your machine out of the box. Cloud providers are available when you need them, but never required.
Remembers across sessions
Velune builds up knowledge of your project over time — past decisions, recurring patterns, and architectural context that carries forward.
No lock-in, ever
Start with local models, switch to any cloud provider, and switch back — same CLI, same memory, no configuration rewrites.
Principled
from the ground up.
Velune is built on four non-negotiables: your code stays on your machine by default, you are never locked into one AI provider, every change is previewed before it happens, and the roadmap is honest about what exists today versus what is coming.
Private by default
Runs locally on your machine. Your code never leaves unless you choose to route to a cloud provider. Credentials are stored securely in your OS keyring.
Understands your project
Velune maps your codebase structure — not just text. It knows how things connect, so it can reason about the impact of any change before acting.
Always shows the diff
Before any file is changed, Velune shows exactly what it wants to do. Review line by line, accept or reject. One command to roll back.
No lock-in, ever
Start with Ollama locally, move to OpenAI, switch to Anthropic — same CLI, same memory, same session. Change your mind anytime.
Building in public — the journey.
Every great tool starts with a foundation you can trust. We took the time to get the async runtime, streaming, and security model exactly right before shipping anything else.
What's
coming.
Velune is actively developed. Here's where we are, and where we're going.
Reliable MVP
REPL, streaming, model routing
- Persistent REPL — launch once, stay inside
- Live streaming responses from Ollama and cloud providers
- Conversation memory that carries context through your session
- Automatic model selection based on task type
- /run for autonomous task execution
- /model to switch AI providers interactively
- Speed modes: /optimus (fast) and /godly (max power)
- Session history stored locally on your machine
Repository Cognition
Codebase mapping, persistent memory, smart routing
- Full codebase mapping — functions, classes, and module relationships
- Memory that recalls past sessions and architectural decisions
- Semantic memory that understands context, not just keywords
- Impact analysis before any code change
- Prompts automatically optimised per AI model
- Smart routing that learns which model suits each task best
- Memory ranked by relevance and recency
Agents & Orchestration
Multi-agent council, advanced codebase intelligence
- Deep impact analysis using commit history and dependency signals
- Scalable codebase graph built for large repositories
- Symbol tracking that stays accurate across refactors
- Memory that stays current and avoids surfacing stale context
- Safe, sandboxed execution — filesystem, git, and terminal
- Council mode: 5 agents deliberate on every complex task
- Intelligent module grouping for large, multi-team codebases
- Context pulled from your actual test run and CI history