llml Flying Nobita

Use this command to install llml:

winget install --id=FlyingNobita.llml -e

LLM Launcher (llml)
LLM Launcher is a terminal user interface (TUI) tool designed to streamline the process of discovering and launching local language models using various runtimes such as llama-server, vLLM, Ollama, and KoboldCpp. It simplifies model management by scanning your filesystem for GGUF and Hugging Face-style safetensors models and detecting installed runtimes.

Key Features:

Automatic Model Discovery: Efficiently scans common directories for GGUF and safetensors models, caching results to expedite future launches.
Runtime Detection: Automatically identifies installed runtimes like llama.cpp, vLLM, KoboldCpp, Ollama, and maps each model to compatible runtimes.
Named Parameter Profiles: Save multiple profiles per model (e.g., "fast-laptop" or "quality") for quick access to preferred configurations.
Profile Export/Import: Share parameter profiles via TUI or CLI, ensuring portability across machines.
One-Key Launch: Execute models with a single keystroke after selecting the desired profile, displaying commands before execution and streaming server output in the UI.
Status Monitoring: Tracks long-running tasks like Ollama preloads and maintains an alert history for errors and warnings.

Audience & Benefits:
Ideal for developers, researchers, and professionals managing local language models, LLM Launcher saves time by eliminating manual command reconstruction. It offers a user-friendly interface for efficient model management and execution.

Installation:
LLM Launcher can be installed via winget on Windows, making it easily accessible for users to integrate into their workflow seamlessly.

README

LLM Launcher (`llml`)

LLM Launcher TUI screenshot

LLM Launcher (llml) is a TUI for people who already have models on disk and are tired of reconstructing launch commands from shell history.

It scans your local filesystem for GGUF and Hugging Face-style safetensors models, detects installed runtimes (llama.cpp, vLLM, Ollama, and KoboldCpp), and lets you save named parameter profiles per model — so the command that worked last time is always one keystroke away.

Browse local models. Detect the right runtime. Launch with one key.

✨ Features

Model discovery — auto-scans common paths for GGUF files and safetensors model directories; add extra roots via LLML_MODEL_PATHS and/or config.toml. Results are cached under {UserConfigDir}/llml/config.toml so the next launch can skip the filesystem walk when the cache is still valid.
Runtime detection — finds installed llama-server, vllm, and koboldcpp binaries and maps installed ollama plus the configured Ollama host, then maps each model to its compatible runtime. GGUF models can use llama.cpp or KoboldCpp via profile selection.
Named parameter profiles — save multiple profiles per model (e.g. fast-laptop, quality, api-8080), each storing runtime args, env vars, port, and context settings. The active profile is always one key away.
Profile export — share your parameter profiles with others via the TUI (E) or CLI (llml export). Profiles are written to a portable TOML file matching the same schema the llml-import skill reads. Filter by model or profile name, toggle individual profiles or entire model groups, and handle file collisions (overwrite or auto-suffixed save).

Key	Action
`hjkl/↑↓←→`	Move selection; horizontal scroll when the path column is wider than the terminal
`E`	Open profile export modal — select profiles by model group, filter by name or backend, and save to a portable TOML file you can share across machines
`r`	Reload `[runtime]` from `config.toml` and re-detect binaries (no model rescan)
`S`	Full model filesystem rescan; refresh cached `[[models]]` in `config.toml`
`R`	Run server (split view: table + log pane)
`ctrl`+`R`	Run server full-screen
`c`	Edit runtime environment (paths, ports)
`p`	Edit parameter profiles for the selected model
`m`	Edit extra model search paths (saved in `config.toml`)
`,` / `.`	Change sort column / reverse sort direction
`enter`	Copy the launch command for the selected row to the clipboard
`a`	Toggle alert history pane
`t`	Cycle theme (`dark` → `light` → `auto` → …)
`?`	Toggle the full shortcut help overlay
`q`	Quit

Platform	Path
Linux (XDG)	`$XDG_CONFIG_HOME/llml/` (usually `~/.config/llml/`)
macOS	`~/Library/Application Support/llml/`
Windows	`%AppData%\llml\`

Feature	Environment Variable	`config.toml` key (under `[runtime]`)	Default
llama.cpp path	`LLAMA_CPP_PATH`	`default_llama_cpp_path`	(auto)
llama.cpp port	`LLAMA_SERVER_PORT`	`default_llama_server_port`	`8080`
vLLM path	`VLLM_PATH`	`default_vllm_path`	(auto)
vLLM venv	`VLLM_VENV`	`default_vllm_venv`	(auto)
vLLM port	`VLLM_SERVER_PORT`	`default_vllm_server_port`	`8000`
Ollama path	`OLLAMA_PATH`	`default_ollama_path`	(auto)
Ollama host	`OLLAMA_HOST`	`default_ollama_host`	`127.0.0.1:11434`
KoboldCpp path	`KOBOLDCPP_PATH`	`default_koboldcpp_path`	(auto)
KoboldCpp port	`KOBOLDCPP_PORT`	`default_koboldcpp_port`	`5001`
TUI Theme	`LLML_THEME`	-	`auto`

Environment Variable	Role
`LLML_MODEL_PATHS`	Comma-separated list of extra roots to scan. Example: `export LLML_MODEL_PATHS="/data/models,/opt/weights"`
`HUGGINGFACE_HUB_CACHE`	Specific root for Hugging Face hub cache (overrides `HF_HOME/hub`).
`HF_HOME`	Root for HF home (cache resolves to `$HF_HOME/hub`).

llml Flying Nobita

README

LLM Launcher (`llml`)

✨ Features

🚀 Quick start

Runtime Requirements

Install

Go (`go install`)

Homebrew

Scoop

Winget

Pre-built binaries

Build from source

Build Requirements

Start

⌨️ Usage

Server output

Status and alerts

Parameter profiles (`p`)

Agent skill files

`llml-import` skill

⚙️ Configuration

Storage & Locations

Runtime Engines

Model Discovery

Custom Search Paths

Discovery Rules

Data Integrity & Backups

💻 Development

Development Tooling Requirements

Set up tooling

Common Tasks

Layout

❤️ Support This Project

📄 License

llml Flying Nobita

README

LLM Launcher (llml)

✨ Features

🚀 Quick start

Runtime Requirements

Install

Go (go install)

Homebrew

Scoop

Winget

Pre-built binaries

Build from source

Build Requirements

Start

⌨️ Usage

Server output

Status and alerts

Parameter profiles (p)

Agent skill files

llml-import skill

⚙️ Configuration

Storage & Locations

Runtime Engines

Model Discovery

Custom Search Paths

Discovery Rules

Data Integrity & Backups

💻 Development

Development Tooling Requirements

Set up tooling

Common Tasks

Layout

❤️ Support This Project

📄 License

LLM Launcher (`llml`)

Go (`go install`)

Parameter profiles (`p`)

`llml-import` skill