Skip to content

heyllm

Talk to an LLM from your terminal. heyllm is a tiny, dependency-light CLI that streams answers from any OpenAI-compatible Chat Completions API straight to stdout — built for daily use and pipelines.

You ask a question, you get an answer. No plugins, no config files, no chat history — just one command that pipes well. And because it reads piped input as context, it slots naturally into shell workflows: hand it a diff, a log, or a file plus an instruction, and let the model do the rest.

Terminal window
$ heyllm "explain monads simply"
A monad is a way to wrap a value along with a recipe for chaining...
$ git diff | heyllm "write a conventional commit message"
feat(parser): handle SSE frames split across network chunks

The prompt is the instruction; piped stdin is the material it works on. That one idea — <context> | heyllm "<instruction>" — is most of what makes heyllm useful day to day.

Pipes like a Unix tool

Reads piped stdin as context and prints the answer to stdout. The positional prompt is the instruction; stdin is what it operates on — combined into one user message, prompt first.

Any OpenAI-compatible API

Speaks the OpenAI Chat Completions wire format, so --base-url points it at OpenAI, OpenRouter, a local Ollama, or anything else that implements it.

Streams by default

Tokens print as they arrive, parsed from server-sent events. Pass --no-stream when you’d rather wait for the full response in one shot.

Tiny & auditable

One dependency, no config files, no hidden state. A small binary that does the obvious thing — and --json hands you the raw response for scripting.


MIT   github.com/abdulmunimjemal/heyllm