Show HN: Runprompt – run .prompt files from the command line

github.com

128 points by chr15m 2 days ago

I built a single-file Python script that lets you run LLM prompts from the command line with templating, structured outputs, and the ability to chain prompts together.

When I discovered Google's Dotprompt format (frontmatter + Handlebars templates), I realized it was perfect for something I'd been wanting: treating prompts as first-class programs you can pipe together Unix-style. Google uses Dotprompt in Firebase Genkit and I wanted something simpler - just run a .prompt file directly on the command line.

Here's what it looks like:

--- model: anthropic/claude-sonnet-4-20250514 output: format: json schema: sentiment: string, positive/negative/neutral confidence: number, 0-1 score --- Analyze the sentiment of: {{STDIN}}

Running it:

cat reviews.txt | ./runprompt sentiment.prompt | jq '.sentiment'

The things I think are interesting:

* Structured output schemas: Define JSON schemas in the frontmatter using a simple `field: type, description` syntax. The LLM reliably returns valid JSON you can pipe to other tools.

* Prompt chaining: Pipe JSON output from one prompt as template variables into the next. This makes it easy to build multi-step agentic workflows as simple shell pipelines.

* Zero dependencies: It's a single Python file that uses only stdlib. Just curl it down and run it.

* Provider agnostic: Works with Anthropic, OpenAI, Google AI, and OpenRouter (which gives you access to dozens of models through one API key).

You can use it to automate things like extracting structured data from unstructured text, generating reports from logs, and building small agentic workflows without spinning up a whole framework.

Would love your feedback, and PRs are most welcome!

Barathkanna 2 days ago

This is really clever. Dotprompt as a thin, pipe-friendly layer around LLMs feels way more ergonomic than spinning up a whole agent stack. The single-file + stdlib approach is a nice touch too. How robust is the JSON schema enforcement when chaining multiple steps?

  • chr15m 2 days ago

    If the LLM returns invalid schema the next link in the chain will send that through since it defaults to string input if the JSON doesn't parse. Maybe I should make it error out in that situation.

    • anonym29 a day ago

      Is including a json schema validator and running the output through the validator against the input prompt, such that you can detect when the output doesn't match the schema, and optionally retry until it does match (possibly with a max number of attempts before it throws an error) too complex of an idea for the target implementation concept you were envisioning?

      It certainly doesn't intuitively sound like it matches the "Do one thing" part of the Unix philosophy, but it does seem to match the "and do it well" part.

      That said, I can totally understand a counterargument which proposes that schema validation and processing logic should be something else that someone desiring reliability pipes the output into.

      • chr15m 13 hours ago

        I'm not sure. I think I need to use it more to see what the LLMs do with bad data. The design you're suggesting might be the answer though.

threecheese 20 hours ago

Looks like Google has packaged dotprompt into a Python library, might allow you to make the codebase leaner: https://github.com/google/dotprompt/tree/main/python/dotprom...

I think you mentioned elsewhere that you dont want to have a lot of dependencies, but as the format evolves using the reference impl will allow you to work on real features.

  • chr15m 12 hours ago

    Will have a look at this. That could be the way to go. Thanks.

cootsnuck 2 days ago

This is pretty cool. I like using snippets to run little scripts I have in the terminal (I use Alfred a lot on macOS). And right now I just manually do LLM requests in the scripts if needed, but I'd actually rather have a small library of prompts and then be able to pipe inputs and outputs between different scripts. This seems pretty perfect for that.

I wasn't aware of the whole ".prompt" format, but it makes a lot of sense.

Very neat. These are the kinds of tools I love to see. Functional and useful, not trying to be "the next big thing".

dymk 2 days ago

Can the base URL be overridden so I can point it at eg Ollama or any other OpenAI compatible endpoint? I’d love to use this with local LLMs, for the speed and privacy boost.

  • chr15m 4 hours ago

    I've implemented this now. You can set it with BASE_URL or OPENAI_BASE_URL which seems to vaguely be the standard. I also plan to use this with local LLMs. Thanks for the suggestion!

  • chr15m 2 days ago

    Good idea. Will figure out a way to do this.

    • khimaros 2 days ago

      simple solution: honor OPENAI_API_BASE env var

    • benatkin 2 days ago

      Perhaps instead of writing an llm abstraction layer, you could use a lightweight one, such as @simonw's llm.

      • chr15m 2 days ago

        I don't want to introduce a dependency. Simon's tool is great but I don't like the way it stores template state. I want my state in a file in my working folder.

        • threecheese 21 hours ago

          Can you explain this decision a bit more? I’m using ‘llm’ and I find your project interesting.

          • chr15m 4 hours ago

            llm stores data (prompts, responses, chats, fragments, aliases, attachment metadta) in a central sqlite database outside the working directory, and you have to use the tool to view and manipulate that data. I prefer a tool like this to default to storing things in a file or files in the project directory I'm working in, in a way that is legible e.g. plain text files. Contrast with e.g. git where everything goes into .git.

            Functions require you to specify them on the command line every time they're invoked. I would prefer a tool like this to default to reading the functions from a hierarchy where it reads e.g. .llm-functions in the current folder, then ~/.config/llm-functions or something like that.

            In general I found myself baffled when trying to figure out where and how to configure things. That's probably me being impatient but I have found other tools to have more straightforward setup and less indirection.

            Basically I like things to be less centralized, magic, and less controlled by the tool.

            Another thing, which is not the fault of llm at all, is I find Python based tools annoying to install. I have to remember the env where I set them up. Contrast with a golang application which is generally a single file I can put in ~/bin. That's the reason I don't want to introduce a dep to runprompt if I can avoid it.

            The final thing that I found frustrating was the name 'llm' which makes it difficult to conduct searches as it is the generic name for what the thing is.

            It is an amazing piece of engineering and I am a huge fan of simonw's work, but I don't use llm much for these reasons.

tomComb 2 days ago

Everything seems to be about agents. Glad to see a post about enabling simple workflows!

meander_water a day ago

This is really cool and interesting timing, as I created something similar recently - https://github.com/julio-mcdulio/pmp

I've been using mlflow to store my prompts, but wanted something lightweight on the cli to version and manage prompts. I setup pmp so you can have different storage backends (file, sqlite, mlflow etc.).

I wasn't aware of dotprompt, I might build that in too.

gessha 2 days ago

Just like Linus being content with other people working on solutions to common problems, I’m so happy that you made this! I’ve had this idea for a long time but haven’t had the time to work on it.

__MatrixMan__ 2 days ago

It would be cool if there were some cache (invalidated by hand, potentially distributed across many users) so we could get consistent results while iterating on the later stages of the pipeline.

  • stephenlf 2 days ago

    That’s a great idea. Store inputs/outputs in XDG_CACHE_DIR/runprompt.sqlite

  • chr15m 2 days ago

    Do you mean you want responses cached to e.g. a file based on the inputs?

    • __MatrixMan__ a day ago

      Yeah, if it's a novel prompt, by all means send it to the model, but if its the same prompt as 30s ago, just immediately give me the same response I got 30s ago.

      That's typically how we expect bash pipelines to work, right?

      • chr15m 12 hours ago

        Bash pipelines don't do any caching and will execute fresh each time, but I understand your idea and why a cache is useful when iterating on the command line. I'll implement it. Thanks!

  • dymk a day ago

    “tee” where you want to intercept and cat that file into later stages?

    • __MatrixMan__ a day ago

      Yeah sure but it breaks the flow that makes bash pipelines so fun:

      - arrow up

      - append a stage to the pipeline

      - repeat until output is as desired

      If you're gonna write to some named location and later read from it you're drifting towards a different mode of usage where you might as well write a python script.

cedws 2 days ago

Can it be made to be directly executable with a shebang line?

  • leobuskin 2 days ago

    /usr/local/bin/promptrun

      #!/bin/bash
      file="$1"
      model=$(sed -n '2p' "$file" | sed 's/^# \*//')
      prompt=$(tail -n +3 "$file")
      curl -s https://api.anthropic.com/v1/messages \
        -H "x-api-key: $ANTHROPIC_API_KEY" \
        -H "content-type: application/json" \
        -H "anthropic-version: 2023-06-01" \
        -d "{
          \"model\": \"$model\",
          \"max_tokens\": 1024,
          \"messages\": [{\"role\": \"user\", \"content\": $(echo "$prompt" | jq -Rs .)}]
        }" | jq -r '.content[0].text'
    
    
    hello.prompt

      #!/usr/local/bin/promptrun
      # claude-sonnet-4-20250514
    
      Write a haiku about terminal commands.
  • chr15m 12 hours ago

    I added this and you can now make .prompt files with a runprompt shebang.

    #/usr/bin/env runprompt

  • _joel 2 days ago

    it already has one - https://github.com/chr15m/runprompt/blob/main/runprompt#L1

    If you curl/wget a script, you still need to chmod +x it. Git doesn't have this issue as it retains the file metadata.

    • vidarh 2 days ago

      I'm assuming the intent was to as if the *.prompt files could have a shebang line.

         #!/bin/env runprompt
         ---
         .frontmatter...
         ---
         
         The prompt.
      
      Would be a lot nicer, as then you can just +x the prompt file itself.
  • chr15m 2 days ago

    That's on my TODO list for tomorrow, thanks!

stephenlf 2 days ago

Seeing lots of good ideas in this thread. I am taking the liberty of adding them as GH issues

stephenlf 2 days ago

Fun! I love the idea of throwing LLM calls in a bash pipe

journal 2 days ago

i literally vibe coded a tool like this. it supports image in, audio out, and archiving.

  • chr15m 2 days ago

    Cool, I'm going to add file modalities too. Thanks for the validation!

ltbarcly3 2 days ago

Ooof, I guess vibecoding is only as good as the vibecoder.

orliesaurus 2 days ago

Why this over md files I already make and can be read by any agent CLI ( Claude, Gemini, codex, etc)?

  • chr15m 2 days ago

    Less typing. More control over chaining prompts together. Reproducibility. Running different prompts on different providers and models. Easy to install and runs everywhere. Inserts into scripting workflows simply. 12 factor env config.

  • jsdwarf 2 days ago

    Claude.md is an input to claude code which requires a monthly plan subscription north of 15€ / month. Same applies to Gemini.md, unless you are ok that they use your prompts for training Gemini. The python script works with a pay per use api key.

  • garfij 2 days ago

    Do your markdown files have frontmatter configuration?

swah 2 days ago

Thats pretty good, now lets see simonw's one...