Supporting Packages

Overview

Beyond the core stack (pi-ai → pi-agent-core → pi-coding-agent) and the terminal UI (pi-tui), the pi-mono monorepo includes three supporting packages that extend the coding agent into different deployment contexts: Slack integration, GPU infrastructure management, and web-based chat interfaces.

pi-mom (Slack Bot)

The pi-mom package¹ is a Slack bot that bridges Slack messaging to the pi coding agent. It runs as a standalone process using Slack’s Socket Mode API for real-time message delivery without requiring a public HTTP endpoint.

Architecture

Slack Workspace
    │
    ▼ (Socket Mode WebSocket)
┌──────────────┐
│   pi-mom     │
│  ┌────────┐  │
│  │ Slack  │  │
│  │ Socket │  │──── @slack/socket-mode
│  │ Mode   │  │──── @slack/web-api
│  └───┬────┘  │
│      │       │
│      ▼       │
│  ┌────────┐  │
│  │Coding  │  │
│  │Agent   │  │──── @mariozechner/pi-coding-agent
│  │Runtime │  │──── @mariozechner/pi-ai
│  └───┬────┘  │
│      │       │
│      ▼       │
│  ┌────────┐  │
│  │Sandbox │  │──── @anthropic-ai/sandbox
│  └────────┘  │
└──────────────┘

Key Dependencies

Dependency	Purpose
`@slack/socket-mode`	WebSocket-based real-time Slack event handling
`@slack/web-api`	Slack API calls for posting messages, reactions, and file uploads
`@mariozechner/pi-coding-agent`	Core coding agent runtime
`@mariozechner/pi-ai`	LLM provider abstraction
`@anthropic-ai/sandbox`	Sandboxed code execution for safety
`croner`	Cron-style scheduling for recurring tasks
`chalk`	Terminal output formatting
`diff`	Text diffing for code change visualization

Self-Managing Architecture

What makes pi-mom distinctive is that the bot is self-managing — it installs its own tools, writes its own automation scripts (“skills”), and configures credentials autonomously. Zero manual setup beyond providing Slack tokens.

Per-Channel Isolation

Each Slack channel gets its own isolated context:

Resource	Scope
Conversation context	Per-channel
Memory (`MEMORY.md`)	Per-channel + global
Working directory	Per-channel
Skills	Per-channel

Dual History System

File	Purpose
`log.jsonl`	Append-only source of truth for all messages
`context.jsonl`	What the LLM actually sees, subject to compaction

Event Scheduling

The bot supports scheduled wake-ups via JSON files with three trigger types: immediate, one-shot (future timestamp), and periodic (cron-based via the croner library).

Security

Docker sandboxing is recommended for tool execution — the bot can execute arbitrary code, so container isolation provides the safety boundary rather than permission popups.

pi-pods (GPU Pod Provisioning)

The pi-pods package² is a CLI utility for provisioning and managing vLLM (a high-throughput LLM serving framework) instances on GPU infrastructure.

Purpose

When teams need to run open-weight models (rather than API-hosted models), they need GPU compute. pi-pods automates the provisioning workflow:

Configure model and GPU requirements
Provision a GPU pod on cloud infrastructure
Deploy vLLM with the specified model
Manage the instance lifecycle (start, stop, status, teardown)

Supported Providers

Provider	Description
DataCrunch	GPU cloud instances
RunPod	Serverless and pod-based GPU compute
Vast.ai	GPU marketplace
Prime Intellect	AI compute platform
AWS EC2	Amazon GPU instances
Custom	Any Ubuntu machine with NVIDIA GPUs

Pre-configured Models

The package bundles model definitions for several open-weight model families:

Family	Tool-Call Parser
Qwen	Hermes parser
GPT-OSS	Standard parser
GLM	glm4_moe parser

Deployment Flow

Configure model requirements and GPU specifications
Provision a GPU pod via the target provider’s API
SSH into the pod and deploy vLLM with the specified model
Expose an OpenAI-compatible API endpoint for the deployed model
Manage lifecycle (start, stop, status, teardown)

Smart multi-GPU allocation supports distributing multiple models across available GPUs on a single machine.

pi-web-ui (Web Components)

The pi-web-ui package³ provides reusable web components for building browser-based AI chat interfaces. Built on Lit.js (Google’s web component framework) with Tailwind CSS for styling, it offers a component library for rendering conversational AI interactions.

Technology Stack

Technology	Role
Lit.js	Web component framework (Shadow DOM, reactive properties, templates)
Tailwind CSS	Utility-first styling
PDF.js	PDF document rendering and text extraction
XLSX	Spreadsheet parsing for document-aware conversations
Lucide	Icon library

AI Backend Integration

The components integrate with multiple AI backends:

Backend	Usage
`@mariozechner/pi-ai`	Core LLM streaming through the unified API
LM Studio	Local model serving via LM Studio’s API
Ollama	Local model serving via Ollama’s API

Use Cases

The web UI components enable building:

Browser-based coding assistants
Document-aware chat interfaces (PDF, spreadsheet support)
Self-hosted chat applications using local models
Custom frontends for pi-ai backed services

Monorepo Coordination

All seven packages share lockstep versioning (currently v0.66.1) managed through the root package.json:

Command	Scope
`npm run release:patch`	Features and fixes (0.66.1 → 0.66.2)
`npm run release:minor`	Breaking API changes (0.66.x → 0.67.0)
`npm run release:major`	Major version bumps

The build system uses concurrently for parallel package builds, with dependency ordering ensured by npm workspace resolution. All packages target Node.js 20+ and use ES modules exclusively.

Deep Research

Explorer