← Back to LLM Wiki
LLM Wiki · Frontier · Closed Weights

Pi

Inflection AI's personal-intelligence agent and the Inflection model family that powers it. Conversational warmth and durable memory as the core product thesis.
Pi — short for "personal intelligence" — is Inflection AI's agent product, differentiated from Claude and GPT by a deliberate bet on emotional intelligence, supportive conversation, and a persistent sense of relationship with the user. After Microsoft's 2024 acquihire of Inflection's leadership, the public Pi product persists in maintenance mode while the underlying model line (Inflection-1 → 2 → 2.5 → 3) is available through Inflection's enterprise API and through licensed deployments.
Inflection AI Conversational Empathic Long Memory Voice-Forward

Quick Facts

Vendor
Inflection AI (Palo Alto)
Founded
2022 — Mustafa Suleyman, Karén Simonyan, Reid Hoffman
Current state
Post-March 2024 Microsoft acquihire; pivoted to enterprise. Model line continues as Inflection for Enterprise.
Product
Pi (consumer personal AI); Inflection for Enterprise (B2B API)
Model line
Inflection-1 (2023) · Inflection-2 · Inflection-2.5 (2024) · Inflection-3
License
Proprietary; hosted API only
Hosting
Inflection API; licensed deployments for enterprise customers
Context window
~32K tokens (varies by tier); persistent cross-session memory at product layer
Modalities
Text and voice (real-time conversational voice is a product differentiator)

Summary

Inflection was one of the most well-funded 2022-era frontier labs, raising $1.3B by mid-2023 on the thesis that personal, emotionally-intelligent conversation was an underserved product wedge distinct from OpenAI's assistant framing or Anthropic's safety-first positioning. The flagship product — Pi — launched in 2023 with a tuned-for-warmth voice, a persistent memory of the user's ongoing context, and a real-time voice mode that was materially ahead of competitors at the time.

In March 2024 Microsoft struck a deal that moved Mustafa Suleyman and most of Inflection's research staff to lead Microsoft AI, licensed Inflection's models for Microsoft's use, and left Inflection itself as a going concern refocused on enterprise B2B. The consumer Pi product still runs, but Inflection's forward roadmap is now "Inflection for Enterprise" — licensing the model and the conversational agent stack to enterprises that want a supportive, non-technical-feeling AI layer on top of existing workflows.

For infrastructure teams, Pi's interesting properties are (1) the baseline conversational style — warmer and more naturally supportive than stock Claude or GPT without prompt-engineering effort — and (2) the product-layer memory model, where the agent remembers facts about the user across sessions without a separate RAG pipeline. These matter for workloads where the LLM faces humans, not other services.

Model Lineup

Where Pi Fits

Pi is the default pick when the agent faces end users and the quality bar is feel before it's capability — customer support, wellness / coaching products, concierge assistants, consumer onboarding flows. Out of the box, Pi is less likely to produce terse, detached, or robotic output than any of the technical-first frontier models. For workloads where the human on the other side is paying per minute of attention, that baseline matters.

It is not the default pick for coding, technical analysis, long-document reasoning, or any agent loop where tool-use reliability is the bottleneck. For those, Claude, Qwen3-Coder, or Hermes remain better picks.

Tradeoffs

Deployment Notes

Within the Claw ecosystem, Pi is routed through the provider arbitrage layer as a specialist empathic tier — requests that match human-facing support, coaching, or onboarding patterns get routed to Pi; everything else goes to Claude, Qwen, or the usual providers. On a Claw Mac Mini, Pi is reachable via the Hermes agent gateway as a delegated tool: the Hermes gateway fronts all agent traffic (including traffic from Raspberry Pi terminals — see Pi × LLM), and when a turn is classified as emotion- or support-heavy, Hermes routes it to the Pi backend rather than answering directly.

This "Hermes front door, Pi specialist behind it" pattern gives deployments a single consistent gateway while still using the right model per turn. FrawdBot sits inline on the gateway regardless of which backend handles the request.

Related

References

  1. Inflection AI
  2. Pi — Personal AI
  3. Inflection-2.5 announcement
  4. The Agent Infrastructure Stack — Organized AI