Chat Agent

Use the AI copilot through natural-language conversations to drive any Praxiom workflow.

Overview

The Praxiom AI chat agent is a conversational interface to the entire product loop. Instead of navigating menus and clicking through multi-step workflows, you can simply ask the agent to synthesize research, generate recommendations, draft documents, search insights, generate execution tickets, or answer questions about your product space — all within a conversation.

How It Works

Conversations

Every chat interaction happens within a conversation, scoped to a workspace and user.

Create a conversation

POST /api/chat/conversations

{
  "workspace_id": "your-workspace-uuid",
  "title": "Q2 Onboarding Analysis"
}

If you omit the title, it defaults to "New Conversation".

Send messages

Messages are sent through the streaming endpoint. The agent processes your message, calls tools as needed (synthesis, recommendation generation, document drafting, web search), and streams the response back in real time via Server-Sent Events (SSE).

View history

GET /api/chat/conversations/{conversation_id}/messages?limit=100

Messages are returned in chronological order. Use the before parameter (ISO datetime) for cursor-based pagination to load older messages.

Conversation Management

Operation	Endpoint	Description
List	`GET /api/chat/conversations?workspace_id={id}`	Paginated, sorted by last message. Use `include_archived=true` to see archived ones.
Search	`GET /api/chat/conversations/search?workspace_id={id}&q=onboarding`	Full-text search across all message content and conversation titles
Rename	`PATCH /api/chat/conversations/{id}`	Update `title` or `is_archived`
Archive	`POST /api/chat/conversations/{id}/archive`	Soft-delete — hides from default list
Restore	`POST /api/chat/conversations/{id}/restore`	Unarchive a conversation
Delete	`DELETE /api/chat/conversations/{id}`	Hard delete — removes conversation and all messages

The search endpoint searches across message content, not just titles. Ask "What did I discuss about pricing?" to find relevant conversations quickly.

What the Agent Can Do

The chat agent has access to the full tool set that powers the structured UI workflows:

Synthesize research — "Analyze my latest 5 interview uploads"
Search insights — "What are the top pain points around checkout?"
Generate recommendations — "Generate recommendations from the high-severity insights"
Draft documents — "Write a PRD for the search improvement recommendation"
Generate tickets — "Create execution tickets from my onboarding PRD"
Search the web — "What are best practices for onboarding flows in SaaS?"
Google Drive — "Find the competitive analysis doc in our Drive"
Answer questions — "How many users mentioned mobile issues in our research?"
Workspace context — The agent automatically loads your product context, existing insights, and recommendation history to give grounded answers

Slash Commands

Quick shortcuts for common workflows:

Command	Action
`/synthesize`	Synthesize research sources
`/recommend`	Generate recommendations
`/pipeline`	Full pipeline (synthesize → recommend → draft PRD)
`/draft`	Draft a document
`/research`	Launch deep research with depth selector and cost estimate
`/insights`	View insights board
`/documents`	View documents
`/reset`	Clear conversation

Deep Research

The /research command opens a Deep Research confirmation dialog before executing. This lets you choose a research depth and see the estimated cost upfront.

Depth options:

Depth	Target Sources	Estimated Duration
Quick	~5 sources	~2 minutes
Standard	~8 sources	~4 minutes
Deep	~12 sources	~6 minutes

The dialog shows:

Estimated credits for the selected depth
Estimated duration in minutes
Your remaining credits with a warning if the run would consume a large portion
A blocked state if you lack sufficient credits (with a link to upgrade)

The cost estimate is fetched from GET /api/workspaces/{id}/research-jobs/estimate?depth={depth}.

File Attachments

The chat input includes a categorised attach menu for uploading files directly into the conversation. Click the paperclip icon to see file type categories.

Supported file types:

Documents — PDF, DOCX, TXT, Markdown
Spreadsheets — CSV, XLSX
Images — PNG, JPG, GIF, WebP
Audio — MP3, WAV, M4A
Video — MP4, MOV

Each attachment shows as a pill below the input with:

File type icon and truncated filename
Processing status: pending → processing → completed or failed
Image preview thumbnail for image and PDF attachments (via previewUrl)

You can attach up to 10 files per message. Pass attachment_ids in the streaming request body.

Vision (Image Understanding)

When you attach images or documents containing images (PDF, PPTX, DOCX), Praxiom automatically extracts visual content and sends it to Claude as vision content blocks. This means the agent can:

Read and interpret charts, diagrams, and screenshots
Extract text from images (OCR)
Analyse UI mockups and wireframes
Understand visual data in research documents

Image extraction runs through the processing pipeline — PDFs are converted to page images via PyMuPDF, PPTX/DOCX images are extracted from embedded media. The vision model used is Claude Haiku for cost efficiency.

Conversation Intelligence

Praxiom tracks rich metadata about each conversation to provide better context and enable multi-session workflows.

Purpose & Working State

The agent infers the conversation's purpose and tracks working state automatically:

Field	Description
`purpose`	What the conversation is about (inferred from the first message and ongoing context)
`working_state`	Current activity: `planning`, `researching`, `drafting`, or `executing`

Continuity Chains

Conversations can be resumed from a previous session using the resumed_from field. When you resume a conversation, the agent receives a briefing with:

Delta — New sources, insights, and documents added since the last message
Active — Total workspace state (source count, insight count, etc.)
Memory — Key events and decisions recalled from prior interactions

This means the agent picks up where you left off with full awareness of what changed while you were away.

Persona System

The agent adapts its personality and expertise based on the inferred task:

Persona	When Active	Behaviour
`research_analyst`	During synthesis and insight exploration	Rigorous evidence standards, cross-referencing, statistical language
`strategy_advisor`	During recommendation generation	Business impact framing, opportunity sizing, competitive context
`writer`	During document drafting	Clean prose, structured sections, citation formatting
`pm_agent`	During execution and ticket generation	Scoping, acceptance criteria, priority frameworks
`explorer`	General Q&A and open-ended chat	Broad knowledge, creative connections, follow-up suggestions

The persona is selected automatically based on intent classification and can shift mid-conversation as the task evolves.

Models & Effort Levels

Choose the right model and effort level for your task:

Model	Credit Weight	Extended Thinking	Best For
Haiku	1x	No	Quick lookups, simple queries
Sonnet	1x	Yes	Balanced analysis, most workflows
Opus	2x	Yes	Complex synthesis, deep research

Effort levels control how deeply the agent reasons:

Effort	Credit Weight	Description
`fast` (low)	1x	Quick responses, minimal tool use
`thorough` (high)	2x	Standard depth, multiple tool calls
`deep` (max)	3x	Extended thinking enabled, maximum depth

Total credit cost = model weight × effort weight. For example, Opus + deep = 2 × 3 = 6 credits per operation.

Set these via the model and reasoning_mode parameters in the streaming request, or use the UI toggle.

Rich Output

The agent produces structured output blocks when appropriate:

Block Type	Description
Metrics	KPI cards with charts and trend data
Comparison	Side-by-side comparison tables
Steps	Numbered step-by-step procedures
Callout	Info, warning, or success highlight boxes
Mermaid	Diagrams rendered from Mermaid syntax
Proposal	Feature proposal cards
Context Header	Contextual information banners
Follow-ups	Suggested next actions

These are rendered as rich UI components in the frontend, not just plain Markdown.

Verification & Trust

After the agent responds, you can verify the quality of its output:

Citation coverage — Are claims backed by research sources?
Cross-source validation — Do multiple sources confirm the same finding?
Severity analysis — Are severity ratings justified by the evidence?
Density scoring — How evidence-dense is the response?

Verification results appear as a trust badge on each message, with expandable details showing per-check results and citation similarity scores.

Skill Activation

When the agent detects that an installed skill is relevant to your query, it activates the skill automatically. You'll see a skill_activated event in the stream. If a relevant skill is not installed, the agent suggests it via a skill_suggestion banner.

Context-Aware Intro

When you open a new conversation, the agent provides a workspace-aware introduction via GET /api/workspaces/{workspace_id}/agent-intro. The intro adapts to your workspace state:

Empty workspace — Guides you to upload your first research source
Has sources, no insights — Offers to analyse your sources
Has insights, no recommendations — Offers to generate recommendations
Rich workspace — Open-ended, ready to help with any task

Key Concepts

Conversation Ownership

Conversations are private to the user who created them. Other workspace members cannot view your conversations. The API enforces ownership checks on all message and conversation endpoints.

Streaming

The agent streams responses in real time. The frontend connects to the SSE stream endpoint to display the agent's thinking process, tool calls, and final response as they happen. This provides immediate feedback even when the agent takes 30-60 seconds for complex operations.

Message Types

Each message has a role:

user — Your input
assistant — The agent's response
tool_result — Output from a tool call (e.g., insight IDs saved, document content generated)

Messages also carry additional metadata:

inferred_intent — The classified intent of the message
thinking_blocks — Agent reasoning chains (when extended thinking is enabled)
decision_context — Configuration snapshot for replay and auditing
execution_metadata — Tool step details with timing

Multi-Agent Missions

For complex queries that require multiple distinct steps, the agent automatically classifies the query and proposes a Mission — a coordinated multi-agent execution plan.

Complexity Classification

Every query is classified before execution:

Level	Behaviour
Simple	Executes directly as a single agent
Medium	Runs in one extended multi-turn session
Complex	Decomposes into a DAG of subtasks and proposes a Mission

Classification uses a fast Haiku call running in parallel with intent inference — zero perceived latency.

Mission Proposal

When a complex query is detected, a Mission Proposal card appears in the conversation before any agent executes. You can:

Launch the multi-agent mission
Run as single agent to skip decomposition
Edit the proposal

Real-Time Controls During Execution

While a mission runs, you can:

Redirect a running agent with new instructions (prompt update takes effect at the next checkpoint)
Stop an individual agent
Spawn additional agents mid-mission

See the Missions guide and Command Center for full details.

When Praxiom detects data quality issues in your workspace (via entropy scanning), a dismissible Entropy Banner appears at the top of the chat panel. The banner reads:

"X high-priority issue(s) detected in your workspace data — may affect research quality"

Amber background when high-severity issues are present
Muted background for lower-severity issues only
Links to the Dashboard Quality Overview for details
Dismiss with the X button (stays dismissed for the session)

This helps you stay aware of data quality problems that could degrade synthesis and recommendation accuracy.

Context Management

For long conversations, the agent automatically compacts older messages to prevent context degradation. Compaction is lazy — it only triggers when the conversation exceeds the workflow's token threshold:

Workflow	Compaction Threshold
Chat	10,000 tokens
Recommendation	15,000 tokens
Drafting	15,000 tokens
Synthesis	20,000 tokens
Full Pipeline	25,000 tokens

When compaction triggers, older messages are summarised using a workflow-specific prompt that preserves artifact IDs, decisions, and open questions. The most recent 15 messages are always kept verbatim.

What's Next

Missions & Command Center — multi-agent parallel execution
Dashboard Quality — understand your workspace health metrics
Overnight Research Cycles — autonomous nightly experiments
Knowledge Graph — how Praxiom builds a semantic graph from your research
Streaming SSE — all stream event types including mission events

Was this helpful?

PreviousResearch Upload NextIntroduction