On this page
- Overview
- How It Works
- Conversations
- Conversation Management
- What the Agent Can Do
- Slash Commands
- Deep Research
- File Attachments
- Vision (Image Understanding)
- Conversation Intelligence
- Purpose & Working State
- Continuity Chains
- Persona System
- Models & Effort Levels
- Rich Output
- Verification & Trust
- Skill Activation
- Context-Aware Intro
- Key Concepts
- Conversation Ownership
- Streaming
- Message Types
- Multi-Agent Missions
- Complexity Classification
- Mission Proposal
- Real-Time Controls During Execution
- Workspace Health Banner
- Context Management
- What's Next
Chat Agent
Use the AI copilot through natural-language conversations to drive any Praxiom workflow.
Overview
The Praxiom AI chat agent is a conversational interface to the entire product loop. Instead of navigating menus and clicking through multi-step workflows, you can simply ask the agent to synthesize research, generate recommendations, draft documents, search insights, generate execution tickets, or answer questions about your product space — all within a conversation.
How It Works
Conversations
Every chat interaction happens within a conversation, scoped to a workspace and user.
Create a conversation
POST /api/chat/conversations
{
"workspace_id": "your-workspace-uuid",
"title": "Q2 Onboarding Analysis"
}If you omit the title, it defaults to "New Conversation".
Send messages
Messages are sent through the streaming endpoint. The agent processes your message, calls tools as needed (synthesis, recommendation generation, document drafting, web search), and streams the response back in real time via Server-Sent Events (SSE).
View history
GET /api/chat/conversations/{conversation_id}/messages?limit=100
Messages are returned in chronological order. Use the before parameter (ISO datetime) for cursor-based pagination to load older messages.
Conversation Management
| Operation | Endpoint | Description |
|---|---|---|
| List | GET /api/chat/conversations?workspace_id={id} | Paginated, sorted by last message. Use include_archived=true to see archived ones. |
| Search | GET /api/chat/conversations/search?workspace_id={id}&q=onboarding | Full-text search across all message content and conversation titles |
| Rename | PATCH /api/chat/conversations/{id} | Update title or is_archived |
| Archive | POST /api/chat/conversations/{id}/archive | Soft-delete — hides from default list |
| Restore | POST /api/chat/conversations/{id}/restore | Unarchive a conversation |
| Delete | DELETE /api/chat/conversations/{id} | Hard delete — removes conversation and all messages |
The search endpoint searches across message content, not just titles. Ask "What did I discuss about pricing?" to find relevant conversations quickly.
What the Agent Can Do
The chat agent has access to the full tool set that powers the structured UI workflows:
- Synthesize research — "Analyze my latest 5 interview uploads"
- Search insights — "What are the top pain points around checkout?"
- Generate recommendations — "Generate recommendations from the high-severity insights"
- Draft documents — "Write a PRD for the search improvement recommendation"
- Generate tickets — "Create execution tickets from my onboarding PRD"
- Search the web — "What are best practices for onboarding flows in SaaS?"
- Google Drive — "Find the competitive analysis doc in our Drive"
- Answer questions — "How many users mentioned mobile issues in our research?"
- Workspace context — The agent automatically loads your product context, existing insights, and recommendation history to give grounded answers
Slash Commands
Quick shortcuts for common workflows:
| Command | Action |
|---|---|
/synthesize | Synthesize research sources |
/recommend | Generate recommendations |
/pipeline | Full pipeline (synthesize → recommend → draft PRD) |
/draft | Draft a document |
/research | Launch deep research with depth selector and cost estimate |
/insights | View insights board |
/documents | View documents |
/reset | Clear conversation |
Deep Research
The /research command opens a Deep Research confirmation dialog before executing. This lets you choose a research depth and see the estimated cost upfront.
Depth options:
| Depth | Target Sources | Estimated Duration |
|---|---|---|
| Quick | ~5 sources | ~2 minutes |
| Standard | ~8 sources | ~4 minutes |
| Deep | ~12 sources | ~6 minutes |
The dialog shows:
- Estimated credits for the selected depth
- Estimated duration in minutes
- Your remaining credits with a warning if the run would consume a large portion
- A blocked state if you lack sufficient credits (with a link to upgrade)
The cost estimate is fetched from GET /api/workspaces/{id}/research-jobs/estimate?depth={depth}.
File Attachments
The chat input includes a categorised attach menu for uploading files directly into the conversation. Click the paperclip icon to see file type categories.
Supported file types:
- Documents — PDF, DOCX, TXT, Markdown
- Spreadsheets — CSV, XLSX
- Images — PNG, JPG, GIF, WebP
- Audio — MP3, WAV, M4A
- Video — MP4, MOV
Each attachment shows as a pill below the input with:
- File type icon and truncated filename
- Processing status:
pending→processing→completedorfailed - Image preview thumbnail for image and PDF attachments (via
previewUrl)
You can attach up to 10 files per message. Pass attachment_ids in the streaming request body.
Vision (Image Understanding)
When you attach images or documents containing images (PDF, PPTX, DOCX), Praxiom automatically extracts visual content and sends it to Claude as vision content blocks. This means the agent can:
- Read and interpret charts, diagrams, and screenshots
- Extract text from images (OCR)
- Analyse UI mockups and wireframes
- Understand visual data in research documents
Image extraction runs through the processing pipeline — PDFs are converted to page images via PyMuPDF, PPTX/DOCX images are extracted from embedded media. The vision model used is Claude Haiku for cost efficiency.
Conversation Intelligence
Praxiom tracks rich metadata about each conversation to provide better context and enable multi-session workflows.
Purpose & Working State
The agent infers the conversation's purpose and tracks working state automatically:
| Field | Description |
|---|---|
purpose | What the conversation is about (inferred from the first message and ongoing context) |
working_state | Current activity: planning, researching, drafting, or executing |
Continuity Chains
Conversations can be resumed from a previous session using the resumed_from field. When you resume a conversation, the agent receives a briefing with:
- Delta — New sources, insights, and documents added since the last message
- Active — Total workspace state (source count, insight count, etc.)
- Memory — Key events and decisions recalled from prior interactions
This means the agent picks up where you left off with full awareness of what changed while you were away.
Persona System
The agent adapts its personality and expertise based on the inferred task:
| Persona | When Active | Behaviour |
|---|---|---|
research_analyst | During synthesis and insight exploration | Rigorous evidence standards, cross-referencing, statistical language |
strategy_advisor | During recommendation generation | Business impact framing, opportunity sizing, competitive context |
writer | During document drafting | Clean prose, structured sections, citation formatting |
pm_agent | During execution and ticket generation | Scoping, acceptance criteria, priority frameworks |
explorer | General Q&A and open-ended chat | Broad knowledge, creative connections, follow-up suggestions |
The persona is selected automatically based on intent classification and can shift mid-conversation as the task evolves.
Models & Effort Levels
Choose the right model and effort level for your task:
| Model | Credit Weight | Extended Thinking | Best For |
|---|---|---|---|
| Haiku | 1x | No | Quick lookups, simple queries |
| Sonnet | 1x | Yes | Balanced analysis, most workflows |
| Opus | 2x | Yes | Complex synthesis, deep research |
Effort levels control how deeply the agent reasons:
| Effort | Credit Weight | Description |
|---|---|---|
fast (low) | 1x | Quick responses, minimal tool use |
thorough (high) | 2x | Standard depth, multiple tool calls |
deep (max) | 3x | Extended thinking enabled, maximum depth |
Total credit cost = model weight × effort weight. For example, Opus + deep = 2 × 3 = 6 credits per operation.
Set these via the model and reasoning_mode parameters in the streaming request, or use the UI toggle.
Rich Output
The agent produces structured output blocks when appropriate:
| Block Type | Description |
|---|---|
| Metrics | KPI cards with charts and trend data |
| Comparison | Side-by-side comparison tables |
| Steps | Numbered step-by-step procedures |
| Callout | Info, warning, or success highlight boxes |
| Mermaid | Diagrams rendered from Mermaid syntax |
| Proposal | Feature proposal cards |
| Context Header | Contextual information banners |
| Follow-ups | Suggested next actions |
These are rendered as rich UI components in the frontend, not just plain Markdown.
Verification & Trust
After the agent responds, you can verify the quality of its output:
- Citation coverage — Are claims backed by research sources?
- Cross-source validation — Do multiple sources confirm the same finding?
- Severity analysis — Are severity ratings justified by the evidence?
- Density scoring — How evidence-dense is the response?
Verification results appear as a trust badge on each message, with expandable details showing per-check results and citation similarity scores.
Skill Activation
When the agent detects that an installed skill is relevant to your query, it activates the skill automatically. You'll see a skill_activated event in the stream. If a relevant skill is not installed, the agent suggests it via a skill_suggestion banner.
Context-Aware Intro
When you open a new conversation, the agent provides a workspace-aware introduction via GET /api/workspaces/{workspace_id}/agent-intro. The intro adapts to your workspace state:
- Empty workspace — Guides you to upload your first research source
- Has sources, no insights — Offers to analyse your sources
- Has insights, no recommendations — Offers to generate recommendations
- Rich workspace — Open-ended, ready to help with any task
Key Concepts
Conversation Ownership
Conversations are private to the user who created them. Other workspace members cannot view your conversations. The API enforces ownership checks on all message and conversation endpoints.
Streaming
The agent streams responses in real time. The frontend connects to the SSE stream endpoint to display the agent's thinking process, tool calls, and final response as they happen. This provides immediate feedback even when the agent takes 30-60 seconds for complex operations.
Message Types
Each message has a role:
user— Your inputassistant— The agent's responsetool_result— Output from a tool call (e.g., insight IDs saved, document content generated)
Messages also carry additional metadata:
inferred_intent— The classified intent of the messagethinking_blocks— Agent reasoning chains (when extended thinking is enabled)decision_context— Configuration snapshot for replay and auditingexecution_metadata— Tool step details with timing
Multi-Agent Missions
For complex queries that require multiple distinct steps, the agent automatically classifies the query and proposes a Mission — a coordinated multi-agent execution plan.
Complexity Classification
Every query is classified before execution:
| Level | Behaviour |
|---|---|
| Simple | Executes directly as a single agent |
| Medium | Runs in one extended multi-turn session |
| Complex | Decomposes into a DAG of subtasks and proposes a Mission |
Classification uses a fast Haiku call running in parallel with intent inference — zero perceived latency.
Mission Proposal
When a complex query is detected, a Mission Proposal card appears in the conversation before any agent executes. You can:
- Launch the multi-agent mission
- Run as single agent to skip decomposition
- Edit the proposal
Real-Time Controls During Execution
While a mission runs, you can:
- Redirect a running agent with new instructions (prompt update takes effect at the next checkpoint)
- Stop an individual agent
- Spawn additional agents mid-mission
See the Missions guide and Command Center for full details.
Workspace Health Banner
When Praxiom detects data quality issues in your workspace (via entropy scanning), a dismissible Entropy Banner appears at the top of the chat panel. The banner reads:
"X high-priority issue(s) detected in your workspace data — may affect research quality"
- Amber background when high-severity issues are present
- Muted background for lower-severity issues only
- Links to the Dashboard Quality Overview for details
- Dismiss with the X button (stays dismissed for the session)
This helps you stay aware of data quality problems that could degrade synthesis and recommendation accuracy.
Context Management
For long conversations, the agent automatically compacts older messages to prevent context degradation. Compaction is lazy — it only triggers when the conversation exceeds the workflow's token threshold:
| Workflow | Compaction Threshold |
|---|---|
| Chat | 10,000 tokens |
| Recommendation | 15,000 tokens |
| Drafting | 15,000 tokens |
| Synthesis | 20,000 tokens |
| Full Pipeline | 25,000 tokens |
When compaction triggers, older messages are summarised using a workflow-specific prompt that preserves artifact IDs, decisions, and open questions. The most recent 15 messages are always kept verbatim.
What's Next
- Missions & Command Center — multi-agent parallel execution
- Dashboard Quality — understand your workspace health metrics
- Overnight Research Cycles — autonomous nightly experiments
- Knowledge Graph — how Praxiom builds a semantic graph from your research
- Streaming SSE — all stream event types including mission events
Was this helpful?