Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Channel Integrations

OpenCrabs connects to multiple messaging platforms simultaneously. All channels share the TUI session by default, with per-user sessions for non-owners.

Setting Up Channels

Channels are configured through the onboarding wizard, not by editing TOML files manually.

Running the Wizard

  • First launch — the wizard runs automatically
  • Re-run — type /onboard in chat, or /onboard:channels to jump straight to the channels step
  • Quick jump/onboard:channels opens the channel picker and returns to chat when done

The channel picker is a keyboard-driven TUI screen:

KeyAction
/ or j / kMove focus between channels
SpaceToggle the focused channel on/off
Enter on an enabled channelOpen that channel’s setup screen
Enter on ContinueSkip remaining setup and advance
TabSame as Continue — advance to the next wizard step
EscGo back to the previous step

Channel Setup Screens

When you press Enter on an enabled channel, a dedicated setup screen opens with the fields needed for that platform (bot token, channel ID, allowed users, etc.). Each field:

  • Auto-detects existing values from config.toml / keys.toml (shown as masked •••••••• for secrets, plain text for IDs)
  • Tab moves to the next field
  • Enter on the last field (or the Test Connection button) saves and returns to the channel list
  • BackTab moves to the previous field

The Five Channels

#ChannelSetup FieldsTest
0TelegramBot Token, Owner User ID, Respond ToSend test message
1DiscordBot Token, Channel ID, Allowed Users, Respond ToSend test message
2WhatsAppQR Code scan, Phone AllowlistConnection status
3SlackBot Token, App Token, Channel ID, Allowed Users, Respond ToSend test message
4TrelloAPI Key, API Token, Board ID, Allowed UsersBoard access check

After enabling and configuring your channels, the wizard saves everything to config.toml and keys.toml automatically. You can always re-run /onboard:channels to modify settings.

Supported Channels

ChannelProtocolImages InVoice InImage Gen OutSetup
TelegramLong pollingVision pipelineSTTNative photoBot token
DiscordWebSocketVision pipelineSTTFile attachmentBot token
SlackSocket ModeVision pipelineSTTFile uploadBot + App token
WhatsAppQR pairingVision pipelineSTTNative imageQR code
TrelloREST APICard attachmentsCard attachmentAPI key + token

Cross-Channel Session Resolution (v0.3.29)

All messaging channels now share a stable [chat:<id>] suffix pattern for reliable session lookup. Previously only Telegram had this; Discord, Slack, and WhatsApp used exact-title matching which broke when the agent auto-renamed sessions (creating duplicates on every message).

The shared channels::session_resolve module provides:

  • Suffix-first lookup — fast path using [chat:discord-dm-<user_id>], [chat:slack-<channel_id>], [chat:wa-<phone>] etc.
  • Legacy forward-migration — pre-suffix rows are migrated to the suffix format on first lookup
  • /sessions binding — explicit chat→session binding on switch so user choices win over suffix lookup

Follow-Up Cancel (v0.3.30)

Sending a message while the agent is mid-run now acts as ESC x2 (cancel current run) across all channels. The cancelled partial content is preserved, and the new message starts a fresh agent turn.

ZIP Attachments (v0.3.30)

ZIP file attachments from users are extracted and processed inline:

  • Text files are inlined into the conversation
  • Images get vision markers for multimodal processing
  • PDFs get text extraction
  • Capped at 50 files / 10 MB per ZIP entry

Common Features

All messaging channels support:

  • Shared session with TUI (owner) or per-user sessions (non-owners)
  • Slash commands/help, /models, /new, /sessions, custom commands
  • Inline buttons — Provider picker, model picker, session switcher (Telegram, Discord, Slack)
  • User allowlists — Restrict access by user ID, chat ID, or phone number
  • respond_to filterall, dm_only, or mention (respond only when @mentioned)

File & Media Support

ChannelImages (in)Text files (in)Documents (in)Audio (in)Image gen (out)
TelegramVision pipelineExtracted inlinePDF noteSTTNative photo
WhatsAppVision pipelineExtracted inlinePDF noteSTTNative image
DiscordVision pipelineExtracted inlinePDF noteSTTFile attachment
SlackVision pipelineExtracted inlinePDF noteSTTFile upload
TrelloCard attachments → visionExtracted inlineCard attachment
TUIPaste path → visionPaste path → inlineSTT[IMG: name] display

Images are passed to the active model’s vision pipeline if it supports multimodal input, or routed to the analyze_image tool (Google Gemini vision) otherwise. Text files are extracted as UTF-8 and included inline up to 8,000 characters.

Proactive Channel Tools

The agent can send messages and take actions proactively:

ToolActions
discord_send17 actions: send, reply, react, edit, delete, pin, create_thread, send_embed, etc.
slack_send17 actions: send, reply, react, edit, delete, pin, set_topic, send_blocks, send_file (TTS voice via OGG/Opus)
trello_send22 actions: create_card, move_card, add_comment, add_checklist, search, etc.

Channel Voice Parity

All four messaging channels (Telegram, Discord, WhatsApp, Slack) now share a single code path via crate::channels::voice::{transcribe, synthesize}. Bot replies are recorded in the channel_messages table for conversation context — previously only user messages were stored.