Assistant

Set up and configure an AI assistant (or chat) cog for your server with one of OpenAI's ChatGPT language models.

Features include configurable prompt injection, dynamic embeddings, custom function calling, and more!

- [p]assistant: base command for setting up the assistant
- [p]chat: talk with the assistant
- [p]convostats: view a user's token usage/conversation message count for the channel
- [p]clearconvo: reset your conversation with the assistant in the channel

/draw (Slash Command)

Generate an image with AI

Usage: /draw <prompt> [size] [quality] [style] [model]
prompt: (Required) What would you like to draw?
size: (Optional) The size of the image to generate
quality: (Optional) The quality of the image to generate
style: (Optional) The style of the image to generate
model: (Optional) The model to use for image generation
Checks: Server Only

/tldr (Slash Command)

Summarize whats been happening in a channel

Usage: /tldr [timeframe] [question] [channel] [member] [private]
timeframe: (Optional) The number of messages to scan
question: (Optional) Ask for specific info about the conversation
channel: (Optional) The channel to summarize messages from
member: (Optional) Target a specific member
private: (Optional) Only you can see the response
Checks: Server Only

[p]chathelp

Get help using assistant

Usage: [p]chathelp

[p]chat

Chat with [botname]!

Conversations are Per user Per channel, meaning a conversation you have in one channel will be kept in memory separately from another conversation in a separate channel

Optional Arguments
--outputfile <filename> - uploads a file with the reply instead (no spaces)
--extract - extracts code blocks from the reply
--last - resends the last message of the conversation

Example
[p]chat write a python script that prints "Hello World!"

Including --outputfile hello.py will output a file containing the whole response.
Including --outputfile hello.py --extract will output a file containing just the code blocks and send the rest as text.
Including --extract will send the code separately from the reply
Usage: [p]chat <question>
Aliases: ask, escribir, razgovor, discuter, plaudern, 채팅, charlar, baterpapo, and sohbet
Cooldown: 1 per 6.0 seconds
Checks: guild_only

[p]convostats

Check the token and message count of yourself or another user's conversation for this channel

Conversations are Per user Per channel, meaning a conversation you have in one channel will be kept in memory separately from another conversation in a separate channel

Conversations are only stored in memory until the bot restarts or the cog reloads

Usage: [p]convostats [user]
Checks: guild_only

[p]convoclear

Reset your conversation with the bot

This will clear all message history between you and the bot for this channel

Usage: [p]convoclear
Aliases: clearconvo
Checks: guild_only

[p]convopop

Pop the last message from your conversation

Usage: [p]convopop
Checks: bot_has_guild_permissions and guild_only

[p]convocopy

Copy the conversation to another channel, thread, or forum

Usage: [p]convocopy <channel>
Checks: bot_has_guild_permissions and guild_only

[p]convoprompt

Set a system prompt for this conversation!

This allows customization of assistant behavior on a per channel basis!

Check out This Guide for prompting help.

Usage: [p]convoprompt [prompt]
Checks: guild_only

[p]convoshow

View the current transcript of a conversation

This is mainly here for moderation purposes

Usage: [p]convoshow [user=None] [channel=operator.attrgetter('channel')]
Restricted to: GUILD_OWNER
Aliases: showconvo
Checks: guild_only

[p]importconvo

Import a conversation from a file

Usage: [p]importconvo
Restricted to: GUILD_OWNER
Checks: guild_only

[p]query

Fetch related embeddings according to the current topn setting along with their scores

You can use this to fine-tune the minimum relatedness for your assistant

Usage: [p]query <query>

[p]assistant

Setup the assistant

You will need an api key from OpenAI to use ChatGPT and their other models.

Usage: [p]assistant
Restricted to: ADMIN
Aliases: assist
Checks: guild_only

[p]assistant autoanswermodel

Set the model used for auto-answer

Usage: [p]assistant autoanswermodel <model>

[p]assistant reasoning

Switch reasoning effort for o1 model between low, medium, and high

Usage: [p]assistant reasoning

[p]assistant questionmark

Toggle whether questions need to end with ?

Usage: [p]assistant questionmark

[p]assistant functioncalls

Toggle whether GPT can call functions

Usage: [p]assistant functioncalls
Aliases: usefunctions

[p]assistant maxretention

Set the max messages for a conversation

Conversation retention is cached and gets reset when the bot restarts or the cog reloads.

Regardless of this number, the initial prompt and internal system message are always included,
this only applies to any conversation between the user and bot after that.

Set to 0 to disable conversation retention

Note: actual message count may exceed the max retention during an API call

Usage: [p]assistant maxretention <max_retention>

[p]assistant temperature

Set the temperature for the model (0.0 - 2.0)

Defaults is 1

Closer to 0 is more concise and accurate while closer to 2 is more imaginative

Usage: [p]assistant temperature <temperature>

[p]assistant regexfailblock

Toggle whether failed regex blocks the assistant's reply

Some regexes can cause catastrophically backtracking
The bot can safely handle if this happens and will either continue on, or block the response.

Usage: [p]assistant regexfailblock

[p]assistant prompt

Set the initial prompt for GPT to use

Check out This Guide for prompting help.

Placeholders

botname: [botname]
timestamp: discord timestamp
day: Mon-Sun
date: MM-DD-YYYY
time: HH:MM AM/PM
timetz: HH:MM AM/PM Timezone
members: server member count
username: user's name
displayname: user's display name
roles: the names of the user's roles
rolementions: the mentions of the user's roles
avatar: the user's avatar url
owner: the owner of the server
servercreated: the create date/time of the server
server: the name of the server
py: python version
dpy: discord.py version
red: red version
cogs: list of currently loaded cogs
channelname: name of the channel the conversation is taking place in
channelmention: current channel mention
topic: topic of current channel (if not forum or thread)
banktype: whether the bank is global or not
currency: currency name
bank: bank name
balance: the user's current balance
Usage: [p]assistant prompt [prompt]
Aliases: pre

[p]assistant channel

Set the main auto-response channel for the assistant

Usage: [p]assistant channel [channel=None]

[p]assistant triggerignore

Ignore a channel or category for trigger phrases

Usage: [p]assistant triggerignore <channel>

[p]assistant mention

Toggle whether to ping the user on replies

Usage: [p]assistant mention

[p]assistant presence

Set the presence penalty for the model (-2.0 to 2.0)

Defaults is 0

Positive values penalize new tokens based on whether they appear in the text so far, increasing the model's likelihood to talk about new topics.

Usage: [p]assistant presence <presence_penalty>

[p]assistant resetconversations

Wipe saved conversations for the assistant in this server

This will delete any and all saved conversations for the assistant.

Usage: [p]assistant resetconversations <yes_or_no>

[p]assistant exportcsv

Export embeddings to a .csv file

Note: csv exports do not include the embedding values

Usage: [p]assistant exportcsv

[p]assistant restorecog

Restore the cog from a backup

Usage: [p]assistant restorecog
Restricted to: BOT_OWNER

[p]assistant triggerlist

View configured trigger phrases

Usage: [p]assistant triggerlist

[p]assistant relatedness

Set the minimum relatedness an embedding must be to include with the prompt

Relatedness threshold between 0 and 1 to include in embeddings during chat

Questions are converted to embeddings and compared against stored embeddings to pull the most relevant, this is the score that is derived from that comparison

Hint: The closer to 1 you get, the more deterministic and accurate the results may be, just don't be too strict or there wont be any results.

Usage: [p]assistant relatedness <mimimum_relatedness>

[p]assistant resetglobalembeddings

Wipe saved embeddings for all servers

This will delete any and all saved embedding training data for the assistant.

Usage: [p]assistant resetglobalembeddings <yes_or_no>
Restricted to: BOT_OWNER

[p]assistant view

View current settings

To send in current channel, use [p]assistant view false

Usage: [p]assistant view [private=False]
Aliases: v

[p]assistant exportexcel

Export embeddings to an .xlsx file

Note: csv exports do not include the embedding values

Usage: [p]assistant exportexcel

[p]assistant blacklist

Add/Remove items from the blacklist

channel_role_member can be a member, role, channel, or category channel

Usage: [p]assistant blacklist <channel_role_member>

[p]assistant channelpromptshow

Show the channel specific system prompt

Usage: [p]assistant channelpromptshow [channel=operator.attrgetter('channel')]

[p]assistant listen

Toggle this channel as an auto-response channel

Usage: [p]assistant listen

[p]assistant listentobots

Toggle whether the assistant listens to other bots

NOT RECOMMENDED FOR PUBLIC BOTS!

Usage: [p]assistant listentobots
Restricted to: BOT_OWNER
Aliases: botlisten and ignorebots

[p]assistant maxresponsetokens

Set the max response tokens the model can respond with

Set to 0 for response tokens to be dynamic

Usage: [p]assistant maxresponsetokens <max_tokens>

[p]assistant maxtokens

Set maximum tokens a convo can consume

Set to 0 for dynamic token usage

Tips

Max tokens are a soft cap, sometimes messages can be a little over
If you set max tokens too high the cog will auto-adjust to 100 less than the models natural cap
Ideally set max to 500 less than that models maximum, to allow adequate responses

Using more than the model can handle will raise exceptions.

Usage: [p]assistant maxtokens <max_tokens>

[p]assistant importcsv

Import embeddings to use with the assistant

Args:
overwrite (bool): overwrite embeddings with existing entry names

This will read excel files too

Usage: [p]assistant importcsv <overwrite>

[p]assistant verbosity

Switch verbosity level for gpt-5 model between low, medium, and high

This setting is exclusive to the gpt-5 model and affects how detailed the model's responses are.

Usage: [p]assistant verbosity

[p]assistant usage

View the token usage stats for this server

Usage: [p]assistant usage

[p]assistant trigger

Toggle the trigger word feature on or off

Usage: [p]assistant trigger

[p]assistant resetembeddings

Wipe saved embeddings for the assistant

This will delete any and all saved embedding training data for the assistant.

Usage: [p]assistant resetembeddings <yes_or_no>

[p]assistant autoanswerthreshold

Set the auto-answer threshold for the bot

Usage: [p]assistant autoanswerthreshold <threshold>

[p]assistant openaikey

Set your OpenAI key

Usage: [p]assistant openaikey
Aliases: key

[p]assistant embedmodel

Set the OpenAI Embedding model to use

Usage: [p]assistant embedmodel [model=None]

[p]assistant tutor

Add/Remove items from the tutor list.

If using OpenAI's function calling and talking to a tutor, the AI is able to create its own embeddings to remember later

role_or_member can be a member or role

Usage: [p]assistant tutor <role_or_member>
Aliases: tutors

[p]assistant braveapikey

Enables use of the search_web_brave function

Get your API key Here

Usage: [p]assistant braveapikey
Restricted to: BOT_OWNER
Aliases: brave

[p]assistant timezone

Set the timezone used for prompt placeholders

Usage: [p]assistant timezone <timezone>

[p]assistant model

Set the OpenAI model to use

Usage: [p]assistant model [model=None]

[p]assistant importexcel

Import embeddings from an .xlsx file

Args:
overwrite (bool): overwrite embeddings with existing entry names

Usage: [p]assistant importexcel <overwrite>

[p]assistant refreshembeds

Refresh embedding weights

This command can be used when changing the embedding model

Embeddings that were created using OpenAI cannot be use with the self-hosted model and vice versa

Usage: [p]assistant refreshembeds
Aliases: refreshembeddings, syncembeds, and syncembeddings

[p]assistant exportjson

Export embeddings to a json file

Usage: [p]assistant exportjson

[p]assistant planner

Add/Remove items from the planner list, or view current planners.

Users/roles in the planner list can use the think_and_plan tool for complex task breakdown.

If the planner list is empty, everyone can use the planning tool.
If the planner list has entries, only those users/roles can use it.

role_or_member can be a member or role. Omit to view the current list.

Usage: [p]assistant planner [role_or_member]
Aliases: planners

[p]assistant triggerphrase

Add or remove a trigger phrase (supports regex)

The bot will respond to messages containing this phrase.
Phrases are case-insensitive regex patterns.

Examples

hello - matches messages containing "hello"
\bhelp\b - matches the word "help" (word boundary)
bad.*word - matches "bad" followed by any characters then "word"
Usage: [p]assistant triggerphrase <phrase>

[p]assistant collab

Toggle collaborative conversations

Multiple people speaking in a channel will be treated as a single conversation.

Usage: [p]assistant collab

[p]assistant questionmode

Toggle question mode

If question mode is on, embeddings will only be sourced during the first message of a conversation and messages that end in ?

Usage: [p]assistant questionmode

[p]assistant regexblacklist

Remove certain words/phrases in the bot's responses

Usage: [p]assistant regexblacklist <regex>

[p]assistant wipecog

Wipe all settings and data for entire cog

Usage: [p]assistant wipecog <confirm>
Restricted to: BOT_OWNER

[p]assistant resolution

Switch vision resolution between high and low for relevant GPT-4-Turbo models

Usage: [p]assistant resolution

[p]assistant resetglobalconversations

Wipe saved conversations for the assistant in all servers

This will delete any and all saved conversations for the assistant.

Usage: [p]assistant resetglobalconversations <yes_or_no>
Restricted to: BOT_OWNER

[p]assistant channelprompt

Set a channel specific system prompt

Usage: [p]assistant channelprompt [channel=operator.attrgetter('channel')] [system_prompt]

[p]assistant toggledraw

Toggle the draw command on or off

Usage: [p]assistant toggledraw
Aliases: drawtoggle

[p]assistant autoanswerignore

Ignore a channel for auto-answer

Usage: [p]assistant autoanswerignore <channel>

[p]assistant autoanswer

Toggle the auto-answer feature on or off

Usage: [p]assistant autoanswer

[p]assistant resetusage

Reset the token usage stats for this server

Usage: [p]assistant resetusage

[p]assistant toggle

Toggle the assistant on or off

Usage: [p]assistant toggle

[p]assistant system

Set the system prompt for GPT to use

Check out This Guide for prompting help.

Placeholders

botname: [botname]
timestamp: discord timestamp
day: Mon-Sun
date: MM-DD-YYYY
time: HH:MM AM/PM
timetz: HH:MM AM/PM Timezone
members: server member count
username: user's name
displayname: user's display name
roles: the names of the user's roles
rolementions: the mentions of the user's roles
avatar: the user's avatar url
owner: the owner of the server
servercreated: the create date/time of the server
server: the name of the server
py: python version
dpy: discord.py version
red: red version
cogs: list of currently loaded cogs
channelname: name of the channel the conversation is taking place in
channelmention: current channel mention
topic: topic of current channel (if not forum or thread)
banktype: whether the bank is global or not
currency: currency name
bank: bank name
balance: the user's current balance
Usage: [p]assistant system [system_prompt]
Aliases: sys

[p]assistant seed

Make the model more deterministic by setting a seed for the model.

Default is None

If specified, the system will make a best effort to sample deterministically, such that repeated requests with the same seed and parameters should return the same result.

Usage: [p]assistant seed [seed=None]

[p]assistant mentionrespond

Toggle whether the bot responds to mentions in any channel

Usage: [p]assistant mentionrespond

[p]assistant embedmethod

Cycle between embedding methods

Dynamic embeddings mean that the embeddings pulled are dynamically appended to the initial prompt for each individual question.
When each time the user asks a question, the previous embedding is replaced with embeddings pulled from the current question, this reduces token usage significantly

Static embeddings are applied in front of each user message and get stored with the conversation instead of being replaced with each question.

Hybrid embeddings are a combination, with the first embedding being stored in the conversation and the rest being dynamic, this saves a bit on token usage.

User embeddings are injected into the beginning of the prompt as the first user message.

Dynamic embeddings are helpful for Q&A, but not so much for chat when you need to retain the context pulled from the embeddings. The hybrid method is a good middle ground

Usage: [p]assistant embedmethod

[p]assistant persist

Toggle persistent conversations

Usage: [p]assistant persist
Restricted to: BOT_OWNER

[p]assistant endpointoverride

Override the OpenAI endpoint

Notes

Using a custom endpoint is not supported!
Using an endpoing override will negate model settings like temperature and custom functions
Usage: [p]assistant endpointoverride [endpoint=None]
Restricted to: BOT_OWNER

[p]assistant maxtime

Set the conversation expiration time

Regardless of this number, the initial prompt and internal system message are always included,
this only applies to any conversation between the user and bot after that.

Set to 0 to store conversations indefinitely or until the bot restarts or cog is reloaded

Usage: [p]assistant maxtime <retention_seconds>

[p]assistant maxrecursion

Set the maximum function calls allowed in a row

This sets how many times the model can call functions in a row

Only the following models can call functions at the moment

gpt-4o-mini
gpt-4o
ect..
Usage: [p]assistant maxrecursion <recursion>

[p]assistant frequency

Set the frequency penalty for the model (-2.0 to 2.0)

Defaults is 0

Positive values penalize new tokens based on their existing frequency in the text so far, decreasing the model's likelihood to repeat the same line verbatim.

Usage: [p]assistant frequency <frequency_penalty>

[p]assistant override

Override settings for specific roles

NOTE
If a user has two roles with override settings, override associated with the higher role will be used.

Usage: [p]assistant override

[p]assistant override model

Assign a role to use a model

Specify same role and model to remove the override

Usage: [p]assistant override model <model> <role>

[p]assistant override maxretention

Assign a max message retention override to a role

Specify same role and retention amount to remove the override

Usage: [p]assistant override maxretention <max_retention> <role>

[p]assistant override maxresponsetokens

Assign a max response token override to a role

Set to 0 for response tokens to be dynamic

Specify same role and token count to remove the override

Usage: [p]assistant override maxresponsetokens <max_tokens> <role>

[p]assistant override maxtokens

Assign a max token override to a role

Specify same role and token count to remove the override

Usage: [p]assistant override maxtokens <max_tokens> <role>

[p]assistant override maxtime

Assign a max retention time override to a role

Specify same role and time to remove the override

Usage: [p]assistant override maxtime <retention_seconds> <role>

[p]assistant minlength

set min character length for questions

Set to 0 to respond to anything

Usage: [p]assistant minlength <min_question_length>

[p]assistant topn

Set the embedding inclusion amout

Top N is the amount of embeddings to include with the initial prompt

Usage: [p]assistant topn <top_n>

[p]assistant backupcog

Take a backup of the cog

This does not backup conversation data
Usage: [p]assistant backupcog
Restricted to: BOT_OWNER

[p]assistant importjson

Import embeddings to use with the assistant

Args:
overwrite (bool): overwrite embeddings with existing entry names

Usage: [p]assistant importjson <overwrite>

[p]assistant triggerprompt

Set the prompt to use when a trigger phrase is matched

This prompt will be appended to the initial prompt when the bot responds to a triggered message.

Placeholders

botname: [botname]
timestamp: discord timestamp
day: Mon-Sun
date: MM-DD-YYYY
time: HH:MM AM/PM
timetz: HH:MM AM/PM Timezone
members: server member count
username: user's name
displayname: user's display name
roles: the names of the user's roles
rolementions: the mentions of the user's roles
avatar: the user's avatar url
owner: the owner of the server
servercreated: the create date/time of the server
server: the name of the server
py: python version
dpy: discord.py version
red: red version
cogs: list of currently loaded cogs
channelname: name of the channel the conversation is taking place in
channelmention: current channel mention
topic: topic of current channel (if not forum or thread)
banktype: whether the bank is global or not
currency: currency name
bank: bank name
balance: the user's current balance
Usage: [p]assistant triggerprompt [prompt]

[p]assistant sysoverride

Toggle allowing per-conversation system prompt overriding

Usage: [p]assistant sysoverride

[p]embeddings (Hybrid Command)

Manage embeddings for training

Embeddings are used to optimize training of the assistant and minimize token usage.

By using this the bot can store vast amounts of contextual information without going over the token limit.

Note
You can enter a search query with this command to bring up the menu and go directly to that embedding selection.

Usage: [p]embeddings [query]
Slash Usage: /embeddings [query]
Restricted to: ADMIN
Aliases: emenu
Checks: guild_only

[p]customfunctions (Hybrid Command)

Add custom function calls for Assistant to use

READ

The following objects are passed by default as keyword arguments.

user: the user currently chatting with the bot (discord.Member)
channel: channel the user is chatting in (TextChannel|Thread|ForumChannel)
guild: current guild (discord.Guild)
bot: the bot object (Red)
conf: the config model for Assistant (GuildSettings)
All functions MUST include *args, **kwargs in the params and return a string

# Can be either sync or async
async def func(*args, **kwargs) -> str:

Only bot owner can manage this, guild owners can see descriptions but not code

Usage: [p]customfunctions [function_name=None]
Slash Usage: /customfunctions [function_name=None]
Aliases: customfunction and customfunc
Checks: guild_only

[p]listfunctions (Hybrid Command)

List all available functions and their enabled/disabled status

This provides a quick overview of all custom functions and 3rd party
registered functions without having to navigate through the full menu.

Usage: [p]listfunctions
Slash Usage: /listfunctions
Restricted to: ADMIN
Aliases: listfuncs and funclist
Checks: guild_only

[p]togglefunctions (Hybrid Command)

Enable or disable multiple functions at once

Arguments

enable: True to enable, False to disable. Omit to toggle current state.
functions: Comma-separated list of function names, or "all" to affect all functions

Examples

[p]togglefunctions get_time, get_weather - Toggle these functions
[p]togglefunctions True all - Enable all functions
[p]togglefunctions False get_time, get_weather - Disable specific functions
Usage: [p]togglefunctions <enable> <functions>
Slash Usage: /togglefunctions <enable> <functions>
Restricted to: ADMIN
Aliases: togglefuncs
Checks: guild_only

Uh oh!

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Assistant

/draw (Slash Command)

/tldr (Slash Command)

[p]chathelp

[p]chat

[p]convostats

[p]convoclear

[p]convopop

[p]convocopy

[p]convoprompt

[p]convoshow

[p]importconvo

[p]query

[p]assistant

[p]assistant autoanswermodel

[p]assistant reasoning

[p]assistant questionmark

[p]assistant functioncalls

[p]assistant maxretention

[p]assistant temperature

[p]assistant regexfailblock

[p]assistant prompt

[p]assistant channel

[p]assistant triggerignore

[p]assistant mention

[p]assistant presence

[p]assistant resetconversations

[p]assistant exportcsv

[p]assistant restorecog

[p]assistant triggerlist

[p]assistant relatedness

[p]assistant resetglobalembeddings

[p]assistant view

[p]assistant exportexcel

[p]assistant blacklist

[p]assistant channelpromptshow

[p]assistant listen

[p]assistant listentobots

[p]assistant maxresponsetokens

[p]assistant maxtokens

[p]assistant importcsv

[p]assistant verbosity

[p]assistant usage

[p]assistant trigger

[p]assistant resetembeddings

[p]assistant autoanswerthreshold

[p]assistant openaikey

[p]assistant embedmodel

[p]assistant tutor

[p]assistant braveapikey

[p]assistant timezone

[p]assistant model

[p]assistant importexcel

[p]assistant refreshembeds

[p]assistant exportjson

[p]assistant planner

[p]assistant triggerphrase

[p]assistant collab

[p]assistant questionmode

[p]assistant regexblacklist

[p]assistant wipecog

[p]assistant resolution

[p]assistant resetglobalconversations

[p]assistant channelprompt

[p]assistant toggledraw

[p]assistant autoanswerignore

[p]assistant autoanswer

[p]assistant resetusage

[p]assistant toggle

[p]assistant system

[p]assistant seed

[p]assistant mentionrespond

[p]assistant embedmethod