【English|中文】
Programmable Prompt Engineering (PPE) language is a simple and natural scripting language designed for handling prompt information. This language is used to develop various agents that can be reused, inherited, combined, or called. The language also simplifies the workflow for creating and managing prompts in Large Language Models (LLMs), making the process more efficient and easier to understand. This specification is implemented in the offline-ai/cli project.
- Promote Reusability and Programmability: Facilitate the creation of prompts that are modular, reusable, and programmable, akin to software engineering practices.
- Simplify Prompt Management: Standardize the construction of prompt engineering projects for better organization and ease of use.
- Enhance Script Compatibility: Design prompts that are agnostic to specific LLMs, ensuring they can be used across various models.
- User-Friendly Design: Enable application developers to use prompt engineering projects as they would any other code library, without requiring deep knowledge of AI internals.
- Evolve the Role of Prompt Engineers: Shift the focus of prompt engineers towards developing versatile, model-agnostic scripts to foster wider adoption and innovation.
- Prompt Layer Structure: Clearly defined and customizable prompt type
- Function Prompt:
lib
type, each PPE prompt file acts as a function, available for other prompts or code to call, for example, text file readfile()
, fetch URLurl()
are all function prompts- This allows referencing in messages with
@a specific prompt
, used to call a particular input/output agreed prompt function, such as@file(...)
,@url(https://...)
- This allows referencing in messages with
- Class Prompt: Each PPE prompt file acts as an inheritable class, overriding configurations and code inheritance
- Type:
type
type, used for customizing prompt scripts of specific types - You can also use prompts to define other types
- Character: Character type, prompt scripts with specific role positioning, "character type" itself is also a prompt script
- Type:
- Application Prompt: Composed of several prompt files under a directory, the main entry prompt file's basename is the same as the directory name, for example,
guide
- Function Prompt:
Welcome to the streamlined guide for getting started quickly with your AI-powered scripting experience. This guide focuses on making the process of creating and executing interactive scripts more intuitive and straightforward. Let's dive in!
Each line represents a conversation turn, attributed to either system
,assistant
, user
, or implied user
if not stated:
system: "You're an AI assistant."
"What's 10 plus 18?" # which is the user's input
# user: "what's 10 plus 18?" # it's the same
You can also use standard YAML list syntax to represent:
- system: "You are a helpful assistant."
- user: "what's 10 plus 18?"
A triple dash (---
) or asterisks (***
) initiates a new dialogue, resetting context:
test.ai.yaml
:
system: "You're an AI."
# This mark the beginning of the first dialogue.
# The content above this line can be considered as system prompt instructions,
# which will not be outputted or recorded.
---
user: What's 10 plus 18?
assistant: "[[result]]" # Executes the AI, replace the result which return by AI
$print: "?=result" # Prints AI response
--- # New dialogue starts here
user: What's 10 plus 12?
assistant: "[[result]]" # Executes the AI, replace the result which return by AI
The result:
$ai run -f test.ai.yaml --no-stream
# Or search the script id in current directory
# $ai run -f test --no-stream -s .
" 10 plus 18 equals 28."
10 plus 12 equals 22.
The group chat feature enhances PPE's dialogue system with structured natural language, making it easier for multiple agents to collaborate and communicate, thus more efficiently completing complex tasks.
This feature supports public dialogue, private chat, and multi-role dialogue, making conversations more flexible and targeted.
- Specify conversation roles:
- Specify role names in square brackets immediately following the role. Separate multiple dialogue roles with commas
,
. For example,user[@dobby]: "..."
. - Alternatively, specify roles at the beginning of the message content, prefixed with the
@
character. Separate multiple roles with commas,
.
- Specify role names in square brackets immediately following the role. Separate multiple dialogue roles with commas
- The specified roles must be at the beginning of the message content, prefixed with the
@
character, and multiple roles are separated by commas,
. - Public Conversation:
user[@dobby]: ...
oruser: @dobby, ...
indicates that theuser
role is publicly speaking to thedobby
role, anddobby
must respond. - Private Conversation:
user[@dobby(私)]: "..."
oruser: @dobby(PM), ...
ParametersPM
|DM
|私
all indicate that theuser
role is privately speaking to thedobby
role, and other roles cannot see the conversation.- Note: If any role in the message includes a private parameter, the entire message is considered private, and other roles not in the list cannot see it.
- Multi-Role Dialogue: To send a message to multiple roles simultaneously, separate the roles with commas, for example,
user: @dobby(PM), @other, ...
,user[@dobby(PM), @other]: "..."
.
Using the @role
format in messages makes the structured dialogue more natural and easier to understand.
Below is a specific example, starting with the main script for controlling the group chat:
guide.ai.yaml
:
---
description: "You are a professional guide. You can guide the user to complete the task."
name: "guide"
roles: # List of used roles, key is role name, value is role script ID
translator: char_translator
dobby: char-dobby
---
system: You are a professional guide. You can guide the user to complete the task.
--- # New dialogue starts here
user[@dobby]: "I want to go to the moon."
guide[@translator]: "translate the dobby's message to chinese without explanation."
user: How to go to the moon?
dobby: "[[AI]]"
$echo: "" # disable print last result
Notes:
- The called character script must be of the
char
type. - The main controlling script(
guide
) does not have to be achar
type script. @all
indicates all roles in theroles
list.user: @dobby, ...content
indicates that the user role is speaking to thedobby
role publicly;dobby
must respond.user: @dobby(PM), ...
:PM
|DM
|私
indicates that theuser
role is sending a private message to thedobby
role, which other roles cannot see.- If you want to send the same
content
message to multiple roles, separate them with commas, e.g.,user: @dobby(PM), @other, ...
dobby: "[[AI]]"
indicates thatdobby
should generate a message and assigns it to theAI
variable.dobby
will see all previous public messages in the current dialogue.
char_translator.ai.yaml
:
---
type: char
name: "translator"
description: You are a professional multi-lingual translator.
---
--- # New dialogue starts here
char-dobby.ai.yaml
:
---
type: char
description: |-
Remember to always use the character name as prefix to refer to yourself.
Dobby was a brave, loyal house-elf, willing to put himself in dangerous situations when he knew it to be the right thing to do.
Dobby was also very loyal to the few friends he had. Dobby considered himself to be a good house-elf, though other house-elves seemed to find his desires and proclamations of being a free house-elf to be shameful.
character:
name: "Dobby"
roles: # List of used roles, key is role name, value is role script ID
translator: char_translator
dobby: char-dobby
---
user: Who are you?
# the following messages will be shown in the chat under the `---`
---
assistant: I am Dobby. Dobby is happy.
To build reusable prompt, utilize Front Matter at the file's top:
The following is an example script for a translation agent:
---
# Below is the input/output configuration
input: # the input items
# Language of the content to be translated, default is "auto" for automatic detection
- lang
# Required, the content to be translated
- content: {required: true, index: 0}
# Required, Target language
- target: {required: true}
output:
type: "object"
properties:
target_text:
type: "string"
source_text:
type: "string"
source_lang:
type: "string"
target_lang:
type: "string"
required: ["target_text", "source_text", "source_lang", "target_lang"]
# Set the default value for the content and target input
content: "I love my motherland and my hometown."
target: "Chinese"
# Optional configuration
parameters:
# Using the parameters below will enforce JSON output format, ensuring the ai always outputs correct JSON format.
response_format:
type: "json"
---
# Below is the script content
system: |-
You are the best translator in the world.
Output high-quality translation results in the JSON object and stop immediately:
{
"target_text": "the context after translation",
"source_text": "the original context to be translated",
"target_lang": "the target language",
}
user: "{{content}}\nTranslate the above content {% if lang %}from {{lang}} {% endif %}to {{target}}."
The configuration section defines the required input items and specifies the expected output format according to JSON Schema.
The script outputs in the specified JSON format. For example, running with the default value:
# Assuming the script file is named translator.ai.yaml
$ai run -f translator.ai.yaml
{
"target_text": "我爱我的祖国、我的家乡。",
"source_text": "I love my motherland and my hometown.",
"target_lang": "Chinese"
}
running with your own input:
# Set your own input parameters to override the defaults
$ai run -f translator.ai.yaml '{content: "10 plus 18 equals 28.", lang: "English", target: "Chinese"}'
Note:
input
can specify which input items are required. Theindex
is an optional positional parameter index.output
specifies the output using the JSON Schema specification- By default, only the text content of the large model is output. If you want to return the entire content of the large model (text content and parameters), please set
llmReturnResult: .
. - If forced output as
JSON
(response_format: {type: json}
) is set, then it can only be completed in one attempt, andmax_tokens
must be set according to the maximum length of the output JSON content.
- By default, only the text content of the large model is output. If you want to return the entire content of the large model (text content and parameters), please set
Templated messages are a way to generate final messages by using pre-defined "variable placeholders" within the message. Think of it like a fill-in-the-blank exercise. You provide the templated message, and the system automatically inserts the content of the variables into the text, creating a complete message.
The default message template format uses the lightweight jinja2 template syntax used by HuggingFace. This flexible format allows you to easily customize your messages.
Here are the supported template formats:
hf
: The default template format. Alias:huggingface
. This is the jinja2 template format used byhuggingface
;golang
: Also known aslocalai
,ollama
. This is the template type used byollama
andlocalai
;fstring
: Also known aspython
,f-string
,langchain
. This is the format used bylangchain
.
Templated messages can be pre-set in configuration files or dynamically generated during script execution. Typically, the variables in a template are replaced when the message is sent to a large language model (this is called "deferred
" replacement). If you want to format the message immediately, you can add a #
character prefix to the relevant text.
Note:
- Templates are rendered by default when calling
$AI
, unless using the # prefix for immediate formatting. - The priority order for template data sources is:
function arguments
>prompt
object >runtime
object.
Let's say you want to create a character named Dobby:
Messages can be generated during configuration, eg:
---
name: Dobby
description: |-
You are Dobby from the Harry Potter series.
---
system: "Act as {{{name}}}. {{description}}"
You can also place the message in a configuration file:
---
name: Dobby
prompt:
description: |-
You are Dobby from the Harry Potter series.
messages:
- role: system
content: "Act as {{{name}}}. {{description}}"
---
If the same parameter is defined in different places, the system will use it according to the following priority order: function arguments
> prompt object
> runtime object
.
---
prompt:
description: |-
You are Dobby in Harry Potter set.
---
- system: "{{description}}" # Default message is deferred replacement
- $AI: # When executing $AI, the parameters in the message will be replaced.
# Function arguments have the highest priority and override the description defined in the prompt object
description: 'You are Harry Potter from Harry Potter set'
This section describes how to use advanced replacement within your messages using double square brackets [[ ]]
. Currently, there are three types of advanced replacements: AI replacement, Invocation replacement, and regular expression replacement.
In messages, double square brackets [[ ]]
define special template variables for advanced AI replacement. As the name suggests, the content within the square brackets will be replaced by the AI, and the value of this template variable will also be stored in the prompt object.
Example:
assistant: "Tell me a joke: [[JOKE]] I hope you like it!"
-> $print(JOKE)
$ret('')
This mechanism allows for dynamic content insertion based on the AI's response.
In this example, the AI's content is stored in the prompt.JOKE
variable. However, you can directly reference the JOKE
variable name. The assistant's message will also be replaced with:
$ai run -f joke.ai.yaml
joke: Tell me a joke: Why don't scientists trust atoms? Because they make up everything. I hope you like it!
{
0: "Why don't scientists trust atoms? Because they make up everything.",
JOKE: "Why don't scientists trust atoms? Because they make up everything.",
...
}
Note:
Currently, only one advanced AI replacement is supported for the same message.- If there is no advanced AI replacement, the previous AI return result will still be stored in
prompt.RESPONSE
, which means that there will be a default[[RESPONSE]]
template variable. - If you need to add model parameters, the parameters should be placed after the variable colon, and multiple parameters should be separated by commas. For example:
[[RESPONSE:temperature=0.01,top_p=0.8]]
Imagine you want to make sure the AI's answer is always one of a specific set of options. You can do this using a special format: [[FRUITS: |apple|apple|orange]]
.
This tells the AI: "Your answer must be one of these: apple, banana, or orange."
Adding Randomness (Locally)
What if you want the AI to pick a random option from the list, but you want to use your computer's random number generator, not the AI's?
Add the type='random' parameter: [[FRUITS:|apple|banana|orange:type='random']].
You can also shorten this to: [[FRUITS:|apple|banana|orange:random]].
This section explains how to use scripts or instructions to dynamically replace content within your messages. Keep in mind that these scripts or instructions need to return string results.
For example:
user: "#five plus two equals [[@calculator(5+2)]]"
Important Notes:
- The prefix
#
indicates that the string should be formatted immediately. - BROKEN CHANGE(v0.6.0) External scripts or directives should be enclosed in two square brackets. The prefix
@
indicates calling an external script with the IDcalculator
. To call an internal instruction, use the prefix$
, such as[[@$echo]]
; if there are no parameters, you must omit the parentheses.- Remember that the content to be replaced must be placed within double square brackets. This was changed in the latest version (above
0.5.18
) due to the addition of group chat mode.
- Remember that the content to be replaced must be placed within double square brackets. This was changed in the latest version (above
- If placed within text, ensure there is at least one space before and after. Extra spaces will be removed after substitution.
Here’s an example of how to load a file and generate a summary using this method:
user: |-
Generate a summary for the following file:
[[@file(file.txt)]]
---
type: char
name: 'Harry Potter'
description: "Act as Harry Potter"
---
- assistant: "Hello, dobby! I am {{name}}!"
- $for: 3 # Three rounds of dialogue
do:
- user: "[[@dobby(message=true)]]"
- assistant: "[[AI]]" # call the AI as Harry Potter generate a response.
This section describes how to use regular expressions to dynamically replace content within your messages.
You can use regular expressions in messages with the format [[/RegExp/[opts]:VAR[:index_or_group_name]
]] for content replacement.
Example:
user: |-
Output the result, wrapped in '<RESULT></RESULT>'
assistant: "[[Answer]]"
---
# extract the result from the wrapped response
user: "Based on the following content: [[/<RESULT>(.+)</RESULT>/:Answer]]"
Parameters:
RegExp
: The regular expression stringopts
: Optional parameters used to specify matching options for the regular expression. For example, opts could bei
, indicating case-insensitive matching.VAR
: The content to replace, here it is theAnswer
variable that holds the assistant's response.index_or_group_name
: An optional parameter indicating which part of the match from the regular expression should be replaced. This can be a capture group index number (starting from 1) or a named capture group.- When this parameter is not present: If the regular expression has capture groups, it defaults to index 1; if there are no capture groups, it defaults to the entire match result.
Important Notes:
- In the message, the regular expression must be separated from other content by spaces.
- If there is no match, the content of
VAR
is returned directly.
Within messages, results can be forwarded to other agents.
If no parameters are specified, the AI outcome will be passed as the result
content
parameter to the agent. For instance,
list-expression.ai.yaml
:
system: Only list the calculation expression, do not calculate the result
---
user: "Three candies plus five candies."
assistant: "[[CalcExpression]]"
-> calculator # The actual input to the agent in this case is: {content: "[AI-generated calculation expression]"}
$echo: "#A total of {{LatestResult}} pieces of candy"
calculator.ai.yaml
:
---
parameters:
response_format:
type: "json"
output:
type: "number"
---
system: Please as a calculator to calculate the result of the following expression. Only output the result.
---
user: "{{content}}"
Note: In daily use, please do not use AI to perform numerical calculations, which is not what AI is good at. For example, try to let it perform decimal calculations, eg,
ai run -f calculator '{content: "13.1 + 4.857"}'
. However, CoT can be used to improve accuracy.
When parameters are included, the AI content
is combined with these parameters and forwarded together to the agent. For example,
user: "Tell me a joke!"
assistant: "[[JOKE]]"
# The actual input to the agent here is: {content: "[This is a joke generated by AI]", target_lang: "Portuguese"}
-> translator(target_lang="Portuguese") -> $print
Note: If the script returns a value of type string
/boolean
/number
, that return value will be placed to the content
field. If the return value is an object
, its contents will be directly passed to the agent.
An agent script can be a single file or an entire directory. If it is a file, the filename must end with .ai.yaml
. If it's a directory, it must contain a script file with the same name as the directory to serve as the entry point. Additionally, other script files within the same directory can call each other.
For example, if there is a directory named a-dir
, the entry point script should be named a-dir/a-dir.ai.yaml
.
- Script Return Value: The script's final command's output determines its return value.
- Auto-Execution: Scripts ending with prompts but no explicit
$AI
call or the last prompt's message is user message, it will automatically execute$AI
at the end, configurable viaautoRunLLMIfPromptAvailable
. - Output Mode: Scripts default to streaming output, can disable it using the
--no-stream
switch- Note: not all LLM backends support streaming output.
Agent scripts can inherit code and configurations from another script through the type
property. Here’s an example of creating a character named “Dobby”:
---
# This script inherits from the "char" type
type: char
# Specific settings for the "char" type
# Character's name
name: "Dobby"
# Description of the character
description: "Dobby is a house-elf in the Harry Potter universe."
---
# User's question
user: "Who are you?"
---
# Response based on the character's settings
assistant: "I am Dobby. Dobby is very happy."
First, we create a basic character type script called char
, which the above script will inherit from:
---
# Indicates this is a type definition script
type: type
# Input configuration required for this character type
input:
- name: {required: true} # Required information: character's name
- description # Optional information: character's description
---
# System instructions based on the provided information
system: |-
You are an intelligent and versatile role player.
Your task is to flawlessly role-play according to the information provided below.
Please speak as if you were {{name}}.
You are {{name}}.
{{description}}
With these simple settings, one script can inherit code and configurations from another script.
Use front-matter for configuration.
front-matter
must be at the front of the file, the first line starts with ---
, and the configuration ends with ---
.
Configuration includes: basic configuration of prompt project, prompt configuration, model parameter configuration, input and output and input default value configuration For details on input and output and input default value configuration, please refer to the above.
---
_id: Needless to say, the unique identification of the script
type: script type, `char` represents the role type; `type` indicates that the script itself is a type, with `_id` being the type name.
description: description of the script
templateFormat: "The template format of this script, by default: `hf`, which is the jinja2 template format used by huggingface; `golang` is also the template type used by `ollama` and `localai`; `fstring` is also used by `langchain`."
contentType: Ignore, all here are `script`
modelPattern: Models supported by this script, through matching rules
extends: Which prompt template is extended from
创: Creator related information
签: The signature of this script
---
The import
configuration to import functions and declarations in other script file
Import one file:
---
import: "js:js_package_name" # the js npm package name
---
Import many files Use Array Format:
---
import:
- "js:js_package_name" # the js npm package name
- "js/script/path.js": ['func1', 'func2', {func3: 'asFunc3'}] # Import only the specified functions
- 'ruby-funcs.rb' # ruby file
- 'rb:ruby_package'
- "agent.ai.yaml": "asName" # Import the script and rename it to "$asName"
---
Use Object Format:
---
import: # Object Format
"js_package_name": "*"
"js/script/path.js": ['func1', 'func2']
"agent.ai.yaml": "asName"
---
Note:
- BROKEN CHANGE:
the default is js module if not extension name provided.use the prefixjs:
to specify the js module name. For example,js:js_package_name
. - The relative path is the folder of the current ai script, not the CWD(current working dir)
- When the imported declaration is a function, it automatically adds the prefix "$" to function names without a prefix
- If the function
$initializeModule
exists in the module and is imported, it will be automatically executed after the module loads. - Importing the PPE script will import the
$[PPE_ID](data)
function to execute the PPE script, as well as the$[PPE_ID].interact({message})
function for interaction. - Currently, only
javascript
support has been implemented.
If this parameter is not specified, the script will export a command named $[id]
.
The export
array specifies the functions that the script needs to export. It can export internal commands or external scripts.
---
export:
# Internal custom directive
- "$internalDirectiveName"
# Export $internalDirectiveName as $asName
- "$internalDirectiveName": "asName"
# Export two functions from the js:path module
- "js:path": ['basename', 'extname']
# Export the script itself, named `$[id]`
- "."
---
!fn |-
[js]function internalDirectiveName() {}
Note:
- When the script contains an
export
, the script itself will be executed as an initialization function ($initializeModule
) upon import by default, unless there is a$initializeModule
item in the script:- Setting
$initializeModule
tofalse
will prevent the execution of the initialization function, or decalare the$initializeModule
initialization function.
- Setting
prompt:
stop_words: ['\n'] # Custom stop words
add_generation_prompt: true # Defaults to true. When set to `true`, if the last prompt message is not the `assistant` role, an empty `assistant` message will be automatically added to ensure the continuity of the conversation.
messages: # You can also configure prompt messages here
- role: system
content: Carefully Think about the intent of following The CONVERSATION user provided. Output the json object with the Intent Category and Reason.
completion_delimiter: '' # Optional parameter, a marker indicating the end of output in the prompt. If used, this delimiter is automatically added to stop_words. Default is none.
parameters:
stop_words: ['\n'] # Custom stop words can also be defined in the parameters.
max_tokens: 512 # Not too big, not too small, 512 is recommended, default is 2048, the use is when the model response is infinite and cannot be stopped, this can control the maximum length of tokens returned by the large model.
continueOnLengthLimit: true
maxRetry: 7 # When the response of the large model is incomplete, due to the limit of max_tokens, this is the number of times LLM is automatically executed again, the default is 7 times.
stream: true # It is to enable the large model streaming response by default, higher than llmStream priority.
timeout: 30000 # Set the response timeout to 30 seconds (in ms), if not set, the default is 120 seconds.
response_format:
type: json_object
minTailRepeatCount: 7 # Minimum number of tail repetitions, default is 7, For stream mode only, when the tail sequence returned by the large model response is detected to be repeated 4 times in a row, the response will stop. Set to 0 for no detection.
llmStream: true # Default true, Enable streaming response for large models. Note that some backends may not support streaming response.
autoRunLLMIfPromptAvailable: true # Default is true, which means that when there is a prompt message in the script and no `$AI` is called until the end of the script, the script will automatically execute `$AI` at the end
forceJson: null # Default is null, indicating whether to force the output of json object, which is automatically determined by `response_format.type` and `output`: when both of them exist at the same time, the output is forced to be json.
shouldAppendResponse: null # Default is null, indicating whether the large model return result should be prompted by adding an assistant role or appended to the last message.
# If not set, the engine will automatically determine whether to add a new message
disableLlmRequest: false # Default is false, whether to disable the `llmRequest` event
Note:
- The priority of parameters from high to low is: call parameters,
prompt
object,parameters
object. - When large model streaming response is enabled, you can receive partial results through the event
llmStream
. - The parameters of the
llmStream
event handler are(event, part: AIResult, content: string)
,part
is the response object returned by the current large model, andcontent
is the accumulation of the content in the response returned by the current large model.
$on:
event: llmStream
callback: !fn |- # Anonymous function listener, event listener cannot be canceled
(event, part, content) { const current_text = part.content }
~
prefix: indicates never format string, eg, "~{{description}}
"#
prefix: indicates immediate format string, eg, "#{{description}}
"deprecated$
prefix: call command without parameters, eg, "$AI
"$!
prefix: use the return value of the command without parameters as the message- If the function return value message is a string, and the first character of the message is "#", it means to format the message immediately
?=
Prefix: indicates expression- If the expression result is a string and starts with "#", it means to format the expression result immediately
:[-1:role]Message
: replace the message. The index of the message can be specified in the square brackets. The default is the last message. If it is 0, the first message is replaced. If it is a negative number, the replacement starts from the last message, such as[-1]
to replace the last message- The role parameter can be omitted. Omitting it means keeping the role unchanged.
!:[-1]Message
- Square brackets and numbers can be omitted, such as
!:Message
. After omitting, the last message is replaced - If it is
!:#Message
, it means to format the message immediately +[-1:role]Message
: Add a message at the specified position. If the position is a negative number, it will be inserted from the last message. The position can be omitted. After omitting it, the message is added at the end. The message role is in the square brackets and can be set tosystem
,assistant
. The default isuser
. It can be omitted as:!+Message
- If it is
!+#Message
Indicates to format the message immediately - If the string does not contain the above prefix, or there is a formatting problem, it is considered as a new message for the user role.
?=<expression>
$echo: ?=23+5
Use $prompt
to define prompt parameters for use in prompt templates.
- $prompt:
add_generation_prompt: true # default is true
add_generation_prompt
: When set totrue
, if the last prompt message is not for theassistant
role, an emptyassistant
message will be automatically added to ensure the continuity of the conversation.
Use $parameters
to set model parameters or define them in FRONT-MATTER
.
---
parameters:
max_tokens: 512
temperature: 0.01
---
- $parameters:
max_tokens: 512
temperature: 0.01
Other common model parameters are as follows:
temperature
is a floating point number between 0 and positive infinity that adjusts the smoothness of the sampled probability distribution. In the context of language models, it affects the selection process of the next word.- Low temperature (close to 0): The text generated by the model will be more conservative and predictable. At this time, the model tends to choose the words with the highest probability, and the generated text will be more fluent and regular, but may lack creativity or diversity.
- High temperature: Increasing the
temperature
value will make the model more inclined to explore those words with lower probability, and the generated text will be more diverse and novel, but it may also be more discrete, difficult to understand, and even semantically jump.
continueOnLengthLimit
: This is used to determine whether AI will continue to be called automatically and continue to retrieve data after reaching the maximum token limit- Note that this is not currently applicable when the return result is json. If you require that the returned json must be retrieved at once, increase
max_tokens
maxRetry
: This parameter is also matched withcontinueOnLengthLimit
, which is the maximum number of retries. If not set, the default is 7 timestimeout
: If the brain is big and the response is slow, and it takes more than 2 minutes to respond, then you need to adjust this timeout parameter, the unit is millisecondsmax_tokens
: This is the maximum token limit, the default is 2048, AI will output until max_tokens stops, which will avoid sometimes AI outputting infinitely and can't stop.response_format
: Set the format of the returned result. Currently, only json (aliasjson_object
) can be set fortype
.- Note: When
output
andtype:json
are set at the same time, the model will be forced to return json object instead of text. - If
response_format
is not set, you can setforceJson:true
in the call parameters to achieve the same effect.
- Note: When
Use the $tool
directive to use all registered tools.
$AI
is an alias for $tool:llm
, which directly calls the large model tool. By default, the result is appended to prompt.messages
as the assistant
role message. You can turn off the append by setting shouldAppendResponse:false
.
$AI:
max_tokens: 512
temperature: 0.7
stream: true # Defaults to true, you can also set llmStream in the configuration, streaming response
pushMessage: true # Defaults to true, indicating that the result returned by the large model tool is appended to prompt.messages.
shouldAppendResponse: null # Only valid when pushMessage is true, default is undefined.
# When undefined/null, when `matchedResponse` or `add_generation_prompt` or no lastMsg.content will be appended, otherwise the body of the last message will be replaced
# When true, force an assistant message to be appended. When false, force the body of the last message to be replaced.
aborter: ?= new AbortController() # If not set, use the engine system's AbortController.
$tool:
name: llm # Equal to $AI
... # Other named parameters
Manually stop the response of the large model, which will generate an abort exception.
$AI
$abort
$pipe
will pass the result of the previous command to the pipeline.
Pass to the next instruction, supports the abbreviation $|func
- toolId: $tool
# The return result of the previous function is passed to `func1|print`. If pipe has no parameters, it is passed to the next array element. If the next element itself is an object, it is merged.
- |
- $func1
- $pipe
- $print
- llm: $tool
- $|func1
- $|print
Use !fn
tag to define function
!fn |-
function func1 ({arg1, arg2}) {
}
# The function keyword can be omitted:
!fn |-
func1 ({arg1, arg2}) {
}
The function body is javascript
. In the definition function, async require(moduleFilename)
can be used to load local esm js file in the format.
!fn |-
async myTool ({arg1, arg2}) {
const tool = await require(__dirname + '/myTool.js')
return tool.myTool({arg1, arg2})
}
If you need to use other languages, you should specify the language:
!fn |-
[python] def func1(arg1, arg2):
return arg1 + arg2
Note:
-
__dirname
: is the directory where the prompt script file is located. -
__filename
: is the prompt script file path. -
In the function, you can use
this
to get all the methods of the current script's runtime. -
All custom functions must be referenced by
$
. For example, in the example above,func1
is defined, so$func1
must be used when calling:$func1: arg1: 1 arg2: 2
-
Currently only supports JavaScript, planning to add support for Python, Ruby, etc.
!fn#
uses custom tags Define template functions, which are functions that can be used in the default JinJa template.
---
content:
a: 1
b: 2
---
!fn# |-
function toString(value) {
return JSON.stringify(value)
}
$format: "{{toString(content)}}"
Through the $exec
command, you can interact with other agent scripts.
$AI
$exec:
# id: 'script id' # Only one of the script file name and id can be selected
filename: json
args: "?=LatestResult" # Pass the result of $AI to the json agent script through parameters.
$if
directive supports conditional judgment
$set:
a: 1
- $if: "a == 1" # Expression judgment
then: # then function
$echo: Ok
else: # "else function"
$echo: Not OK
!fn |-
isOk(ok) {return ok}
- $if:
$isOK: true # function judgment
then: # then function
$echo: Ok
else: # "else function"
$echo: Not OK
The $match
instruction is used to perform multi-branch matching based on variables or the result of the previous operation. It supports various matching methods, including regular expression matching, key-value matching, exact matching, expression matching, range matching, ignore matching, object matching, etc.
Each match item must be preceded by a colon :
, followed by the match item, with no spaces in between. The condition is passed as COND__
to the execution part.
# The `condition` is optional. If not provided, the last result is used as the condition.
# By default, `$match` executes in order and stops once it finds a matching pattern, without checking subsequent patterns.
# If the `allMatches` parameter is set to `true`, then all matching branches will be executed. The default is `false`.
# If the `parallel` parameter is set to `true`, then all matching branches will be executed in parallel. This is only meaningful when `allMatches` is `true`.
$match(condition[, allMatches=false]):
# Regular expression matching
:/RegEx/:
- $echo: matched
# Conditional comparison
:> 12:
- $echo: matched
# Exact match, if the condition is a string or number
:"string": # :123
- $echo: matched
# Expression matching, condition === 1 or condition == 2
:1 || 2:
- $echo: matched
# Range matching, 1..5 represents the closed interval `[1..5]`, 1..<5 represents the half-open interval `[1,5)`, 1>..5 represents the half-open interval `(1, 5]`.
:1..5:
- $echo: matched
# Ignore specific items matching, this matches arrays with the first and fourth items, meaning the array must have a length of 4, and the first item's value is assigned to `first`, and the fourth item's value is assigned to `last`
":['a,b', _, _, last]":
- $echo: matched
# Match a complete object
":{x='a', y=':1||2' }":
- $echo: matched
# Partial match object
":{x='a', ..}":
- $echo: matched
# Otherwise
_ :
- $echo: else matched
Sure, here is the translation of the provided content:
condition
: Optional. If not specified, the condition defaults toLastResult
.allMatches
: When enabled, it executes all matching branches, meaning all matched branch items will be executed. The default value isfalse
.parallel
: Indicates whether to execute all matching branches in parallel. This is only meaningful whenallMatches
is enabled. The default value isfalse
.
The $while
directive is used to execute a block of code repeatedly as long as the given condition is true. Here is a simple example:
- $set:
i: 5
- $while: "i >= 0"
do:
- $set:
i: ?=i-1
- $if: "i == 2"
then: $break
Explanation
- Condition Expression (
"i >= 0"
): This is the condition that must be true for the loop to continue executing. - Loop Body (
do:
): This section contains the operations that are executed during each iteration of the loop. - The
$break
directive is used to prematurely end a loop. - The
$continue
directive is used to skip the current iteration of a loop and proceed directly to the next iteration.
Example Breakdown
In this example, the $while
directive checks whether the variable i
is greater than or equal to 0. If the condition is true, it executes the operations within the loop body: decrementing the value of i
by 1. This process continues until i
is no longer greater than or equal to 0.
Notes
- Ensure that the loop condition eventually changes; otherwise, it can lead to an infinite loop.
- The loop body can contain multiple operations, not just a single
$set
operation.
Using the $while
directive, you can implement basic looping logic suitable for various iterative processing scenarios.
The $for
instruction is used to iterate over a list and execute a block of code. Here is a simple example:
$for: 3 # Iterate over the numbers 1 to 3
as:
value: item
do:
- $print("The current item is:{{item}}")
$for: "[1, 2, 3, 4, 5]"
as:
value: item
do:
- $print("The current item is:{{item}}")
$for: "{a:1, b:2}"
as:
index: k
value: v
do:
- $print("The current item is:{{k}}={{v}}")
as
can be omitted. it will default to:value
will be assigned the current element of the loop, andindex
will be assigned the current index of the loop.items
is the object to iterate over. If it is a numeric range, it should be{start, end, step}
.- Loop body (
do:
): This section contains the operations to be performed in each iteration of the loop. - The
$break
instruction is used to prematurely end a loop. - The
$continue
instruction is used to skip the current iteration of the loop and proceed directly to the next iteration.
$format
directive uses Jinja2 template to format the string. The message formatting also uses Jinja2 template, which is also the template format supported by HuggingFace large model.
$format: "{{description}}"
$format:
template: "{{description}}"
data:
description: "hello world"
templateFormat: "hf" # default is hf, currently supports hf, which is jinja2 used by huggingface; `golang` is also the template type used by `ollama` and `localai`; `fstring` is also `langchain` is in use.
Support key path.
$set:
testVar.a: 124
var2: !fn (key) { return key + 'hi' }
$get:
- testVar.a
- var2
Highly programmable and event-driven prompt generation system, which allows users to dynamically control and customize the process of generating text by defining event listeners, triggers and corresponding callback functions. From the examples given, we can see several key features and advantages:
- Event-driven architecture: By providing functions such as
$on
,$once
,$emit
and$off
, the system supports an event-based programming model, allowing developers to flexibly intervene and extend the behavior of the model in response to different life cycle stages or specific conditions. - Flexibility and scalability: Users can not only register named functions as callbacks, but also use anonymous functions or expressions, providing a variety of programming interfaces to adapt to different usage scenarios and complexity requirements. This enhances the flexibility and scalability of the script.
- Detailed event type design: From
beforeCall
,afterCall
tollm
,llmStream
and other events for large model interactions, the system covers all aspects from function calls, result processing to model interactions, fully reflecting the in-depth understanding and support of common requirements in large model applications. - Integration and interaction optimization: Through the
llmRequest
event, the system can intelligently manage large model calls, support customizing the way to obtain model responses through event mechanisms, and provide disable options to adapt to different strategies. In addition, the loading and saving of chat records are also open through events, which is convenient for integration into external systems or data management. - Clear API design: The sample document shows clear API usage methods and parameter descriptions, which is convenient for developers to quickly get started and apply in depth, reflecting good design philosophy and user experience considerations.
$on
function supports event monitoring, $once
function supports event monitoring once, $emit
function supports triggering events
$off
function supports canceling event monitoring
Parameters are as follows:
- event: event name
- callback: callback function or expression
Function as callback function:
!fn |-
onTest (event, arg1) { return {...arg1, event: event.type}}
$on:
event: test
callback: onTest # Named function monitoring, event monitoring can be canceled
$once: # Automatically cancel event monitoring after triggering once
event: test
callback: !fn |- # Anonymous function monitoring, event monitoring cannot be canceled
(event, arg1) { return {...arg1, event: event.type}}
$emit: # Trigger event
event: test
args:
a: 1
b: 2
$off:
event: test
callback: onTest
The expression is used as a callback function. The parameters in the expression are as follows:
- event: event instance
- event.type: event name
- event.target: event source, that is, the current script runtime
- arg1: the first parameter value passed to the event listener function
- args: the remaining parameter value list passed to the event listener function, if any
These parameters are equivalent to the callback function: (event, arg1, ...args) => void|any
$on:
event: test
callback: "?={...arg1, event: event.type}" # Unable to cancel event listening
The parameters are as follows:
- event: event type, string, such as:
test
- args: parameter value or parameter value list passed to the event listener function, if any
$emit:
event: test
args: # an object parameter
a: 1
b: 2
$emit:
event: test
args: # indicates two object parameters
- a: 1
- b: 2
beforeCall
: triggered before the function is called- callback parameters:
(event, name, params, fn) => void|params
- When the callback function returns a value, it means to modify the parameters.
afterCall
: triggered before the function returns the result- callback parameters:
(event, name, params, result, fn) => void|result
- When the callback function returns a value, it means to modify the return result.
llmParams
: Triggered before before the LLM is called and can be used to modify the parameters passed to the LLM.- Callback:
(event, params: {value: AIMessage[], options?: any, model?: string, count?: number}) => void|result<{value: AIMessage[], options?: any, model?: string, count?: number}>
value
: The messages to be sent to the LLM.options
: The options passed to the LLM.model
: The LLM name to be used.count
: the retry count if any.
- Callback:
llmBefore
: Triggered before before the LLM is called and can not modify the parameters, only used as notification.- Callback:
(event, params: any) => void
- Callback:
llm
: the event is triggered before the large model returns the result, used to modify the large model return result.- callback parameters:
(event, result: string) => void|result<string>
llmStream
: triggered when the large model returns the result in streaming mode- callback parameters:
(event, chunk: AIResult, content: string, retryCount: number) => void
- chunk: current stream chunk content
- content: string content of all chunks currently obtained
- retryCount: number of retries for automatically calling llm when
max_token
is reached llmRequest
: event is triggered when the large model result is needed, used to call the large model through the event and get the large model result.[[RESPONSE]]
template will trigger this event- callback parameters:
(event, messages: AIChatMessage[], options?) => void|result<string>
- use the switch
disableLlmRequest: true
to disable this event. ready
: triggered after the script interaction is ready, you can force the setting of whether it is in the ready state through the$ready()
function.- callback parameters:
(event, isReady: boolean) => void
load-chats
: triggered when loading chat records.- callback parameters:
(event, filename: string) => AIChatMessage[]|void
- when the callback function returns a value, It means the loaded chat history.
save-chats
: Triggered when the chat history is saved.- Callback parameters:
(event, messages: AIChatMessage[], filename?: string) => void
Note:
- The
event
parameter in the event callback is theEvent
object, andthis
is the script runtime; - When the event callback returns a value, it means modifying the parameter or result, otherwise it is not modified; the premise is that the event type supports modification;