An assistant and modules are created and modified via API call. This guide leads you through the calls you have to do and the settings and configurations that can be set.
...
Variable | Description | Options | optional / required |
---|---|---|---|
| Name of assistant | required | |
| fallbackModule if module selection could not find a suitable module | required | |
| Language module used for module selection | see below section with GPT-models default: | optional |
| Enable/disable upload of documents into chat |
| optional |
Collection of various setting |
|
| optional |
|
| optional | |
|
| optional | |
| List of modules to create. See Module dependent configurations for details about configuration of a module | required |
...
Code Block |
---|
{ "input": { "name": "Internal Knowledge", "fallbackModule": "SearchInVectorDB", "languageModel": "AZURE_GPT_35_TURBO_06130125", "chatUpload": "Disabled", "settings": { "showPdfHighlighting": true, "modelChoosing": "BY_FUNCTION_CALL", "isPinned": true }, "modules": { "create": [ { "name": "SearchInVectorDB", "configuration": { }, "description": null, "isExternal": false, "weight": 10000 }, { "name": "Translate", "configuration": { }, "description": null, "isExternal": false, "weight": 6000 } ] } } } |
...
Module | Example | Description | Parameter | Options | ||
---|---|---|---|---|---|---|
|
| GPT model to be used |
| see below section with GPT-models default: | ||
Scopes that the module can access |
| |||||
RAG approach to search for chunks |
|
| ||||
Describing if chunks of same document are appended as individual sources to GPT content or merged to one source |
|
| ||||
Scope restriction to documents that are uploaded. If no documents are uploaded, then scopes in scopeIds are relevant. |
|
| ||||
Flag that allows to include previous chat conversation in GPT-calls only if the new user input is a follow-up question |
|
| ||||
Max tokens used by sources and previous conversation |
| Default value depends on the used
| ||||
Specifies the primary language used for full-text search. This should match the predominant language of the documents in the knowledge centre. |
| Default: | ||||
|
| GPT model to be used |
| |||
Describing if chunks of same document are appended as individual sources to GPT content or merged to one source |
|
| ||||
|
| GPT model to be used |
| |||
|
| GPT model to be used |
| |||
|
| GPT model to be used |
| |||
Temperature (chatGPT) |
| Range: 0-1 Default: 0.5 | ||||
System prompt |
| Default system prompt is (depending of
| ||||
Maximum number of user-assistant interactions taken into account in the history. |
| Default: 2 | ||||
| ||||||
| ||||||
|
| GPT model to be used |
| |||
|
| GPT model to be used |
| |||
tbd |
| |||||
tbd |
| |||||
tbd |
| |||||
|
| GPT model to be used |
| |||
Scope that the module can access |
| |||||
Max tokens used by sources and previous conversation |
| |||||
Name of excel template file, that will be filled with extracted values. Need to uploaded to the same scopeId |
| |||||
|
| GPT model to be used |
| |||
|
| GPT model to be used |
|
...
The list below contains available GPT-models with the corresponding name that has to be used in the configurations for the assistants and modules:
Model | Key | ||||
---|---|---|---|---|---|
GPT-35-turbo (0301) | AZURE_GPT_35_TURBO | GPT-35-turbo (06130125) | AZURE_GPT_35_TURBO_0613 | GPT-35-turbo-16K (0613) | AZURE_GPT_35_TURBO_16K0125 |
GPT-4 (0613) | AZURE_GPT_4_0613 | ||||
GPT-4-32K (0613) | AZURE_GPT_4_32K_0613 | ||||
GPT-4-turbo (0409) | AZURE_GPT_4_TURBO_2024_0409 | ||||
GPT-4o (2024-0513) | AZURE_GPT_4o_2024_0513 | ||||
GPT-4o (2024-0806) | AZURE_GPT_4o_2024_0806 | ||||
GPT-4o-mini (2024-0718) | AZURE_GPT_4o_MINI_2024_0718 |
...
Panel | ||||||||
---|---|---|---|---|---|---|---|---|
| ||||||||
You can find more about modules here: Modules |
Model provisioning on Azure in terraform
When provisioning models on Azure via terraform, the following values for the name
, mode_name
and model_version
should be used. The config map of certain Unique services will contain a mapping of the provisioned models and their respective endpoints, where the keys for the supported models will be composed from the {{model_name}}-{{model_version}}
(also shown in table below).
Model | name (terraform) | model_name (terraform) | model_version (terraform) | config map key |
---|---|---|---|---|
GPT-35-turbo (0125) |
|
|
|
|
GPT-4 (0613) |
|
|
|
|
GPT-4-32K (0613) |
|
|
|
|
GPT-4-turbo (0409) |
|
|
|
|
GPT-4o (2024-0513) |
|
|
|
|
GPT-4o (2024-0806) |
|
|
|
|
GPT-4o-mini (2024-0718) |
|
|
|
|
Ada text embedding (v2) |
|
|
|
|
...
Author |
---|