Chat with an LLM

This guide contains information on how you can use the MongoDB Chatbot Server to chat with a large language model (LLM).

Configure the `ChatLlm`

The ChatLlm is the interface between the chatbot server and the LLM.

The MongoDB Chatbot Server comes with an implementation of the ChatLlm, which uses the OpenAI API. You could also implement your own ChatLlm to use a different language model or different configuration on the OpenAI API.

The following are useful things to keep in mind when using an LLM:

What model to use. This is probably the single most important decision for shaping the chatbot response. The quality and characteristics of different models vary greatly.
Model temperature. The temperature of the model determines how "creative" the model is. A higher temperature will result in more creative responses, but also more errors.
Model max tokens. The maximum number of tokens that the model will generate. This is useful for preventing the model from generating very long responses, which impacts cost and quality.
Prompt engineering. What additional information to include in the prompt to guide the model's behavior. For more information, refer to the Prompt Engineering section.
Tools. What tools to give the model to use. For more information, refer to the Tool Calling guide.

Use OpenAI API

You can use the makeOpenAiChatLlm() constructor function to create a ChatLlm that uses an OpenAI model like GPT-3.5.

makeOpenAiChatLlm() supports both the OpenAI API and Azure OpenAI Service. It wraps the openai package, which supports both of these services.

The following is an example implementation of makeOpenAiChatLlm():

import { makeOpenAiChatLlm, OpenAiChatMessage } from "mongodb-chatbot-server";
import { someTool } from "./someTool";
export const openAiClient = new OpenAIClient(
  OPENAI_ENDPOINT,
  new AzureKeyCredential(OPENAI_API_KEY)
);

export const llm = makeOpenAiChatLlm({
  openAiClient,
  deployment: OPENAI_CHAT_COMPLETION_DEPLOYMENT,
  openAiLmmConfigOptions: {
    temperature: 0,
    maxTokens: 500,
  },
  tools: [someTool],
});

Use Langchain `ChatModel`

You can use the makeLangchainChatLlm() constructor function to create a ChatLlm that uses a Langchain ChatModel. For more information on available ChatModel implementations, refer to the Chat Models in the Langchain documentation.

The following is an example implementation of using makeLangchainChatLlm() to use Anthropic's Claude family of models:

import { makeLangchainChatLlm } from "mongodb-chatbot-server";
import { ChatAnthropic } from "@langchain/anthropic";

const anthropicModel = new ChatAnthropic({
  temperature: 0.9,
  anthropicApiKey: "YOUR-API-KEY",
  maxTokensToSample: 1024,
});
const anthropicChatLlm = makeLangchainChatLlm({ chatModel: anthropicModel });

Manage Previous Messages Sent to the LLM

The MongoDB Chatbot Server always sends the current user message to the LLM.

You can also manage which previous messages in a conversation the MongoDB Chatbot Server sends to the LLM on each user message. You might want to do this to allow for appropriate context to be sent to the LLM without exceeding the maximum number of tokens in the LLM's context window.

You do this at the ConversationRouter level with the ConversationsRouterParams.filterPreviousMessages property. The filterPreviousMessages property accepts a FilterPreviousMessages function.

By default, the MongoDB Chatbot Server only send the initial system prompt and the user's current message to the LLM. You can change this behavior by implementing your own FilterPreviousMessages function.

The MongoDB Chatbot Server package also comes with a makeFilterNPreviousMessages constructor function. makeFilterNPreviousMessages creates a FilterPreviousMessages function that returns the previous n messages plus the initial system prompt.

The following is an example implementation of makeFilterNPreviousMessages():

import {
  makeFilterNPreviousMessages,
  ConversationsRouterParams,
  AppConfig,
} from "mongodb-chatbot-server";

const filter10PreviousMessages = makeFilterNPreviousMessages(10);

const conversationsRouterConfig: ConversationsRouterParams = {
  filterPreviousMessages: filter10PreviousMessages,
  ...otherConfig,
};
const appConfig: AppConfig = {
  conversationsRouterConfig,
  ...otherConfig,
};

Prompt Engineering

Prompt engineering is the process of directing the output of a language model to produce a desired response.

In the context of a chatbot server such as this, there are the following main areas for prompt engineering:

System prompt: Message at the beginning of the conversation that guides the chatbot's behavior when generating responses.
User prompt: User message that the chatbot uses to generate a response. In RAG applications, this can include adding relevant content gathered from vector search results based on the user's input.

This guide does not cover prompt engineering techniques, but rather where you can apply them in the MongoDB Chatbot Server.

Prompt engineering is a fairly new field, and best practices are still emerging. A great resource to learn more about prompt engineering is the Prompt Engineering Guide.

System Prompt

To add a system prompt, include a SystemPrompt message in your app's ConversationsRouterParams.systemPrompt.

The system prompt is one of the most powerful way to customize the way that the chatbot responds to users. You can use the system prompt to do things such as:

Control the style and personality of the chatbot.
Determine how the chatbot responds to certain types of questions.
Direct how the chatbot interprets user input and context information.

import {
  SystemPrompt,
  ConversationsRouterParams,
} from "mongodb-chatbot-server";
import { MongoClient } from "mongodb-rag-core/mongodb";

// System prompt for chatbot
const systemPrompt: SystemPrompt = {
  role: "system",
  content: `You are an assistant to users of the MongoDB Chatbot Framework.
Answer their questions about the framework in a friendly conversational tone.
Format your answers in Markdown.
Be concise in your answers.
If you do not know the answer to the question based on the information provided,
respond: "I'm sorry, I don't know the answer to that question. Please try to rephrase it. Refer to the below information to see if it helps."`,
};

const conversationsRouterConfig: ConversationsRouterParams = {
  // ...other config
  systemPrompt,
};

User Prompt

You can modify what the chatbot uses as the user prompt by implementing the GenerateUserPromptFunc function.

GenerateUserPromptFunc takes in the user's query and previous messages in the conversation, then returns a new user message. For an overview of the GenerateUserPromptFunc function, refer to the Generate User Message guide.

You might want to modify the user prompt if you're using a prompting technique like retrieval augmented generation (RAG) or chain of thought. To learn more about using RAG with the MongoDB Chatbot Server, refer to the RAG guide.

Chat with an LLM

Configure the ChatLlm​

Use OpenAI API​

Use Langchain ChatModel​

Manage Previous Messages Sent to the LLM​

Prompt Engineering​

System Prompt​

User Prompt​