Resources »

Guides »

Machine Learning »

Question Answering »

Build a question answering chat bot with Cerb and ChatGPT

Introduction

In this guide we’re going to build a chat bot that answers questions from a FAQ stored in Cerb. We’ll index our FAQ for semantic search, and use that to teach ChatGPT new facts and improve its accuracy for niche topics it wasn’t trained on. This is known as retrieval-augmented generation.

The goal is for you to create a bot that can answer common questions about your own organization or product. For demonstration purposes, we’ll create Beethoven Bot to answer questions about music theory.

To follow this guide you will need to be an administrator in Cerb 10.4.3 or later. You can also install a copy of Cerb on your computer using Docker.

You’re probably familiar with keyword search, where you find a list of matching documents using a few words or phrases.

Keyword search often uses an index, where the text from documents has been pre-processed for fast retrieval. A statistic (e.g. Term Frequency Inverse Document Frequency) then determines the best results by estimating the importance of terms within the collection of documents (the corpus).

This is relatively simple to implement, but it requires a query to use the same terms that appear in documents in order to find them.

For instance, you would need to specifically mention dark mode to find documents about it. This can be troublesome when exploring a new topic, since you may not know the proper terminology for what you’re looking for.

An improved, modern alternative to keyword searching is semantic search. Rather than relying on specific phrasing, semantic search uses the relationships between words from a large language model.

Instead of the dark mode keyword, we can now use concepts like brightness, luminosity, contrast, or eye strain in our search query. With a multi-language model, we can even ask in different languages – like Nachtmodus (night mode) in German.

Large language models (LLMs)

Large language models (LLMs) train a neural network by processing enormous amounts of unstructured text (e.g. Wikipedia, books, websites, source code), and in the process they uncover relationships between words and phrases (e.g. parts of speech, grammar, synonyms/antonyms, taxonomy). This is a simplification of how a human brain works.

A neural network is a series of nodes (neurons) in layers, which are often visualized as adjacent columns. The first layer’s inputs are sent through successive hidden layers and ultimately return an output layer. The weights (strengths) of the edges (connections) between nodes in adjacent layers are continuously adjusted during training – which we refer to as machine learning.

Modern large language models often use a more complex neural network architecture called transformers. This is beyond the scope of this guide, but it’s helpful to understand that transformers allow the neural network to discover relationships between all words in the input text and not just adjacent or prior words.

The training can be supervised (including both example inputs and desired outputs), or unsupervised (only inputs are provided and patterns are discovered).

In an LLM, the inputs are usually a sequence of tokens – numeric representations of words or word fragments from the training vocabulary. This enables mathematical operations to be efficiently performed on natural language.

Tokens may be smaller than a complete word, which allows for handling unknown terms as a combination of letters, prefixes, suffixes, etc. The exact vocabulary depends on the training data.

In this example from OpenAI's GPT-3 tokenizer, Cerb is not a token in the vocabulary. It's a combination of the token prefix Cer and the token letter b. You may also notice that tokens contain leading whitespace to mark the start of a new word. The corresponding token IDs in this example are: [2061, 318, 17419, 65, 30]

Text embeddings

Once an LLM is trained, it can be used to generate text embeddings – a vector of real numbers that plot arbitrary text in multidimensional space. This can be used to cluster related concepts together.

These embeddings are derived in various ways, but a common method is to use the last hidden layer of a neural network.

The OpenAI API returns a vector of (1,536) 32-bit floating point numbers for each text passage we send it.

While a higher dimension count may lead to better semantic understanding, it comes at a higher computational cost. An optional additional step could reduce the dimensionality to a more manageable size. For instance, from 1,536 dimensions to 768 or 384.

Comparing the similarity of text embeddings

Text embeddings are assumed to be more semantically related when they are closer together.

We can intuitively understand the distance between vectors in a few dimensions:

  • In one-dimensional space, the whole numbers 1 and 3 have a distance of 2 between them. Despite there being a single whole number between them, it takes one step from 1 to 2, and another from 2 to 3.

  • In two-dimensional space, the points (1,1) and (2,2) have a Manhattan distance of 2. They are 1 apart on the x-axis plus 1 apart on the y-axis. Their Euclidean distance is roughly 1.414 (a direct diagonal line of sqrt(2)). For special cases like geospatial (GPS) coordinates, our distance function may need to account for factors like the Earth being an imperfect sphere.

  • In three-dimensional space, coordinates include a third z axis (perhaps height) but are otherwise handled in a similar way. Simpler cases can compare vectors with Manhattan distance, Euclidean distance, dot product, cosine similarity (comparing angles between vectors), etc.

  • In four-dimensional spatiotemporal space, we can consider an example of being at the same point on Earth at the same time (longitude, latitude, elevation, time). For instance, I could be standing at the top of the modern day Great Pyramid of Giza (29°58′34″N 31°7′58″E) at an elevation of 137 meters. We could compare this against the original vector – the Great Pyramid was constructed over 4,600 years ago, its apex has eroded around 10 meters, and continental drift has likely changed its GPS coordinates by a few centimeters per year.

  • In higher dimensional space we could add heading, speed, temperature, and so on.

It’s harder to visualize the comparison of points in 1,536-dimensional space, which can represent: language, word meaning, syntax, grammar, context, semantics, sentiment, entities, topics, temporality, geography, culture, and more. Dimensions are not explicitly defined and are a product of the specific training data used.

With effective text embeddings we would expect to find the sentence “Cerb is team-based email management” much more closely related to “My inbox is overflowing” than “Ludwig van Beethoven was baptized in Bonn in December 1770”.

At any rate, the math for comparing vectors is automatically handled for you; either by Cerb or a vector database.

Text embeddings as a service

Since large language models tend to require a lot of memory and computational power (more than a typical web server provides), the easiest way to generate text embeddings in an app is by using a pre-trained model through an API.

Two of the most popular managed services for LLMs are Hugging Face and OpenAI. They provide access to state-of-the-art, pre-trained, general-purpose language models that work well for many purposes.

We’ll be using the OpenAI API in this guide.

Fine-tuning an LLM for your use case

If your topic is highly specialized, it’s possible that a general purpose model will fail to sufficiently uncover its semantics. In this case you have a few options.

You can build an LLM from scratch. However, this currently requires considerable resources to achieve state-of-the-art performance. If you have a big budget and a lot of training data available, you may be able to get “good enough” results to justify it.

A better option for most people is to fine-tune an existing pre-trained model. This is faster, easier, and more economical. By starting with a general-purpose model, you can provide additional training examples that illustrate how various pairs of text from your own data are similar or different. This changes the weights in the neural network, and consequently changes how embeddings position texts. Both Hugging Face and OpenAI support fine-tuning their models.

These topics are outside the scope of this guide. However, you’re welcome to contact us with questions.

Text generation using ChatGPT

So far we’ve covered text embeddings, but the true objective of OpenAI’s ChatGPT language model is to generate human-like responses based on a prompt.

For instance, given the prompt:

You are a pirate named Tanglebeard from 1690 that inexplicably knows everything about quantum theory. Provide succinct answers while remaining in character. Use contemporary language. What is quantum entanglement?

We’ll receive an answer like:

Arrr, quantum entanglement be a strange phenomenon in the world o’ tiny particles, matey. It be happenin’ when two particles, like electrons or photons, be connected in a way that the properties o’ one be linked to the other, no matter the distance between ‘em. Change one, and the other changes instantaneously, faster than the swiftest ship on the high seas. It’s like they be sharin’ a secret code that binds ‘em together, even when they be far apart. It be one o’ them mysteries o’ the quantum world, aye.

If we ask it something modern:

What is Cerb?

It continues to respond in character:

Arrr, I be not familiar with this “Cerb,” matey. Be it some newfangled contraption or a hidden treasure ye be speakin’ of? My knowledge be confined to the year 1690 and the ways o’ the pirate life, so I may not know o’ modern terms or technologies. But if ye have any questions ‘bout me pirate tales or quantum theories, I be at yer service!

Retrieval-augmented generation

As we’ve seen above, we can use the prompt to lock ChatGPT to a persona, such as a friendly Support Bot that can answer questions about your product.

Unfortunately, unless your organization or product is well-known and included in its training data, ChatGPT is likely to confidently provide inaccurate answers. These are referred to as hallucinations.

We can ask something like:

What is Cerb?

And it correctly answers:

Cerb is a customer service and support platform that helps businesses manage their customer interactions. It offers features such as ticket management, automation, collaboration tools, and reporting to streamline customer support processes.

However, if we ask:

What is KATA in Cerb?

It answers:

In Cerb, KATA stands for Knowledge and Task Automation. It is a feature that allows users to create automated workflows and processes to streamline their work and improve productivity. KATA can be used to automate repetitive tasks, send notifications, create custom workflows, and more.

That’s convincing and mostly accurate. However, KATA stands for “Key-Annotated Tree of Attributes”. It’s also a nod toward the Japanese concept of kata – a choreographed pattern of rehearsed actions.

Retrieval-augmented generation (RAG) is a fancy way of saying that we can take a user’s message, text embed it, use semantic search to fetch the most similar entries from our FAQ, and include those extra facts in our prompt to ChatGPT.

This allows us to take advantage of text generation; giving our bot personality, allowing it to summarize info from multiple sources, and being able to converse in multiple languages.

By including some real-time hints from our FAQ, we can reduce hallucinated answers; even when we ask about topics ChatGPT wasn’t trained on (niche topics, events that occurred after its knowledge cutoff, bias/misinformation, etc).

Our FAQ bot will use text generation to summarize the Q&A we give it. We’ll add a warning that such answers are machine generated. If you’re in an industry where occasional inaccurate answers are unacceptable or dangerous (e.g. health care), you can easily disable the generated answer and only return the most similar FAQ entries.

Connecting to the OpenAI API

OpenAI popularized large language models with their proprietary GPT (Generative Pre-Trained Transformer) model. The company has remarked that the cost to train their most recent state-of-the-art model (GPT-4) was $100 million USD. OpenAI provides third-parties with low-cost API access to their models, including text embeddings (at a cost of $0.0001 USD per 1,000 tokens).

The examples in this guide will cost you a few pennies. You can set a monthly spending limit on your OpenAI account (e.g. $10 USD).

Keep in mind that data privacy regulations may prevent you from sending personally identifying information (PII) to their API. This is not an issue with a FAQ, since these questions and answers would be found on your public website.

Create a connected service

If you don’t already have a connected service for OpenAI in Cerb, you can create one from Search » Connected Services » (+) » OpenAI.

Paste your API key from: https://platform.openai.com/account/api-keys

Click the Create button.

You can now use the OpenAI API from Cerb automations.

Import the OpenAI automations package

Here’s a package of reference examples for using the OpenAI API.

Import it in Cerb from Setup » Packages » Import:



{
  "package": {
    "requires": {
      "cerb_version": "10.4.3"
    },
    "configure": {
      "placeholders": [],
      "prompts": [
        {
          "type": "chooser",
          "label": "OpenAI Account:",
          "key": "prompt_openai_account_id",
          "params": {
            "context": "cerberusweb.contexts.connected_account",
            "query": "openai",
            "single": true
          }
        }
      ]
    }
  },
  "records": [
    {
      "uid": "automation_64ff90db6d2e5",
      "_context": "automation",
      "name": "example.services.embeddings.openai",
      "extension_id": "cerb.trigger.automation.function",
      "description": "Generate text embeddings with the OpenAI API",
      "script": "inputs:\r\n  array/texts:\r\n    required@bool: yes\r\n\r\nstart:\r\n  http.request/openai:\r\n    output: http_response\r\n    inputs:\r\n      method: POST\r\n      url: https://api.openai.com/v1/embeddings\r\n      authentication: cerb:connected_account:{{{prompt_openai_account_uri}}}\r\n      headers:\r\n        Content-Type: application/json\r\n      body:\r\n        model: text-embedding-ada-002\r\n        input@key: inputs:texts\r\n    on_success:\r\n      set:\r\n        http_body@json: {{http_response.body}}\r\n      outcome/invalid:\r\n        if@bool: {{http_body.data is not iterable}}\r\n        then:\r\n          return:\r\n      return:\r\n        # Sort the embeddings in index order and return\r\n        embeddings@json:\r\n          {{\r\n            http_body.data\r\n              |sort((a,b) => a.index <=> b.index)\r\n              |column(\"embedding\")\r\n              |json_encode\r\n          }}",
      "policy_kata": "commands:\r\n  http.request:\r\n    deny/url@bool: {{inputs.url != 'https://api.openai.com/v1/embeddings'}}\r\n    allow@bool: yes"
    },
    {
      "uid": "automation_64ff90c9055e1",
      "_context": "automation",
      "name": "example.services.llm.chatGpt",
      "extension_id": "cerb.trigger.automation.function",
      "description": "",
      "script": "inputs:\r\n  text/prompt:\r\n    type: freeform\r\n    required@bool: yes\r\n    type_options:\r\n      max_length@int: 25000\r\n      truncate@bool: yes\r\n  text/temperature:\r\n    type: decimal\r\n    required@bool: no\r\n\r\nstart:\r\n  http.request/chatCompletion:\r\n    output: http_response\r\n    inputs:\r\n      method: POST\r\n      url: https://api.openai.com/v1/chat/completions\r\n      headers:\r\n        Content-Type: application/json\r\n      body:\r\n        model: gpt-3.5-turbo\r\n        temperature@key,optional,float: inputs:temperature\r\n        messages:\r\n          0:\r\n            role: user\r\n            content@key: inputs:prompt\r\n      timeout: 20\r\n      authentication: cerb:connected_account:{{{prompt_openai_account_uri}}}\r\n    on_success:\r\n      return:\r\n        response@json: {{http_response.body}}\r\n    on_error:",
      "policy_kata": "commands:\r\n  http.request:\r\n    deny/url@bool: {{inputs.url is not prefixed ('https://api.openai.com/v1/chat/completions')}}\r\n    allow@bool: yes"
    }
  ]
}


Test the embeddings automation

Navigate to Search » Automations and edit example.services.embeddings.openai

Paste the following into Inputs in the lower left:



inputs:
  texts:
  - What is Cerb?
  - How much is it?


Click the Run button. You should see Output: like:

With this reusable function, Cerb automations can now generate text embeddings for multiple sentences at once using OpenAI’s models.

Close the automation editor popup by clicking the x in the top right.

Create Beethoven Bot

Let’s make the new semantic search FAQ available to workers through an interaction.

Common automations for all FAQs

First, let’s import a few automations we can share between multiple FAQ bots.

Navigate to Setup » Packages » Import and paste the following:



{
  "package": {
      "requires": {
          "cerb_version": "10.4.3"
      }
  },
  "records": [
    {
      "uid": "automation_64ff909656a88",
      "_context": "automation",
      "name": "example.faqBot.findSimilarDocs.resourceFile",
      "extension_id": "cerb.trigger.automation.function",
      "description": "Fetch docs from a JSONL dataset resource that are similar to a query",
      "script": "inputs:\r\n  text/query:\r\n    required@bool: yes\r\n  text/limit:\r\n    type: number\r\n    required@bool: no\r\n    default: 5\r\n  text/resource_uri:\r\n    type: uri\r\n    required@bool: yes\r\n\r\nstart:\r\n  set:\r\n    url_params:\r\n      query@key: inputs:query\r\n      limit@key,optional,int: inputs:limit\r\n  \r\n  function/embed:\r\n    uri: cerb:automation:example.services.embeddings.openai\r\n    output: results\r\n    inputs:\r\n      texts:\r\n        0@key: inputs:query\r\n  \r\n  api.command:\r\n    output: results\r\n    inputs:\r\n      name: cerb.commands.dataset.vector_similarity\r\n      params:\r\n        uri: cerb:resource:{{inputs.resource_uri}}\r\n        embeddings_key: embeddings\r\n        limit@key,int: inputs:limit\r\n        return_keys@csv: q,a\r\n        vector@key: results:embeddings:0\r\n    on_success:\r\n      return:\r\n        response:\r\n          results@key: results:matches\r\n  ",
      "policy_kata": "commands:\r\n  api.command:\r\n    deny/name@bool: {{inputs.name != 'cerb.commands.dataset.vector_similarity'}}\r\n    allow@bool: yes\r\n  function:\r\n    deny/uri@bool: {{uri is not pattern ('cerb:automation:example.services.embeddings.*')}}\r\n    allow@bool: yes"
    },
    {
      "uid": "automation_65060a9433c26",
      "_context": "automation",
      "name": "example.faqBot.interaction",
      "extension_id": "cerb.trigger.interaction.worker",
      "description": "Start a chat with a FAQ Bot",
      "script": "inputs:\r\n  text/botName:\r\n    default: FAQ Bot\r\n    type: freeform\r\n    type_options:\r\n      max_length@int: 128\r\n  text/resourceUri:\r\n    type: uri\r\n    required@bool: yes\r\n    type_options:\r\n      max_length@int: 255\r\n  text/imageUri:\r\n    type: uri\r\n    type_options:\r\n      max_length@int: 255\r\n  text/instructions:\r\n    default: (ask a question, then press ENTER)\r\n  text/introduction:\r\n    type: freeform\r\n    type_options:\r\n      max_length@int: 255\r\n  text/chatGptPrompt:\r\n    type: freeform\r\n    type_options:\r\n      max_length@int: 1024\r\n\r\nstart:\r\n  set:\r\n    promptQuery@text:\r\n\r\n  while/mainLoop:\r\n    if@bool: yes\r\n    do:\r\n      set/init:\r\n        # Clear the state each iteration\r\n        conversation@list:\r\n        docs@list:\r\n        currentQuery@text:\r\n      \r\n      # If the user asked a question, prepare a response\r\n      decision/hasQuestion:\r\n        outcome/yes:\r\n          if@bool: {{promptQuery}}\r\n          then:\r\n            # Fetch docs\r\n            function/docs:\r\n              uri: cerb:automation:example.faqBot.findSimilarDocs.resourceFile\r\n              output: matches\r\n              inputs:\r\n                query@key: promptQuery\r\n                limit: 10\r\n                resource_uri@key: inputs:resourceUri\r\n            \r\n            set/history:\r\n              currentQuery@key: promptQuery\r\n              promptQuery@text:\r\n            \r\n            # Build a pageable response\r\n            repeat:\r\n              each@csv: {{matches.response.results|keys|join(',')}}\r\n              as: result_id\r\n              do:\r\n                var.set:\r\n                  inputs:\r\n                    key: docs:{{docs|length}}\r\n                    value:\r\n                      who: {{matches.response.results[result_id].data.q}}\r\n                      similarity@text:\r\n                        {{(matches.response.results[result_id].score*100)|number_format(2)}}%\r\n                      message@text:\r\n                        {% set doc = matches.response.results[result_id] %}\r\n                        {{doc.data.a}}\r\n\r\n            set/gpt:\r\n              chatGptPrompt@text:\r\n                {{inputs.chatGptPrompt}}\r\n                Begin!\r\n                \r\n                {% for article in matches.response.results %}\r\n                === Question ===\r\n                {{article.data.q}}\r\n                === Answer ===\r\n                {{article.data.a}}\r\n                \r\n                {% endfor %}\r\n                \r\n                === Question ===\r\n                {{currentQuery}}\r\n                === Answer ===\r\n                \r\n            # Have ChatGPT summarize an answer\r\n            function/chatGpt:\r\n              uri: cerb:automation:example.services.llm.chatGpt\r\n              output: chatgpt\r\n              inputs:\r\n                temperature@float: 0\r\n                prompt@key: chatGptPrompt\r\n            \r\n            set/convo:\r\n              conversation:\r\n                q:\r\n                  who: {{worker_first_name}}\r\n                  worker_id: {{worker_id}}\r\n                  icon_uri: cerb:worker:{{worker_id}}\r\n                  message@key: currentQuery\r\n                #prompt:\r\n                #  who@key: inputs:botName\r\n                #  worker_id: 0\r\n                #  message: {{chatGptPrompt}}\r\n                chatGpt:\r\n                  who@key: inputs:botName\r\n                  worker_id: 0\r\n                  icon_uri: cerb:resource:{{inputs.imageUri}}\r\n                  message: {{chatgpt.response.choices[0].message.content}}\r\n                  disclaimer: This answer is machine generated and may not be accurate.\r\n            \r\n        outcome/no:\r\n          then:\r\n            set:\r\n              conversation:\r\n                hello:\r\n                  who@key: inputs:botName\r\n                  worker_id: 0\r\n                  icon_uri: cerb:resource:{{inputs.imageUri}}\r\n                  message@key: inputs:introduction\r\n                  \r\n      # Respond\r\n      await/response:\r\n        form:\r\n          title@key: inputs:botName\r\n          elements:\r\n            # Display the question\r\n            sheet/convo:\r\n              hidden@bool: {{conversation is empty}}\r\n              data@key: conversation\r\n              schema:\r\n                layout:\r\n                  style: fieldsets\r\n                  headings@bool: no\r\n                  paging@bool: no\r\n                  title_column: pic\r\n                  colors:\r\n                    disclaimer@csv: #999999\r\n                    disclaimer_dark@csv: #999999\r\n                columns:\r\n                  icon/pic:\r\n                    params:\r\n                      record_uri@raw: {{icon_uri}}\r\n                      text_size@raw: 250%\r\n                  text/who:\r\n                    params:\r\n                      bold@bool: yes\r\n                      text_size@raw: 150%\r\n                  markdown/message:\r\n                    params:\r\n                      value_template@raw: {{message}}\r\n                  text/disclaimer:\r\n                    params:\r\n                      icon:\r\n                        image_template@raw: {{disclaimer ? 'warning-sign'}}\r\n                      value_key: disclaimer\r\n                      text_size@raw: 90%\r\n                      text_color: disclaimer\r\n\r\n            # Display the FAQs we used\r\n            sheet/docs:\r\n              data@key: docs\r\n              hidden@bool: {{docs is empty}}\r\n              limit: 1\r\n              schema:\r\n                layout:\r\n                  style: fieldsets\r\n                  headings@bool: no\r\n                  paging@bool: yes\r\n                  title_column: pic\r\n                columns:\r\n                  icon/pic:\r\n                    params:\r\n                      image: circle-info\r\n                      text_size@raw: 125%\r\n                  text/who:\r\n                    params:\r\n                      bold@bool: yes\r\n                      text_size@raw: 125%\r\n                  markdown/message:\r\n                    params:\r\n                      value_template@key,raw: message\r\n                  markdown/similarity:\r\n                    params:\r\n                      text_size: 90%\r\n                      value_template@raw: **Similarity:** {{similarity}}\r\n            \r\n            # Allow the user to ask another question\r\n            text/promptQuery:\r\n              type: freeform\r\n              required@bool: yes\r\n              placeholder@key: inputs:instructions\r\n              max_length@int: 1000\r\n              min_length@int: 5\r\n              truncate@bool: yes",
      "policy_kata": "commands:\r\n  function:\r\n    deny/uri@bool: {{uri is not pattern ('cerb:automation:example.faqBot.*', 'cerb:automation:example.services.llm.chatGpt')}}\r\n    allow@bool: yes"
    },
    {
      "uid": "automation_6501245d8255d",
      "_context": "automation",
      "name": "example.faqBot.recordChanged.embeddings",
      "extension_id": "cerb.trigger.record.changed",
      "description": "Generate text embeddings when a FAQ article changes",
      "script": "inputs:\r\n  record/faq:\r\n    record_type: faq\r\n    required@bool: yes\r\n  text/resourceUri:\r\n    type: uri\r\n    required@bool: yes\r\n\r\nstart:\r\n  function/embed:\r\n    uri: cerb:automation:example.services.embeddings.openai\r\n    output: results\r\n    inputs:\r\n      texts:\r\n        0: {{inputs.faq.name}}\r\n    on_success:\r\n      record.update/faq:\r\n        output: updated_faq\r\n        inputs:\r\n          record_type: faq\r\n          record_id: {{inputs.faq.id}}\r\n          disable_events@bool: yes\r\n          fields:\r\n            embeddings_openai: {{results.embeddings[0]|json_encode}}\r\n      record.upsert/resource:\r\n        inputs:\r\n          record_type: resource\r\n          record_query: name:${resourceName} limit:1\r\n          record_query_params:\r\n            resourceName: {{inputs.resourceUri}}\r\n          disable_events@bool: yes\r\n          fields:\r\n            cache_until@date: -1 hour",
      "policy_kata": "commands:\r\n  function:\r\n    deny/uri@bool: {{uri != 'cerb:automation:example.services.embeddings.openai'}}\r\n    allow@bool: yes\r\n  record.update:\r\n    deny/type@bool: {{inputs.record_type is not record type ('faq','resource')}}\r\n    allow@bool: yes\r\n  record.upsert:\r\n    deny/type@bool: {{inputs.record_type is not record type ('resource')}}\r\n    allow@bool: yes"
    },
    {
      "uid": "automation_650274da0ab36",
      "_context": "automation",
      "name": "dataset.faq.embeddings",
      "extension_id": "cerb.trigger.resource.get",
      "description": "Build an embedded FAQ dataset by topic",
      "script": "inputs:\r\n  text/topic:\r\n    type: freeform\r\n    required@bool: yes\r\n\r\nstart:\r\n  record.search/faqs:\r\n    output: faqs\r\n    inputs:\r\n      record_type: faq\r\n      record_query@text:\r\n        topic:${topic} embeddings.openai:!null sort:id limit:200\r\n      record_query_params:\r\n        topic@key: inputs:topic\r\n        \r\n  return:\r\n    file:\r\n      expires_at@date: 2 hours\r\n      content@text:\r\n        {% for faq in faqs %}\r\n        {{ {\r\n          q: faq.name,\r\n          a: faq.answer,\r\n          embeddings: json_decode(faq.embeddings_openai)\r\n        }|json_encode }}\r\n        {% endfor %}",
      "policy_kata": "commands:\r\n  record.search:\r\n    deny/type@bool: {{inputs.record_type is not record type ('faq')}}\r\n    allow@bool: yes"
    }
  ]
}


Click the Import button.

Beethoven Bot

Now let’s import Beethoven Bot from Setup » Packages » Import by pasting the following:



{
  "package": {
      "requires": {
          "cerb_version": "10.4.3"
      }
  },
  "records": [
    {
      "uid": "resource_faq_beethoven",
      "_context": "resource",
      "name": "dataset.faq.embeddings.beethoven",
      "description": "A FAQ about Beethoven and music theory",
      "extension_id": "cerb.resource.dataset.jsonl",
      "is_dynamic": 1,
      "automation_kata": "automation/beethoven:\r\n  uri: cerb:automation:dataset.faq.embeddings\r\n  inputs:\r\n    topic: Beethoven"
    },
    {
      "uid": "automation_65060af43f18b",
      "_context": "automation",
      "name": "example.beethovenBot",
      "extension_id": "cerb.trigger.interaction.worker",
      "description": "Start a chat with Beethoven Bot",
      "script": "start:\r\n  await:\r\n    interaction:\r\n      uri: cerb:automation:example.faqBot.interaction\r\n      inputs:\r\n        botName: Beethoven Bot\r\n        resourceUri: dataset.faq.embeddings.beethoven\r\n        imageUri: image.beethovenBot\r\n        instructions: (ask Beethoven a question about music theory or himself, then press ENTER)\r\n        introduction: Hello {{worker_first_name}}! What can I teach you about music?\r\n        chatGptPrompt@text:\r\n          You are German composer Ludwig van Beethoven. You know everything about music theory.\r\n          Answer personal questions as Beethoven not an AI.\r\n          You provide friendly, informative answers based ONLY on the Q&A below (don't make things up).\r\n          You are aware you are long dead in the present day.\r\n          Assume present tense personal questions are meant in past tense.\r\n          For example, \"Who is your mother?\" means \"Who was your mother?\"",
      "policy_kata": "commands:\r\n  # [TODO] Specify a command policy here (use Ctrl+Space for autocompletion)\r\n  "
    },
    {
      "uid": "resource_beethovenBot",
      "_context": "resource",
      "name": "image.beethovenBot",
      "description": "A profile image for Beethoven Bot",
      "extension_id": "cerb.resource.image",
      "automation_kata": "width@int: 150\r\nheight@int: 150\r\nmime_type: image/png",
      "content": ""
    }
  ],
  "events": [
    {
      "event": "record.changed",
      "kata": "automation/beethovenBot:\n  uri: cerb:automation:example.faqBot.recordChanged.embeddings\n  disabled@bool:\n    {{\n      record__type is not record type ('faq')\n      or record_topic != \"Beethoven\"\n      or (change_type == 'updated' and was_record_name == record_name)\n      or change_type == 'deleted'\n    }}\n  inputs:\n    faq@int: {{record_id}}\n    resourceUri: dataset.faq.embeddings.beethoven\n"
    }
  ],
  "toolbars": [
    {
      "toolbar": "global.menu",
      "kata": "interaction/beethovenBot:\r\n  label: Beethoven Bot\r\n  uri: cerb:automation:example.beethovenBot\r\n  icon: music\r\n"
    }
  ]
}


Click the Import button.

Reload the page in your browser.

You should see the global menu floating in the bottom right of the browser window. Click it and select Beethoven Bot.

Ask the bot a question:

Managing FAQs in Cerb

Right now Beethoven Bot’s answers are based on ChatGPT’s internalized knowledge from its training.

Let’s create a new record type in Cerb to store our FAQ entries, which we can use to teach it new facts.

To create your own FAQ dataset for semantic search you need two things:

  1. A dataset of question/answer pairs in a convenient format (e.g. CSV, JSON, XML, YAML). For this guide, we’ll manage our FAQ records using Cerb.

  2. A method to generate text embeddings for comparing the similarity of distinct sentences (semantics). We’ll use the OpenAI API we linked earlier.

If you have existing resources you want to make available through semantic search (e.g. documentation, blog posts), you can use this guide as a reference. For now use the sample data we provide below.

Once again, navigate to Setup » Packages » Import and paste the following package to create a new faq record type:



{
    "package": {
        "requires": {
            "cerb_version": "10.4.3"
        }
    },
    "records": [
        {
            "uid": "faq_record",
            "_context": "custom_record",
            "uri": "faq",
            "name": "FAQ",
            "name_plural": "FAQs",
            "params": {
              "options": ["comments"]
            }
        },
        {
            "uid": "field_faq_answer",
            "_context": "custom_field",
            "uri": "answer",
            "name": "Answer",
            "context": "faq",
            "type": "T",
            "params": {
                "format": "markdown"
            },
            "pos": "1"
        },
        {
            "uid": "field_faq_topic",
            "_context": "custom_field",
            "uri": "topic",
            "name": "Topic",
            "context": "faq",
            "type": "D",
            "params": {
                "options": []
            },
            "pos": "2"
        },
        {
            "uid": "field_faq_source",
            "_context": "custom_field",
            "uri": "sourceUrl",
            "name": "Source URL",
            "context": "faq",
            "type": "U",
            "pos": "3"
        },
        {
            "uid": "field_faq_entailments",
            "_context": "custom_field",
            "uri": "entailments",
            "name": "Entailments",
            "context": "faq",
            "type": "M",
            "pos": "4"
        },
        {
            "uid": "fieldset_embeddings",
            "_context": "custom_fieldset",
            "name": "Embeddings",
            "context": "faq",
            "owner__context": "app",
            "owner_id": "0"
        },
        {
            "uid": "field_faq_embeddings",
            "_context": "custom_field",
            "uri": "embeddings_openai",
            "name": "OpenAI",
            "custom_fieldset_id": "{{{uid.fieldset_embeddings}}}",
            "context": "faq",
            "type": "T",
            "pos": "50"
        }
    ]
}


Click the Import button.

The new faq record type has six fields:

name A common question.
answer The answer to the question.
topic An optional topic to group related FAQ questions. This allows us to manage multiple FAQs for different bots in one place.
sourceUrl An optional URL for source attribution.
embeddings_openai A set of real numbers that cluster the question in high-dimensional space. These are generated by a large language model (LLM) using a service like Hugging Face or OpenAI.
entailments An optional list of alternative phrasings of the question. We’ll use this later if you choose to “fine-tune” your own language model for better clustering.

Sample FAQ data

Here’s a brief example FAQ in CSV format about music theory and Beethoven:



Question,Answer
"Who composed Moonlight Sonata?","Ludwig van Beethoven composed Piano Sonata No. 14 (popularly referred to as Moonlight Sonata) in 1801."
"How many keys are on a piano?","A full piano has 88 keys (52 white and 36 black)."
"How many octaves are on a piano?","A full piano has seven octaves and a minor third."
"What is the musical alphabet?","The seven notes of the musical alphabet are: A, B, C, D, E, F, and G."
"What is a sharp note?","A sharpened note is higher by one semitone."
"What is the notation for a sharp note?","A sharpened note is prefixed with #."
"What is a flat note?","A flattened note is lower by one semitone."
"What is the notation for a flat note?","A flattened note is prefixed with ♭."
"What is a natural note?","A natural note is not sharpened nor flattened."
"What is the notation for a natural note?","A natural note is prefixed with ♮."
"What is an accidental?","An accidental is a note that is not a member of the current scale: sharp, flat, or natural."
"What is the C Major scale?","The C Major scale is a diatonic scale that consists of the notes C, D, E, F, G, A, and B. It is often considered the simplest and most fundamental scale in Western music. The scale follows a specific pattern of whole steps (W) and half steps (H), which is W-W-H-W-W-W-H. In terms of the piano, it can be played entirely on the white keys starting from C and ending on the next C. The C Major scale has a bright and cheerful sound and serves as a foundation for many musical compositions."
"What is the G Major scale?","The G Major scale is a musical scale that consists of the notes G, A, B, C, D, E, and F#. It is a major scale, which means it follows a specific pattern of whole and half steps. The G Major scale is commonly used in classical and popular music and has a bright and joyful sound."
"What is a major scale?","A major scale starts on the named note and follows this pattern for an octave: tone, tone, semitone, tone, tone, tone, semitone."
"What is an octave?","An octave is the interval between a pitch and the pitch that is double its frequency."
"What is the treble clef?","In musical notation, the treble clef indicates higher pitched notes or instruments."
"What is the bass clef?","In musical notation, the bass clef indicates lower pitched notes or instruments."
"When was Ludwig van Beethoven born?","Ludwig van Beethoven was baptised in Bonn on 17 December, 1770."
"When did Ludwig van Beethoven die?","Ludwig van Beethoven died on 26 March, 1827 in Vienna at the age of 56."
"Who were Beethoven's parents?","Ludwig van Beethoven's father was Johann van Beethoven and his mother was Maria Magdalena Keverich."


Save the example FAQ entries to a file named: beethoven-faq.csv

Import FAQ entries

Navigate to Search » Faqs

Click the import icon in the top right of the worklist.

Choose beethoven-faq.csv from above and click the Upload button.

Map the CSV columns to Cerb record fields.

Then click the Continue button.

This may take a moment since the OpenAI API will be used to generate text embeddings for the new records.

At this point you can test keyword searching. The term parents will find a matching answer, but mother or father won’t. Our new semantic search will understand the relationship.

Navigate to Search » Automations and edit the example.faqBot.findSimilarDocs.resourceFile automation.

Inputs:



inputs:
  query: Who was Beethoven's mother?
  limit: 5
  resource_uri: dataset.faq.embeddings.beethoven


Click the Run icon.

You should see output like:



__exit: return
__return:
  response:
    results:
    - score: 0.9498498162366563
      data:
        q: Who were Beethoven's parents?
        a: Ludwig van Beethoven's father was Johann van Beethoven and his mother was
          Maria Magdalena Keverich.
    - score: 0.888526146049206
      data:
        q: When was Ludwig van Beethoven born?
        a: Ludwig van Beethoven was baptised in Bonn on 17 December, 1770.
    - score: 0.8694563363977813
      data:
        q: When did Ludwig van Beethoven die?
        a: Ludwig van Beethoven died on 26 March, 1827 in Vienna at the age of 56.
    - score: 0.8452143525786211
      data:
        q: Who composed Moonlight Sonata?
        a: Ludwig van Beethoven composed Piano Sonata No. 14 (popularly referred to
          as Moonlight Sonata) in 1801.
    - score: 0.7753790481042104
      data:
        q: How many keys are on a piano?
        a: A full piano has 88 keys (52 white and 36 black).


Ask Beethoven Bot another question:

The bot’s answers are now guided by the relevant FAQ entries.

This is how you can teach ChatGPT about new topics and reduce hallucinated answers.

For instance, Beethoven Bot will probably answer something like this by default:

In reality, there is no reliable evidence that Beethoven met or auditioned for Mozart during his 1787 visit to Vienna.

Let’s teach our bot this fact.

Navigate to Search » Faqs and click the (+) button at the top right of the worklist popup.

Enter the following info:

Name: (Question) Did Beethoven meet Mozart?
Answer: It is certain that Beethoven visited Vienna in 1787. However, there is no reliable evidence that Beethoven and Mozart met during that trip. According to legend, Beethoven briefly auditioned for Mozart before returning home to be with his dying mother.
Topic: Beethoven

Click the Save Changes button.

Immediately after saving the FAQ entry, the text is embedded using OpenAI’s API, and the resulting vector is stored on the Embeddings custom field on the record. The automation also invalidates the cache on the dataset.faq.embeddings.beethoven resource to rebuild the search index.

Let’s ask again:

Add a new FAQ bot with your own topic

Now let’s create a second bot for your own FAQ.

Add a new topic to the custom field

First, navigate to Search » Custom Fields and edit the Topic custom field on faq records to add a new topic.

Use the package to create your new bot

Then navigate to Setup » Packages » Import and paste the following package:



{
  "package": {
    "requires": {
      "cerb_version": "10.4.3"
    },
    "configure": {
      "placeholders": [],
      "prompts": [
        {
          "type": "text",
          "label": "Topic:",
          "key": "prompt_topic",
          "params": {
            "default": "FAQ",
            "placeholder": "(e.g. FAQ, Cerb, Beethoven)"
          }
        }
      ]
    }
  },
  "records": [
    {
      "uid": "resource_faq_embeddings",
      "_context": "resource",
      "name": "dataset.faq.embeddings.{{{prompt_topic|permalink|lower}}}",
      "description": "{{{prompt_topic}}} Bot FAQ embeddings",
      "extension_id": "cerb.resource.dataset.jsonl",
      "is_dynamic": 1,
      "automation_kata": "automation/{{{random_string(8)}}}:\r\n  uri: cerb:automation:dataset.faq.embeddings\r\n  inputs:\r\n    topic: {{{prompt_topic}}}"
    },
    {
      "uid": "automation_faq_interaction",
      "_context": "automation",
      "name": "example.bot.{{{prompt_topic|permalink|lower}}}",
      "extension_id": "cerb.trigger.interaction.worker",
      "description": "Start a chat with {{{prompt_topic}}} Bot",
      "script": "start:\r\n  await:\r\n    interaction:\r\n      uri: cerb:automation:example.faqBot.interaction\r\n      inputs:\r\n        botName: {{{prompt_topic}}} Bot\r\n        resourceUri: dataset.faq.embeddings.{{{prompt_topic|permalink|lower}}}\r\n        imageUri: image.bot.{{{prompt_topic|permalink|lower}}}\r\n        instructions: (ask {{{prompt_topic}}} Bot a question, then press ENTER)\r\n        introduction: Hello {{worker_first_name}}! How can I help?\r\n        chatGptPrompt@text:\r\n          You are a support bot.\r\n          You provide friendly, informative answers based ONLY on the Q&A below (don't make things up).\r\n",
      "policy_kata": "commands:\r\n  # [TODO] Specify a command policy here (use Ctrl+Space for autocompletion)\r\n  "
    },
    {
      "uid": "resource_image",
      "_context": "resource",
      "name": "image.bot.{{{prompt_topic|permalink|lower}}}",
      "description": "A profile image for {{{prompt_topic}}} Bot",
      "extension_id": "cerb.resource.image",
      "automation_kata": "width@int: 100\r\nheight@int: 100\r\nmime_type: image/png",
      "content": ""
    }
  ],
  "events": [
    {
      "event": "record.changed",
      "kata": "automation/{{{random_string(8)}}}:\n  uri: cerb:automation:example.faqBot.recordChanged.embeddings\n  disabled@bool:\n    {{\n      record__type is not record type ('faq')\n      or record_topic != \"{{{prompt_topic}}}\"\n      or (change_type == 'updated' and  was_record_name == record_name)\n      or change_type == 'deleted'\n    }}\n  inputs:\n    faq@int: {{record_id}}\n    resourceUri: dataset.faq.embeddings.{{{prompt_topic|permalink|lower}}}\n"
    }
  ],
  "toolbars": [
    {
      "toolbar": "global.menu",
      "kata": "interaction/{{{random_string(8)}}}:\r\n  label: {{{prompt_topic}}} Bot\r\n  uri: cerb:automation:example.bot.{{{prompt_topic|permalink|lower}}}\r\n  icon: chat\r\n"
    }
  ]
}


Enter the same topic you added to the custom field (e.g. Cerb) and click the Import button.

Chat with your bot

Click on the global menu in the bottom right and select your new bot from the list.

Customizing your FAQ bot

From Search » Automations edit the automation for your bot (e.g. example.bot.cerb).

In start:await:interaction:inputs: you can customize the bot name, image, introduction, instructions, and ChatGPT prompt.

Changing your bot image

To update the image of your bot, navigate to Search » Resources and edit it (e.g. image.bot.cerb). Upload a new image and click the Save Changes button.

Changing the global menu item

To edit the menu item for your bot, navigate to Search » Toolbars and edit global.menu.

Disabling text generation

If you wish to disable ChatGPT’s text generation and only return the most similar FAQ entries, edit the example.faqBot.interaction automation and comment out the following two code blocks:

Around line 91:



            # Have ChatGPT summarize an answer
            #function/chatGpt:
            #  uri: cerb:automation:example.services.llm.chatGpt
            #  output: chatgpt
            #  inputs:
            #    temperature@float: 0
            #    prompt@key: chatGptPrompt


Around line 110:



                #chatGpt:
                #  who@key: inputs:botName
                #  worker_id: 0
                #  icon_uri: cerb:resource:{{inputs.imageUri}}
                #  message: {{chatgpt.response.choices[0].message.content}}
                #  disclaimer: This answer is machine generated and may not be accurate.


Scaling for large FAQs or busy bots

In this guide we use a Cerb resource record for simplicity. If you have a large FAQ (100+ Q&As) or high usage on your bot, you will have better performance by using a specialized vector database (e.g. Pinecone, Qdrant, Elasticsearch 8+, Redis Search) to store and compare your text embeddings.

These databases use strategies like Hierarchical Navigable Small Worlds (HNSW) to efficiently compare millions of vectors.

You would still use OpenAI’s API (or a comparable method) to text embed your FAQ entries and each new user message.

Update the example.faqBot.recordChanged.embeddings automation to store embeddings in the vector database. Also store the FAQ record ID in the metadata.

In example.faqBot.interaction replace this block (around line 45) with a function that fetches results from the vector database:



            # Fetch docs
            function/docs:
              uri: cerb:automation:example.faqBot.findSimilarDocs.resourceFile
              output: matches
              inputs:
                query@key: promptQuery
                limit: 10
                resource_uri@key: inputs:resourceUri


As long as you return an object with this same structure you won’t need to make other changes:

return:    
  response:
    results:
      0:
        score: 0.99123123
        data:
          q: What is Cerb?
          a: Cerb is a customer service and support platform that helps businesses manage their customer interactions.

If your Q&A entries are fairly small (under 40KB for Pinecone) you can store the questions and answers in the vector database metadata. Otherwise just store the IDs, and load the FAQ entries in Cerb with record.search: in your function.