Introduction

Embeddings Generator

Processing of the source text files typically involves chunking the text into smaller pieces, such as sentences or paragraphs, and then making an OpenAI call to produce embeddings for each chunk independently. Finally, the embeddings need to be stored in a database or other data store for later use.

C# embeddings generator example

[Function(nameof(GenerateEmbeddings_Http_RequestAsync))]
public async Task GenerateEmbeddings_Http_RequestAsync(
    [HttpTrigger(AuthorizationLevel.Function, "post", Route = "embeddings")] HttpRequestData req,
    [EmbeddingsInput("{RawText}", InputType.RawText)] EmbeddingsContext embeddings)
{
    using StreamReader reader = new(req.Body);
    string request = await reader.ReadToEndAsync();

    EmbeddingsRequest? requestBody = JsonSerializer.Deserialize<EmbeddingsRequest>(request);

    this.logger.LogInformation(
        "Received {count} embedding(s) for input text containing {length} characters.",
        embeddings.Count,
        requestBody.RawText.Length);

    // TODO: Store the embeddings into a database or other storage.
}

Python example

@app.function_name("GenerateEmbeddingsHttpRequest")
@app.route(route="embeddings", methods=["POST"])
@app.embeddings_input(
    arg_name="embeddings",
    input="{rawText}",
    input_type="rawText",
    embeddings_model="%EMBEDDING_MODEL_DEPLOYMENT_NAME%",
)
def generate_embeddings_http_request(
    req: func.HttpRequest, embeddings: str
) -> func.HttpResponse:
    user_message = req.get_json()
    embeddings_json = json.loads(embeddings)
    embeddings_request = {"raw_text": user_message.get("rawText")}
    logging.info(
        f'Received {embeddings_json.get("count")} embedding(s) for input text '
        f'containing {len(embeddings_request.get("raw_text"))} characters.'
    )
    # TODO: Store the embeddings into a database or other storage.
    return func.HttpResponse(status_code=200)

Prerequisites

The sample is available in the following language stacks:

Please refer to the root README for common prerequisites that apply to all samples.

Running the sample

Clone this repo and navigate to the sample folder.
Use a terminal window to navigate to the sample directory (e.g. cd samples/embeddings/csharp-ooproc/Embeddings)
If using python, run pip install -r requirements.txt to install the correct library version.

Run func start to build and run the sample function app

If successful, you should see the following output from the func command:

Functions:

    GenerateEmbeddings_Http_RequestAsync: [POST] http://localhost:7071/api/embeddings

    GetEmbeddings_Http_FilePath: [POST] http://localhost:7071/api/embeddings-from-file

Use an HTTP client tool to send a POST request to the GenerateEmbeddings_Http_RequestAsync function. The following is an example request:
```
POST http://localhost:7071/api/embeddings
```
NOTE: All the HTTP requests in this sample can also be found in the demo.http file, which can be opened and run in most IDEs.

You should see some relevant log output in the terminal window where the app is running.
Use an HTTP client tool to send a POST request to the GetEmbeddings_Http_FilePath function. The following is an example request:
```
POST http://localhost:7071/api/embeddings-from-file
```
NOTE: All the HTTP requests in this sample can also be found in the demo.http file, which can be opened and run in most IDEs.

You should see some relevant log output in the terminal window where the app is running.