Skip to content

Latest commit

 

History

History
28 lines (22 loc) · 1.16 KB

design.md

File metadata and controls

28 lines (22 loc) · 1.16 KB

Design

Introduction

The Transformers Extension proposes to use large pre-trained models using Transformers API in Exasol Database. These models can be used for many predictive tasks such as text classification, question-answering, text generation etc. in the field of NLP.

Design Considerations:

Pre-trained models contain many parameters and can grow very large. Therefore, it is not efficient to download these large models in each predictive task. Transformer Extensions is therefore divided into two different parts:

  1. Download : It is responsible to download the specified pre-defined model into the Exasol BucketFS.
  2. Prediction: These are a group of UDFs for each supported task. Each of them caches the downloaded pre-trained model and perform prediction. Supported tasks:
    1. Sequence Classification for Single Text
    2. Sequence Classification for Text Pair
    3. Question Answering
    4. Masked Language Modeling
    5. Text Generation
    6. Token Classification
    7. Text Translation

System Design and Architecture

System Design Diagram