caten/apps.gpt2

Implements GPT2 text generation from the pre-trained model.

[function] make-gpt2

(make-gpt2 model-type &key max-seq-len)

Creates a compiled GPT2 model from the pre-trained gguf file. The model-type must be one of :gpt2, :gpt2-medium, :gpt2-large, or :gpt2-xl. The max-seq-len is the maximum sequence length for the model. Since GPT2 is a heavy model, you should consider JIT=1 as a prerequisite.

The pretrained model is downloaded from the following HuggingFace repository:

(defun url (model-type) (format nil "https://huggingface.co/hikettei/gpt2-gguf/resolve/main/~(~a~)-f32.gguf?download=true" model-type))

[function] gpt2-inference

(gpt2-inference model input)

Takes a compiled GPT2 model and a string input, and generates a text output.

caten/apps.gpt2

Share via

caten/apps.gpt2

[function] make-gpt2

[function] gpt2-inference