Type Safety for Tokenizer and Processor inputs #1131

tsekiguchi · 2025-01-02T00:16:27Z

Feature request

Currently, the way each class is constructed, Transformers.js is using ...args: any[] when passing arguments to the tokenizer and processor. This leads to a lot of developer confusion as each model requires different inputs. There is very little documentation besides the single example provided for most models, which has led to me to many hours of trying to figure out what I can pass to the function in order to make this particular model happy.

The broad approach of passing a string of a model for tokenizer / processor / model feels unstable. And while it might resemble the transformers python counterpart, it feels like a no guardrails approach for such an important library.

Ideally, this would require a bit of a restructure, with an alternative Model class so as to not break any existing code bases. This a class for a model could be instantiated from a list of currently supported models (with an option for custom should users want to convert specific models to ONNX on their own), and the class itself would contain the processor, tokenizer, and model with all the proper parameters and type definitions to allow users to more accurately pass information to be encoded, and to know what to expect upon decode.

A broad example:

// Create the specific class for this model
const clipModel = await baseModel.initialize(ApprovedModels.OpenCLIP, {
     dtype: 'f16',
     device: 'webgpu'
})

// All model specific methods are on this class
const tokens = clipModel.tokenizer(text)

const inputs = clipModel.processInput(imageInput, { padding: true, truncation: true })
// or some specific methods
const inputs = clipModel.processorImageOnly(imageInput, { padding: true, truncation: true })

// Use descriptive names for methods to have a better idea of output
const embeddings: ClipModelEmbeddings = clipModel.generateEmbeddings(inputs)

Motivation

It would be great for DX to provide a class that is specific to the model with all the proper options and definitions that devs are used to in Typescript. I want to provide a way to give feedback and guardrails for devs as they're implementing Transformers.js to make it a seamless, delightful experience.

Your contribution

Absolutely! I'd love to help out. It will take a lot of work, but I want to make sure that I have the support of Xenova before heading down this road.

Thank you!

The text was updated successfully, but these errors were encountered:

tsekiguchi · 2025-01-02T00:27:08Z

I'll also add that this would give an opportunity to provide better error handling across the board. There is a lack of error handling / messages provided when something goes wrong, often causing the entire application to crash without logging an error. I believe that many of these could be prevented by generous use of try / catch that isn't present right now, along with system check guardrails when loading models (for example, getting OOM crashes when loading large models)

tsekiguchi added the enhancement New feature or request label Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type Safety for Tokenizer and Processor inputs #1131

Type Safety for Tokenizer and Processor inputs #1131

tsekiguchi commented Jan 2, 2025 •

edited

Loading

tsekiguchi commented Jan 2, 2025

Type Safety for Tokenizer and Processor inputs #1131

Type Safety for Tokenizer and Processor inputs #1131

Comments

tsekiguchi commented Jan 2, 2025 • edited Loading

Feature request

Motivation

Your contribution

tsekiguchi commented Jan 2, 2025

tsekiguchi commented Jan 2, 2025 •

edited

Loading