handle multiple model artifacts that are associated with a model #9

rbavery · 2024-02-14T23:35:40Z

As I'm working with AOTIndutor and Torchscript for exporting models, I'm realizing that model publishers will sometimes want to reference runtime details for multiple model artifacts, without duplicating all model extension info.

AOTInductor (.pt2) exports a model with hardware specific optimizations, so it will be tied to a particular accelerator (cpu, gpu, tpu, etc.)

Torchscript tracing (.pt) is hardware agnostic. the loaded model and model inputs just need to be moved to the correct hardware before inference. The optimizations are not hardware specific so accelerator utilization is lower than models compiled with AOTInductor.

Model publishers might want to provide any of combinations of a hardware agnostic model artifact, multiple optimized models, or the original weights.

I think we should probably accept an array of Runtime Objects instead of a single Runtime Object.

rbavery · 2024-02-15T01:56:29Z

done in #2 and updated the hackmd

fmigneault · 2024-04-18T19:09:25Z

Various model artifacts should be provided by distinct Assets with mlm:model role.
Each Asset can also provide mlm:artifact_type to be more explicit about the specific artifact content.
Other fields such as mlm:framework can also be applied on individual Assets to allow providing multiple equivalent definitions by various implementations.

Fixed by #2

rbavery mentioned this issue Feb 14, 2024

Roadmap for V2 of the ML Model Extension #7

Closed

20 tasks

fmigneault closed this as completed Apr 18, 2024

fmigneault mentioned this issue Apr 23, 2024

Roadmap for V2 of the ML Model Extension crim-ca/mlm-extension#4

Closed

21 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

handle multiple model artifacts that are associated with a model #9

handle multiple model artifacts that are associated with a model #9

rbavery commented Feb 14, 2024

rbavery commented Feb 15, 2024

fmigneault commented Apr 18, 2024

handle multiple model artifacts that are associated with a model #9

handle multiple model artifacts that are associated with a model #9

Comments

rbavery commented Feb 14, 2024

rbavery commented Feb 15, 2024

fmigneault commented Apr 18, 2024