build bert: build does not load model #2379

Alireza3242 · 2024-10-26T09:24:49Z

System Info

a100

Who can help?

@byshiue

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

My bert model is model.safetensors. But examples/bert/build.py cant read it.
I changed this part of code and worked:

if args.model_dir is not None and os.path.exists(
                os.path.join(args.model_dir, "pytorch_model.bin")):
            state_dict = torch.load(
                os.path.join(args.model_dir, "pytorch_model.bin"))
            hf_bert.load_state_dict(state_dict, strict=False)

to

if args.model_dir is not None and os.path.exists(
                os.path.join(args.model_dir, "pytorch_model.bin")):
            state_dict = torch.load(
                os.path.join(args.model_dir, "pytorch_model.bin"))
            hf_bert.load_state_dict(state_dict, strict=False)
elif args.model_dir is not None and os.path.exists(
                os.path.join(args.model_dir, "model.safetensors")):
            state_dict = safetensors.torch.load_file(os.path.join(args.model_dir, "model.safetensors"))
            hf_bert.load_state_dict(state_dict, strict=False)

Expected behavior

read model

actual behavior

not read model

additional notes

nothing

The text was updated successfully, but these errors were encountered:

symphonylyh · 2024-10-28T21:48:50Z

fixed together with your other issue #2373. Thank you!!
I used a try-catch instead:

from safetensors.torch import load_file
...
if args.model_dir is not None:
            try:
                state_dict = torch.load(
                    os.path.join(args.model_dir, "pytorch_model.bin"))
            except FileNotFoundError:
                state_dict = load_file(os.path.join(args.model_dir, "model.safetensors"))
            hf_bert.load_state_dict(state_dict, strict=False)

tkhanipov · 2024-11-06T12:36:44Z

@symphonylyh
Note: the change only covers the case elif args.model == 'BertForSequenceClassification' or args.model == 'RobertaForSequenceClassification'. The if args.model == 'BertModel' or args.model == 'RobertaModel' case remains buggy. Please have another look at #2187, there is a fix for the latter case (the corresponding bug: #2197).

tkhanipov · 2024-11-06T12:37:51Z

@symphonylyh Here you said that you were planning to merge the PR

Alireza3242 added the bug Something isn't working label Oct 26, 2024

Superjomn added build triaged Issue has been triaged by maintainers labels Oct 28, 2024

symphonylyh closed this as completed Oct 28, 2024

tkhanipov mentioned this issue Nov 6, 2024

examples/bert/build.py does not use model weights #2197

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build bert: build does not load model #2379

build bert: build does not load model #2379

Alireza3242 commented Oct 26, 2024 •

edited

Loading

symphonylyh commented Oct 28, 2024 •

edited

Loading

tkhanipov commented Nov 6, 2024

tkhanipov commented Nov 6, 2024

build bert: build does not load model #2379

build bert: build does not load model #2379

Comments

Alireza3242 commented Oct 26, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

actual behavior

additional notes

symphonylyh commented Oct 28, 2024 • edited Loading

tkhanipov commented Nov 6, 2024

tkhanipov commented Nov 6, 2024

Alireza3242 commented Oct 26, 2024 •

edited

Loading

symphonylyh commented Oct 28, 2024 •

edited

Loading