use accelerate to load model

by adolf669 - opened Mar 14, 2023

Mar 14, 2023

I use accelerate,like this:
"""
tokenizer = AutoTokenizer.from_pretrained("model/GPT-NeoXT-Chat-Base-20B")
with init_empty_weights():
model = AutoConfig.from_pretrained("model/GPT-NeoXT-Chat-Base-20B")
model = load_checkpoint_and_dispatch(model, "model/GPT-NeoXT-Chat-Base-20B", device_map="auto")
"""
but it have a error:AttributeError: 'GPTNeoXConfig' object has no attribute 'named_parameters',what can I do

juewang

Together org Mar 17, 2023

@adolf669 Hi, you seem to take a config as model. Can you try this?

tokenizer = AutoTokenizer.from_pretrained("model/GPT-NeoXT-Chat-Base-20B")
config = AutoConfig.from_pretrained("model/GPT-NeoXT-Chat-Base-20B")
with init_empty_weights():
    model = AutoModelForCausalLM.from_config(config)
model = load_checkpoint_and_dispatch(model, "model/GPT-NeoXT-Chat-Base-20B", device_map="auto")

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment