Skip to content

KeyError: 'tokenizer' #40

@Ther-nullptr

Description

@Ther-nullptr

When I run PiSSA/scripts/metamath_llama2_7b/run_qlora.sh and to save a checkpoint, I get the following error:

{'loss': 0.3431, 'grad_norm': 0.09538137912750244, 'learning_rate': 1.9506710479124212e-05, 'epoch': 0.13}                                                                                             
 13%|████████████████████▏                                                                                                                                         | 100/781 [33:26<3:49:05, 20.18s/it]Traceback (most recent call last):
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 303, in <module>
    train()
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 292, in train
    trainer.train(resume_from_checkpoint = resume_from_checkpoint_dir)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2245, in train
    return inner_training_loop(
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2620, in _inner_training_loop
    self._maybe_log_save_evaluate(
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 3101, in _maybe_log_save_evaluate
    self.control = self.callback_handler.on_save(self.args, self.state, self.control)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 545, in on_save
    return self.call_event("on_save", args, state, control)
  File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 556, in call_event
    result = getattr(callback, event)(
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 69, in on_save
    self.save_model(args, state, kwargs)
  File "/home/yujin-wa20/projects/PiSSA/train.py", line 66, in save_model
    kwargs["tokenizer"].save_pretrained(peft_model_path)
KeyError: 'tokenizer'
[rank0]: Traceback (most recent call last):
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 303, in <module>
[rank0]:     train()
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 292, in train
[rank0]:     trainer.train(resume_from_checkpoint = resume_from_checkpoint_dir)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2245, in train
[rank0]:     return inner_training_loop(
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 2620, in _inner_training_loop
[rank0]:     self._maybe_log_save_evaluate(
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer.py", line 3101, in _maybe_log_save_evaluate
[rank0]:     self.control = self.callback_handler.on_save(self.args, self.state, self.control)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 545, in on_save
[rank0]:     return self.call_event("on_save", args, state, control)
[rank0]:   File "/home/yujin-wa20/miniconda3/envs/halo/lib/python3.10/site-packages/transformers/trainer_callback.py", line 556, in call_event
[rank0]:     result = getattr(callback, event)(
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 69, in on_save
[rank0]:     self.save_model(args, state, kwargs)
[rank0]:   File "/home/yujin-wa20/projects/PiSSA/train.py", line 66, in save_model
[rank0]:     kwargs["tokenizer"].save_pretrained(peft_model_path)
[rank0]: KeyError: 'tokenizer'

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions