runtime error
Exit code: 1. Reason: tokenizer.json: 0%| | 0.00/9.09M [00:00<?, ?B/s][A tokenizer.json: 100%|██████████| 9.09M/9.09M [00:00<00:00, 112MB/s] special_tokens_map.json: 0%| | 0.00/350 [00:00<?, ?B/s][A special_tokens_map.json: 100%|██████████| 350/350 [00:00<00:00, 2.05MB/s] Loading model on CPU (this takes ~5 mins)... config.json: 0%| | 0.00/1.25k [00:00<?, ?B/s][A config.json: 100%|██████████| 1.25k/1.25k [00:00<00:00, 12.9MB/s] Unused kwargs: ['_load_in_4bit', '_load_in_8bit', 'quant_method']. These kwargs are not used in <class 'transformers.utils.quantization_config.BitsAndBytesConfig'>. Traceback (most recent call last): File "/app/app.py", line 20, in <module> model = AutoModelForCausalLM.from_pretrained( base_model_name, ...<3 lines>... trust_remote_code=True ) File "/usr/local/lib/python3.13/site-packages/transformers/models/auto/auto_factory.py", line 564, in from_pretrained return model_class.from_pretrained( ~~~~~~~~~~~~~~~~~~~~~~~~~~~^ pretrained_model_name_or_path, *model_args, config=config, **hub_kwargs, **kwargs ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ ) ^ File "/usr/local/lib/python3.13/site-packages/transformers/modeling_utils.py", line 3669, in from_pretrained hf_quantizer.validate_environment( ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^ torch_dtype=torch_dtype, ^^^^^^^^^^^^^^^^^^^^^^^^ ...<3 lines>... weights_only=weights_only, ^^^^^^^^^^^^^^^^^^^^^^^^^^ ) ^ File "/usr/local/lib/python3.13/site-packages/transformers/quantizers/quantizer_bnb_4bit.py", line 74, in validate_environment raise ImportError( "Using `bitsandbytes` 4-bit quantization requires the latest version of bitsandbytes: `pip install -U bitsandbytes`" ) ImportError: Using `bitsandbytes` 4-bit quantization requires the latest version of bitsandbytes: `pip install -U bitsandbytes`
Container logs:
Fetching error logs...