Training procedure
The following bitsandbytes quantization config was used during training:
- quant_method: gptq
 - bits: 4
 - tokenizer: None
 - dataset: None
 - group_size: 128
 - damp_percent: 0.1
 - desc_act: True
 - sym: True
 - true_sequential: True
 - use_cuda_fp16: True
 - model_seqlen: 4096
 - block_name_to_quantize: model.layers
 - module_name_preceding_first_block: ['model.embed_tokens']
 - batch_size: 1
 - pad_token_id: None
 - disable_exllama: False
 - max_input_length: None
 
Framework versions
- PEFT 0.5.0