pkbiswas
/

DeepSeek-R1-Distill-Llama-8B-Summarization-QLoRa

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

DeepSeek-R1-Distill-Llama-8B-Summarization-QLoRa

1 contributor

History: 4 commits

pkbiswas's picture

Update README.md

365eb76 verified about 10 hours ago

runs
End of training about 10 hours ago
.gitattributes

1.57 kB

End of training about 10 hours ago
README.md

2.11 kB

Update README.md about 10 hours ago
adapter_config.json

814 Bytes

End of training about 10 hours ago
adapter_model.safetensors

168 MB
LFS

End of training about 10 hours ago
special_tokens_map.json

371 Bytes

End of training about 10 hours ago
tokenizer.json

17.2 MB
LFS

End of training about 10 hours ago
tokenizer_config.json

52.9 kB

End of training about 10 hours ago
training_args.bin
Detected Pickle imports (10)
- "accelerate.state.PartialState",
- "transformers.trainer_utils.SchedulerType",
- "transformers.trainer_utils.HubStrategy",
- "transformers.training_args.OptimizerNames",
- "accelerate.utils.dataclasses.DistributedType",
- "transformers.trainer_pt_utils.AcceleratorConfig",
- "transformers.trainer_utils.IntervalStrategy",
- "transformers.training_args.TrainingArguments",
- "transformers.trainer_utils.SaveStrategy",
- "torch.device"
How to fix it?
5.37 kB
LFS

End of training about 10 hours ago