Bloom 1b1 for Spanish text generation
This model is a fine-tuned version of bigscience/bloom-1b1 on Spanish datasets. It achieves the following results on the evaluation set:
- Loss: 2.340
Model under development. Use with caution.
Dataset Summary
Model trained with Large Spanish Corpus and a Spanish books corpus crawled from web and torrents.
Preprocessing
Preprocessing performed by spanish_nlp.
Licensing Information
The dataset is available under the Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0).
Some books may be subject to copyright. Use for academic purposes only.
Citation Information
@misc {jorge_ortiz_fuentes_2023,
author = { {Jorge Ortiz Fuentes} },
title = { Bloom 1b1 for Spanish text generation },
year = 2023,
url = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish },
doi = { 10.57967/hf/0247 },
publisher = { Hugging Face }
}