text-generation bloom

Bloom 1b1 for Spanish text generation

This model is a fine-tuned version of bigscience/bloom-1b1 on Spanish datasets. It achieves the following results on the evaluation set:

Model under development. Use with caution.

Dataset Summary

Model trained with Large Spanish Corpus and a Spanish books corpus crawled from web and torrents.

Preprocessing

Preprocessing performed by spanish_nlp.

Licensing Information

The dataset is available under the Creative Commons Attribution-ShareAlike License (CC BY-SA 4.0).

Some books may be subject to copyright. Use for academic purposes only.

Citation Information

@misc {jorge_ortiz_fuentes_2023,
	author       = { {Jorge Ortiz Fuentes} },
	title        = { Bloom 1b1 for Spanish text generation },
	year         = 2023,
	url          = { https://huggingface.co/jorgeortizfuentes/bloom-1b1-spanish },
	doi          = { 10.57967/hf/0247 },
	publisher    = { Hugging Face }
}