huggingartists lyrics lm-head causal-lm

<div class="inline-flex flex-col" style="line-height: 1.5;"> <div class="flex"> <div style="display:DISPLAY_1; margin-left: auto; margin-right: auto; width: 92px; height:92px; border-radius: 50%; background-size: cover; background-image: url('https://images.genius.com/a6115c556163f271124bacf8a07db45d.499x499x1.png')"> </div> </div> <div style="text-align: center; margin-top: 3px; font-size: 16px; font-weight: 800">🤖 HuggingArtists Model 🤖</div> <div style="text-align: center; font-size: 16px; font-weight: 800">Cocomelon</div> <a href="https://genius.com/artists/cocomelon"> <div style="text-align: center; font-size: 14px;">@cocomelon</div> </a> </div>

I was made with huggingartists.

Create your own bot based on your favorite artist with the demo!

How does it work?

To understand how the model was developed, check the W&B report.

Training data

The model was trained on lyrics from Cocomelon.

Dataset is available here. And can be used with:

from datasets import load_dataset

dataset = load_dataset("huggingartists/cocomelon")

Explore the data, which is tracked with W&B artifacts at every step of the pipeline.

Training procedure

The model is based on a pre-trained GPT-2 which is fine-tuned on Cocomelon's lyrics.

Hyperparameters and metrics are recorded in the W&B training run for full transparency and reproducibility.

At the end of training, the final model is logged and versioned.

How to use

You can use this model directly with a pipeline for text generation:

from transformers import pipeline
generator = pipeline('text-generation',
                     model='huggingartists/cocomelon')
generator("I am", num_return_sequences=5)

Or with Transformers library:

from transformers import AutoTokenizer, AutoModelWithLMHead
  
tokenizer = AutoTokenizer.from_pretrained("huggingartists/cocomelon")

model = AutoModelWithLMHead.from_pretrained("huggingartists/cocomelon")

Limitations and bias

The model suffers from the same limitations and bias as GPT-2.

In addition, the data present in the user's tweets further affects the text generated by the model.

About

Built by Aleksey Korshuk

Follow

Follow

Follow

For more details, visit the project repository.

GitHub stars