text-generation-inference

KoMETA AI models

This is a collection of models trained on KoMETA's talents' voice, manner of speech, and other things.


Model Description (SVC)

<!-- Provide a longer summary of what this model is. -->

Dataset

ElaineGeneral1

Around 3500 audio files of Elaine's voice, totaling to more than 3 hours worth of audio.

Singlaine2

130 audio files of Elaine's unarchived karaoke streams, singing voice only.


Model Description (Text Generation)

Known compatible models

7B LoRA

1.3B LoRA

Dataset

VirgilCorpus

A collection of text transcribed by OpenAI Whisper (Medium). Using 13 livestreams from Virgil's channel.

VirgilCorpusV2

A better filtered and larger collection of text, using transcription by OpenAI Whisper (Medium and Small.en). Using VirgilCorpus with an additional 13 streams.


Intended Use

For entertainment, educational, and personal use only.

Out-of-Scope Use

<!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->

Please note that these AI models are not designed for any harmful, malicious, or deceptive activities, and it is the users' responsibility to make sure that these are not used for such purposes.

Limitations and Biases

So-vits-svc (voice conversion) models can't perfectly imitate the voice of the character, this is mostly due to badly filtered data.

Text generation models have a tendency to hallucinate and spew inaccurate information or derail from the current conversation.

How to Get Started with the Model

Use so-vits-svc-fork for so-vits-svc. Use text-generation-webui for text generation. Models can be found on Huggingface.