text-to-image sygil-devs Muse Sygil-Muse

Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->

This model is based in Muse and trained using the code hosted on Sygil-Dev/muse-maskgit-pytorch, which is based on lucidrains/muse-maskgit-pytorch and a collaboration between the Sygil-Dev and ShoukanLabs teams.

Model Details

This model is a new model trained from scratch based on Muse, trained on a subset of the Imaginary Network Expanded Dataset, with the big advantage of allowing the use of multiple namespaces (labeled tags) to control various parts of the final generation. The use of namespaces (eg. “species:seal” or “studio:dc”) stops the model from misinterpreting a seal as the singer Seal, or DC Comics as Washington DC.

Note: As of right now, only the first VAE and MaskGit has been trained with different configuration, we are trying to find the best balance between quality, performance and vram usage so Muse can be used on all kind of devices, we still need to train the Super Resolution VAE for the model to be usable even tho we might be able to reuse the first VAE depending on the quality of it once the training progresses more.

If you find my work useful, please consider supporting me on GitHub Sponsors!

This model is still in its infancy and it's meant to be constantly updated and trained with more and more data as time goes by, so feel free to give us feedback on our Discord Server or on the discussions section on huggingface. We plan to improve it with more, better tags in the future, so any help is always welcome. Join the Discord Server

Available Checkpoints:

Note: Checkpoints under the Beta section are updated daily or at least 3-4 times a week. While the beta checkpoints can be used as they are, only the latest version is kept on the repo and the older checkpoints are removed when a new one is uploaded to keep the repo clean.

Training

Training Data: The model was trained on the following dataset:

Hardware and others

Developed by: ZeroCool at Sygil-Dev.

License

This model is open access and available to all, with a CreativeML Open RAIL++-M License further specifying rights and usage.