stable-diffusion anime aiart

This model intends to be the ultimate uma-musume model where we try to train as many things about uma-musume as possible into it.
Ok it's not true. Only characters and outfits are trained, ant not locations/objects/running-style(?) or whatever.

Example Generations

See https://civitai.com/models/13543/umamusume

Usage

You don't need to use the model as is. Treat it as lora and do either

How to prompt

Here are two example captions that are used for training

AgnesDigital; CurrenChan, tracen school uniform; fanart; 2girls, horse girl, multiple girls, character doll, phone, beads, bangs, bow, blush, animal ears, school uniform, holding phone, two side up, heart in mouth, one eye closed, white background, sweat, tracen school uniform, selfie, red bow, shirt, sailor collar, open mouth, simple background, heart, horse ears, holding, purple shirt, purple skirt, skirt, long sleeves

CurrenChan; SmartFalcon, racing suit; fanart; 2girls, horse girl, multiple girls, dress, diamond (shape), party, nail polish, red bow, see-through, teeth, frilled dress, upper teeth only, balloon, blurry, frilled collar, animal ears, timestamp, ring, black bow, pink dress, border, gem, short sleeves, puffy sleeves, heart hands, pink nails, strap, bow, lace-trimmed sleeves, black headband, chromatic aberration, white border, black dress, writing on wall, headband, frills, horse tail, chain, horse ears, puffy short sleeves, bracelet, tail, bangs, open mouth, collar, black nails, multicolored dress, multicolored clothes, depth of field, heart

Concepts

List of characters

Umamusume

Others

List of outfits

Common outfits

Character specific outfit

Racing suit (勝負服) for most characters, casual outfit for some, and also other specific costumes if they are tagged by booru. You should prompt directly with something like character, racing suit and you may need other trigger words for better results.

Styles

For richer style consider using style prompt that the model usually knows or merging with other models

Dataset description

Around 60K images containing

Training

The model is trained in two phases (resolution 512, clip skip 1) on top of ACertainty

The first phase is trained with EveryDream2

Total steps x batch is around 1.1M

The second phase is trained with naifu trainer

The goal is to train embeddings for character specific outfits. Both unets and embeddings are trained but not text encoders. This phase does not use the anime screenshots nor the regularization images. In the end, it turns out that although the model is not further trained for "charactar, racing suit" this acutally gets improved and gives more satisfying results than the embeddings. This is really mysterious and worths more investigation.