See https://huggingface.co/The-Face-Of-Goonery/Huginn-19b-prototype ?
Stheno-20B is even more stupid, uses the same technique as above, just slightly different params.
a 64-layer splice of Stheno P1 and P2.
Hey, it works... decently well.
Meme model that somehow isn't as bad as I thought.
Ty Chargoddard for mergekit.
Stheno v2 on the way soon, Euryale-70B progress stalled for now, Medusa-7B soonTM