This model is one of the result of my bachelor's thesis. It's main purpose is to detect semantic types of columns in tables containing Russian text. Also it can be used as table to vec encoder for downstream tasks.

You can find more info in this github repo https://github.com/Elluran/rudoduo.

Also check out streamlit demo https://rudoduo.streamlit.app/