SEC-BERT post-trained using company name masking on Form 10-K filings

How to use

from transformers import AutoTokenizer, AutoModel
  
tokenizer = AutoTokenizer.from_pretrained("sophia-jihye/Incorporation_of_Company-Related_Factual_Knowledge_into_Pre-trained_Language_Models")
model = AutoModel.from_pretrained("sophia-jihye/Incorporation_of_Company-Related_Factual_Knowledge_into_Pre-trained_Language_Models")

Citation

@article{park2023incorporation,
  title={Incorporation of company-related factual knowledge into pre-trained language models for stock-related spam tweet filtering},
  author={Park, Jihye and Cho, Sungzoon},
  journal={Expert Systems with Applications},
  pages={121021},
  year={2023},
  publisher={Elsevier}
}