word_sense_disambiguation

Semantic Specialization for Knowledge-based Word Sense Disambiguation

Trained Model (Projection Heads)

Argument name Value Description
max_epochs 15 Maximum number of training epochs
cfg_similarity_class.temperature ($\beta^{-1}$) 0.015625 (=1/64) Temperature parameter for the contrastive loss
batch_size ($N_B$) 256 Number of samples in each batch for the attract-repel and self-training objectives
coef_max_pool_margin_loss ($\alpha$) 0.2 Coefficient for the self-training loss
cfg_gloss_projection_head.n_layer 2 Number of FFNN layers for the projection heads
cfg_gloss_projection_head.max_l2_norm_ratio ($\epsilon$) 0.015 Hyperparameter for the distance constraint integrated in the projection heads

Sense/context embeddings

Reference

@inproceedings{Mizuki:EACL2023,
    title     = "Semantic Specialization for Knowledge-based Word Sense Disambiguation",
    author    = "Mizuki, Sakae and Okazaki, Naoaki",
    booktitle = "Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume",
    series = {EACL},
    month = may,
    year = "2023",
    address = "Dubrovnik, Croatia",
    publisher = "Association for Computational Linguistics",
    pages = "3449--3462",
}