xlm-roberta-base-tweet-sentiment-ar-trimmed-ar`

This model is a trimmed version of cardiffnlp/xlm-roberta-base-tweet-sentiment-ar by vocabtrimmer, a tool for trimming vocabulary of language models to compress the model size. Following table shows a summary of the trimming process.

	cardiffnlp/xlm-roberta-base-tweet-sentiment-ar	vocabtrimmer/xlm-roberta-base-tweet-sentiment-ar-trimmed-ar
parameter_size_full	278,045,955	124,345,347
parameter_size_embedding	192,001,536	38,300,928
vocab_size	250,002	49,871
compression_rate_full	100.0	44.72
compression_rate_embedding	100.0	19.95

Following table shows the parameter used to trim vocabulary.

language	dataset	dataset_column	dataset_name	dataset_split	target_vocab_size	min_frequency
ar	vocabtrimmer/mc4_validation	text	ar	validation		2

Vocabulary Trimmed cardiffnlp/xlm-roberta-base-tweet-sentiment-ar: vocabtrimmer/xlm-roberta-base-tweet-sentiment-ar-trimmed-ar