text-classification

debate2vec

Word-vectors created from a large corpus of competitive debate evidence, and data extraction / processing scripts

#usage

import fasttext.util
ft = fasttext.load_model('debate2vec.bin')
ft.get_word_vector('dialectics')

Download Link

Github won't let me store large files in their repos.

About

Created from all publically available Cross Examination Competitive debate evidence posted by the community on Open Evidence (From 2013-2020)

Search through the original evidence by going to debate.cards

Stats about this corpus:

Stats about debate2vec vectors:

The corpus includes the following topics

Other topics that this word vector model will handle extremely well

Initial release is of fasttext vectors without subword information. Future releases will include fine-tuned GPT-2 and other high end models as my GPU compute allows.

Screenshots