Notes on: Bojanowski, P., Grave, E., Joulin, A., & Mikolov, T. (2016): Enriching Word Vectors With Subword Information
Notation
target word
contextual word of 
set of negative examples sampled from the vocabulary
set of indices of words surrounding word 
Overview
- Consider each word to be the sum of representations of character n-grams and a special "n-gram" corresponding to the word itself
- Skipgram mode, where each word is represented as a bag of character n-grams.
- A vector representation is associated to each character n-gram; words being represented as the sum of these representations.