drastic impact of Changing the vocabulary on perplexity #233

Krishnkant-Swarnkar · 2020-06-13T19:07:07Z

I was trying to train the ELMo on an augmented version of the 1 Billion Benchmark corpus. The augmented sentences bring in some extra proper nouns to the corpus. So, I added these extra proper nouns (a few thousand) to the default vocab.
I noticed that the training perplexity went to near 4 (just in one epoch of training).
I noticed that the code uses a sampled softmax, so I increased the "n_negative_samples_batch" by 5x. Still the perplexity remains nearly the same (after 1 epoch).
Isn't that weird? Any explainations?

matt-peters · 2020-06-15T18:12:57Z

Yes that is weird. Possible explanations are:

your augmented 1 Billion Benchmark is much easier for a language model to learn then the original 1 Billion Benchmark (and therefore perplexity really is much lower)
it's a bug

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

drastic impact of Changing the vocabulary on perplexity #233

drastic impact of Changing the vocabulary on perplexity #233

Krishnkant-Swarnkar commented Jun 13, 2020

matt-peters commented Jun 15, 2020

drastic impact of Changing the vocabulary on perplexity #233

drastic impact of Changing the vocabulary on perplexity #233

Comments

Krishnkant-Swarnkar commented Jun 13, 2020

matt-peters commented Jun 15, 2020