query: $q_{j}$, key: $k_i$
$\Large BLEU = \frac{\text{matched number of words in candidate}}{\text{total number of words in candidate}} = \frac{7}{7}$
Summarize original sentnese by deletion-only
Summarize original sentnese by deletion and reordering
Summarize original sentnese by arbitary transformation
Ignoring properties of the original order or relationships between neighboring words
Allowing local interactions between words while also not requiring the context $y_c$ while encoding the input.
A method to evaluate a language model. A language model describes the probability distribution over whole sentnese.
If each word is specified in a sentense, the meaning of a sentense is clear. We evaluate the occurrence (probability) of words in sentense. Lower entropy means precise meaning. Small perplexity is better.
A method to evaluate machine translation and machine abstraction. It evaluates generated result and a reference (usually written by human), and further calculate the similarity between them.
the cat was found under the bed
the cat was under the bed
the
, cat
, was
, found
, under
, bed
the
, cat
, was
, under
, bed
the cat
, cat was
, was found
, found under
, under the
, the bed
the cat
, cat was
, was under
, under the
, the bed