fundamentals-machine-learning/8a-transformers

Question

fundamentals-machine-learning/8a-transformers

midhun 21

https://learn.microsoft.com/en-us/training/modules/fundamentals-machine-learning/8a-transformers

Here in the image output of encode is represented as a list of embeddings like "dog" - [10, 3, 2], "cat" - [10, 3, 1], "puppy" - [5, 2, 1] but the encoder embedding are tokens (["when" - [10, 3, 4], "my" - [10, 3, 5], "dog" - [10, 3, 6], "was" - [10, 3, 7]]) not the list of embeddings like "dog" - [10, 3, 2], "cat" - [10, 3, 1], "puppy" - [5, 2, 1].

This creates confusion for decoder.

This question is related to the following Learning Module

Accepted answer

0 additional answers

Your answer

Answer 1

Gowtham CP 6,020 Volunteer Moderator

Hi midhun ,

Thanks for posting on Microsoft Q&A!

The “dog → [10,3,2]” example in the module is a static word vector, showing a word’s fixed meaning. In a real Transformer, text like “when my dog was” gets split into tokens, and each token starts with an embedding. The encoder’s self-attention then tweaks these into contextual embeddings, capturing the sentence’s meaning. The decoder uses these to generate output, like translations, by focusing on the right input parts. The module’s example is just simplified for clarity.

I hope this helps! If you have any further questions, feel free to ask.

If the information is useful, please accept the answer and upvote it to assist other community members.

midhun 21 Reputation points

2025-06-23T18:15:27.31+00:00

Hi Gowtham,

Thanks for the response. If possible, please correct the image in the Learn.
Below image is wrong and not translating the intended functionality of encoder.

Encoder generates contextual embeddings of "when", "my", "dog", "was" as it is a input tokens from the sentence.
Decoder rely on encoder embeddings to generate next prediction in the sequence which is "a puppy".
midhun 21 Reputation points

2025-06-23T18:19:27.09+00:00

There is no "cat", "puppy", "skateboard" in input to generate the embeddings in encoder.
VarunTha 14,850 Reputation points Microsoft External Staff Moderator

2025-06-25T10:39:27.3333333+00:00

Hi midhun,
We’ll reach out to the content author to review and address the image accordingly.

Share via

fundamentals-machine-learning/8a-transformers

0 additional answers

Your answer