Can't position encoding be like temporal encoding

Why should the position encoding change along the dimension of the word embedding?
Shouldn't the entire embedding be multiplied elementwise by a constant value? 
Consider the sentence "john, went, to, the, hallway", doesn't it suffice to multiply element-wise "john" by a small constant value, say 0.1, and, the last word "hallway" by a larger value.
I am trying to understand the reason behind varying the weight of position encoding along the dimension of a word embedding

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can't position encoding be like temporal encoding #15

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Can't position encoding be like temporal encoding #15

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions