Theoretical Analysis of Positional Encodings in Transformer Models
17 PaulHoule 1 6/27/2025, 10:07:11 PM arxiv.org ↗
Comments (1)
semiinfinitely · 58m ago
Kinda disappointing that rope- the most common pe- is given about one sentence in this work and omitted from the analysis.