RoPE - Rotary Positional Embedding
Introduction
The transformer architecture has seen a meteoric rise in its applications across various domains of machine learning. However, the architecture lacks an inherent understanding of the order or sequence of tokens. This necessitates some fo...
afterhoursresearch.com5 min read