Exploring the impact of fixed theta values in RoPE on character-level language model performance and efficiency
Rotary Positional Embedding (RoPE) is a widely used technique in Transformers, influenced by the hyperparameter theta (θ). However, the impact of varying *fixed* theta values, especially the trade-off between performance and efficiency on tasks like character-level modeling, remains under-explored....
Saved in:
| Main Authors: | Zhigao Huang, Musheng Chen, Shiyan Zheng |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Frontiers Media S.A.
2025-08-01
|
| Series: | Frontiers in Computer Science |
| Subjects: | |
| Online Access: | https://www.frontiersin.org/articles/10.3389/fcomp.2025.1626899/full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Character Virtues: Toward a Functionalist Perspective on Character Virtue Science
by: Navrose Bajwa, et al.
Published: (2025-05-01) -
Dual vs. isolated anti-Ro antibody positivity in rheumatoid arthritis
by: Yan Ma, et al.
Published: (2025-08-01) -
Assimilating Tianmu‐1 RO Data From a 23‐Satellite Constellation to Enhance the Track Forecasts of Typhoon Gaemi (2024)
by: S. Yang, et al.
Published: (2025-07-01) -
Qualidade de vida de pessoas com pé diabético
by: Pedro Martins Lima Neto, et al.
Published: (2016-04-01) -
GRID SEARCH AND RANDOM SEARCH HYPERPARAMETER TUNING OPTIMIZATION IN XGBOOST ALGORITHM FOR PARKINSON’S DISEASE CLASSIFICATION
by: Shafa Fitria Aqilah Khansa, et al.
Published: (2025-07-01)