Are queries and keys always relevant? A case study on transformer wave functions

Are queries and keys always relevant? A case study on transformer wave functions

The dot product attention mechanism, originally designed for natural language processing tasks, is a cornerstone of modern Transformers. It adeptly captures semantic relationships between word pairs in sentences by computing a similarity overlap between queries and keys. In this work, we explore the...

Full description

Saved in:

Bibliographic Details
Main Authors:	Riccardo Rende, Luciano Loris Viteritti
Format:	Article
Language:	English
Published:	IOP Publishing 2025-01-01
Series:	Machine Learning: Science and Technology
Subjects:	neural network quantum states variational Monte Carlo vision transformer wave function attention mechanisms
Online Access:	https://doi.org/10.1088/2632-2153/ada1a0
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Study on Test‐Mass Charging for Taiji Gravitational Wave Observatory
by: Ruilong Han, et al.
Published: (2024-01-01)

Enhancing Deepfake Detection Through Quantum Transfer Learning and Class-Attention Vision Transformer Architecture
by: Bekir Eray Katı, et al.
Published: (2025-01-01)

METHODOLOGY FOR DATA COLLECTION IN SIMULATED IRRADIATIONS USING MONTE CARLO TECHNIQUES ON DICOM IMAGES FOR X-RAY FLUORESCENCE EMISSIONS LOCALIZATION IN HIGH ATOMIC NUMBER MATERIALS
by: Nicolás E. Martín, et al.
Published: (2024-12-01)

Variation Comparison of OLS and GLS Estimators using Monte Carlo Simulation of Linear Regression Model with Autoregressive Scheme
by: Sajid AliKhan, et al.
Published: (2021-02-01)

Query-Based Instance Segmentation with Dual Attention Transformer for Autonomous Vehicles
by: Aya Taourirte, et al.
Published: (2024-12-01)

IMViT: Adjacency Matrix-Based Lightweight Plain Vision Transformer
by: Qihao Chen, et al.
Published: (2025-01-01)

Comparative analysis of economic responses: Lebanon vs. Russia
by: Samar Abou Ltaif, et al.
Published: (2025-12-01)

STOCHASTIC MODELING OF LIVING/CONTROLLED COPOLYMERIZATIONS IN TUBULAR REACTORS WITH LATERAL FEED
by: Matheus Dias de Carvalho, et al.
Published: (2020-06-01)

Evaluating the Kerma coefficients relevant to various water-equivalent materials at different photon energies
by: Zahra Abdi, et al.
Published: (2025-02-01)

Skull Impact on Photoacoustic Imaging of Multi-Layered Brain Tissues with Embedded Blood Vessel Under Different Optical Source Types: Modeling and Simulation
by: Xi Yang, et al.
Published: (2025-01-01)

Cross-attention swin-transformer for detailed segmentation of ancient architectural color patterns
by: Lv Yongyin, et al.
Published: (2024-12-01)

Simulación digital de procesos de construcción de estructura en concreto: casos de estudio práctico en Bogotá Digital simulation of concrete structure construction: practical case-study in Bogota
by: Diego Echeverry, et al.
Published: (2008-08-01)

D-Wave’s Nonlinear-Program Hybrid Solver: Description and Performance Analysis
by: Eneko Osaba, et al.
Published: (2025-01-01)

Reconstruction of Fan Beam X-Ray Fluorescence Computed Tomography Based on Parallel Hole Collimator via Total Variation and Ordered Subsets
by: Shanghai Jiang, et al.
Published: (2025-01-01)

Blending physics with data using an efficient Gaussian process regression with soft inequality and monotonicity constraints
by: Didem Kochan, et al.
Published: (2025-01-01)

Simulación con Monte Carlo para dosimetría de 153Samario-HM en un modelo anatómico geométrico de rodilla humana
by: Hardy Pérez Alberto Ernesto, et al.
Published: (2010-01-01)

ViT-DualAtt: An efficient pornographic image classification method based on Vision Transformer with dual attention
by: Zengyu Cai, et al.
Published: (2024-12-01)

TRANSFORMATION AND SCATTERING OF SURFACE WAVES ON THE ACOUSTIC LOAD TO ULTRASONIC EVALUATION AND MEASUREMENTS. Part 2. The object to study – solid with ledge
by: A. R. Baev, et al.
Published: (2018-06-01)

A Massively Parallel SMC Sampler for Decision Trees
by: Efthyvoulos Drousiotis, et al.
Published: (2025-01-01)

Código GATE na geração de imagens e cálculos de doses de um sistema tomográfico convencional
by: Felix Mas Milian, et al.
Published: (2024-01-01)

APPLICATIONS OF MONTE CARLO SIMULATION TO STRUCTURAL ENGINEERING PROBLEMS
by: Abdullah Azbah
Published: (2024-12-01)

Dynamics of stochastic mutation to immunodominance
by: Yu Wu, et al.
Published: (2012-09-01)

Monte–Carlo Techniques Applied to CGH Generation Processes and Their Impact on the Image Quality Obtained
by: Juan A. Magallón, et al.
Published: (2025-01-01)

Variational Quantum Monte Carlo Solution of the Many-Electron Schrödinger Equation Based on Deep Neural Networks
by: Huiping Su, et al.
Published: (2024-02-01)

Research on the Wavelet Denoising Algorithm for Thorpe Analysis Based on the Radiosonde Data
by: Qixiang Liao, et al.
Published: (2025-01-01)

Kinematics and Workspace Analysis of 3-UU/UPU+2P Hybrid Mechanism
by: Xunbao Liao, et al.
Published: (2020-12-01)

Kinematics Analysis and Workspace Study of Hybrid Labelling Mechanism
by: Fuxiang Zhang, et al.
Published: (2019-04-01)

Kinematics and Workspace Analysis of a 1T2R Parallel Mechanism
by: Jinyue Huang, et al.
Published: (2021-09-01)

Probabilistic Elastic-plastic Fracture Mechanics Analysis of Propagation of Cracks in Pipes under Internal Pressure
by: Belaïd Mechab, et al.
Published: (2020-09-01)

Numerical prediction of the ductile damage for axial cracks in pipe under internal pressure
by: Fezazi Amina Ismah�ne, et al.
Published: (2021-10-01)

MDA-DETR: Enhancing Offending Animal Detection with Multi-Channel Attention and Multi-Scale Feature Aggregation
by: Haiyan Zhang, et al.
Published: (2025-01-01)

LATrack: Limited Attention for Visual Object Tracking
by: Jian Shi, et al.
Published: (2025-01-01)

HTTD: A Hierarchical Transformer for Accurate Table Detection in Document Images
by: Mahmoud SalahEldin Kasem, et al.
Published: (2025-01-01)

Agent-Based Adaptive Dynamic Round Robin (AADRR) Scheduling Algorithm
by: Zafar Iqbal Khan, et al.
Published: (2025-01-01)

Nonlinear diffraction of water waves by offshore stuctures
by: Matiur Rahman, et al.
Published: (1986-01-01)

Efficient Monte Carlo simulation of streamer discharges with deep-learning denoising models
by: F M Bayo-Muñoz, et al.
Published: (2025-01-01)

Research of nonparametric density estimation algorithms by applying clustering methods
by: Rasa Šmidtaitė, et al.
Published: (2023-09-01)

SIMULATION OF XRF SPECTRA EMPLOYING A MODIFIED MCNP CODE
by: Nguyen Thi Tho, et al.
Published: (2021-02-01)

NEUVAC: Nonlinear Extreme Ultraviolet Irradiance Model for Aeronomic Calculations
by: Daniel A. Brandt, et al.
Published: (2024-12-01)

Calculation of Involved and Noninvolved Organs Doses in Carbon Therapy of Brain Tumor Using GEANT4 Simulation Toolkit
by: Maria Ahmadi, et al.
Published: (2024-10-01)