Saved in:
Bibliographic Details
Main Authors: Sun, Chuanhao, Yuan, Zhihang, Xu, Kai, Mai, Luo, Siddharth, N., Chen, Shuo, Marina, Mahesh K.
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.09370
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917724364472320
author Sun, Chuanhao
Yuan, Zhihang
Xu, Kai
Mai, Luo
Siddharth, N.
Chen, Shuo
Marina, Mahesh K.
author_facet Sun, Chuanhao
Yuan, Zhihang
Xu, Kai
Mai, Luo
Siddharth, N.
Chen, Shuo
Marina, Mahesh K.
contents Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels. Despite their effectiveness, existing PEs require manual, empirical adjustment of crucial hyperparameters, specifically the Fourier features, tailored to each unique task. Further, PEs face challenges in efficiently learning high-frequency functions, particularly in tasks with limited data. In this paper, we introduce sinusoidal PE (SPE), designed to efficiently learn adaptive frequency features closely aligned with the true underlying function. Our experiments demonstrate that SPE, without hyperparameter tuning, consistently achieves enhanced fidelity and faster training across various tasks, including 3D view synthesis, Text-to-Speech generation, and 1D regression. SPE is implemented as a direct replacement for existing PEs. Its plug-and-play nature lets numerous tasks easily adopt and benefit from SPE.
format Preprint
id arxiv_https___arxiv_org_abs_2407_09370
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding
Sun, Chuanhao
Yuan, Zhihang
Xu, Kai
Mai, Luo
Siddharth, N.
Chen, Shuo
Marina, Mahesh K.
Machine Learning
Fourier features based positional encoding (PE) is commonly used in machine learning tasks that involve learning high-frequency features from low-dimensional inputs, such as 3D view synthesis and time series regression with neural tangent kernels. Despite their effectiveness, existing PEs require manual, empirical adjustment of crucial hyperparameters, specifically the Fourier features, tailored to each unique task. Further, PEs face challenges in efficiently learning high-frequency functions, particularly in tasks with limited data. In this paper, we introduce sinusoidal PE (SPE), designed to efficiently learn adaptive frequency features closely aligned with the true underlying function. Our experiments demonstrate that SPE, without hyperparameter tuning, consistently achieves enhanced fidelity and faster training across various tasks, including 3D view synthesis, Text-to-Speech generation, and 1D regression. SPE is implemented as a direct replacement for existing PEs. Its plug-and-play nature lets numerous tasks easily adopt and benefit from SPE.
title Learning High-Frequency Functions Made Easy with Sinusoidal Positional Encoding
topic Machine Learning
url https://arxiv.org/abs/2407.09370