Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations

AuthorsKrishna Subramani†, Paris Smaragdis†, Takuya Higuchi, Mehrez Souden

View publication

This paper was accepted at the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA) 2025

Non-negative Matrix Factorization (NMF) is a powerful technique for analyzing regularly-sampled data, i.e., data that can be stored in a matrix. For audio, this has led to numerous applications using time-frequency (TF) representations like the Short-Time Fourier Transform. However extending these applications to irregularly-spaced TF representations, like the Constant-Q transform, wavelets, or sinusoidal analysis models, has not been possible since these representations cannot be directly stored in matrix form. In this paper, we formulate NMF in terms of learnable functions (instead of vectors) and show that NMF can be extended to a wider variety of signal classes that need not be regularly sampled.

† University of Illinois at Urbana-Champaign

Related readings and updates.

Learning Spatially-Aware Language and Audio Embeddings

December 9, 2024research area Methods and Algorithms, research area Speech and Natural Language Processingconference NeurIPS

Humans can picture a sound scene given an imprecise natural language description. For example, it is easy to imagine an acoustic environment given a phrase like “the lion roar came from right behind me!”. For a machine to have the same degree of comprehension, the machine must know what a lion is (semantic attribute), what the concept of “behind” is (spatial attribute) and how these pieces of linguistic information align with the semantic and…

Apple Workshop on Privacy-Preserving Machine Learning 2024

August 29, 2024research area Privacy

At Apple, we believe privacy is a fundamental human right. It’s also one of our core values, influencing both our research and the design of Apple’s products and services.

Understanding how people use their devices often helps in improving the user experience. However, accessing the data that provides such insights — for example, what users type on their keyboards and the websites they visit — can compromise user privacy. We develop system…

Rethinking Non-Negative Matrix Factorization with Implicit Neural Representations

Related readings and updates.

Learning Spatially-Aware Language and Audio Embeddings

Apple Workshop on Privacy-Preserving Machine Learning 2024

Discover opportunities in Machine Learning.