Implicit Greedy Rank Learning in Autoencoders via Overparameterized Linear Networks

AuthorsShih-Yu Sun, Vimal Thilak, Etai Littwin, Omid Saremi, Joshua M. Susskind

This paper was accepted at the workshop on Overparameterization: Pitfalls and Opportunities at the ICML 2021 conference.

Deep linear networks trained with gradient descent yield low rank solutions, as is typically studied in matrix factorization. In this paper, we take a step further and analyze implicit rank regularization in autoencoders. We show greedy learning of low-rank latent codes induced by a linear sub-network at the autoencoder bottleneck. We further propose orthogonal initialization and principled learning rate adjustment to mitigate sensitivity of training dynamics to spectral prior and linear depth. With linear autoencoders on synthetic data, our method converges stably to ground-truth latent code rank. With nonlinear autoencoders, our method converges to latent ranks optimal for downstream classification and image sampling.

Related readings and updates.

Towards Automatic Assessment of Self-Supervised Speech Models Using Rank

March 5, 2025research area Speech and Natural Language Processingconference ICASSP

This study explores using embedding rank as an unsupervised evaluation metric for general-purpose speech encoders trained via self-supervised learning (SSL). Traditionally, assessing the performance of these encoders is resource-intensive and requires labeled data from the downstream tasks. Inspired by the vision domain, where embedding rank has shown promise for evaluating image encoders without tuning on labeled downstream data, this work…

AGRaME: Any Granularity Ranking with Multi-Vector Embeddings

June 4, 2024research area Knowledge Bases and Search, research area Methods and Algorithms

Ranking is a fundamental and popular problem in search. However, existing ranking algorithms usually restrict the granularity of ranking to full passages or require a specific dense index for each desired level of granularity. Such lack of flexibility in granularity negatively affects many applications that can benefit from more granular ranking, such as sentence-level ranking for open-domain question-answering, or proposition-level ranking for…

Implicit Greedy Rank Learning in Autoencoders via Overparameterized Linear Networks

Related readings and updates.

Towards Automatic Assessment of Self-Supervised Speech Models Using Rank

AGRaME: Any Granularity Ranking with Multi-Vector Embeddings

Discover opportunities in Machine Learning.