Revisiting Non-separable Binary Classification and its Applications in Anomaly Detection

AuthorsMatthew Lau, Ismaila Seck, Athanasios Meliopoulos, Wenke Lee, Eugene Ndiaye

The inability to linearly classify XOR has motivated much of deep learning. We revisit this age-old problem and show that linear classification of XOR is indeed possible. Instead of separating data between halfspaces, we propose a slightly different paradigm, equality separation, that adapts the SVM objective to distinguish data within or outside the margin. Our classifier can then be integrated into neural network pipelines with a smooth approximation. From its properties, we intuit that equality separation is suitable for anomaly detection. To formalize this notion, we introduce closing numbers, a quantitative measure on the capacity for classifiers to form closed decision regions for anomaly detection. Springboarding from this theoretical connection between binary classification and anomaly detection, we test our hypothesis on supervised anomaly detection experiments, showing that equality separation can detect both seen and unseen anomalies.

Related readings and updates.

Reasoning-based Anomaly Detection Framework: A Real-time, Scalable, and Automated Approach to Anomaly Detection Across Domains

October 8, 2025research area Methods and Algorithms, research area Tools, Platforms, Frameworks

Detecting anomalies in large, distributed systems presents several challenges. The first challenge arises from the sheer volume of data that needs to be processed. Flagging anomalies in a high-throughput environment calls for a careful consideration of both algorithm and system design. The second challenge comes from the heterogeneity of time-series datasets that leverage such a system in production. In practice, anomaly detection systems are…

SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

November 8, 2023research area Computer Vision, research area Methods and Algorithmsconference ACM SIGSPATIAL

*Equal Contributors

We propose a Self-supervised Anomaly Detection technique, called SeMAnD, to detect geometric anomalies in Multimodal geospatial datasets. Geospatial data comprises acquired and derived heterogeneous data modalities that we transform to semantically meaningful, image-like tensors to address the challenges of representation, alignment, and fusion of multimodal data. SeMAnD is comprised of (i) a simple data augmentation…

Revisiting Non-separable Binary Classification and its Applications in Anomaly Detection

Related readings and updates.

Reasoning-based Anomaly Detection Framework: A Real-time, Scalable, and Automated Approach to Anomaly Detection Across Domains

SeMAnD: Self-Supervised Anomaly Detection in Multimodal Geospatial Datasets

Discover opportunities in Machine Learning.