HACKOBAR_item
[r/MachineLearning]score: 0.16

Anomaly Detection Belongs in Your Database — built SIMD-accelerated isolation forests into Stratum's SQL engine [P]

May 5, 2026
Stratum, an Apache 2.0 JVM columnar analytics engine, now ships native isolation forest anomaly detection callable directly via SQL using an ANOMALY_SCORE() function. The implementation uses Java's Vector API for SIMD acceleration, achieving 6 microseconds per-record scoring fused into query execution with zone map pruning and chunked streaming. This eliminates Python export pipelines entirely, outperforming PyOD and scikit-learn baselines per published benchmarks. Data engineers running JVM-based stacks handling fraud detection or operational monitoring should evaluate this as a zero-overhead alternative to sidecar ML inference services.
project