Outlier/Anomaly Detection for large dataset

Apoorv57 · October 24, 2024, 6:41pm

Hi all, I am working on a problem to detect outliers using Anomaly detection. While working on Python we were using “Isolation Forest” for the same. Now we are moving to larger datasets, and the data would be in GBs and TBs.

What would be the best framework here Spark, Dask or Distributed Tensorflow? And what ML model would be compatible to be used with the said framework.

Do let me know if you need any more information from me.

Thanks

Topic		Replies	Views
Anomaly detection-model selection AI Discussions ai-discussions , project	2	119	August 13, 2024
Anomaly Detection Unsupervised Learning, Recommenders, Reinforcement week-1	1	520	August 11, 2022
Many outliers vs real data Unsupervised Learning, Recommenders, Reinforcement week-1	2	430	June 7, 2023
Anomaly detection using Gaussian vs isolation forest/SVM Unsupervised Learning, Recommenders, Reinforcement week-1	1	513	August 11, 2022
Anomaly Detection Algorithm Unsupervised Learning, Recommenders, Reinforcement week-1	1	504	August 31, 2022

Outlier/Anomaly Detection for large dataset

Related topics