Problem: You have a huge large multivariate data and want to get list of outliers?
Outlier detection is a significant statistical process and lot of theory under pining but there is a simple, quick way to do this is using the Inter-quartile (IQR) rule.
Read the linked PDF for a simple example summary
bloxplot_stats from matplotlib.cbook
Returns list of dictionaries of statistics
Here is a quick example
From matplotlib.cbook import boxplot_stats st = boxplot_stats(data.AMT) outliers = st["fliers"]
I like this, because it is quick and does not need any external libraries apart from matplotlib.