Skip to content

Commit

Permalink
Update Readme.md
Browse files Browse the repository at this point in the history
  • Loading branch information
pavitraag authored Jul 7, 2024
1 parent 51121df commit 1cd2d8e
Showing 1 changed file with 35 additions and 0 deletions.
35 changes: 35 additions & 0 deletions Outlier Detection/Readme.md
Original file line number Diff line number Diff line change
@@ -1 +1,36 @@
### Outliers in Data




An outlier is a data point that significantly deviates from the majority of the data, either being much higher or lower. Outliers can result from measurement or execution errors and can significantly impact machine learning algorithms. Analyzing outliers is known as outlier analysis or outlier mining.




### Importance of Outlier Detection in Machine Learning




1. **Biased Models**: Outliers can skew a model towards these extreme values, leading to poor generalization.

2. **Reduced Accuracy**: Outliers introduce noise, complicating the detection of true data patterns and reducing model accuracy.

3. **Increased Variance**: Outliers increase model sensitivity to data changes, reducing stability and reliability.

4. **Reduced Interpretability**: Outliers obscure model insights, making predictions less trustworthy and impeding performance improvements.




### Techniques for Outlier Detection




1. **Percentile Method**: Identifies outliers based on their position relative to a certain percentile range.

2. **Z-Score Method**: Detects outliers by calculating how many standard deviations a data point is from the mean.

3. **IQR Method**: Uses the interquartile range to identify outliers as data points lying outside 1.5 times the IQR above the third quartile or below the first quartile.

0 comments on commit 1cd2d8e

Please sign in to comment.