Definition
The initial investigation of data to discover patterns spot anomalies and form hypotheses.
Detailed Explanation
EDA involves using visual and statistical methods to understand data characteristics relationships between variables and potential issues. It typically includes summary statistics distribution analysis correlation studies and various plotting techniques to gain insights before formal modeling.
Use Cases
1. Research hypothesis generation 2. Data quality assessment 3. Variable relationship discovery 4. Outlier detection
