This project demonstrates the use of Pointblank for Python - a powerful library for data validation, testing and documenting data quality. As a learning exercise, I applied data validation checks to a sample dataset from the FARS (Fatality Analysis Reporting System), focusing specifically on the Drugs data. The goal is to understand how to implement column-level validation rules, build a validation report, and ensure the integrity of real-world data using Python.
Data used in this project comes from the NHTSA FARS data and can be accessed here.
The validation report generated from the pointblank package can be accessed via this URL.
```bash
git clone https://github.com/anchalab/explore-pointblank.git
cd explore-pointblank
pip install -r requirements.txt
jupyter notebook
Navigate to the code/ directory in the Jupyter interface. Open and run the drugs_data_validation.ipynb file to explore the Pointblank validations.