see case study at https://www.a-moscatelli.info/cookbook/rule-mining-1
projects based on docker:
https://hub.docker.com/r/jupyter/pyspark-notebook
for non-docker-setups:
pip install findspark
use cases
use case 1:
(financial risk mgmt)
show the change in PV for each trading desk across software versions v1/v2
(or dates d1/d2)
files:
pvdiff.ipynb + its exports: .html and .py
https://github.com/a-moscatelli/pyspark/tree/main/dataframe-pivot (jupyter/python)
files:
FPG-FPmining-on-pvdiffs.ipynb + its exports: .html and .py
https://github.com/a-moscatelli/pyspark/tree/main/MLLIB-FPmining (jupyter/python)