Quick exploratory analysis for BioKind's donor/marketing support work with NYCF (New York Cancer Foundation).
- Source workbook:
data/NYCFBiokindData.xlsx - Converted CSV:
data/NYCFBiokindData_Sheet1.csv - Analysis script:
analysis/analyze_nycf_data.py
- Total records: 4,462
- Columns: 10
- Organization field: all rows are
New York Cancer Foundation - Status split:
Active: 4,197DoNotContact: 265
- Donation distribution:
- Parsed donation values for all 4,462 rows
- Total donations: $10,658,637.30
- Mean: $2,388.76
- Median: $0.00
- Max: $325,000.00
- Rows with
$0donation: 2,494
0: 2,4941-25: 23426-100: 509101-500: 450501-2.5k: 4202.5k-10k: 18610k+: 169
- Missing values are high in location fields:
city: 56.4% missingstate: 56.7% missingzip code: 56.6% missing
extensionis 100% missingphone typeis almost entirelyUnknown
- Donor segmentation: build target groups (non-donor, small, mid, major).
- Major donor strategy: prioritize the
10k+cohort and identify upgrade candidates in501-2.5k. - Reactivation campaign: target
$0segment with tailored outreach. - Data cleanup/enrichment: improve city/state/zip and contact fields before deeper campaign attribution analysis.