18

NYC Ride Share

Super computer project using HPC/ Slurm

test

Skills

Slurm/ HPC, R (tidyverse) / RStudio, Parquet (Arrow library), Condor (initial usage), etc.

test

Summary:

  • Worked collaboratively to address: which For-Hire Ride Service in NYC is most favorable for drivers?
  • Collected data used to study congestion from NYC.gov websites containing 46 Parquet files
  • Converted source files from Apache Parquet format using an R script with the package Arrow
  • Ran parallel computation using Slurm job scheduler on the HPC at the University of Wisconsin-Madison