What is this project?
Project title: Taffic Patterns in New York City
Team members: Richard Sowers, Derrek Yager, Vaibhav Karve, Marzieh Abolhelm.
What does this code do?
- It factorizes a matrix D into two smaller matrices W and H such that:
- D, W, H all have non-negative entries
- Column sum of W is 1 for each column
- H is sparse
- It applies matrix factorization to study traffic patterns of taxi-travel in New York City.
Minimal requirements for running this code?
- System requirements:
- Python3
- Jupyter Notebook
- Go to https://databank.illinois.edu/datasets/IDB-4900670 for necessary files and import into DataFiles/:
- nodes.csv
- links.csv
- travel_times_2011.csv
How is this repository organized?
- ./ (Home directory: DataFiles -> ReadData -> MultiplicativeAlgorithm)
- ReadData
- Archive
- MultiplicativeAlgorithm
- DataFiles
- ReadData/ (Contains ReadData.ipynb for importing data into matrix format)
- MultiplicativeAlgorithm/ (Contains the meat)
- CSNMF.ipynb does most of the work
Importing into Jupyter Notebook
For example we can import everything in EndChecker.py
to a notebook by adding a line as such:
from util.EndChecker import *