Of these, full_link_ids.txt and D_trips.txt are important for running CSNMF. D_traveltimes.txt can be used in the CSNMF algorithm as well, but
that is not currently functional due to bad hyperparameters.
Of these, full_link_ids.txt and D_trips.txt are important for running CSNMF. D_traveltimes.txt can be used in the CSNMF algorithm as well
but that is not currently functional due to bad hyperparameters.
4. Run all cells in MultiplicativeAlgorithm/cSNMF.ipynb.
- config.py contains global variables set to current dataset. Some need to be fixed, e.g. there are 260855 links and 8760 hours, while others can be modified by the user like the rank of the decomposition and if the user wants to run the seeded or randomized algorithm.
- config.py contains global variables set to current dataset. Some variables are constant due to the dataset, e.g. there are 260855 links and 8760 hours, while other variables can be modified by the user, like the rank of the decomposition and if the user wants to run the seeded or randomized algorithm.
-\_\_init\_\_.py initializes logger
- The cell running cSNMF.factorize() uses global variables set in config.py, but can be overriden for experimentation.
- The W and H matrices are saved as txt files to MultiplicativeAlgorithm/.