- Apr 29, 2018
-
-
Nischol Antao authored
Finished up Performance_Tracking for question 3b. Moved py files from the notebooks folder to the src folder. Renamed results to match the naming conventions for other visualizations. Created ipython notebooks for question 3b
-
Nischol Antao authored
-
Nischol Antao authored
-
antao authored
-
Nischol Antao authored
-
- Apr 28, 2018
-
-
rkr2 authored
-
rkr2 authored
-
Nischol Antao authored
-
rkr2 authored
-
- Apr 27, 2018
-
-
rkr2 authored
-
- Apr 25, 2018
-
-
Nischol Antao authored
-
Nischol Antao authored
-
Nischol Antao authored
-
rkr2 authored
-
rkr2 authored
-
rkr2 authored
-
rkr2 authored
-
rkr2 authored
-
rkr2 authored
-
antao authored
-
antao authored
-
- Apr 24, 2018
-
-
Nischol Antao authored
-
Nischol Antao authored
1) Ran Code for questions 1-3 on pyspark in cluster mode, with multiple nodes. Measured and captured the difference in performance between running it on a single EC-2 instance, and running it on a cluster. 2) Added some screenshots for the final report, to show the cluster configuration. 3) Added ipython notebooks for performance metrics in local mode. 4) Added json files for zeppelin notebooks 5) Created new source files for Code pyspark code run in zeppelin notebooks, in cluster mode 6) Added test results for question 3 when using hive to calculate the median data. 7) Added R code from Rob for question 3 local exploration 8) Renamed some of the local exploration files
-
antao authored
-
Nischol Antao authored
-
Nischol Antao authored
-
- Apr 23, 2018
-
-
Nischol Antao authored
-
- Apr 22, 2018
-
-
Nischol Antao authored
Completed Visualizations for Question 2. Made some small edits to the Vizzes for question 1. Questions 1 and 2 are now completely ready.
-
antao authored
-
Nischol Antao authored
Finished Code, and iPython notebooks for Question 2. Will work on completing the Viz for this Question next.
-
Nischol Antao authored
-
Nischol Antao authored
-
antao authored
-
Nischol Antao authored
Added source , ipython notebook , mark down and results for question 2. Will add a bit more results and visualization tomorrow.
-
- Apr 21, 2018
-
-
Nischol Antao authored
Added Guidelines for the final report, and updated the report to include the new sections we should include.
-
Nischol Antao authored
Completed Code and Visualizations for Question 1. Need to update final report with findings of Question 1
-
antao authored
-
Nischol Antao authored
-
antao authored
-