1. In one tab run: python httpProxy.py > consoleLog # there is a lot of console log generated 2. In second tab run: ./runPhantom.sh 3. Output: mainData #contains essential data: (Request - host - subdomain - GET/POST - resource) (Response - resource size - resource type) extraData #details headers/ cookies etc. currently i am running a shuffle separately on the list and then rerunning steps 1 and 2 and fixing the script to sort the output to get domain-based, content-type-based upstream and downstream data to do: to get the sorting done during runtime itself