ctralie · Jul 16, 2016
diff --git a/‎4-Video.ipynb
+151-5 b/‎4-Video.ipynb
+151-5
diff --git a/‎KTH/boxing/person01_boxing_d1_uncomp.ogg
731 KB b/‎KTH/boxing/person01_boxing_d1_uncomp.ogg
731 KB
diff --git a/‎KTH/handwaving/person02_handwaving_d2_uncomp.ogg
1.02 MB b/‎KTH/handwaving/person02_handwaving_d2_uncomp.ogg
1.02 MB
diff --git a/‎KTH/walking/person01_walking_d1_uncomp.ogg
1.01 MB b/‎KTH/walking/person01_walking_d1_uncomp.ogg
1.01 MB
diff --git a/‎KTHTests.py
-1 b/‎KTHTests.py
-1
@@ -13,7 +13,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": null,
    "metadata": {
     "collapsed": false
    },
@@ -218,7 +218,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": null,
    "metadata": {
     "collapsed": false
    },
@@ -240,7 +240,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": null,
    "metadata": {
     "collapsed": false
    },
@@ -259,8 +259,154 @@
    "source": [
     "As you can see, the maximum persistence peaks at around 40 frames, which is the period of each hand wave.  This is what the theory we developed for 1D time series would have predicted as the roundest window.<BR>\n",
     "\n",
-    "<h2>KTH Dataset Rankings</h2><BR>\n",
-    "For the final experiment, students will split up into groups and run code on different subsets of the KTH dataset which ranks the video clips in decreasing order of periodicity.  As an example, <a href = \"VideoResults/index.html\">click here</a> to see the rankings of all activities for the first 4 subjects.  Groups will run the code in <a href = \"KTHTests.py\">KTHTests.py</a> after modifying it to go through the appropriate subset of the database by changing lines 53 through 55.  A new web page will be generated <a href = \"VideoResults/index.html\">here</a> to show the resulting rankings.  Note that a fixed window length of 20 frames is maintained throughout all of the experiments.  Feel free to tweak the window size (\"win\" on line 72), the dimension of the embedding (\"dim\" on line 73), or any other parameters you think would make the results more meaningful."
+    "<h2>KTH 4 Video Ranking</h2>\n",
+    "<BR>\n",
+    "Now, students will rank 4 videos from the database from most periodic to least periodic visually, and then we will run TDA on the videos and compare the ranking based on maximum persistence to the rank aggregated class estimate.  Click below to load the four videos.\n"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [],
+   "source": [
+    "#First video\n",
+    "videos = ['KTH/handwaving/person01_handwaving_d1_uncomp.ogg']\n",
+    "video = io.open(videos[-1], 'r+b').read()\n",
+    "encoded = base64.b64encode(video)\n",
+    "HTML(data='''<h1>Video 1</h1><video alt=\"test\" controls>\n",
+    "                <source src=\"data:video/mp4;base64,{0}\" type=\"video/mp4\" />\n",
+    "             </video>'''.format(encoded.decode('ascii')))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [],
+   "source": [
+    "videos.append('KTH/boxing/person01_boxing_d1_uncomp.ogg')\n",
+    "video = io.open(videos[-1], 'r+b').read()\n",
+    "encoded = base64.b64encode(video)\n",
+    "HTML(data='''<h1>Video 2</h2><video alt=\"test\" controls>\n",
+    "                <source src=\"data:video/mp4;base64,{0}\" type=\"video/mp4\" />\n",
+    "             </video>'''.format(encoded.decode('ascii')))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [],
+   "source": [
+    "videos.append('KTH/walking/person01_walking_d1_uncomp.ogg')\n",
+    "video = io.open(videos[-1], 'r+b').read()\n",
+    "encoded = base64.b64encode(video)\n",
+    "HTML(data='''<h1>Video 3</h1><video alt=\"test\" controls>\n",
+    "                <source src=\"data:video/mp4;base64,{0}\" type=\"video/mp4\" />\n",
+    "             </video>'''.format(encoded.decode('ascii')))"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [],
+   "source": [
+    "videos.append('KTH/handwaving/person02_handwaving_d2_uncomp.ogg')\n",
+    "video = io.open(videos[-1], 'r+b').read()\n",
+    "encoded = base64.b64encode(video)\n",
+    "HTML(data='''<h1>Video 4</h1><video alt=\"test\" controls>\n",
+    "                <source src=\"data:video/mp4;base64,{0}\" type=\"video/mp4\" />\n",
+    "             </video>'''.format(encoded.decode('ascii')))"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Now that you have loaded in and watched the videos, go to the following Google doc and input your ranking, from most periodic to least:<BR><BR>\n",
+    "\n",
+    "<a href = \"https://docs.google.com/spreadsheets/d/1L59t7oO6jiFHlrxSMnMGwnrKNdqYebTO_30I7nhcFgg/edit?usp=sharing\">https://docs.google.com/spreadsheets/d/1L59t7oO6jiFHlrxSMnMGwnrKNdqYebTO_30I7nhcFgg/edit?usp=sharing</a><BR><BR>\n",
+    "\n",
+    "Now, run the code below to compute a ranking based on delay embeddings and persistent homology.  The code goes through the video in blocks equal to 160 frames, hopping forward 80 frames to the next block, and it uses a fixed window size of 20 frames for all videos.  You can tweak these parameters if you'd like.  At the end, it records the maximum maximum persistence over all blocks of 160 frames."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 25,
+   "metadata": {
+    "collapsed": false
+   },
+   "outputs": [],
+   "source": [
+    "from KTHTests import *\n",
+    "\n",
+    "#Setup video blocks\n",
+    "BlockLen = 160\n",
+    "BlockHop = 80\n",
+    "win = 20\n",
+    "dim = 20\n",
+    "\n",
+    "scores = np.zeros(len(videos))\n",
+    "#Loop through each video and record the maximum persistence score\n",
+    "for i in range(len(videos)):\n",
+    "    (XOrig, FrameDims) = loadVideo(videos[i])\n",
+    "    X = getPCAVideo(XOrig)\n",
+    "    [X, validIdx] = getTimeDerivative(X, 10)\n",
+    "\n",
+    "    idxs = []\n",
+    "    N = X.shape[0]\n",
+    "    NBlocks = int(np.ceil(1 + (N - BlockLen)/BlockHop))\n",
+    "    print(\"NBlocks = \", NBlocks)\n",
+    "    for k in range(NBlocks):\n",
+    "        thisidxs = np.arange(k*BlockHop, k*BlockHop+BlockLen, dtype=np.int64)\n",
+    "        thisidxs = thisidxs[thisidxs < N]\n",
+    "        idxs.append(thisidxs)\n",
+    "\n",
+    "    res = np.zeros(NBlocks)\n",
+    "\n",
+    "    #Get sliding window video in blocks\n",
+    "    maxXS = []\n",
+    "    maxPD = []\n",
+    "    for j in range(len(idxs)):\n",
+    "        idx = idxs[j]\n",
+    "        Tau = win/float(dim-1)\n",
+    "        dT = (len(idx)-dim*Tau)/float(len(idx))\n",
+    "        XS = getSlidingWindowVideo(X[idx, :], dim, Tau, dT)\n",
+    "\n",
+    "        #Mean-center and normalize sliding window\n",
+    "        XS = XS - np.mean(XS, 1)[:, None]\n",
+    "        XS = XS/np.sqrt(np.sum(XS**2, 1))[:, None]\n",
+    "\n",
+    "        PDs = doRipsFiltration(XS, 1)\n",
+    "        if len(PDs) < 2:\n",
+    "            continue\n",
+    "        if PDs[1].size > 0:\n",
+    "            res[j] = np.max(PDs[1][:, 1] - PDs[1][:, 0])\n",
+    "            if res[j] > scores[i]:\n",
+    "                scores[i] = res[j]\n",
+    "                maxXS = np.array(XS)\n",
+    "                maxPD = np.array(PDs[1])\n",
+    "\n",
+    "print(\"\\n\\n\\n\\n-----------------------\\n        RESULTS\\n-----------------------\\n\")\n",
+    "print(np.argsort(-scores)+1)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "<h2>KTH Dataset Batch Rankings (Optional)</h2><BR>\n",
+    "For an optional final experiment, students will split up into groups and run code on different subsets of the KTH dataset which ranks the video clips in decreasing order of periodicity.  As an example, <a href = \"VideoResults/index.html\">click here</a> to see the rankings of all activities for the first 4 subjects.  Groups will run the code in <a href = \"KTHTests.py\">KTHTests.py</a> after modifying it to go through the appropriate subset of the database by changing lines 53 through 55.  A new web page will be generated <a href = \"VideoResults/index.html\">here</a> to show the resulting rankings.  Note that a fixed window length of 20 frames is maintained throughout all of the experiments.  Feel free to tweak the window size (\"win\" on line 72), the dimension of the embedding (\"dim\" on line 73), or any other parameters you think would make the results more meaningful."
    ]
   },
   {
 
@@ -80,7 +80,6 @@ def getTimeDerivative(I, Win):
             thisidxs = thisidxs[thisidxs < N]
             idxs.append(thisidxs)
 
-        wins = np.arange(2, 50)
         res = np.zeros(NBlocks)
 
         #Get sliding window video in blocks