Paper Reading: Thin-Slicing Network

Note: this post is only meant for personal digestion and interpretation. It is incomplete and may mislead readers.


Spatio-temporal inference layer



Solved via generalized distance transform


  1. Training fully convolutional layers
  2. Joint training with flow warping and inference layers

Training fully convolutional layers

f=i=1Kpbi(p)bi(p)2f = \sum^K_{i=1} \sum_p \left\Vert b^i (p) - b_{*}^i (p) \right\Vert^2

Joint training with flow warping and inference layers

f=i=1Kpmax(0,1bi(p)Ii(p))f = \sum_{i=1}^K \sum_p \max \left( 0, 1-b^i (p) \cdot I^i (p) \right)

Result Analysis

  • (Fact, not model feature) parts such as elbows and wrists are the most flexible joints of our body. This flexibility can yield configurations with very large variation and these joints are also prone to be occluded by other parts of the body.
  • Note that predictions for shoulders can be negatively influenced by sending or receiving messages from elbows through spatial inference only

Author: Texot
Reprint policy: All articles in this blog are used except for special statements CC BY 4.0 reprint polocy. If reproduced, please indicate source Texot !