Outline
- This is a work of L.Fei-fei’s group, which was admitted in the ICCV17. It extends some ideas of perceptual loss and style transfer based on feed-forward network to video style transfer.
- Its stability analysis is a beautiful theory, which can provide us an insight of the reason consecutive frames have some unstable appearance.
- The recurrent convolutional network is also a point which deserves us to discuss.
Related Work
Here, we conclude some popular work and methods used in style transfer. To compare these methods, I list a table, which illustrates the relations of these papers. At last, I show a figure to help you recall the network of the group’s previous work.
Analysis
At the beginning, let’s recall the loss in image style transfer. Note the gram matrix representation. It is closely related to our discussion later.
As the gram matrix and objective function shows, one of our goals is to minimize the style loss. Here we ignore the index j for convenience. Then we can first consider the simplest situation, let C=W=H=1, then the solution must be $\Phi_p=\pm\Phi_s$ as the first picture in Figure2 shows. Next, let H=2, then we generalize it to the 2D case. Also we can do the same thing in the N-D space. It’s easy to understand the relation by the Figure2 and the following Theory. Then we will prove the theory.
According to Figure 3, the theory is reasonable in the author’s experiment.
Algorithm
First, let’s see the network in details. Note that the previous frame output will be concatenated with the next frame content. And there is a new loss named FLOW LOSS, which measures the temporary consistency.
To be more specific about flow loss, here is an illustration. It is based on the brightness constancy constraint. As for the mask, it is used to avoid artifacts at motion boundaries.
Experiments
Here the optim method refers to the paper ‘Artistic Style Transfer for Videos’.
Because we do not need optical flow information when testing (only used in training instead), the result of the network is not likely to be influenced by the accuracy of optical flow results, whereas the optim method can’t work without it.
Discussion
Reference
- Characterizing and Improving Stability in Neural Style Transfer
- Perceptual Losses for Real-Time Style Transfer and Super-Resolution
- Artistic style transfer for videos
-
Previous
RoomNet End-to-End Room Layout Estimation -
Next
A Recurrent Neural Network For Image Generation