Added possibility to save the visual attention for Show, attend and tell...
Added possibility to save the visual attention for Show, attend and tell architecture as an image. Adjusted axis parameter in softmax layer to ignore batch size. Fixed a problem with some states not being initialized correctly in C++ code due to previous changes to the inline mode
Showing with 357 additions and 31 deletions