[paper reading][CVPR 2020] Spatio-Temporal Graph for Video Captioning with Knowledge Distillation

目录

2 Related Work
- General Video Classification
3
- 3.2 Spatio-Temporal Graph

CVPR 2020
https://openaccess.thecvf.com/content_CVPR_2020/papers/Pan_Spatio-Temporal_Graph_for_Video_Captioning_With_Knowledge_Distillation_CVPR_2020_paper.pdf
spatio-temporal graph model for video captioning that exploits object interactions in space and time
two-branch, knowledge distillatio

General Video Classification

3D conv
two-stream, optical flow
wider range
SlowFast, multiple time scales, two pathways
feature bank, long-term, correlated, short-term
raw pixels, in contrast, objects within scenes

3

two-branch, distill
scene, 2D, resnet, 3D, I3D
object features: \(N_T\) objects, each \(o_t^j\) has the same dimension

3.2 Spatio-Temporal Graph

decompose our graph into two components: the spatial graph and the temporal graph
Spatial: normalized Intersection over Union (IoU) value, explicitly
temporal: object transformations, semantic similarities, \(cos\)
imagine: # - % = $ x @ structure

Paperreading cv temporal GCN representation

相关

Centos7 安装 opencv

云服务器安装opcv-python

Android Kotlin opencv MatOfPoint 转 MatOfPoint2f 报错踩坑 (解决)

weblogic-CVE-2020-2551-IIOP反序列化学习记录

【图像处理】OpenCV+Python图像处理入门教程（四）几何变换

ICCV 2021口罩人物身份鉴别全球挑战赛冠军方案分享

opencv--检测图片中的圆形

OpenCV 学习笔记（1）

EasyCVR对接华为iVS订阅摄像机和用户变更请求接口介绍

Tensorflow实现LeNet5网络并保存pb模型，实现自定义的手写数字识别（附opencv-python调用

OpenCV4【17】-DNN 之 yolov3 目标检测

编译opencv3.1.0时出现错误：error: ‘NppiGraphcutState’ has not been declared

标签