Jason J. Corso
Publications List
|
tag: video to text
[1]
|
L. Zhou, Y. Kalantidis, X. Chen, J. J. Corso, and M. Rohrbach.
Grounded video description.
In Proceedings of IEEE Conference on Computer Vision and
Pattern Recognition, 2019.
[ bib |
.pdf ]
|
[2]
|
L. Zhou, C. Xu, and J. J. Corso.
Towards automatic learning of procedures from web instructional
videos.
In Proceedings of AAAI Conference on Artificial Intelligence,
2018.
[ bib |
code |
data |
http ]
|
[3]
|
L. Zhou, Y. Zhou, J. J. Corso, R. Socher, and C. Xiong.
End-to-end dense video captioning with masked transformer.
In Proceedings of IEEE Conference on Computer Vision and
Pattern Recognition, 2018.
[ bib |
code |
.pdf ]
|
[4]
|
L. Zhou, N. Louis, and J. J. Corso.
Weakly-supervised video object grounding from text by loss weighting
and object interaction.
In Proceedings of British Machine Vision Conference, 2018.
[ bib |
.pdf ]
|
[5]
|
L. Zhou, C. Xu, P. Koch, and J. J. Corso.
Watch what you just said: Image captioning with text-conditional
attention.
In Proceedings of the Thematic Workshops of ACM Multimedia,
2017.
[ bib ]
|
[6]
|
R. Xu, C. Xiong, W. Chen, and J. J. Corso.
Jointly modeling deep video and compositional text to bridge vision
and language in a unified framework.
In Proceedings of AAAI Conference on Artificial Intelligence,
2015.
[ bib |
.pdf ]
|
[7]
|
P. Das, R. K. Srihari, and J. J. Corso.
Translating related words to videos and back through latent topics.
In Proceedings of Sixth ACM International Conference on Web
Search and Data Mining, 2013.
[ bib |
.pdf ]
|
[8]
|
P. Das, C. Xu, R. F. Doell, and J. J. Corso.
A thousand frames in just a few words: Lingual description of videos
through latent topics and sparse object stitching.
In Proceedings of IEEE Conference on Computer Vision and
Pattern Recognition, 2013.
[ bib |
poster |
data |
.pdf ]
|
|