Hi, when the input is video clips, how to calculate the score of video captioning?
Hi, when the input is video clips, how to calculate the score of video captioning?