-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
evaluation on the VIP and JHMDB datasets #10
Comments
Hi @AndyTang15, Thanks for your interest, and I apologize for the late reply. I haven't re-run the JHMDB and VIP evaluations since refactoring and retraining models for the code release, so thanks for bringing this to my attention, and I will take a closer look! One detail that will improve the JHMDB result is that the radius should be (commensurately) decreased, since the input is about 4x smaller (320x320 v.s. 900x480). So, you might consider a radius of 5 instead of 12. I apologize for the confusion (and the typo in the appendix).
|
@ajabri Hi Allan, radius=5: [email protected] 58.64, [email protected] 80.54 The performances have all been improved but still lower than the results reported in your paper before refactoring, especially on [email protected], would it be possible for you to help me with this again? Many thanks! |
Hi @ajabri and @AndyTang15, Thanks! |
Hi @dmckee5, I have not yet reconciled this issue (the lower [email protected] with this repository). If you are reporting or comparing to our results, at this point, please go ahead and report the result you've reproduced. I am hoping to get to this soon. |
Hi @ajabri @AndyTang15 , |
How did you get |
Hi @AndyTang15, I used the same commands as you but looks like my results are much worse than urs. Just wondering is there other modification you have made to the code? Plus, what's the filelist you are using? The filelist I used is from the original UVC repo and it contains 268 lines. Any help would be appreciated. Thanks! |
Hi, |
Hi Allan,
Happy new year! And many thanks for releasing the code of this great work!
I used the codebase and the pretrained model provided in the repo to evaluate the VIP and JHMDB datasets, the results are:
VIP: 37.12(mIOU), JHMDB: 57.62([email protected]) and 79.59([email protected]).
They are noticeably lower than the results in your paper:
VIP: 38.6(mIOU), JHMDB: 59.3([email protected]) and 84.9([email protected]).
Could you please help to check whether I evaluated them in a right way?
For VIP, I used the command:
python test.py --filelist eval/VIP_vallist.txt --model-type scratch --resume ../pretrained.pth --save-path vip_results --topk 10 --videoLen 4 --radius 12 --temperature 0.05 --cropSize 560
For JHMDB, I used the command:
python test.py --filelist eval/jhmdb_vallist.txt --model-type scratch --resume ../pretrained.pth --save-path jhmdb_results --topk 10 --videoLen 7 --radius 12 --temperature 0.05 --cropSize 320
The hyperparameters above were selected based on your paper except temperature (I've also tried 0.07 but found 0.05 is better).
BTW, there're two bugs for JHMDB evaluation:
https://github.com/ajabri/videowalk/blob/master/code/data/jhmdb.py#L231
the "sio" should be imported in this python file
https://github.com/ajabri/videowalk/blob/master/code/test.py#L161
it should be "test_utils" rather than "utils"
The text was updated successfully, but these errors were encountered: