Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consultation on training questions #9

Open
feiyun1265 opened this issue Dec 15, 2017 · 4 comments
Open

Consultation on training questions #9

feiyun1265 opened this issue Dec 15, 2017 · 4 comments

Comments

@feiyun1265
Copy link

feiyun1265 commented Dec 15, 2017

1、Can i use video-level features train your model ?
2、Can i use rgb only(no audio) of frame-level features train your model?
thanks.

@antoine77340
Copy link
Owner

Yes you can train on rgb only but you will need to first specify it in the training command.
Also you will need to modify the architecture so it is not taking into account the audio part (just remove the audio part from the graph). The accuracy should not change a lot without audio.

@feiyun1265
Copy link
Author

Thanks for reply. @antoine77340. The following commond correct?(use rgb only(no audio) )

  1. commond to train video-level feature:
    python train.py --train_data_pattern="../dataset/Video_data/video_train/train*.tfrecord" --model=NetVLADModelLF --train_dir=gatednetvladLF-256k-1024-80-0002-300iter-norelu-basic-gatedmoe --frame_features=False --feature_names="mean_rgb" --feature_sizes="1024" --batch_size=80 --base_learning_rate=0.0002 --netvlad_cluster_size=256 --netvlad_hidden_size=1024 --moe_l2=1e-6 --iterations=300 --learning_rate_decay=0.8 --netvlad_relu=False --gating=True --moe_prob_gating=True --max_step=700000

  2. commond to train frame-level feature:
    python train.py --train_data_pattern="../dataset/Frame_data/frame_train/train*.tfrecord" --model=NetVLADModelLF --train_dir=gatednetvladLF-256k-1024-80-0002-300iter-norelu-basic-gatedmoe --frame_features=True --feature_names="rgb" --feature_sizes="1024" --batch_size=80 --base_learning_rate=0.0002 --netvlad_cluster_size=256 --netvlad_hidden_size=1024 --moe_l2=1e-6 --iterations=300 --learning_rate_decay=0.8 --netvlad_relu=False --gating=True --moe_prob_gating=True --max_step=700000

@antoine77340
Copy link
Owner

antoine77340 commented Dec 15, 2017

Yes, then you need to modify the frame_level_models.py file to take into account the fact that you only have rgb features. It means you will have to do an easy modification of the graph. I am sorry If I cannot help you more I am a bit busy but It should be very easy to modify the architecture of the model to get only RGB.
You can set --max_step=700000 to --max_step=300000, you will get the same performance and it will train faster.

@feiyun1265
Copy link
Author

feiyun1265 commented Dec 15, 2017

I see, thanks for your help.@antoine77340.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants