Skip to content

Refinement experiments

Juan Camilo Pérez Santamaría edited this page Jan 29, 2018 · 11 revisions

Experiments

Machine Description Command
[email protected] Mixed layers: 3 → 5 CUDA_VISIBLE_DEVICES=0 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --lr 1e-5 --mixed-layers 5 --save-folder unc_dpn92_sru_lowres_mixed_layers_5 > unc_dpn92_sru_lowres_mixed_layers_5.log &
[email protected] Combine lang WE + Hid (--mix-we) CUDA_VISIBLE_DEVICES=1 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --lr 1e-5 --mix-we --save-folder unc_dpn92_sru_lowres_mixwe > unc_dpn92_sru_lowres_mixwe.log &
[email protected] Mixed size (Before mSRU): 4096 CUDA_VISIBLE_DEVICES=0 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --mixed-size 4096 --save-folder unc_dpn92_sru_lowres_mixed_size_4096 > unc_dpn92_sru_lowres_mixed_size_4096.log &
[email protected] Mixed size (Before mSRU): 512 CUDA_VISIBLE_DEVICES=1 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --mixed-size 512 --save-folder unc_dpn92_sru_lowres_mixed_size_512 > unc_dpn92_sru_lowres_mixed_size_512.log &
[email protected] mSRU Hidden Size 4096 CUDA_VISIBLE_DEVICES=0 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --hid-mixed-size 4096 --save-folder unc_dpn92_sru_lowres_hid_mixed_size_4096 > unc_dpn92_sru_lowres_hid_mixed_size_4096.log &
[email protected] SRU Hidden Size 2048 CUDA_VISIBLE_DEVICES=1 nohup python -u train.py --data /mnt/referit_data/ --backend dpn92 --hid-size 2048 --save-folder unc_dpn92_sru_lowres_hid_size_2048 > unc_dpn92_sru_lowres_hid_size_2048.log &
T1 mSRU layers 4 CUDA_VISIBLE_DEVICES=0 nohup python -u train.py --data ../referit_data --backend dpn92 --save-folder unc_dpn92_mixed_layers_4_weights --lr 1e-5 --mixed-layers 4 > unc_dpn92_mixed_layers_4.log &
T2 SRU layers 3 CUDA_VISIBLE_DEVICES=1 nohup python -u train.py --data ../referit_data --backend dpn92 --save-folder unc_dpn92_lang_layers_3_weights --lr 1e-5 --lang-layers 3 > unc_dpn92_lang_layers_3.log &
T3 5 filters CUDA_VISIBLE_DEVICES=2 nohup python -u train.py --data ../referit_data --backend dpn92 --save-folder unc_dpn92_filters5_weights --lr 1e-5 --num-filters 5 > unc_dpn92_filters5.log &
T4 10 filters CUDA_VISIBLE_DEVICES=3 nohup python -u train.py --data ../referit_data --backend dpn92 --save-folder unc_dpn92_filters10_weights --lr 1e-5 --num-filters 10 > unc_dpn92_filters10.log &

Monitoring

Machine Command
M1 multitail unc_dpn92_sru_lowres_mixed_layers_5.log unc_dpn92_sru_lowres_mixwe.log
M2 multitail unc_dpn92_sru_lowres_mixed_size_4096.log unc_dpn92_sru_lowres_mixed_size_512.log
M3 multitail unc_dpn92_sru_lowres_hid_mixed_size_4096.log unc_dpn92_sru_lowres_hid_size_2048.log
005 multitail SSD1/query-objseg/unc_A.log SSD2/query-objseg/unc_B.log SSD3/query-objseg/unc_C.log SSD4/query-objseg/unc_D.log
Clone this wiki locally