probing-via-prompting

This repository is in accompany with the paper: Probing via Prompting.

Dependencies

python 3.8.5
pytorch 1.7.1+cu110

Setup

Install required packages:

pip install -r requirements.txt

Data Prcoessing

Process your OntoNotes data with the script
Extract all tasks:

python extract_ontonotes_all.py --ontonotes /path/to/conll-formatted-ontonotes-5.0 -o ontonotes

This will create two folders under ontonotes/, one for diagnostic probing (DP), one for probing via prompting (PP).

Probing via Prompting

export task=
python run_pp.py \
    --num_train_epochs 1.0 \
    --do_train \
    --do_eval \
    --per_device_train_batch_size 4 \
    --per_device_eval_batch_size 4 \
    --gpt2_name_or_path gpt2 \
    --data_dir ontonotes/pp/ \
    --task $task \
    --output_dir outputs/pp/$task/ \
    --overwrite_output_dir \
    --use_fast_tokenizer False \
    --cache_dir cache/\
    --save_strategy no \
    --prefix_len 200

task can be any one of ["pos", "const", "coref", "ner", "srl", "pos_control"].

If you want to experiment on the random model, replace --gpt2_name_or_path gpt2 with

    --config_name gpt2 \
    --tokenizer_name gpt2 \

To prune attention heads for analysis, use --do_prune:

export task=
python run_pp.py \
    --num_train_epochs 1.0 \
    --do_train \
    --do_eval \
    --per_device_train_batch_size 4 \
    --per_device_eval_batch_size 4 \
    --gpt2_name_or_path gpt2 \
    --data_dir ontonotes/pp/ \
    --task $task \
    --output_dir outputs/pp/pruning/$task/ \
    --overwrite_output_dir \
    --use_fast_tokenizer False \
    --cache_dir cache/ \
    --save_strategy no \
    --prefix_len 200 \
    --do_prune \
    --num_of_heads 96 \
    --pruning_lr 0.1 \
    --seed 0

Diagnostic Probing

Multi-layer perceptron (MLP) probe:

export task=
python run_dp.py \
    --num_train_epochs 1.0 \
    --do_train \
    --do_eval \
    --per_device_train_batch_size 32 \
    --per_device_eval_batch_size 32 \
    --gpt2_name_or_path gpt2 \
    --data_dir ontonotes/dp/ \
    --task $task \
    --output_dir outputs/dp/mlp/$task/ \
    --overwrite_output_dir \
    --cache_dir cache/\
    --save_strategy no

Please note that DP (MLP) does not support multi-gpus due to the incompatibility between nn.ParameterList in AllenNLP's ScalarMix and DataParallel.

You can use linear regression (LR) probe instead by setting --use_mlp False:

export task=
python run_dp.py \
    --num_train_epochs 1.0 \
    --do_train \
    --do_eval \
    --per_device_train_batch_size 32 \
    --per_device_eval_batch_size 32 \
    --gpt2_name_or_path gpt2 \
    --data_dir ontonotes/dp/ \
    --task $task \
    --output_dir outputs/dp/lr/$task/ \
    --overwrite_output_dir \
    --cache_dir cache/\
    --save_strategy no \
    --mlp_dropout 0.1 \
    --use_mlp False

DP (LR) also supports head pruning:

export task=
python run_dp.py \
    --num_train_epochs 1.0 \
    --do_train \
    --do_eval \
    --per_device_train_batch_size 32 \
    --per_device_eval_batch_size 32 \
    --gpt2_name_or_path gpt2 \
    --data_dir ontonotes/dp/ \
    --task $task \
    --output_dir outputs/dp/lr/pruning/$task/ \
    --overwrite_output_dir \
    --cache_dir cache/\
    --save_strategy no \
    --mlp_dropout 0.1 \
    --use_mlp False \
    --do_prune \
    --num_of_heads 96 \
    --pruning_lr 0.1 \
    --seed 0

Amnesic Probing

To evaluate language modeling loss when the essential heads stored in /path/to/head_mask are pruned, run

python run_clm.py \
    --model_name_or_path gpt2 \
    --dataset_name wikitext \
    --dataset_config_name wikitext-103-raw-v1 \
    --do_eval \
    --output_dir outputs/lm/ \
    --overwrite_output_dir \
    --per_device_eval_batch_size 32 \
    --cache_dir cache/ \
    --head_mask_path /path/to/head_mask

Name	Name	Last commit message	Last commit date
Latest commit yileitu finetune skeleton Nov 27, 2023 60f8245 · Nov 27, 2023 History 50 Commits
plot	plot	plot size, font, etc	Oct 19, 2023
tables	tables	misc	Nov 16, 2023
.gitignore	.gitignore	misc	Nov 16, 2023
README.md	README.md	Update README.md	Jul 6, 2022
convergence_determination_dp.py	convergence_determination_dp.py	fit reg for PP to determine convergence	Oct 19, 2023
convergence_determination_pp.py	convergence_determination_pp.py	fit reg for PP to determine convergence	Oct 19, 2023
dataset_stat.py	dataset_stat.py	calc majority	Oct 19, 2023
dp_arguments.py	dp_arguments.py	Only EN and CN GPT2 models	Nov 17, 2023
extract_ontonotes_all.py	extract_ontonotes_all.py	First commit	Apr 26, 2022
modeling_gated_gpt2.py	modeling_gated_gpt2.py	misc	Nov 16, 2023
modeling_gpt2_dp.py	modeling_gpt2_dp.py	Only EN and CN GPT2 models	Nov 17, 2023
modeling_gpt2_pp.py	modeling_gpt2_pp.py	Prefix tuning flat	Dec 16, 2022
onehot_probe.py	onehot_probe.py	Norm modified weight initialization strategy	Mar 12, 2023
param_norm.py	param_norm.py	misc	Nov 16, 2023
pvp_req.yaml	pvp_req.yaml	Prefix tuning flat	Dec 16, 2022
requirements.txt	requirements.txt	First commit	Apr 26, 2022
run_dp.py	run_dp.py	finetune skeleton	Nov 27, 2023
run_dp_lr.sh	run_dp_lr.sh	Shell to run LR-Onehot	Oct 19, 2023
run_dp_lr_onehot.sh	run_dp_lr_onehot.sh	GPT2 German	Nov 14, 2023
run_dp_mlp_agg_mod_rand.sh	run_dp_mlp_agg_mod_rand.sh	Norm modified weight initialization strategy	Mar 12, 2023
run_dp_mlp_deep.sh	run_dp_mlp_deep.sh	finetune skeleton	Nov 27, 2023
run_dp_mlp_fine_mod_rand.sh	run_dp_mlp_fine_mod_rand.sh	Norm modified weight initialization strategy	Mar 12, 2023
run_dp_mlp_modrand.sh	run_dp_mlp_modrand.sh	Aggregated modified weight initialization strategy	Mar 1, 2023
run_dp_mlp_norm_mod_rand.sh	run_dp_mlp_norm_mod_rand.sh	Converged Probes Experiments	Mar 18, 2023
run_dp_mlp_onehot.sh	run_dp_mlp_onehot.sh	Shell to run LR-Onehot	Oct 19, 2023
run_dp_mlp_saturated.sh	run_dp_mlp_saturated.sh	Aggregated modified weight initialization strategy	Mar 1, 2023
run_dp_mlp_wide.sh	run_dp_mlp_wide.sh	misc	Oct 10, 2023
run_ner.py	run_ner.py	cleaned some code	Nov 16, 2023
run_ner_mlp_euler.sh	run_ner_mlp_euler.sh	misc	Nov 16, 2023
run_ner_mlp_local.sh	run_ner_mlp_local.sh	misc	Nov 16, 2023
run_pp.py	run_pp.py	DPLR	Jul 6, 2023
run_pp.sh	run_pp.sh	renamed	Oct 3, 2023
run_pp_flat.sh	run_pp_flat.sh	DPLR	Jul 6, 2023
run_temp.sh	run_temp.sh	tiny changes	Mar 6, 2023
temp.py	temp.py	finetune skeleton	Nov 27, 2023
trainer_pp.py	trainer_pp.py	First commit	Apr 26, 2022
utils.py	utils.py	cleaned some code	Nov 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

probing-via-prompting

Dependencies

Setup

Data Prcoessing

Probing via Prompting

Diagnostic Probing

Amnesic Probing

About

Releases

Packages

Languages

yileitu/probing-via-prompting

Folders and files

Latest commit

History

Repository files navigation

probing-via-prompting

Dependencies

Setup

Data Prcoessing

Probing via Prompting

Diagnostic Probing

Amnesic Probing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages