GitHub - stmrdus/ReED: PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning (ICML 2024)

PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning

This is the official code and data of the following paper:

Jaejun Lee, Minsung Hwang, and Joyce Jiyoung Whang, PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning, The 41st International Conference on Machine Learning (ICML), 2024.

All codes are written by Jaejun Lee ([email protected]). When you use this code or data, please cite our paper.

@inproceedings{reed,
	author={Jaejun Lee and Minsung Hwang and Joyce Jiyoung Whang},
	title={{PAC}-{B}ayesian Generalization Bounds for Knowledge Graph Representation Learning},
	booktitle={Proceedings of the 41st International Conference on Machine Learning},
	year={2024},
	pages={26589--26620}
}

Requirements

We used python 3.8 and PyTorch 1.12.1 with cudatoolkit 11.3.

You can install all requirements with:

pip install -r requirements.txt

Training & Evaluation

We used NVIDIA NVIDIA GeForce RTX 2080 Ti for all our experiments. It takes less than 4 minutes for a single run.

The commands we used to get the results in our paper:

FB15K237

python train.py --data_path ./data/ --dataset_name FB15K237_sampled --decoder <decoder_type> -m 0.5 -lr <learning_rate> -L <number_of_RAMP_layers> -d 96 -phi LeakyReLU -rho Identity -psi Identity -s <value_of_s> --aggr <aggregator_type> --seed <random_seed> -e 2000 -b 1

<learning_rate>: 0.0003 (RAMP+TD) or 0.0005 (RAMP+SM)

<decoder_type>: Translational_Distance or Semantic_Matching

<aggregator_type>: mean or sum

<number_of_RAMP_layers>: 1, 2, or 3

<value_of_s>: 10.0, 15.0, or 20.0

<random_seed>: 0, 10, 20, 30, 40, 50, 60, 70, 80, or 90

FB15K237 w/ text features

python train_txt.py --data_path ./data/ --dataset_name FB15K237_sampled_txt --decoder <decoder_type> -m 0.5 -lr <learning_rate> -L 2 -d <dimension> -phi LeakyReLU -rho Identity -psi Identity -s 15.0 --aggr mean --seed <random_seed> -e 2000 -b 1

<learning_rate>: 0.0002 (RAMP+TD) or 0.00005 (RAMP+SM)

<decoder_type>: Translational_Distance or Semantic_Matching

: 64, 96, or 128

<random_seed>: 0, 10, 20, 30, 40, 50, 60, 70, 80, or 90

CoDEx-M

python train.py --data_path ./data/ --dataset_name CoDEx-M_sampled --decoder <decoder_type> -m 0.5 -lr 0.0005 -L <number_of_RAMP_layers> -d 64 -phi LeakyReLU -rho Identity -psi Identity -s <value_of_s> --aggr <aggregator_type> --seed <random_seed> -e 2000 -b 1

<decoder_type>: Translational_Distance or Semantic_Matching

<aggregator_type>: mean or sum

<number_of_RAMP_layers>: 1, 2, or 3

<value_of_s>: 10.0, 15.0, or 20.0

<random_seed>: 0, 10, 20, 30, 40, 50, 60, 70, 80, or 90

UMLS-43

python train.py --data_path ./data/ --dataset_name UMLS-43 --decoder <decoder_type> -m 0.75 -lr <learning_rate> -L <number_of_RAMP_layers> -d 48 -phi LeakyReLU -rho Identity -psi Identity -s <value_of_s> --aggr <aggregator_type> --seed <random_seed> -e 2000 -b 1

<learning_rate>: 0.0002 (RAMP+TD) or 0.0005 (RAMP+SM)

<decoder_type>: Translational_Distance or Semantic_Matching

<aggregator_type>: mean or sum

<number_of_RAMP_layers>: 1, 2, or 3

<value_of_s>: 10.0, 12.5, or 15.0

<random_seed>: 0, 10, 20, 30, 40, 50, 60, 70, 80, or 90

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
data		data
LICENSE.txt		LICENSE.txt
README.md		README.md
dataset.py		dataset.py
dataset_txt.py		dataset_txt.py
evaluate.py		evaluate.py
evaluate_txt.py		evaluate_txt.py
model.py		model.py
model_txt.py		model_txt.py
my_parser.py		my_parser.py
my_parser_txt.py		my_parser_txt.py
requirements.txt		requirements.txt
train.py		train.py
train_txt.py		train_txt.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning

Requirements

Training & Evaluation

FB15K237

FB15K237 w/ text features

CoDEx-M

UMLS-43

About

Releases

Packages

Languages

License

stmrdus/ReED

Folders and files

Latest commit

History

Repository files navigation

PAC-Bayesian Generalization Bounds for Knowledge Graph Representation Learning

Requirements

Training & Evaluation

FB15K237

FB15K237 w/ text features

CoDEx-M

UMLS-43

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages