Extension and PyTorch version of 'Standalone Neural Ranking Model (SNRM)'

The package of SNRM is distributed for research purpose, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. **/

SNRM has been extended to run on the MS Marco Passage Ranking Dataset. https://github.com/microsoft/MSMARCO-Passage-Ranking This version of SNRM is implemented using PyTorch.

Introduction

SNRM [1] is the first learning to rank model that instead of "re-ranking" a few items (e.g., documents) is able to rank documents from a large collection of items. SNRM is a pairwise neural ranking model implemented for the ad-hoc retrieval task.

The main idea behind SNRM is to learn a high-dimensional sparse representation for each query or document in order to make inverted index construction possible. Then, an inverted index is constructed from the learned sparse representations, which is used for efficient retrieval. Therefore, SNRM does not need a first stage retrieval and can retrieve items (documents) from a large collection.

The original SNRM model [1] is trained using weak supervision [2]. The weak supervision signal was computed using the query likelihood retrieval model. Since the weak supervision data is huge (hundreds of gigabytes), we cannot share the data. If you want to use the code, you should implement your own 'generate_batch' method that returns a batch of pairwise training data (query. document1, document2, label). For inverted index construction, you should also implement your own 'generate_batch' method that simply returns a batch of document ID and their content.

If you find this model useful, you may want to cite the SNRM paper published at CIKM '18 [1].

[1] Hamed Zamani, Mostafa Dehghani, W. Bruce Croft, Erik Learned-Miller, and Jaap Kamps. "From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted Indexing", In Proc. of CIKM 2018.

[2] Mostafa Dehghani, Hamed Zamani, Aliaksei Severyn, Jaap Kamps, and W. Bruce Croft. "Neural Ranking Models with Weak Supervision", In Proc. of SIGIR 2017.

Author

This project was implemented by Hamed Zamani of the Center for Intelligent Information Retrieval (CIIR) at the University of Massachusetts Amherst. If you have any comment or question, please do not hesitate to contact the author via [email protected].

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
.idea		.idea
code		code
config		config
data		data
evaluation-tools		evaluation-tools
google-colab		google-colab
index		index
logs		logs
model		model
results		results
tf-log		tf-log
thesis		thesis
.gitignore		.gitignore
HOW_TO_TENSORBOARD.md		HOW_TO_TENSORBOARD.md
LICENSE		LICENSE
README.md		README.md
README_CONDA.MD		README_CONDA.MD
conda_environment.yml		conda_environment.yml
conda_list.txt		conda_list.txt
conda_spec-file.txt		conda_spec-file.txt
requirements_pip.txt		requirements_pip.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Extension and PyTorch version of 'Standalone Neural Ranking Model (SNRM)'

Introduction

Author

About

Contributors 4

Languages

License

Bernhard-Steindl/snrm-extension

Folders and files

Latest commit

History

Repository files navigation

Extension and PyTorch version of 'Standalone Neural Ranking Model (SNRM)'

Introduction

Author

About

Topics

Resources

License

Stars

Watchers

Forks

Contributors 4

Languages