bios

Code for byte-level learning of twitter bios (full training set is in a JSON on my laptop)

Link to the state_dict of a trained model.

model.py - defines the LayerRNN (which was used for training)

data.py - utilties for building byte datasets

train.py - routines used to train the model

profiles.py - used to build the training set.

Usage

sample.py is a script for sampling strings from a trained model file, printing to stdout. See command line help for details. Requires a local copy of a model file (such as that from the link above) plus a model config file and byte values config file; these are available in this repo for the trained model above as model_config.json and byte_values.txt respectively.

Example: python sample.py model_file -N 5 prints 5 samples from the model.

Sampling is quite slow on my laptop - GPU will be used automatically if available.

Name	Name	Last commit message	Last commit date
Latest commit briantimar bytevals Apr 13, 2020 d26b218 · Apr 13, 2020 History 32 Commits
.gitignore	.gitignore	Initial commit	Mar 29, 2020
README.md	README.md	sampling and readme	Apr 13, 2020
auth.py	auth.py	pulling twitter bios	Apr 1, 2020
byte_values.txt	byte_values.txt	bytevals	Apr 13, 2020
data.py	data.py	switch target indicies	Apr 5, 2020
model.py	model.py	NIE RNN	Apr 13, 2020
model_config.json	model_config.json	sampling and readme	Apr 13, 2020
profiles.py	profiles.py	pulling twitter bios	Apr 1, 2020
sample.py	sample.py	sampling and readme	Apr 13, 2020
stats.py	stats.py	bio lengths	Apr 1, 2020
test_data.py	test_data.py	packed tensors optional	Apr 5, 2020
test_model.py	test_model.py	rnn sampling	Apr 5, 2020
train.py	train.py	train log	Apr 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bios

Usage

About

Releases

Packages

Languages

briantimar/bios

Folders and files

Latest commit

History

Repository files navigation

bios

Usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages