Introduction: Frodotype - a fantasy flavoured text generator

Frodotye is a webapp that generates text which grammatically correct and logical most of the time. This project aims to fine-tune the GPT2 model using the gpt-2-simple python package. OpenAI offers the use of 3 of their models, however the size and complexity of these models makes it diffcult to train on consumer hardware. Therefor, the smallest (117m) parameter model was used in conjucntion with a Google Deep Leanring VM to retrain the model.The data used to fine-tuned the model consists of 102 fantasy novels by 20 authors. This data was used for 2 reasons:

Fantasy is the genre I read the most.
I own the ebooks used in this project.

Frodotype was built using Tensorflow, docker, Google Cloud Platform, Javascript, Chart.js, and Bulma CSS. A bulk of the heavy lifting in python was done using Max Wolf's gpt-2-simple project.

Gathering the data

The 102 books used were converted from the Amazon Kindle format .azw to plain text files. This processes included stripping all images, formattting, and hyperlinks. The books were then manually stripped of their table of contents, appencices, and glossaries. The final text file used to retrain the model was put together using the following commands:

Concatenate the files:

$ for f in *.txt do (cat "${f}"; echo) >> unprocessed.txt; done

Deleting all none ASCII charcaters:

$ LC_ALL=C tr -dc '\0-\177' < unprocessed.txt > processed.txt

Removing numbers and dashes from the text:

$ tr -d '[0-9-]' < processed.txt > final.txt

Additional processing is done in the text-analysis notebook.

Training the model

The model was trained on a Google Deep Learning VM using a Tesla K80 GPU, TensorFlow 1.15, and CUDA 10.0.

The model was retrained using gpt-2-simple, a python package that eases the process of tweeking hyperparameters. The model was trained for three differeing lengths. The one used in this app was trianed for 45,000 steps or approximattly 90 hours. Two additional models were trained at 25,000 steps and 80,000 steps. The smaller of the two models had a much higher loss value, while the larger model had a simillar loss that began to increase towards the end.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
checkpoint/run1		checkpoint/run1
css		css
data and analysis		data and analysis
gpt2-model		gpt2-model
js		js
.gitignore		.gitignore
CNAME		CNAME
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Introduction: Frodotype - a fantasy flavoured text generator

Gathering the data

Training the model

About

Releases

Packages

Languages

License

gdeol4/frodotype

Folders and files

Latest commit

History

Repository files navigation

Introduction: Frodotype - a fantasy flavoured text generator

Gathering the data

Training the model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages