This repository has been archived by the owner on Jan 15, 2024. It is now read-only.
v0.8.2: Bug Fixes
This release covers a few fixes for the bugs reported:
- Fixed argument passing in the
bert/embedding.py
script - Updated
SimVerb3500
dataset URL to the aclweb hosted version - Removed multi-processing in DataLoader from in
bert/pretraining_utils.py
which potentially causes crash when horovod mpi is used for training - Before MXNet 1.6.0, Gluon
Trainer
assumes deterministic parameter creation order for distributed traiing. The attention cell for BERT and transformer has a non-deterministic parameter creation order in v0.8.1 and v0.8.0, which will cause divergence during distributed training. It is now fixed.
Note that since v0.8.2, the default branch of gluon-nlp github will be switched to the latest stable branch, instead of the master branch under development.