Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
const.yaml		const.yaml
data.py		data.py
distributed.yaml		distributed.yaml
model.py		model.py
model_def.py		model_def.py
validation_loss_table.png		validation_loss_table.png

README.md

PyTorch Word-Level Language Modeling Example

This example shows how to build the RNN and Transformer word-level language modeling models on the WikiText-2 dataset using Determined's PyTorch API. This example is adapted from this PyTorch Word-Level Language Modeling example.

Files

model.py: This file defines both models - the RNN and the Transformer.
data.py: The data loading and preparation code for the model. Unless use_cache: False is specified in the experiment config file, this code will attempt to load a cached version of the preprocessed data. If the cached version is unavailable, it will download and preprocess the data, and cache it for future use.
model_def.py: The core code for training. This includes building and compiling the model.

Configuration Files

const.yaml: Train the model on a single GPU with constant hyperparameter values.
distributed.yaml: Same as const.yaml, but trains the model with 8 GPUs (distributed training).

Data

The data used for this example is downloaded from Salesforce Einstein at the start of each run. This is the same source as the original example.

To Run

If you have not yet installed Determined, installation instructions can be found under docs/install-admin.html or at https://docs.determined.ai/latest/index.html

Run the following command: det -m <master host:port> experiment create -f const.yaml .. The other configurations can be run by specifying the appropriate configuration file in place of const.yaml.

Specifying the Model

This example defines 3 models - a Transformer, a recurrent neural network with LSTM cells, and a recurrent neural network with GRU cells. We do not include the recurrent neural network with the tanh and ReLU activations due to numerical instability.

To specify which model to use, replace the model_cls attribute in the appropriate .yaml file with one of the following:

Transformer
LSTM
GRU

Results

Note: The purpose of this table is to show the word language modeling models running in Determined for a set number of epochs, demonstrating the acceleration of model training time achieved with Determined’s distributed training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

word_language_model

word_language_model

README.md

PyTorch Word-Level Language Modeling Example

Files

Configuration Files

Data

To Run

Specifying the Model

Results

Files

word_language_model

Directory actions

More options

Directory actions

More options

Latest commit

History

word_language_model

Folders and files

parent directory

README.md

PyTorch Word-Level Language Modeling Example

Files

Configuration Files

Data

To Run

Specifying the Model

Results