StoryClozeTask

This project is part of the Natural Languange Understanding course (2019) at ETH.

Task: Find the best ending for a story (https://arxiv.org/abs/1604.01696).

Team: 25

Name	Email
Ioannis Sachinoglou	[email protected]
Adamos Solomou	[email protected]
Anagnostidis Sotiris	[email protected]
Georgios Vasilakopoulos	[email protected]

Project structure

.
├── data                                
│   ├── glove-embeddings                # 100d glove embeddings 
│   ├── ROCStories                      # Datasets (train, validation, test)
│   ├── incorrect_endings               # negative endings generated from the language model
│   ├── skip-thoughts                   # Skip thoughts embeddings for (train, validation, test)
├── results                             # Predictions for unlabeled test set
├── src                                 # Source files
│   ├── bert                            # Files concerning running BERT classifier for the task.
│   ├── create_skip_thoughts_embeddings # Script to create skip thoughts embeddings.
├── report                              # Report pdf
└── README.md

Getting Started

Prerequisites

Install Python 3.6+
Load modules and create virtual environment (works when running on eth leonhard cluster):
```
source initialize.sh
```
Install requirements and skip thought embeddings. Some experiments require skip thoughts embeddings as specified in the paper Skip-Thought Vectors (https://arxiv.org/abs/1506.06726 ,https://github.com/ryankiros/skip-thoughts). For time saving purposes these have been precomputed and are publicly available at https://polybox.ethz.ch/index.php/s/X3GsRxeIhATdt8J. They files saved have the form of a numpy array with a shape [num_samples, num_sentences, skip_thought_embeddings_size]. For the training set, each story has a total of 5 sentences, while for the validation and test set each story has 6 sentences (corresponding to the two possible endings). To install:
```
run_experiments setup
```

Run experiments

Run run_experiments help to display all available options.

Run run_experiments all to run all available models.

Run run_experiments bert to run BERT classifier.

Results

The predcitions of the best performing model can be found in the results folder. Each line contains a number, 1 or 2, which corresponds the prediction of the model for the correct ending sentence.

Documentation

Report

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StoryClozeTask

Project structure

Getting Started

Prerequisites

Run experiments

Results

Documentation

About

Releases 1

Packages

Contributors 4

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 166 Commits
data		data
report		report
results		results
src		src
.gitignore		.gitignore
README.md		README.md
initialize.sh		initialize.sh
requirements.txt		requirements.txt
run_experiments		run_experiments

sanagno/StoryClozeTask

Folders and files

Latest commit

History

Repository files navigation

StoryClozeTask

Project structure

Getting Started

Prerequisites

Run experiments

Results

Documentation

About

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 4

Languages

Packages