This is an old revision of the document!


Resources

You can find most of my code on Github.

This page links to a couple of specific resources related to one of my Publications.

Reading Comprehension Corpus

As part of our paper “Teaching Machines to Read and Comprehend” we published a tool to rebuild the large reading comprehension corpus described in the paper. You can find this tool on Github: https://github.com/deepmind/rc-data/

Bolinas: Synchronous Hyperedge Replacement Grammars

Together with our 2013 ACL paper on graph parsing with hyperedge replacement grammars, we released a set of tools for doing exactly that. Bolinas can process rooted, directed, cyclic, edge and node-labeled (hyper-)graphs. The software is hosted at ISI: http://www.isi.edu/licensed-sw/bolinas/

Combinatory Categorial Autoencoders

The code for my 2013 ACL paper on compositional semantics using CCG and vectors is hosted on Github: https://github.com/karlmoritz/oxcvsm This is a new (cleaner) implementation compared to the one used for the paper, and I haven't migrated all models to the new version yet. This will be updated this over time and I will also add documentation.

Multilingual Models for Compositional Semantics

The code for my 2014 ACL and ICLR papers on learning multilingual word embeddings and language models is hosted on Github: https://github.com/karlmoritz/bicvm. A cleaner version of this code is in the making and will be published at some point. However, the current version suffices for replicating the results in my papers and can easily be extended with more complex composition functions.

As part of these publications I have published a modified version of the TED corpus (https://wit3.fbk.eu/), which you can find here.