📚 Full list of publications

Here is a link to all of our research publications.

💾 Datasets

As part of the project, we open source some of the datasets that were used in our research.

🔎 Research highlights

DDSP

A library that lets you combine the interpretable structure of classical DSP elements (such as filters, oscillators, reverberation, etc.) with the expressivity of deep learning.

Papers

DDSP: Differentiable Digital Signal Processing

Blog Posts

Colab Notebooks

DDSP Timbre Transfer

GANSynth

A method to synthesize high-fidelity audio with GANs.

Music Transformer

A self-attention-based neural network that can generate music with long-term coherence.

Papers

Blog Posts

Colab Notebooks

Music Transformer

Wave2Midi2Wave

A new process able to transcribe, compose, and synthesize audio waveforms with coherent musical structure on timescales spanning six orders of magnitude (~0.1 ms to ~100 s).

Papers

Enabling Factorized Piano Music Modeling and Generation with the MAESTRO Dataset

Blog Posts

The MAESTRO Dataset and Wave2Midi2Wave

Music VAE

A hierarchical latent vector model for learning long-term structure in music

Papers

Blog Posts

Colab Notebooks

Onsets and Frames

We advance the state of the art in polyphonic piano music transcription by using a deep convolutional and recurrent neural network which is trained to jointly predict onsets and frames.

Papers

Onsets and Frames: Dual-Objective Piano Transcription

Blog Posts

Colab Notebooks

Onsets and Frames

Latent Constraints

A method to condition generation without retraining the model, by post-hoc learning latent constraints, value functions that identify regions in latent space that generate outputs with desired attributes. We can conditionally sample from these regions with gradient-based optimization or amortized actor functions.

Papers

Blog Posts

MidiMe: Personalizing MusicVAE

Colab Notebooks

Latent Constraints

COCONET

An instance of orderlessNADE, Coconet uses deep convolutional neural networks to perform music inpaintings through Gibbs sampling.

Papers

Blog Posts

Performance RNN

An LSTM-based recurrent neural network designed to model polyphonic music with expressive timing and dynamics.

Papers

Learning to Create Piano Performances

Blog Posts

Colab Notebooks

Performance RNN

Sketch RNN

A recurrent neural network (RNN) able to construct stroke-based drawings of common objects. The model is trained on thousands of crude human-drawn images representing hundreds of classes.

Papers

Blog Posts

NSynth

A powerful new WaveNet-style autoencoder model that conditions an autoregressive decoder on temporal codes learned from the raw audio waveform.