4.6 Article

Translating Melody to Chord: Structured and Flexible Harmonization of Melody With Transformer

Journal

IEEE ACCESS
Volume 10, Issue -, Pages 28261-28273

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/ACCESS.2022.3155467

Keywords

Transformers; Hidden Markov models; Music; Adaptation models; Data models; Computational modeling; Context modeling; Music information retrieval; computer generated music; neural networks; self-supervised learning

Ask authors/readers for more resources

Recent deep learning approaches have made remarkable progress in melody harmonization. This paper proposes a Transformer-based architecture that directly maps melody notes to chord sequences, generating structured chords with appropriate rhythms. Experimental results show that the proposed models generate chord sequences that are more structured and diverse than those generated by LSTM-based models.
Recent deep learning approaches for melody harmonization have achieved remarkable performance by overcoming the uneven chord distributions of music data. However, most of these approaches have not attempted to capture an original melodic structure and generate structured chord sequences with appropriate rhythms. Hence, we use a Transformer-based architecture that directly maps lower-level melody notes into a semantic higher-level chord sequence. In particular, we encode the binary piano roll of a melody into a note-based representation. Furthermore, we address the flexible generation of various chords with Transformer expanded with a VAE framework. We propose three Transformer-based melody harmonization models: 1) the standard Transformer-based model for the neural translation of a melody to chords (STHarm); 2) the variational Transformer-based model for learning the global representation of complete music (VTHarm); and 3) the regularized variational Transformer-based model for the controllable generation of chords (rVTHarm). Experimental results demonstrate that the proposed models generate more structured, diverse chord sequences than LSTM-based models.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available