Music Generation Using Deep Learning and Generative AI: A Systematic Review

This paper presents a systematic review of recent advances in music generation using deep learning techniques, categorizing the latest research in the field and identifying key contributions from various approaches. The study examines common data representations in music generation, including raw wa...

Full description

Saved in:
Bibliographic Details
Main Authors: Rohan Mitra, Imran Zualkernan
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10845168/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832576748969000960
author Rohan Mitra
Imran Zualkernan
author_facet Rohan Mitra
Imran Zualkernan
author_sort Rohan Mitra
collection DOAJ
description This paper presents a systematic review of recent advances in music generation using deep learning techniques, categorizing the latest research in the field and identifying key contributions from various approaches. The study examines common data representations in music generation, including raw waveforms, spectrograms, and MIDI, alongside the most prominent deep learning architectures like Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs), Variational Autoencoders (VAEs), and Transformer-based models. Through a comparative analysis, the paper highlights the strengths and limitations of these approaches. The findings suggest that GANs with spectrograms and RNNs with MIDI data are particularly effective for generating multi-track music, while autoregressive models like MusicGen and transformer models demonstrate superior performance in capturing long-term dependencies in music generation. Additionally, the paper underscores the emergence of diffusion models, which are gaining popularity for generating high-quality, complex music outputs. The major contribution of this review is the identification of the best-performing models for various music generation tasks and the provision of comprehensive insights into data representation methods, evaluation metrics, and future research directions.
format Article
id doaj-art-3bb77304c7a1460eb9be702fb011b8ba
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-3bb77304c7a1460eb9be702fb011b8ba2025-01-31T00:00:55ZengIEEEIEEE Access2169-35362025-01-0113180791810610.1109/ACCESS.2025.353179810845168Music Generation Using Deep Learning and Generative AI: A Systematic ReviewRohan Mitra0https://orcid.org/0000-0002-8007-5437Imran Zualkernan1https://orcid.org/0000-0002-1048-5633Computer Science and Engineering Department, American University of Sharjah, Sharjah, United Arab EmiratesComputer Science and Engineering Department, American University of Sharjah, Sharjah, United Arab EmiratesThis paper presents a systematic review of recent advances in music generation using deep learning techniques, categorizing the latest research in the field and identifying key contributions from various approaches. The study examines common data representations in music generation, including raw waveforms, spectrograms, and MIDI, alongside the most prominent deep learning architectures like Generative Adversarial Networks (GANs), Recurrent Neural Networks (RNNs), Variational Autoencoders (VAEs), and Transformer-based models. Through a comparative analysis, the paper highlights the strengths and limitations of these approaches. The findings suggest that GANs with spectrograms and RNNs with MIDI data are particularly effective for generating multi-track music, while autoregressive models like MusicGen and transformer models demonstrate superior performance in capturing long-term dependencies in music generation. Additionally, the paper underscores the emergence of diffusion models, which are gaining popularity for generating high-quality, complex music outputs. The major contribution of this review is the identification of the best-performing models for various music generation tasks and the provision of comprehensive insights into data representation methods, evaluation metrics, and future research directions.https://ieeexplore.ieee.org/document/10845168/Music generationsurvey paperGANLSTMVAEspectrograms
spellingShingle Rohan Mitra
Imran Zualkernan
Music Generation Using Deep Learning and Generative AI: A Systematic Review
IEEE Access
Music generation
survey paper
GAN
LSTM
VAE
spectrograms
title Music Generation Using Deep Learning and Generative AI: A Systematic Review
title_full Music Generation Using Deep Learning and Generative AI: A Systematic Review
title_fullStr Music Generation Using Deep Learning and Generative AI: A Systematic Review
title_full_unstemmed Music Generation Using Deep Learning and Generative AI: A Systematic Review
title_short Music Generation Using Deep Learning and Generative AI: A Systematic Review
title_sort music generation using deep learning and generative ai a systematic review
topic Music generation
survey paper
GAN
LSTM
VAE
spectrograms
url https://ieeexplore.ieee.org/document/10845168/
work_keys_str_mv AT rohanmitra musicgenerationusingdeeplearningandgenerativeaiasystematicreview
AT imranzualkernan musicgenerationusingdeeplearningandgenerativeaiasystematicreview