default search action
24th ISMIR 2023: Milan, Italy
- Augusto Sarti, Fabio Antonacci, Mark Sandler, Paolo Bestagini, Simon Dixon, Beici Liang, Gaël Richard, Johan Pauwels:
Proceedings of the 24th International Society for Music Information Retrieval Conference, ISMIR 2023, Milan, Italy, November 5-9, 2023. 2023, ISBN 978-1-7327299-3-3 - Shreyas Nadkarni, Sujoy Roychowdhury, Preeti Rao, Martin Clayton:
Exploring the Correspondence of Melodic Contour With Gesture in Raga Alap Singing. 21-28 - Miguel Pérez Fernández, Holger Kirchhoff, Xavier Serra:
TriAD: Capturing Harmonics With 3D Convolutions. 29-36 - Fabio Morreale, Megha Sharma, I-Chieh Wei:
Data Collection in Music Generation Training Sets: A Critical Analysis. 37-46 - Bob L. T. Sturm, Arthur Flexer:
A Review of Validity and Its Relationship to Music Information Research. 47-55 - Gowriprasad R., Srikrishnan Sridharan, R. Aravind, Hema A. Murthy:
Segmentation and Analysis of Taniavartanam in Carnatic Music Concerts. 56-63 - Changhong Wang, Gaël Richard, Brian McFee:
Transfer Learning and Bias Correction With Pre-Trained Audio Embeddings. 64-70 - Michèle Duguay, Kate Mancey, Johanna Devaney:
Collaborative Song Dataset (CoSoD): An Annotated Dataset of Multi-Artist Collaborations in Popular Music. 71-79 - Michele Newman, Lidia Morris, Jin Ha Lee:
Human-AI Music Creation: Understanding the Perceptions and Experiences of Music Creators for Ethical and Productive Collaboration. 80-88 - Nathan Fradet, Nicolas Gutowski, Fabien Chhel, Jean-Pierre Briot:
Impact of Time and Note Duration Tokenizations on Deep Learning Symbolic Music Modeling. 89-97 - Max Johnson, Mark Gotham:
Musical Micro-Timing for Live Coding. 98-105 - Francisco J. Castellanos, Antonio Javier Gallego, Ichiro Fujinaga:
A Few-Shot Neural Approach for Layout Analysis of Music Score Images. 106-113 - Behzad Haki, Blazej Kotowski, Cheuk Lun Isaac Lee, Sergi Jordà:
TapTamDrum: A Dataset for Dualized Drum Patterns. 114-120 - Andrea Martelloni, Andrew P. McPherson, Mathieu Barthet:
Real-Time Percussive Technique Recognition and Embedding Learning for the Acoustic Guitar. 121-128 - Hiromu Yakura, Masataka Goto:
IteraTTA: An Interface for Exploring Both Text Prompts and Audio Priors in Generating Music With Text-to-Audio Models. 129-137 - Mirco Pezzoli, Raffaele Malvermi, Fabio Antonacci, Augusto Sarti:
Similarity Evaluation of Violin Directivity Patterns for Musical Instrument Retrieval. 138-145 - George Sioros:
Polyrhythmic Modelling of Non-Isochronous and Microtiming Patterns. 146-153 - Shangda Wu, Dingyao Yu, Xu Tan, Maosong Sun:
CLaMP: Contrastive Language-Music Pre-Training for Cross-Modal Symbolic Music Information Retrieval. 157-165 - Luca Marinelli, György Fazekas, Charalampos Saitis:
Gender-Coded Sound: Analysing the Gendering of Music in Toy Commercials via Multi-Task Learning. 166-173 - Li-Yang Tseng, Tzu-Ling Lin, Hong-Han Shuai, Jen-Wei Huang, Wen-Whei Chang:
A Dataset and Baselines for Measuring and Predicting the Music Piece Memorability. 174-181 - Carlos Peñarrubia, Carlos Garrido-Munoz, Jose J. Valero-Mas, Jorge Calvo-Zaragoza:
Efficient Notation Assembly in Optical Music Recognition. 182-189 - Yuting Yang, Zeyu Jin, Connelly Barnes, Adam Finkelstein:
White Box Search Over Audio Synthesizer Parameters. 190-196 - Vincent K. M. Cheung, Lana Okuma, Kazuhisa Shibata, Kosetsu Tsukuda, Masataka Goto, Shinichi Furuya:
Decoding Drums, Instrumentals, Vocals, and Mixed Sources in Music Using Human Brain Activity With fMRI. 197-206 - Liyue Zhang, Xinyu Yang, Yichi Zhang, Jing Luo:
Dual Attention-Based Multi-Scale Feature Fusion Approach for Dynamic Music Emotion Recognition. 207-214 - Keisuke Toyama, Taketo Akama, Yukara Ikemiya, Yuhta Takida, Wei-Hsiang Liao, Yuki Mitsufuji:
Automatic Piano Transcription With Hierarchical Frequency-Time Transformer. 215-222 - Nazif Can Tamer, Yigitcan Özer, Meinard Müller, Xavier Serra:
High-Resolution Violin Transcription Using Weak Labels. 223-230 - Lejun Min, Junyan Jiang, Gus Xia, Jingwei Zhao:
Polyffusion: A Diffusion Model for Polyphonic Score Generation With Internal and External Controls. 231-238 - Claire Arthur, Nathaniel Condit-Schultz:
The Coordinated Corpus of Popular Musics (CoCoPops): A Meta-Corpus of Melodic and Harmonic Transcriptions. 239-246 - Anja Volk, Tinka Veldhuis, Katrien Foubert, Jos De Backer:
Towards Computational Music Analysis for Music Therapy. 247-256 - Luca Comanducci, Fabio Antonacci, Augusto Sarti:
Timbre Transfer Using Image-to-Image Denoising Diffusion Implicit Models. 257-263 - Neha Rajagopalan, Blair Kaneshiro:
Correlation of EEG Responses Reflects Structural Similarity of Choruses in Popular Music. 264-271 - Mark Gotham:
Chromatic Chords in Theory and Practice. 272-278 - Yo-Wei Hsiao, Tzu-Yun Hung, Tsung-Ping Chen, Li Su:
BPS-Motif: A Dataset for Repeated Pattern Discovery of Polyphonic Symbolic Music. 281-288 - Michael Krause, Sebastian Strahl, Meinard Müller:
Weakly Supervised Multi-Pitch Estimation Using Cross-Version Alignment. 289-296 - Patricia Hu, Gerhard Widmer:
The Batik-Plays-Mozart Corpus: Linking Performance to Score to Musicological Annotations. 297-303 - Joan Serrà, Davide Scaini, Santiago Pascual, Daniel Arteaga, Jordi Pons, Jeroen Breebaart, Giulio Cengarle:
Mono-to-Stereo Through Parametric Stereo Generation. 304-310 - Charilaos Papaioannou, Emmanouil Benetos, Alexandros Potamianos:
From West to East: Who Can Understand the Music of the Others Better? 311-318 - Juan C. Martinez-Sevilla, Adrian Rosello, David Rizo, Jorge Calvo-Zaragoza:
On the Performance of Optical Music Recognition in the Absence of Specific Training Data. 319-326 - Martin E. Malandro:
Composer's Assistant: An Interactive Transformer for Multi-Track MIDI Infilling. 327-334 - Ethan Lustig, David Temperley:
The FAV Corpus: An Audio Dataset of Favorite Pieces and Excerpts, With Formal Analyses and Music Theory Descriptors. 335-342 - Le Zhuo, Ruibin Yuan, Jiahao Pan, Yinghao Ma, Yizhi Li, Ge Zhang, Si Liu, Roger B. Dannenberg, Jie Fu, Chenghua Lin, Emmanouil Benetos, Wenhu Chen, Wei Xue, Yike Guo:
LyricWhiz: Robust Multilingual Zero-Shot Lyrics Transcription by Whispering to ChatGPT. 343-351 - Alia Morsi, Kana Tatsumi, Akira Maezawa, Takuya Fujishima, Xavier Serra:
Sounds Out of Pläce? Score-Independent Detection of Conspicuous Mistakes in Piano Performances. 352-358 - Hugo Flores García, Prem Seetharaman, Rithesh Kumar, Bryan Pardo:
VampNet: Music Generation via Masked Acoustic Token Modeling. 359-366 - Yucong Jiang:
Expert and Novice Evaluations of Piano Performances: Criteria for Computer-Aided Feedback. 367-374 - Andres Ferraro, Jaehun Kim, Sergio Oramas, Andreas F. Ehmann, Fabien Gouyon:
Contrastive Learning for Cross-Modal Artist Retrieval. 375-382 - Christoph Finkensiep, Matthieu Haeberle, Friedrich Eisenbrand, Markus Neuwirth, Martin Rohrmeier:
Repetition-Structure Inference With Formal Prototypes. 383-390 - Peter van Kranenburg, Eoin Kearns:
Algorithmic Harmonization of Tonal Melodies Using Weighted Pitch Context Vectors. 391-397 - Kento Watanabe, Masataka Goto:
Text-to-Lyrics Generation With Image-Based Semantics and Reduced Risk of Plagiarism. 398-406 - Seungheon Doh, Keunwoo Choi, Jongpil Lee, Juhan Nam:
LP-MusicCaps: LLM-Based Pseudo Music Captioning. 409-416 - Morgan Buisson, Brian McFee, Slim Essid, Hélène C. Crayencour:
A Repetition-Based Triplet Mining Approach for Music Segmentation. 417-424 - Francesco Foscarin, Daniel Harasim, Gerhard Widmer:
Predicting Music Hierarchies With a Graph-Based Neural Decoder. 425-432 - Johannes Zeitler, Simon Deniffel, Michael Krause, Meinard Müller:
Stabilizing Training With Soft Dynamic Time Warping: A Case Study for Pitch Class Estimation With Weakly Aligned Targets. 433-439 - Danbinaerin Han, Rafael Caro Repetto, Dasaem Jeong:
Finding Tori: Self-Supervised Learning for Analyzing Korean Folk Song. 440-447 - Bernardo Torres, Stefan Lattner, Gaël Richard:
Singer Identity Representation Learning Using Self-Supervised Techniques. 448-456 - Yinghao Ma, Ruibin Yuan, Yizhi Li, Ge Zhang, Chenghua Lin, Xingran Chen, Anton Ragni, Hanzhi Yin, Emmanouil Benetos, Norbert Gyenge, Ruibo Liu, Gus Xia, Roger B. Dannenberg, Yike Guo, Jie Fu:
On the Effectiveness of Speech Self-Supervised Learning for Music. 457-465 - Tian Cheng, Masataka Goto:
Transformer-Based Beat Tracking With Low-Resolution Encoder and High-Resolution Decoder. 466-473 - Vanessa Nina Borsan, Mathieu Giraud, Richard Groult, Thierry Lecroq:
Adding Descriptors to Melodies Improves Pattern Matching: A Study on Slovenian Folk Songs. 474-481 - Karlijn Dinnissen, Christine Bauer:
How Control and Transparency for Users Could Improve Artist Fairness in Music Recommender Systems. 482-491 - Ahyeon Choi, Eunsik Shin, Haesun Joung, Joongseek Lee, Kyogu Lee:
Towards a New Interface for Music Listening: A User Experience Study on YouTube. 492-499 - Xavier Riley, Simon Dixon:
FiloBass: A Dataset and Corpus Based Study of Jazz Basslines. 500-507 - Louis Couturier, Louis Bigo, Florence Levé:
Comparing Texture in Piano Scores. 508-515 - Johannes Hentschel, Andrew McLeod, Yannis Rammos, Martin Rohrmeier:
Introducing DiMCAT for Processing and Analyzing Notated Music on a Very Large Scale. 516-523 - Sehun Kim, Kazuya Takeda, Tomoki Toda:
Sequence-to-Sequence Network Training Methods for Automatic Guitar Transcription With Tokenized Outputs. 524-531 - Alain Riou, Stefan Lattner, Gaëtan Hadjeres, Geoffroy Peeters:
PESTO: Pitch Estimation With Self-Supervised Transposition-Equivariant Objective. 535-544 - Vanessa Nina Borsan, Mathieu Giraud, Richard Groult:
The Games We Play: Exploring the Impact of ISMIR on Musicology. 545-552 - Genís Plaja-Roglans, Marius Miron, Adithi Shankar, Xavier Serra:
Carnatic Singing Voice Separation Using Cold Diffusion on Training Data With Bleeding. 553-560 - Kosetsu Tsukuda, Tomoyasu Nakano, Masahiro Hamasaki, Masataka Goto:
Unveiling the Impact of Musical Factors in Judging a Song on First Listen: Insights From a User Survey. 561-570 - Jan Hajic Jr., Gustavo A. Ballen, Klára Hedvika Mühlová, Hana Vlhová-Wörner:
Towards Building a Phylogeny of Gregorian Chant Melodies. 571-578 - Yiwei Ding, Alexander Lerch:
Audio Embeddings as Teachers for Music Classification. 579-587 - Ilya Borovik, Vladimir Viro:
ScorePerformer: Expressive Piano Performance Rendering With Fine-Grained Control. 588-596 - Emmanouil Karystinaios, Gerhard Widmer:
Roman Numeral Analysis With Graph Neural Networks: Onset-Wise Predictions From Note-Wise Features. 597-604 - Brian Regan, Desislava Hristova, Mariano Beguerisse-Díaz:
Semi-Automated Music Catalog Curation Using Audio and Metadata. 605-611 - Ioannis Petros Samiotis, Christoph Lofi, Alessandro Bozzon:
Crowd's Performance on Temporal Activity Detection of Musical Instruments in Polyphonic Music. 612-618 - Igor Pereira, Felipe Araújo, Filip Korzeniowski, Richard Vogl:
MoisesDB: A Dataset for Source Separation Beyond 4-Stems. 619-626 - Zeng Ren, Wulfram Gerstner, Martin Rohrmeier:
Music as Flow: A Formal Representation of Hierarchical Processes in Music. 627-633 - Silvan David Peter:
Online Symbolic Music Alignment With Offline Reinforcement Learning. 634-641 - Oren Barkan, Shlomi Shvartzman, Noy Uzrad, Moshe Laufer, Almog Elharar, Noam Koenigstein:
Inversynth II: Sound Matching via Self-Supervised Synthesizer-Proxy and Inference-Time Finetuning. 642-648 - Amantur Amatov, Dmitry Lamanov, Maksim Titov, Ivan Vovk, Ilya Makarov, Mikhail A. Kudinov:
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task. 649-656 - Keren Shao, Ke Chen, Taylor Berg-Kirkpatrick, Shlomo Dubnov:
Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction. 657-663 - Chin-Yun Yu, György Fazekas:
Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables. 667-675 - Qiaoyu Yang, Frank Cwitkowitz, Zhiyao Duan:
Harmonic Analysis With Neural Semi-CRF. 676-683 - Alberto Acquilino, Ninad Puranik, Ichiro Fujinaga, Gary P. Scavone:
A Dataset and Baseline for Automated Assessment of Timbre Quality in Trumpet Sound. 684-691 - Frank Heyen, Quynh Quang Ngo, Michael Sedlmair:
Visual Overviews for Sheet Music Structure. 692-699 - Luís Carvalho, Gerhard Widmer:
Passage Summarization With Recurrent Models for Audio - Sheet Music Retrieval. 700-707 - Pedro Ramoneda, Jose J. Valero-Mas, Dasaem Jeong, Xavier Serra:
Predicting Performance Difficulty From Piano Sheet Music Images. 708-715 - Junghyun Koo, Yunkee Chae, Chang-Bin Jeon, Kyogu Lee:
Self-Refining of Pseudo Labels for Music Source Separation With Noisy Labeled Data. 716-724 - Marcel A. Vélez Vásquez, Mariëlle Baelemans, Jonathan Driedger, Willem H. Zuidema, John Ashley Burgoyne:
Quantifying the Ease of Playing Song Chords on the Guitar. 725-732 - Irmak Bukey, Jason Zhang, T. J. Tsai:
FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions. 733-740 - Alexandre D'Hooge, Louis Bigo, Ken Déguernel:
Modeling Bends in Popular Music Guitar Tablatures. 741-748 - Geoffroy Peeters:
Self-Similarity-Based and Novelty-Based Loss for Music Structure Analysis. 749-756 - Carey Bunks, Tillman Weyde, Simon Dixon, Bruno Di Giorgi:
Modeling Harmonic Similarity for Jazz Using Co-occurrence Vectors and the Membrane Area. 757-764 - Shuqi Dai, Yuxuan Wu, Siqi Chen, Roy Huang, Roger B. Dannenberg:
SingStyle111: A Multilingual Singing Dataset With Style Transfer. 765-773 - Haven Kim, Kento Watanabe, Masataka Goto, Juhan Nam:
A Computational Evaluation Framework for Singable Lyric Translation. 774-781 - Kosetsu Tsukuda, Masahiro Hamasaki, Masataka Goto:
Chorus-Playlist: Exploring the Impact of Listening to Only Choruses in a Playlist. 782-792 - David Lewis, Elisabete Shibata, Andrew Hankinson, Johannes Kepper, Kevin R. Page, Lisa Rosendahl, Mark Saccomano, Christine Siegert:
Supporting Musicological Investigations With Information Retrieval Tools: An Iterative Approach to Data Collection. 795-801 - Federico Simonetta, Ana Llorens, Martín Serrano, Eduardo García-Portugués, Álvaro Torrente:
Optimizing Feature Extraction for Symbolic Music. 802-809 - Mathias Rose Bjare, Stefan Lattner, Gerhard Widmer:
Exploring Sampling Techniques for Generating Melodies With a Transformer Language Model. 810-816 - John Ashley Burgoyne, Janne Spijkervet, David J. Baker:
Measuring the Eurovision Song Contest: A Living Dataset for Real-World MIR. 817-823 - Pablo Alonso-Jiménez, Xavier Serra, Dmitry Bogdanov:
Efficient Supervised Training of Audio Transformers for Music Representation Learning. 824-831 - Michael Krause, Christof Weiß, Meinard Müller:
A Cross-Version Approach to Audio Representation Learning for Orchestral Music. 832-839 - Tomoyasu Nakano, Masataka Goto:
Music Source Separation With MLP Mixing of Time, Frequency, and Channel. 840-847 - Huan Zhang, Emmanouil Karystinaios, Simon Dixon, Gerhard Widmer, Carlos Eduardo Cancino Chacón:
Symbolic Music Representations for Classification Tasks: A Systematic Evaluation. 848-858 - Jacopo de Berardinis, Valentina Anita Carriero, Albert Meroño-Peñuela, Andrea Poltronieri, Valentina Presutti:
The Music Meta Ontology: A Flexible Semantic Model for the Interoperability of Music Metadata. 859-867 - Jeff Miller, Johan Pauwels, Mark Sandler:
Polar Manhattan Displacement: Measuring Tonal Distances Between Chords Based on Intervallic Content. 868-874
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.