


default search action
18th MSR 2021: Madrid, Spain
- 18th IEEE/ACM International Conference on Mining Software Repositories, MSR 2021, Madrid, Spain, May 17-19, 2021. IEEE 2021, ISBN 978-1-7281-8710-5
Technical Papers
- Huy Tu, George Papadimitriou
, Mariam Kiran, Cong Wang, Anirban Mandal
, Ewa Deelman, Tim Menzies
:
Mining Workflows for Anomalous Data Transfers. 1-12 - Egor Spirin
, Egor Bogomolov
, Vladimir Kovalenko, Timofey Bryksin:
PSIMiner: A Tool for Mining Rich Abstract Syntax Trees from Code. 13-17 - Ruchika Malhotra, Ritvik Kapoor, Deepti Aggarwal, Priya Garg:
Comparative Study of Feature Reduction Techniques in Software Change Prediction. 18-28 - Sofonias Yitagesu, Xiaowang Zhang
, Zhiyong Feng, Xiaohong Li, Zhenchang Xing:
Automatic Part-of-Speech Tagging for Security Vulnerability Descriptions. 29-40 - Rolf-Helge Pfeiffer:
Identifying Critical Projects via PageRank and Truck Factor. 41-45 - Md. Abdullah Al Alamin, Sanjay Malakar, Gias Uddin, Sadia Afroz, Tameem Bin Haider, Anindya Iqbal:
An Empirical Study of Developer Discussions on Low-Code Software Development Challenges. 46-57 - Hendrig Sellik, Onno van Paridon, Georgios Gousios, Maurício Aniche:
Learning Off-By-One Mistakes: An Empirical Study. 58-67 - Ahmed Imam, Tapajit Dey
, Alexander Nolte
, Audris Mockus
, James D. Herbsleb:
The Secret Life of Hackathon Code Where does it come from and where does it go? 68-79 - Christoph Gote, Christian Zingg
:
gambit - An Open Source Name Disambiguation Tool for Version Control Systems. 80-84 - Samuel W. Flint
, Jigyasa Chauhan, Robert Dyer
:
Escaping the Time Pit: Pitfalls and Guidelines for Using Time-Based Git Data. 85-96 - Jiayan Pei, Yimin Wu, Zishan Qin, Yao Cong, Jingtao Guan:
Attention-based model for predicting question relatedness on Stack Overflow. 97-107 - Matteo Ciniselli, Nathan Cooper, Luca Pascarella
, Denys Poshyvanyk
, Massimiliano Di Penta, Gabriele Bavota
:
An Empirical Study on the Usage of BERT Models for Code Completion. 108-119 - Quentin Fournier
, Daniel Aloise
, Seyed Vahid Azhari, François Tetreault:
On Improving Deep Learning Trace Analysis with System Call Arguments. 120-130 - Zhen Yu Ding, Claire Le Goues
:
An Empirical Study of OSS-Fuzz Bugs. 131-142 - Jeanderson Cândido, Jan Haesen, Maurício Aniche, Arie van Deursen
:
An Exploratory Study of Log Placement Recommendation in an Enterprise System. 143-154 - Sina Gholamian, Paul A. S. Ward:
On the Naturalness and Localness of Software Logs. 155-166 - Mia Mohammad Imran
, Agnieszka Ciborowska, Kostadin Damevski
:
Automatically Selecting Follow-up Questions for Deficient Bug Reports. 167-178 - Alexandra-Maria Chaniotaki, Tushar Sharma
:
Architecture Smells and Pareto Principle: A Preliminary Empirical Exploration. 190-194 - Zadia Codabux, Melina C. Vidoni
, Fatemeh H. Fard
:
Technical Debt in the Peer-Review Documentation of R Packages: a rOpenSci Case Study. 195-206 - Diego Marcilio, Carlo A. Furia:
How Java Programmers Test Exceptional Behavior. 207-218 - Guillaume Haben
, Sarra Habchi, Mike Papadakis
, Maxime Cordy, Yves Le Traon
:
A Replication Study on the Usability of Code Vocabulary in Predicting Flaky Tests. 219-229 - Golnaz Gharachorlu, Nick Sumner
:
Leveraging Models to Reduce Test Cases in Software Repositories. 230-241 - Jean-Gabriel Young, Amanda Casari
, Katie McLaughlin
, Milo Z. Trujillo
, Laurent Hébert-Dufresne, James P. Bagrow:
Which contributions count? Analysis of attribution in open source. 242-253 - Mahmoud Alfadel, Diego Elias Costa
, Emad Shihab, Mouafak Mkhallalati:
On the Use of Dependabot Security Pull Requests. 254-265 - Aleksandr Khvorov, Roman Vasiliev, George A. Chernishev, Irving Muller Rodrigues, Dmitrij V. Koznov, Nikita Povarov:
S3M: Siamese Stack (Trace) Similarity Measure. 266-270 - Gian Luca Scoccia, Patrizio Migliarini
, Marco Autili
:
Challenges in Developing Desktop Web Apps: a Study of Stack Overflow and GitHub. 271-282 - Saraj Singh Manes, Olga Baysal:
Studying the Change Histories of Stack Overflow and GitHub Snippets. 283-294 - Nikolai Sviridov, Mikhail Evtikhiev, Vladimir Kovalenko:
TNM: A Tool for Mining of Socio-Technical Data from Git Repositories. 295-299 - Ivano Malavolta
, Katerina Chinnappan, Stan Swanborn, Grace A. Lewis
, Patricia Lago:
Mining the ROS ecosystem for Green Architectural Tactics in Robotics and an Empirical Evaluation. 300-311 - Andreas Schuler, Gabriele Kotsis:
Mining API Interactions to Analyze Software Revisions for the Evolution of Energy Consumption. 312-316 - André C. Hora:
Googling for Software Development: What Developers Search For and What They Find. 317-328 - Alexey Svyatkovskiy, Sebastian Lee, Anna Hadjitofi
, Maik Riechert, Juliana Vicente Franco, Miltiadis Allamanis:
Fast and Memory-Efficient Neural Code Completion. 329-340 - Ahmed Zerouali, Camilo Velázquez-Rodríguez
, Coen De Roover
:
Identifying Versions of Libraries used in Stack Overflow Code Snippets. 341-345 - Fabio Santos, Igor Wiese, Bianca Trinkenreich, Igor Steinmacher, Anita Sarma, Marco Aurélio Gerosa:
Can I Solve It? Identifying APIs Required to Complete OSS Tasks. 346-257 - Murali Sridharan, Mika Mäntylä, Leevi Rantala, Maëlick Claes:
Data Balancing Improves Self-Admitted Technical Debt Detection. 358-368 - Chanathip Pornprasit, Chakkrit Tantithamthavorn:
JITLine: A Simpler, Better, Faster, Finer-grained Just-In-Time Defect Prediction. 369-379 - Saikat Mondal, Gias Uddin, Chanchal K. Roy:
Rollback Edit Inconsistencies in Developer Forum. 380-391 - André C. Hora:
What Code Is Deliberately Excluded from Test Coverage and Why? 392-402 - Gianmarco Fucci, Nathan Cassee, Fiorella Zampetti, Nicole Novielli, Alexander Serebrenik, Massimiliano Di Penta:
Waiting around or job half-done? Sentiment in self-admitted technical debt. 403-414 - Maria Papoutsoglou, Johannes Wachs
, Georgia M. Kapitsaki:
Mining DEV for social and technical insights about software development. 415-419 - Timothy Kinsman, Mairieli Santos Wessel, Marco Aurélio Gerosa, Christoph Treude
:
How Do Software Developers Use GitHub Actions to Automate Their Workflows? 420-431 - Jirayus Jiarpakdee, Chakkrit Tantithamthavorn, John C. Grundy:
Practitioners' Perceptions of the Goals and Visual Explanations of Defect Prediction Models. 432-443 - Panyawut Sri-Iesaranusorn
, Raula Gaikovina Kula
, Takashi Ishio
:
Does Code Review Promote Conformance? A Study of OpenStack Patches. 444-448 - Kalvin Eng, Abram Hindle:
Revisiting Dockerfiles in Open Source Software Over Time. 449-459 - Mahfouth Alghamdi, Shinpei Hayashi, Takashi Kobayashi, Christoph Treude
:
Characterising the Knowledge about Primitive Variables in Java Code Comments. 460-470 - Anderson G. Uchôa
, Caio Barbosa, Daniel Coutinho
, Willian Nalepa Oizumi
, Wesley K. G. Assunção
, Silvia Regina Vergilio, Juliana Alves Pereira, Anderson Oliveira, Alessandro F. Garcia:
Predicting Design Impactful Changes in Modern Code Review: A Large-Scale Empirical Study. 471-482 - Michel Albonico, Ivano Malavolta
, Gustavo Pinto, Emitza Guzman, Katerina Chinnappan, Patricia Lago:
Mining Energy-Related Practices in Robotics Software. 483-494
MSR Challenge
- Balázs Mosolygó, Norbert Vándor, Gábor Antal, Péter Hegedüs:
On the Rise and Fall of Simple Stupid Bugs: a Life-Cycle Analysis of SStuBs. 495-499 - Jasmine Latendresse, Rabe Abdalkareem
, Diego Elias Costa
, Emad Shihab:
How Effective is Continuous Integration in Indicating Single-Statement Bugs? 500-504 - Ehsan Mashhadi, Hadi Hemmati:
Applying CodeBERT for Automated Program Repair of Java Simple Bugs. 505-509 - Fernanda Madeiral, Thomas Durieux
:
A large-scale study on human-cloned changes for automated program repair. 510-514 - Wenhan Zhu, Michael W. Godfrey:
Mea culpa: How developers fix their own simple bugs differently from other developers. 515-519 - Arthur V. Kamienski, Luisa Palechor, Cor-Paul Bezemer
, Abram Hindle:
PySStuBs: Characterizing Single-Statement Bugs in Popular Open-Source Python Projects. 520-524 - Anthony Peruma, Christian D. Newman:
On the Distribution of "Simple Stupid Bugs" in Unit Test Files: An Exploratory Study. 525-529 - Jiayi Hua, Haoyu Wang:
On the Effectiveness of Deep Vulnerability Detectors to Simple Stupid Bug Detection. 530-534
MSR Data
- Sebastian Nielebock, Paul Blockhaus, Jacob Krüger, Frank Ortmeier:
AndroidCompass: A Dataset of Android Compatibility Checks in Code Repositories. 535-539 - Misoo Kim
, Youngkyoung Kim
, Eunseok Lee
:
Denchmark: A Bug Benchmark of Deep Learning-related Software. 540-544 - Thomas Durieux
, César Soto-Valero, Benoit Baudry:
Duets: A Dataset of Reproducible Pairs of Java Library-Clients. 545-549 - Luigi Quaranta
, Fabio Calefato, Filippo Lanubile:
KGTorrent: A Dataset of Python Jupyter Notebooks from Kaggle. 550-554 - Mouna Hammoudi, Christoph Mayr-Dorn, Atif Mashkoor, Alexander Egyed:
A Traceability Dataset for Open Source Systems. 555-559 - Ozren Dabic
, Emad Aghajani, Gabriele Bavota
:
Sampling Projects in GitHub for MSR Studies. 560-564 - Nafise Eskandani, Guido Salvaneschi
:
The Wonderless Dataset for Serverless Computing. 565-569 - Wen Li, Xiaoqin Fu, Haipeng Cai
:
AndroCT: Ten Years of App Call Traces in Android. 570-574 - Nikitha Rao, Chetan Bansal, Joe Guan:
Search4Code: Code Search Intent Classification Using Weak Supervision. 575-579 - Ruben Opdebeeck
, Ahmed Zerouali, Coen De Roover
:
Andromeda: A Dataset of Ansible Galaxy Roles and Their Evolution. 580-584 - Amir M. Mir
, Evaldas Latoskinas, Georgios Gousios:
ManyTypes4Py: A Benchmark Python Dataset for Machine Learning-based Type Inference. 585-589 - Tushar Sharma
, Marouane Kessentini:
QScored: A Large Dataset of Code Smells and Quality Metrics. 590-594 - Likang Yin, Zhiyuan Zhang, Qi Xuan, Vladimir Filkov:
Apache Software Foundation Incubator Project Sustainability Dataset. 595-599 - Tyler Wendland, Jingyang Sun, Junayed Mahmud, S. M. Hasan Mansur, Steven Huang, Kevin Moran, Julia Rubin, Mattia Fazzini
:
Andror2: A Dataset of Manually-Reproduced Bug Reports for Android apps. 600-604 - Dheeraj Vagavolu, Vartika Agrahari, Sridhar Chimalakonda, Akhila Sri Manasa Venigalla:
GE526: A Dataset of Open-Source Game Engines. 605-609 - Sahar Badihi, Yi Li, Julia Rubin:
EqBench: A Dataset of Equivalent and Non-equivalent Program Pairs. 610-614
MSR Data Hackathon
- Ahmed Imam, Tapajit Dey
:
Tracking Hackathon Code Creation and Reuse. 615-617 - Elena Lyulina, Mahmoud Jahanshahi
:
Building the Collaboration Graph of Open-Source Software Ecosystem. 618-620 - David Reid, Kalvin Eng, Chris Bogart
, Adam Tutko:
Tracing Vulnerable Code Lineage. 621-623 - James Walden
, Noah Burgin, Kuljit Kaur:
An Exploratory Study of Project Activity Changepoints in Open Source Software Evolution. 624-626 - Mengchen Sam Yong, Lavinia Paganini
, Huilian Sophie Qiu, José Bayoán Santiago Calderón:
The Diversity-Innovation Paradox in Open-Source Software. 627-629

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.