Do Code LLMs Understand Design Patterns?

Do Code LLMs Understand Design Patterns?

Code Large Language Models (LLMs) demonstrate great versatility in adapting to various downstream tasks, including code generation and completion, as well as bug detection and fixing. However, Code LLMs often fail to capture existing coding standards, leading to the generation of code that conflicts with the required design patterns for a given project. As a result, developers must post-process to adapt the generated code to the project's design norms. In this work, we empirically investigate the biases of Code LLMs in software development. Through carefully designed experiments, we assess the models' understanding of design patterns across recognition, comprehension, and generation. Our findings reveal that biases in Code LLMs significantly affect the reliability of downstream tasks.

Publication:

arXiv e-prints

Pub Date:

January 2025

arXiv:

arXiv:2501.04835

Bibcode:

2025arXiv250104835P

Keywords:

Computer Science - Software Engineering;
Computer Science - Artificial Intelligence

E-Print:

accpeted by llm4code workshop in ICSE 2025

ADS

Do Code LLMs Understand Design Patterns?

Abstract