Source Code is a Graph, Not a Sequence: A Cross-Lingual Perspective on Code Clone Detection.
Mohammed Ataaur RahamanJulia IvePublished in: CoRR (2023)
Keyphrases
- source code
- cross lingual
- clone detection
- linux kernel
- software systems
- machine translation
- open source
- code clones
- language modeling
- cross language
- software evolution
- software maintenance
- text classification
- string matching
- software projects
- software reuse
- document clustering
- high level
- news articles
- program understanding
- source files
- transfer learning
- operating system
- probabilistic model
- databases
- language model
- information retrieval systems
- object oriented
- software repositories
- search engine
- information retrieval