Share this page
Share this page E-mail this page Print this page RSS feeds
Home > Publications > Clone Detection via Structural Abstraction
Clone Detection via Structural Abstraction

This paper describes the design, implementation, and application of a new algorithm to detect cloned code. It operates on the abstract syntax trees formed by many compilers as an intermediate representation. It extends prior work by identifying clones even when arbitrary subtrees have been changed. On a 16,000-line code corpus, 20-50% of its clones eluded previous methods. The method also identifies cloning in declarations, so it is somewhat more general than conventional procedural abstraction.

tr-2005-104.pdf
PDF file

Details

Type: TechReport
Number: MSR-TR-2005-104
Pages: 11
Institution: Microsoft Research