Structural comparison of source code between multiple programming languages

UBC Theses and Dissertations

Featured Collection

UBC Theses and Dissertations

Structural comparison of source code between multiple programming languages Biehn, Rolf

Abstract

Software developers are often faced with the task of comparing two or more versions of software. Typical usages of software comparison utilities include: a code-review prior to checkin, tracking down a recently introduced regression, and searching for code-clones in the source code (for future refactoring). However, most traditional source code comparison tools typically use simple text-to-text comparison (with some simple rule-based comparisons for comments), which has the drawback of showing superfluous differences during comparison. Many projects, for a variety of business reasons, ship products and software development kits (SDKs) using multiple programing languages. It is desirable to compare amongst languages in order to detect potential errors and understand the meaningful differences between the two codebases. In some cases, fixes may be implemented in one language, but not in the other. In this paper, we create a tool called the Software Difference Analyzer Tool (SDAT), a tool capable of comparing Java and CSharp code, to address some of the unique problems associated with cross-language comparison. Automated testing demonstrated SDAT reduces the number of reported differences by up to 40%. User testing has shown a 37% increase in speed and 28% increase in accuracy.

Item Metadata

Title	Structural comparison of source code between multiple programming languages
Creator	Biehn, Rolf
Publisher	University of British Columbia
Date Issued	2014
Description	Software developers are often faced with the task of comparing two or more versions of software. Typical usages of software comparison utilities include: a code-review prior to checkin, tracking down a recently introduced regression, and searching for code-clones in the source code (for future refactoring). However, most traditional source code comparison tools typically use simple text-to-text comparison (with some simple rule-based comparisons for comments), which has the drawback of showing superfluous differences during comparison. Many projects, for a variety of business reasons, ship products and software development kits (SDKs) using multiple programing languages. It is desirable to compare amongst languages in order to detect potential errors and understand the meaningful differences between the two codebases. In some cases, fixes may be implemented in one language, but not in the other. In this paper, we create a tool called the Software Difference Analyzer Tool (SDAT), a tool capable of comparing Java and CSharp code, to address some of the unique problems associated with cross-language comparison. Automated testing demonstrated SDAT reduces the number of reported differences by up to 40%. User testing has shown a 37% increase in speed and 28% increase in accuracy.
Genre	Thesis/Dissertation
Type	Text
Language	eng
Date Available	2014-04-22
Provider	Vancouver : University of British Columbia Library
Rights	Attribution-NonCommercial-NoDerivs 2.5 Canada
DOI	10.14288/1.0103428
URI	http://hdl.handle.net/2429/46547
Degree	Master of Science - MSc
Program	Computer Science
Affiliation	Science, Faculty of; Computer Science, Department of
Degree Grantor	University of British Columbia
Graduation Date	2014-05
Campus	UBCV
Scholarly Level	Graduate
Rights URI	http://creativecommons.org/licenses/by-nc-nd/2.5/ca/
Aggregated Source Repository	DSpace

Open Collections

UBC Theses and Dissertations

UBC Theses and Dissertations

Structural comparison of source code between multiple programming languages Biehn, Rolf

Abstract

Item Metadata

Item Media

Item Citations and Data

Rights