Document retrieval metrics for program understanding

Harth, Eric ( Haute école de gestion de Genève, HES-SO // Haute Ecole Spécialisée de Suisse Occidentale) ; Dugerdil, Philippe ( Haute école de gestion de Genève, HES-SO // Haute Ecole Spécialisée de Suisse Occidentale)

The need for domain knowledge representation for program comprehension is now widely accepted in the program comprehension community. The so-called "concept assignment problem" represents the challenge to locate domain concepts in the source code of programs. The vast majority of attempts to solve it are based on static source code search for clues to domain concepts. In contrast, our approach is based on dynamic analysis using information retrieval (IR) metrics. First we explain how we modeled the domain concepts and their role in program comprehension. Next we present how some of the popular IR metrics could be adapted to the "concept assignment problem" and the way we implemented the search engine. Then we present our own metric and the performance of these metrics to retrieve domain concepts in source code. The contribution of the paper is to show how the IR metrics could be applied to the "concept assignment problem" when the "documents" to retrieve are domain concepts structured in an ontology.


Keywords:
Conference Type:
full paper
Faculty:
Economie et Services
School:
HEG GE Haute école de gestion de Genève
Institute:
CRAG - Centre de Recherche Appliquée en Gestion
Subject(s):
Informatique
Publisher:
Gandhinagar, India , 4-6 December
Date:
Gandhinagar, India
4-6 December
2015
Pagination:
8 p.
Published in
Proceedings of the 7th Forum for Information Retrieval Evaluation
Numeration (vol. no.):
2015, pp. 8-15
DOI:
Appears in Collection:



 Record created 2016-02-15, last modified 2018-11-09

Fulltext:
Download fulltext
PDF

Rate this document:

Rate this document:
1
2
3
 
(Not yet reviewed)