Data and Supplementary Materials of Our Paper Submitted to QUATIC 2021

Paper title: A Large-Scale Investigation of Local Variable Names in Java Programs : Is Longer Name Better For Broader Scope Variable?
Conference: The 14th International Conference on the Quality of Information and Communications Technology (QUATIC 2021)
Status: published as:
Hirohisa Aman, Sousuke Amasaki, Tomoyuki Yokogawa and Minoru Kawahara,
``A Large-Scale Investigation of Local Variable Names in Java Programs: Is Longer Name Better for Broader Scope Variable?,''
Paiva A.C.R., Cavalli A.R., Ventura Martins P., Pérez-Castillo R. (eds) Quality of Information and Communications Technology, Communications in Computer and Information Science, vol. 1439, Springer, pp.489–500, Sept. 2021.
material file description
Data source (OSS projects from GitHub)
  1. project_list.csv(73KB)
  2. hash_list.csv(45KB)
  1. List of OSS projects
  2. List of commit hash
Variable data
  1. categorized_local_vars.txt.zip(9.7MB)
  2. terms_with_broad_scope.txt(89KB)
  3. abbreves_with_broad_scope.txt(2KB)
  1. List of all local variables, including the variable's name, type, scope, name category, project ID, source file path; Category "s_comp" signifies one of compounded names, which are constructed by merely concatenating two terms.
  2. List of terms used in the variables with broad scopes; the numbers are the appearance counts.
  3. List of abbreviated words used in the variables with broad scopes; the numbers are the appearance counts.
Prepared dictionary
  1. dot_aspell.en.pws.txt(1KB)
  2. abbreviated_word_dictionary.txt(1KB)
  1. Aspell user dictionary; it should be renamed to ".aspell.en.pws" when you use.
  2. Abbreviated word dictionary
Java program to extarct local variables JavaVariableScopeExtractor.jar(8MB) see the tool site for the details.