Topic Modeling-based Logging Suggestions for Java Software Systems
dc.contributor.author | Akter, Mehenika | |
dc.date.accessioned | 2024-02-14T20:58:13Z | |
dc.date.available | 2024-02-14T20:58:13Z | |
dc.identifier.uri | http://hdl.handle.net/10464/18305 | |
dc.description.abstract | Log statements help software developers and end users get informed about different valuable run-time information while log levels categorize the severity of that information. Researchers have been working extensively on log-related problems for the last two decades. As a result, a good amount of research has been conducted on logging and its practices. However, determining which topics can be logged from a system has a potential to work on. To implement our study, first, we examined the code snippets from some renowned open-source Java language-based projects. We collected the logged methods from nine applications and after preprocessing the methods and extracting our required data, we applied some renowned topic models: Latent Dirichlet Allocation (LDA), Latent Semantic Analysis (LSA), and Non-negative Matrix Factorization (NMF). In the first part of the results, we showed how the topics are related to logging to investigate the alignment between topic modeling and logging. Our dataset, enriched with meaningful words related to method functionality, is subjected to LDA analysis. Results indicate that topics with the highest sum of word probabilities are more likely to be logged. In the second section, we listed the popular topics with their associated words from different systems generated by LDA. In the last part of the results, a comprehensive result was shown by evaluating the performance of the models using coherence scores. We believe that our research will not only be useful for its result and evaluation but also be helpful for future researchers by providing a unique dataset. | en_US |
dc.language.iso | eng | en_US |
dc.publisher | Brock University | en_US |
dc.rights | CC0 1.0 Universal | * |
dc.rights.uri | http://creativecommons.org/publicdomain/zero/1.0/ | * |
dc.subject | Logging | en_US |
dc.subject | Topic Modeling | en_US |
dc.subject | LDA | en_US |
dc.subject | LSA | en_US |
dc.subject | NMF | en_US |
dc.title | Topic Modeling-based Logging Suggestions for Java Software Systems | en_US |
dc.type | Electronic Thesis or Dissertation | en_US |
dc.degree.name | M.Sc. Computer Science | en_US |
dc.degree.level | Masters | en_US |
dc.contributor.department | Department of Computer Science | en_US |
dc.degree.discipline | Faculty of Mathematics and Science | en_US |