Deteksi Duplikasi Metadata File pada Media Penyimpanan menggunakan Metode Latent Semantic Analysis

Erlin Erlin(1*), Boby Hasbul Fikri(2), Susanti Susanti(3), Triyani Arita Fitri(4)

(1) STMIK Amik Riau
(*) Corresponding Author


Metadata files help user find relevant information, provides digital identification, archives and conserves stored files so that they are easily found and reused. The large number of data files on the storage media often makes the user unaware of the duplication and redundancy of the files that have an impact on the waste of storage media space, affecting the speed of a computer in the indexing process, finding or backing up data. This study employ the Latent Semantic Analysis method to detect file duplication and analyze the metadata of various file types in storage media. The findings showed that Latent Semantic Analysis method is able to detect duplicate file metadata in various types of storage media thereby further increasing the usability and speed of access of the data storage media.

Article metrics

Abstract views : 79 | views : 8

Full Text:



Lackey, D. How Much Data Do We Create Every Day? The Mind-Blowing Stats Everyone Should Read. Blazon Online. 2019. url:, tanggal akses 1 Mei 2020

Chowdhury, R. A Look into the Evolution of Storage Devices [1956-2013]. Onextrapixel. 2013. url:, tanggal akses 3 Mei 2020

Sehra, S.S., Singh, J. and Rai, H.S. Using Latent Semantic Analysis to Identify Research Trends in OpenStreetMap. International Journal of Geo-Information. 2017; 6, 195: 1-31.

Ghica, D. R., and Alyahya, K. Latent Semantic Analysis of Game Models Using LSTM. Journal of Logical and Algebraic Methods in Programming, 2019; 106, 39–54.

Sastre, F., Velazquez, A., Sanchez de Leon, L., Montanes, J. L., and Rodrigo, J. Method to Solve Redundant Inverse Problems Based on A Latent Semantic Analysis Approach. Application to An Aerojet Engine. Aerospace Science and Technology, 2020;102, 1-10.

Kim, S., Park, H., and Lee, J. Word2vec-Based Latent Semantic Analysis (W2V-LSA) for Topic Modeling: A Study on Blockchain Technology Trend Analysis. Expert Systems with Applications, 2020;113401.

Santilli, S., Nota, L., & Pilato, G. The Use of Latent Semantic Analysis in the Positive Psychology: A Comparison with Twitter Posts. IEEE 11th International Conference on Semantic Computing (ICSC), 2017; 494-498.

Inrak, P. and Sinthupinyo, S. Applying Latent Semantic Analysis to Classify Emotions in Thai Text. 2nd International Conference on Computer Engineering and Technology. Chengdu. 2010; 6, 450-454

Evangelopoulos, N. Latent Semantic Analysis. WIREs Cognitive Science. 2013. Vol. 4, John Wiley & Sons. Ltd.

Deerwester, S., Dumais, S., Furnas, G., Landauer, T., and Harshman, R. Indexing by Latent Semantic Analysis. Journal of the American Society for Information Science. 1990;41(6), 391–407

Cios, K. J., Pedrycz, W., Swiniarski, R. W., and KurganL, A. L. Data mining: A Knowledge Discovery Approach. 2007. New York, NY: Springer.

Manning, C. D., Raghavan, P., and Schütze, H. An Introduction to Information Retrieval. 2009. New York, NY: Cambridge University Press.

Landauer, T. K. LSA as a Theory of Meaning. In T. Landauer & D. S. Mcnamara (Eds.), Handbook of Latent Semantic Analysis. 2007

Martin, D. I., and Berry, M. W. Mathematical Foundations Behind Latent Semantic Analysis. In L. Small Bear Technical Consulting (Ed.), 2007.

Valle-Lisboa, J. C., and Mizraji, E. The Uncovering of Hidden Structures by Latent Semantic Analysis. Information Sciences, 2007;177(19), 4122–4147.

Evangelopoulos, N., Zhang, X., and Prybutok, V. Latent Semantic Analysis: Five Methodological Recommendations. European Journal of Information Systems, 21(1), 70–86.

Oliver Müller,Theresa Schmiedel,Elena Gorbacheva &Jan vom Brocke. Towards A Typology of Business Process Management Professionals: Identifying Patterns of Competences Through Latent Semantic Analysis. Enterprise Information Systems, 2014; 50-80.

Huh, Y., Kim, J., Yu, K., and Cho, S. (2014). M:N Object Matching between Image and Map Object Data Sets by Means of Latent Semantic Analysis. International Journal of Remote Sensing, 2014; 35(18), 6799–6814.

Ahmad, S. N., and Laroche, M. (2015). How Do Expressed Emotions Affect the Helpfulness of a Product Review? Evidence from Reviews Using Latent Semantic Analysis. International Journal of Electronic Commerce, 2015; 20(1), 76–111.

Wenli, C. (2016). Application Research on Latent Semantic Analysis for Information Retrieval. Eighth International Conference on Measuring Technology and Mechatronics Automation (ICMTMA).2016; 118-121.



  • There are currently no refbacks.

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

This Journal has been listed and indexed in :

Find in a library with WorldCat Find in a library with WorldCat

Copyright of Jurnal Inovtek Polbeng - Seri Informatika (ISSN: 2527-9866)

Creative Commons License
ISI: Inovtek Polbeng Seri Informatikan is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.

Editorial Office :
Pusat Penelitian dan Pengabdian kepada Masyarakat
 Politeknik Negeri Bengkalis 
Jl. Bathin alam, Sungai Alam Bengkalis-Riau 28711 

View My Stats