Extractive Text Summarization of Student Essay Assignment Using Sentence Weight Features and Fuzzy C-Means

I Made Suwija Putra(1*), Yonatan Adiwinata(2), Desy Purnami Singgih Putri(3), Ni Putu Sutramiani(4),


(1) Department of Information Technology, Faculty of Engineering, Udayana University, Bali
(2) Department of Information Technology, Faculty of Engineering, Udayana University, Bali
(3) Graduate School of Department of Electrical Engineering and Computer Science, Kanazawa University
(4) Department of Information Technology, Faculty of Engineering, Udayana University, Bali
(*) Corresponding Author

Abstract


One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.

Keywords


Text Summarization; Essay Assignments; Weight Sentences; Fuzzy C-Means

Article Metrics

Abstract view : 46 times

References


B. A. Manyika J, Chui M, Brown B, Roxburgh C, “Big data: the next frontier for innovation, competition, and productivity,” [MGI] McKinsey Glob. Inst., 2011.

N. O. Finnemann, “E-text,” in Oxford Research Encyclopedia of Literature, Oxford University Press, 2018.

V. Gupta and G. S. Lehal, “A Survey of Text Summarization Extractive techniques,” in Journal of Emerging Technologies in Web Intelligence, Aug. 2010, vol. 2, no. 3, pp. 258–268, doi: 10.4304/jetwi.2.3.258-268.

R. Bharathi, S. C. Shirwaikar, and V. Kharat, “A distributed, scalable parallelization of fuzzy c-means algorithm,” 2016, doi: 10.1109/IBSS.2016.7940196.

V. K. Singh, N. Tiwari, and S. Garg, “Document clustering using K-means, heuristic K-means and fuzzy C-means,” 2011, doi: 10.1109/CICN.2011.62.

B. Adnyana, “Implementasi Algoritma Fuzzy C Means Dan Statistical Region Merging Pada Segmentasi Citra,” Konf. Nas. Sist. Inform. 2015, pp. 9–10, 2015.

K. S. Gilda and S. S. Dixit, “Clustering : Basics , Approaches , Practical View and Applications,” Int. J. Comput. Eng. Appl., vol. X, no. Vii, pp. 19–29, 2012.

W. Wang, C. Wang, X. Cui, and A. Wang, “Fuzzy C-means text clustering with supervised feature selection,” in Proceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008, 2008, vol. 1, pp. 57–61, doi: 10.1109/FSKD.2008.161.

M. Irfan, Jumadi, W. B. Zulfikar, and Erik, “Implementation of Fuzzy C-Means algorithm and TF-IDF on English journal summary,” in Proceedings of the 2nd International Conference on Informatics and Computing, ICIC 2017, Feb. 2018, vol. 2018-January, pp. 1–5, doi: 10.1109/IAC.2017.8280646.

R. Al-Hashemi, “Text Summarization Extraction System (TSES) Using Extracted Keywords,” Int. Arab J. e-Technology, vol. 1, no. 4, pp. 164–168, 2010, [Online]. Available: http://www.iajet.org/iajet/iajet_files/vol.1/no.4/Text Summarization Extraction System TSES Using Extracted Keywords_doc.pdf.

D. Wang, S. Zhu, T. Li, and Y. Gong, “Multi-Document Summarization using Sentence-based Topic Models,” 2009. doi: 10.3115/1667583.1667675.

M. R. Muztahid, “Peringkasan Dokumen Bahasa Indonesia Menggunakan Metode K-Means,” 2015.

A. Nenkova and K. McKeown, Automatic summarization, vol. 5, no. 2–3. 2011.

S. Akter, A. S. Asa, M. P. Uddin, M. D. Hossain, S. K. Roy, and M. I. Afjal, “An extractive text summarization technique for Bengali document(s) using K-means clustering algorithm,” 2017, doi: 10.1109/ICIVPR.2017.7890883.

H. A. Robbani, “Sastrawi,” MIT, 2016. .

M. Ozer, “Fuzzy c-means clustering and Internet portals: A case study,” Eur. J. Oper. Res., vol. 164, no. 3 SPEC. ISS., pp. 696–714, 2005, doi: 10.1016/j.ejor.2003.11.015.

M. Afsharizadeh, H. Ebrahimpour-Komleh, and A. Bagheri, “Query-oriented text summarization using sentence extraction technique,” 2018, doi: 10.1109/ICWR.2018.8387248.

Z. Ghahramani, “Probabilistic machine learning and artificial intelligence,” Nature. 2015, doi: 10.1038/nature14541.

C. D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval. 2008.

I. Yoo and X. Hu, “A comprehensive comparison study of document clustering for a biomedical digital library MEDLINE,” 2006, doi: 10.1145/1141753.1141802.

K. Vimal Kumar and D. Yadav, “An improvised extractive approach to hindi text summarization,” in Advances in Intelligent Systems and Computing, 2015, vol. 339, pp. 291–300, doi: 10.1007/978-81-322-2250-7_28.




DOI: https://doi.org/10.29099/ijair.v5i1.187

________________________________________________________

International Journal Of Artificial Intelligence Research

Organized by: Departemen Teknik Informatika STMIK Dharma Wacana
Published by: STMIK Dharma Wacana
Jl. Kenanga No.03 Mulyojati 16C Metro Barat Kota Metro Lampung
phone. +62725-7850671
Fax. +62725-7850671
Email: info@ijair.id | internationaljournalair@gmail.com | herinurdiyanto@ieee.org 

View IJAIR Statcounter

Creative Commons License
IJAIR is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.