Extractive Text Summarization of Student Essay Assignment Using Sentence Weight Features and Fuzzy C-Means

(1) * I Made Suwija Putra Mail (Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
(2) Yonatan Adiwinata Mail (Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
(3) Desy Purnami Singgih Putri Mail (Graduate School of Department of Electrical Engineering and Computer Science, Kanazawa University, Japan)
(4) Ni Putu Sutramiani Mail (Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
*corresponding author

Abstract


One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.

Keywords


Text Summarization; Essay Assignments; Weight Sentences; Fuzzy C-Means

   

DOI

https://doi.org/10.29099/ijair.v5i1.187
      

Article metrics

10.29099/ijair.v5i1.187 Abstract views : 643 | PDF views : 87

   

Cite

   

Full Text

Download

References


B. A. Manyika J, Chui M, Brown B, Roxburgh C, “Big data: the next frontier for innovation, competition, and productivity,” [MGI] McKinsey Glob. Inst., 2011.

N. O. Finnemann, “E-text,” in Oxford Research Encyclopedia of Literature, Oxford University Press, 2018.

V. Gupta and G. S. Lehal, “A Survey of Text Summarization Extractive techniques,” in Journal of Emerging Technologies in Web Intelligence, Aug. 2010, vol. 2, no. 3, pp. 258–268, doi: 10.4304/jetwi.2.3.258-268.

R. Bharathi, S. C. Shirwaikar, and V. Kharat, “A distributed, scalable parallelization of fuzzy c-means algorithm,” 2016, doi: 10.1109/IBSS.2016.7940196.

V. K. Singh, N. Tiwari, and S. Garg, “Document clustering using K-means, heuristic K-means and fuzzy C-means,” 2011, doi: 10.1109/CICN.2011.62.

B. Adnyana, “Implementasi Algoritma Fuzzy C Means Dan Statistical Region Merging Pada Segmentasi Citra,” Konf. Nas. Sist. Inform. 2015, pp. 9–10, 2015.

K. S. Gilda and S. S. Dixit, “Clustering : Basics , Approaches , Practical View and Applications,” Int. J. Comput. Eng. Appl., vol. X, no. Vii, pp. 19–29, 2012.

W. Wang, C. Wang, X. Cui, and A. Wang, “Fuzzy C-means text clustering with supervised feature selection,” in Proceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008, 2008, vol. 1, pp. 57–61, doi: 10.1109/FSKD.2008.161.

M. Irfan, Jumadi, W. B. Zulfikar, and Erik, “Implementation of Fuzzy C-Means algorithm and TF-IDF on English journal summary,” in Proceedings of the 2nd International Conference on Informatics and Computing, ICIC 2017, Feb. 2018, vol. 2018-January, pp. 1–5, doi: 10.1109/IAC.2017.8280646.

R. Al-Hashemi, “Text Summarization Extraction System (TSES) Using Extracted Keywords,” Int. Arab J. e-Technology, vol. 1, no. 4, pp. 164–168, 2010, [Online]. Available: http://www.iajet.org/iajet/iajet_files/vol.1/no.4/Text Summarization Extraction System TSES Using Extracted Keywords_doc.pdf.

D. Wang, S. Zhu, T. Li, and Y. Gong, “Multi-Document Summarization using Sentence-based Topic Models,” 2009. doi: 10.3115/1667583.1667675.

M. R. Muztahid, “Peringkasan Dokumen Bahasa Indonesia Menggunakan Metode K-Means,” 2015.

A. Nenkova and K. McKeown, Automatic summarization, vol. 5, no. 2–3. 2011.

S. Akter, A. S. Asa, M. P. Uddin, M. D. Hossain, S. K. Roy, and M. I. Afjal, “An extractive text summarization technique for Bengali document(s) using K-means clustering algorithm,” 2017, doi: 10.1109/ICIVPR.2017.7890883.

H. A. Robbani, “Sastrawi,” MIT, 2016. .

M. Ozer, “Fuzzy c-means clustering and Internet portals: A case study,” Eur. J. Oper. Res., vol. 164, no. 3 SPEC. ISS., pp. 696–714, 2005, doi: 10.1016/j.ejor.2003.11.015.

M. Afsharizadeh, H. Ebrahimpour-Komleh, and A. Bagheri, “Query-oriented text summarization using sentence extraction technique,” 2018, doi: 10.1109/ICWR.2018.8387248.

Z. Ghahramani, “Probabilistic machine learning and artificial intelligence,” Nature. 2015, doi: 10.1038/nature14541.

C. D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval. 2008.

I. Yoo and X. Hu, “A comprehensive comparison study of document clustering for a biomedical digital library MEDLINE,” 2006, doi: 10.1145/1141753.1141802.

K. Vimal Kumar and D. Yadav, “An improvised extractive approach to hindi text summarization,” in Advances in Intelligent Systems and Computing, 2015, vol. 339, pp. 291–300, doi: 10.1007/978-81-322-2250-7_28.




Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

________________________________________________________

International Journal Of Artificial Intelligence Research

Organized by: Departemen Teknik Informatika STMIK Dharma Wacana
Published by: STMIK Dharma Wacana
Jl. Kenanga No.03 Mulyojati 16C Metro Barat Kota Metro Lampung
phone. +62725-7850671
Fax. +62725-7850671
Email: jurnal.ijair@gmail.com | herinurdiyanto@dharmawacana.ac.id

View IJAIR Statcounter

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.