Extractive Text Summarization of Student Essay Assignment Using Sentence Weight Features and Fuzzy C-Means

I Made Suwija Putra; Yonatan Adiwinata; Desy Purnami Singgih Putri; Ni Putu Sutramiani

doi:10.29099/ijair.v5i1.187


Extractive Text Summarization of Student Essay Assignment Using Sentence Weight Features and Fuzzy C-Means

^{(1) *} I Made Suwija Putra

(Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
⁽²⁾ Yonatan Adiwinata

(Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
⁽³⁾ Desy Purnami Singgih Putri

(Graduate School of Department of Electrical Engineering and Computer Science, Kanazawa University, Japan)
⁽⁴⁾ Ni Putu Sutramiani

(Department of Information Technology, Faculty of Engineering, Udayana University, Bali, Indonesia)
^*corresponding author

Abstract

One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.One of the main tasks of a lecturer is to give students an academic assessment in the learning process. The assessment process begins with reading or checking the answers of student assignments that contain a combination of very long sentences such as essay or report assignments. This certainly takes a lot of time to get the primary information contained therein. It is necessary to summarize the answers so that the lecturer does not need to read the whole document but is still able to take the essence of the response to the task. This study proposes the application of summarizing text documents of student essay assignments automatically using the Fuzzy C-Means method with the sentence weighting feature. The sentence weighting feature is used by selecting the sentence with the highest weight in one cluster, helping the system to get the primary information from a document quickly. The results of this study indicate that the system succeeds in summarizing text with an average evaluation of the values of precision, recall, accuracy, and F-measure of 0.52, 0.54, 0.70, and 0.52, respectively.

Keywords

Text Summarization; Essay Assignments; Weight Sentences; Fuzzy C-Means

DOI

https://doi.org/10.29099/ijair.v5i1.187

Article metrics

10.29099/ijair.v5i1.187 Abstract views : 1761 | PDF views : 241

Cite

How to cite item

Full Text

Download

References

B. A. Manyika J, Chui M, Brown B, Roxburgh C, â€œBig data: the next frontier for innovation, competition, and productivity,â€ [MGI] McKinsey Glob. Inst., 2011.

N. O. Finnemann, â€œE-text,â€ in Oxford Research Encyclopedia of Literature, Oxford University Press, 2018.

V. Gupta and G. S. Lehal, â€œA Survey of Text Summarization Extractive techniques,â€ in Journal of Emerging Technologies in Web Intelligence, Aug. 2010, vol. 2, no. 3, pp. 258â€“268, doi: 10.4304/jetwi.2.3.258-268.

R. Bharathi, S. C. Shirwaikar, and V. Kharat, â€œA distributed, scalable parallelization of fuzzy c-means algorithm,â€ 2016, doi: 10.1109/IBSS.2016.7940196.

V. K. Singh, N. Tiwari, and S. Garg, â€œDocument clustering using K-means, heuristic K-means and fuzzy C-means,â€ 2011, doi: 10.1109/CICN.2011.62.

B. Adnyana, â€œImplementasi Algoritma Fuzzy C Means Dan Statistical Region Merging Pada Segmentasi Citra,â€ Konf. Nas. Sist. Inform. 2015, pp. 9â€“10, 2015.

K. S. Gilda and S. S. Dixit, â€œClustering : Basics , Approaches , Practical View and Applications,â€ Int. J. Comput. Eng. Appl., vol. X, no. Vii, pp. 19â€“29, 2012.

W. Wang, C. Wang, X. Cui, and A. Wang, â€œFuzzy C-means text clustering with supervised feature selection,â€ in Proceedings - 5th International Conference on Fuzzy Systems and Knowledge Discovery, FSKD 2008, 2008, vol. 1, pp. 57â€“61, doi: 10.1109/FSKD.2008.161.

M. Irfan, Jumadi, W. B. Zulfikar, and Erik, â€œImplementation of Fuzzy C-Means algorithm and TF-IDF on English journal summary,â€ in Proceedings of the 2nd International Conference on Informatics and Computing, ICIC 2017, Feb. 2018, vol. 2018-January, pp. 1â€“5, doi: 10.1109/IAC.2017.8280646.

R. Al-Hashemi, â€œText Summarization Extraction System (TSES) Using Extracted Keywords,â€ Int. Arab J. e-Technology, vol. 1, no. 4, pp. 164â€“168, 2010, [Online]. Available: http://www.iajet.org/iajet/iajet_files/vol.1/no.4/Text Summarization Extraction System TSES Using Extracted Keywords_doc.pdf.

D. Wang, S. Zhu, T. Li, and Y. Gong, â€œMulti-Document Summarization using Sentence-based Topic Models,â€ 2009. doi: 10.3115/1667583.1667675.

M. R. Muztahid, â€œPeringkasan Dokumen Bahasa Indonesia Menggunakan Metode K-Means,â€ 2015.

A. Nenkova and K. McKeown, Automatic summarization, vol. 5, no. 2â€“3. 2011.

S. Akter, A. S. Asa, M. P. Uddin, M. D. Hossain, S. K. Roy, and M. I. Afjal, â€œAn extractive text summarization technique for Bengali document(s) using K-means clustering algorithm,â€ 2017, doi: 10.1109/ICIVPR.2017.7890883.

H. A. Robbani, â€œSastrawi,â€ MIT, 2016. .

M. Ozer, â€œFuzzy c-means clustering and Internet portals: A case study,â€ Eur. J. Oper. Res., vol. 164, no. 3 SPEC. ISS., pp. 696â€“714, 2005, doi: 10.1016/j.ejor.2003.11.015.

M. Afsharizadeh, H. Ebrahimpour-Komleh, and A. Bagheri, â€œQuery-oriented text summarization using sentence extraction technique,â€ 2018, doi: 10.1109/ICWR.2018.8387248.

Z. Ghahramani, â€œProbabilistic machine learning and artificial intelligence,â€ Nature. 2015, doi: 10.1038/nature14541.

C. D. Manning, P. Raghavan, and H. Schutze, Introduction to Information Retrieval. 2008.

I. Yoo and X. Hu, â€œA comprehensive comparison study of document clustering for a biomedical digital library MEDLINE,â€ 2006, doi: 10.1145/1141753.1141802.

K. Vimal Kumar and D. Yadav, â€œAn improvised extractive approach to hindi text summarization,â€ in Advances in Intelligent Systems and Computing, 2015, vol. 339, pp. 291â€“300, doi: 10.1007/978-81-322-2250-7_28.

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.

________________________________________________________

The International Journal of Artificial Intelligence Research

Organized by: Prodi Teknik Informatika Fakultas Teknologi Bisnis dan Sains
Published by: Universitas Dharma Wacana
Jl. Kenanga No. 03 Mulyojati 16C Metro Barat Kota Metro Lampung

Email: jurnal.ijair@gmail.com

View IJAIR Statcounter

This work is licensed under Creative Commons Attribution-ShareAlike 4.0 International License.

Username
Password
Remember me