Event Info In this session, David will share about his work on computational linguistics and his experiences in using Python and other tools in his research. He has used resources and undertaken activities such as :* Breaking up (Indonesian dictionary) text data from Word and Excel which can be further exported into a database file* Wordnet (http://www.nltk.org/howto/wordnet.html) in the Natural Language Toolkit (NLTK) in Python (https://www.nltk.org/) that can be used to acquire lexical information* Indonesian SentiWordnet (https://github.com/neocl/barasa) which was built based on SentiWordnet (http://sentiwordnet.isti.cnr.it/), a lexical resource for opinion mining and sentiment analysisAbout the speaker : David Moeljadi is a PhD candidate in computational linguistics at the School of Humanities, NTU. He is developing a computer system that can be used for various applications: parsing and generating Indonesian sentences, machine translation, computer-aided language learning, etc. In addition, he is developing a large scale, freely available, semantic dictionary of the Malay languages (currently Malaysian and Indonesian) called Wordnet Bahasa that can be used in the Natural Language Processing (NLP) field. During his PhD candidature, he helped the Indonesian language agency (Badan Bahasa), under the Ministry of Education and Culture of the Republic of Indonesia develop a database for the most comprehensive, official Indonesian dictionary, called the Great Dictionary of the Indonesian Language (Kamus Besar Bahasa Indonesia or KBBI). Moreover, he has been involved in other related projects: building a sentiment analysis system for Indonesian tweets, building a morphological dictionary for Indonesian, and building a corpus. Registration for this event has closed. |