Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 2 of 2
  • Item
    Overview of the second shared task on Indian native language identification (INLI)
    (CEUR-WS ceurws@sunsite.informatik.rwth-aachen.de, 2018) Anand Kumar, M.; Barathi Ganesh, H.; Ajay, S.G.; Padannayil, K.P.
    This overview paper describes the second shared task on Indian Native Language Identification (INLI) that was organized by FIRE 2018. Given a corpus with comments in English from various Facebook newspapers pages, the objective of the task is to identify the native language among the following six Indian languages: Bengali, Hindi, Kannada, Malayalam, Tamil, and Telugu. Altogether, 31 approaches of 14 different teams are evaluated. In this paper, we report the overview of the participant’s systems and the results of second INLI shared task. We have also compared the results of the first INLI shared task conducted with FIRE-2017. © 2018 CEUR-WS. All Rights Reserved.
  • Item
    Indian native language identification - INLI 2018
    (Association for Computing Machinery acmhelp@acm.org, 2018) Anand Kumar, M.; Barathi Ganesh, H.B.; Padannayil, K.P.; Ajay, S.G.
    The growth of digital platforms enables the industries to serve user specific services. Most of the time, the information of the internet users are not explicitly available and it acts as a constrain in developing the personalized applications. There comes the need for author profiling tasks, which intends to predict the internet users characteristics from their texts. Native language Identification is one among the author profiling task, that predicts the authors native language from their texts available in other language. We have proposed Indian Native Language Identification task, where the internet users texts are written in English and participants needs to find, whether the user’s native language is from Tamil, Malayalam, Kannada, Telugu, Bengali and Hindi. The corpus is collected from texts from regional news paper pages available in Facebook by considering the hypothesis that the user belongs to a particular region will read the news from respective regional news paper. © 2018 Association for Computing Machinery.