Faculty Publications
Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736
Publications by NITK Faculty
Browse
4 results
Search Results
Item Movie Box-Office Success Prediction Using Machine Learning(Institute of Electrical and Electronics Engineers Inc., 2022) Velingkar, G.; Varadarajan, R.; Lanka, S.; Anand Kumar, M.Being a multi-billion dollar business, the film industry contributes largely to helping sustain a country's economy. A movie's box office (the revenue generated by the number of tickets sold of a movie) is an essential indicator of the movie's popularity. It varies depending upon several factors, including a production company, genre, budget, reviews, ratings, etc. Predicting an approximate value for a movie's box office based upon the various parameters helps investors with this business make intelligent and informed decisions. Thus, this paper designs a machine learning model that can predict the revenue a film will generate based on the information available before the movie's release. It also provides a model capable of taking in the planned genre, the required revenue, and using the Random Forest Regression model, provides recommended budget, runtime, star power, and expected popularity. © 2022 IEEE.Item Advancements in Financial Document Structure Extraction: Insights from Five Years of FinTOC (2019-2023)(Institute of Electrical and Electronics Engineers Inc., 2023) Kang, J.; Patel, M.M.; Agrawal, A.; Simhadri, S.; Srinivasa, R.; Bellato, S.; Anand Kumar, M.; Tsang, N.D.; El-Haj, M.In this comprehensive paper, we present a detailed overview of the Financial Table Of Content extraction shared task series, FinTOC, conducted over a span of five years from 2019 to 2023. This paper serves as a retrospective analysis of the key developments in the field of financial document structure extraction. The FinTOC series, hosted within the framework of the Financial Narrative Processing (FNP) workshop, has been instrumental in shaping the landscape of Natural Language Processing (NLP) in the financial domain. Our analysis delves into the diverse methodologies proposed by participants across all editions, shedding light on the innovative strategies employed to tackle the intricate challenge of extracting structured information from financial documents. We explore the evolution of techniques, from traditional rule-based approaches to cutting-edge deep learning models, showcasing the dynamic nature of NLP advancements. Furthermore, our study investigates the introduction of multilingual datasets by the organizers, highlighting the importance of cross-lingual analysis in financial document processing. We also examine the contributions made by participants in augmenting the training data with external sources, showcasing the collaborative spirit of the NLP community in enhancing the quality and size of the shared training dataset. © 2023 IEEE.Item Downscaled XCO2 Estimation Using Data Fusion and AI-Based Spatio-Temporal Models(Institute of Electrical and Electronics Engineers Inc., 2024) Pais, S.M.; Bhattacharjee, S.; Anand Kumar, M.; Chen, J.One of the well-known greenhouse gases (GHGs) produced by anthropogenic human activity is carbon dioxide (CO2). Understanding the carbon cycle and how negatively it affects the ecosystem requires analysis of the rise in CO2 concentration. This work aims to map CO2 concentration for the entire surface, making it useful for regional carbon cycle analysis. Here, column-averaged CO2 dry mole fraction, called XCO2, measured by the orbiting carbon observatory-2 (OCO-2) satellite, is used. Because of spectral interference by the clouds and aerosols, there are many missing footprints in the Level-2 swath of OCO-2, making it disruptive to understand any assessment related to the carbon cycle. The objective of this work is to predict 1 km2 XCO2 using data resampling and machine learning models. This work achieves a minimum mean absolute error (MAE) and root mean square error (RMSE) of 0.3990 and 0.8090 ppm, using the monthly models. © 2004-2012 IEEE.Item Forecasting of Fine-Grained SIF of OCO-2 Using Multi-source Data and AI-Based Techniques(Springer, 2025) Pais, S.M.; Bhattacharjee, S.; Anand Kumar, M.The emission of unused energy absorbed from the sunlight by plants and other photosynthetic organisms is known as solar-induced fluorescence (SIF). SIF is a direct proxy for the photosynthetic activity of the plants used to monitor drought, crop yield estimation, ecological processes, and carbon cycles. Comprehending the SIF dynamics beforehand helps gain an understanding of vegetation dynamics, carbon cycle, and crop phenology. This study explores the potential of using Orbiting Carbon Observatory-2 (OCO-2) SIF data for forecasting SIF at regional scales. The research utilizes machine learning models and data fusion to forecast the SIF data, by establishing relationships between observed SIF from past timestamps and the Enhanced Vegetation Index. The lasso regression achieves minimal error of RMSE 0.0355 Wm-2nm-1sr-1 and MAPE of 16.9093% for forecasting monthly SIF data. In contrast, the light gradient boosting machine model (LG) performs well for a larger non-linear dataset, i.e., seasonal models achieving a RMSE of 0.0389 Wm-2nm-1sr-1 and MAPE of 17.4895%, respectively. Karnataka and Maharastra, the two Indian states, are considered as the study areas for this work for a temporal window of 2017–2019. Fine-grained, uniformly distributed SIF forecasting provides valuable insights for understanding vegetation responses to environmental changes, optimizing agricultural practices, and developing climate change mitigation strategies. © Indian Society of Remote Sensing 2025.
