Conference Papers
Permanent URI for this collectionhttps://idr.nitk.ac.in/handle/123456789/28506
Browse
3 results
Search Results
Item A Deep Neural Network Based End to End Model for Joint Height and Age Estimation from Short Duration Speech(Institute of Electrical and Electronics Engineers Inc., 2019) Kalluri, S.B.; Vijayasenan, D.; Ganapathy, S.Automatic height and age prediction of a speaker has a wide variety of applications in speaker profiling, forensics etc. Often in such applications only a few seconds of speech data is available to reliably estimate the speaker parameters. Traditionally, age and height were predicted separately using different estimation algorithms. In this work, we propose a unified DNN architecture to predict both height and age of a speaker for short durations of speech. A novel initialization scheme for the deep neural architecture is introduced, that avoids the requirement for a large training dataset. We evaluate the system on TIMIT dataset where the mean duration of speech segments is around 2.5s. The DNN system is able to improve the age RMSE by at least 0.6 years as compared to a conventional support vector regression system trained on Gaussian Mixture Model mean supervectors. The system achieves an RMSE error of 6.85 and 6.29 cm for male and female height prediction. In case of age estimation, the RMSE errors are 7.60 and 8.63 years for male and female respectively. Analysis of shorter speech segments reveals that even with 1 second speech input the performance degradation is at most 3% compared to the full duration speech files. © 2019 IEEE.Item Semi-supervised Semantic Segmentation for Effusion Cytology Images(Springer Science and Business Media Deutschland GmbH, 2023) Aboobacker, S.; Vijayasenan, D.; Sumam David, S.; Suresh, P.K.; Sreeram, S.Cytopathologists analyse images captured at different magnifications to detect the malignancies in effusions. They identify the malignant cell clusters from the lower magnification, and the identified area is zoomed in to study cell level details in high magnification. The automatic segmentation of low magnification images saves scanning time and storage requirements. This work predicts the malignancy in the effusion cytology images at low magnification levels such as 10 × and 4 ×. However, the biggest challenge is the difficulty in annotating the low magnification images, especially the 4 × data. We extend a semi-supervised learning (SSL) semantic model to train unlabelled 4 × data with the labelled 10 × data. The benign F-score on the predictions of 4 × data using the SSL model is improved 15% compared with the predictions of 4 × data on the semantic 10 × model. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.Item YOLOv5 Model-based Ship Detection in High Resolution SAR Images(Institute of Electrical and Electronics Engineers Inc., 2023) Sapna, S.; Sandhya, S.; Shetty, R.D.; Pais, S.M.; Bhattacharjee, S.Detection of ships in Synthetic Aperture Radar (SAR) images play a crucial role in maritime surveillance, most importantly under complex sea conditions. SAR permits observation in any weather conditions, at all hours of the day and night. At present, the ship detection from SAR images is a notable area of research since it is very difficult to detect the ships in the SAR images using traditional object or target detection algorithms. In this work, a You Only Look Once version 5 (YOLOv5) based ship detection model from SAR images with faster training speed and higher accuracy is implemented and tested. This model achieved a mean average precision (mAP) of 96.2% with a training time of 8.63 hours. This work also provides a comparative analysis with the existing methods for detection of ships in SAR images. The comparison shows that the YOLOv5 based model performs better in terms of both mean average precision and training time when compared to the existing models. © 2023 IEEE.
