Faculty Publications

Permanent URI for this communityhttps://idr.nitk.ac.in/handle/123456789/18736

Publications by NITK Faculty

Browse

Search Results

Now showing 1 - 10 of 59
  • Item
    Hybrid scheme for CFO cancellation in OFDM systems
    (Acta Press, 2013) Savitha, H.M.; Kulkarni, M.
    In this paper, we combine maximum likelihood estimation (MLE) technique with improved sinc power (ISP) pulse shaping to cancel inter-carrier interference caused by carrier frequency offset in coded orthogonal frequency division multiplexing (OFDM) systems, thereby achieving an improved bit error rate (BER) performance as compared to the above two schemes. The BER performance of the OFDM system was checked for ISP pulse shaping alone, MLE technique with low pass filtering, and a hybrid scheme of MLE technique with ISP pulse shaping. It has been shown that, at normalized carrier frequency offset of 0.2 and BER of 10-5, the hybrid scheme with convolutional coding could achieve a BER performance improvement of around 1.78 and 4.83 dB, respectively, as compared to MLE technique with low pass filtering and ISP pulse shaping alone. © International Journal of Modelling and Simulation 2013.
  • Item
    Implementation of comprehensive address generator for digital signal processor
    (2013) Ramesh Kini, R.M.; Sumam David, S.
    The performance of signal-processing algorithms implemented in hardware depends on the efficiency of datapath, memory speed and address computation. Pattern of data access in signal-processing applications is complex and it is desirable to execute the innermost loop of a kernel in a single-clock cycle. This necessitates the generation of typically three addresses per clock: two addresses for data sample/coefficient and one for the storage of processed data. Most of the Reconfigurable Processors, designed for multimedia, focus on mapping the multimedia applications written in a high-level language directly on to the reconfigurable fabric, implying the use of same datapath resources for kernel processing and address generation. This results in inconsistent and non-optimal use of finite datapath resources. Presence of a set of dedicated, efficient Address Generator Units (AGUs) helps in better utilisation of the datapath elements by using them only for kernel operations; and will certainly enhance the performance. This article focuses on the design and application-specific integrated circuit implementation of address generators for complex addressing modes required by multimedia signal-processing kernels. A novel algorithm and hardware for AGU is developed for accessing data and coefficients in a bit-reversed order for fast Fourier transform kernel spanning over log 2 N stages, AGUs for zig-zag-ordered data access for entropy coding after Discrete Cosine Transform (DCT), convolution kernels with stored/streaming data, accessing data for motion estimation using the block-matching technique and other conventional addressing modes. When mapped to hardware, they scale linearly in gate complexity with increase in the size. © 2013 Copyright Taylor and Francis Group, LLC.
  • Item
    Efficient assessment of structural reliability in presence of random and fuzzy uncertainties
    (American Society of Mechanical Engineers (ASME), 2014) Balu, A.S.; Rao, B.N.
    This paper presents an efficient uncertainty analysis for estimating the possibility distribution of structural reliability in presence of mixed uncertain variables. The proposed method involves high dimensional model representation for the limit state function approximation, transformation technique to obtain the contribution of the fuzzy variables to the convolution integral and fast Fourier transform for solving the convolution integral. In this methodology, efforts are required in evaluating conditional responses at a selected input determined by sample points, as compared to full scale simulation methods, thus the computational efficiency is accomplished. The proposed method is applicable for structural reliability estimation involving any number of fuzzy and random variables with any kind of distribution. Copyright © 2014 by ASME.
  • Item
    Segmentation of intra-retinal cysts from optical coherence tomography images using a fully convolutional neural network model
    (Institute of Electrical and Electronics Engineers Inc., 2019) Girish, G.N.; Thakur, B.; Chowdhury, S.R.; Kothari, A.R.; Rajan, J.
    Optical coherence tomography (OCT) is an imaging modality that is used extensively for ophthalmic diagnosis, near-histological visualization, and quantification of retinal abnormalities such as cysts, exudates, retinal layer disorganization, etc. Intra-retinal cysts (IRCs) occur in several macular disorders such as, diabetic macular edema, retinal vascular disorders, age-related macular degeneration, and inflammatory disorders. Automated segmentation of IRCs poses challenges owing to variations in the acquisition system scan intensities, speckle noise, and imaging artifacts. Several segmentation methods have been proposed in the literature for IRC segmentation on vendor-specific OCT images that lack generalizability across imaging systems. In this paper, we propose a fully convolutional network (FCN) model for vendor-independent IRC segmentation. The proposed method counteracts image noise variabilities and trains FCN models on OCT sub-images from the OPTIMA cyst segmentation challenge dataset (with four different vendor-specific images, namely, Cirrus, Nidek, Spectralis, and Topcon). Further, optimal data augmentation and model hyperparametrization are shown to prevent over-fitting for IRC area segmentation. The proposed method is evaluated on the test dataset with a recall/precision rate of 0.66/0.79 across imaging vendors. The Dice correlation coefficient of the proposed method outperforms that of the published algorithms in the OPTIMA cyst segmentation challenge with a Dice rate of 0.71 across the vendors. © 2013 IEEE.
  • Item
    Dense refinement residual network for road extraction from aerial imagery data
    (Institute of Electrical and Electronics Engineers Inc., 2019) Eerapu, K.K.; Ashwath, B.; Lal, S.; Dell’Acqua, F.; Narasimha Dhan, A.V.
    Extraction of roads from high-resolution aerial images with a high degree of accuracy is a prerequisite in various applications. In aerial images, road pixels and background pixels are generally in the ratio of ones-to-tens, which implies a class imbalance problem. Existing semantic segmentation architectures generally do well in road-dominated cases but fail in background-dominated scenarios. This paper proposes a dense refinement residual network (DRR Net) for semantic segmentation of aerial imagery data. The proposed semantic segmentation architecture is composed of multiple DRR modules for the extraction of diversified roads alleviating the class imbalance problem. Each module of the proposed architecture utilizes dense convolutions at various scales only in the encoder for feature learning. Residual connections in each module of the proposed architecture provide the guided learning path by propagating the combined features to subsequent DRR modules. Segmentation maps undergo various levels of refinement based on the number of DRR modules utilized in the architecture. To emphasize more on small object instances, the proposed architecture has been trained with a composite loss function. The qualitative and quantitative results are reported by utilizing the Massachusetts roads dataset. The experimental results report that the proposed architecture provides better results as compared to other recent architectures. © 2019 Institute of Electrical and Electronics Engineers Inc.. All rights reserved.
  • Item
    Automatic detection and localization of Focal Cortical Dysplasia lesions in MRI using fully convolutional neural network
    (Elsevier Ltd, 2019) Bijay Dev, K.M.; Pawan, P.S.; Niyas, S.; Vinayagamani, S.; Kesavadas, C.; Rajan, J.
    Focal cortical dysplasia (FCD) is the leading cause of drug-resistant epilepsy in both children and adults. At present, the only therapeutic approach in patients with drug-resistant epilepsy is surgery. Hence, the quantification of FCD via non-invasive imaging techniques helps physicians to decide on surgical interventions. The properties like non-invasiveness and capability to produce high-resolution images makes magnetic resonance imaging an ideal tool for detecting the FCD to an extent. The FCD lesions vary in size, shape, and location for different patients and make the manual detection time consuming and sensitive to the experience of the observer. Automatic segmentation of FCD lesions is challenging due to the difference in signal strength in images acquired with different machines, noise, and other kinds of distortions such as motion artifacts. Most of the methods proposed in the literature use conventional machine learning and image processing techniques in which their accuracy relies on the trained features. Hence, feature extraction should be done more precisely which requires human expertise. The ability to learn the appropriate features/representations from the training data without any human interventions makes the convolutional neural network (CNN) the suitable method for addressing these drawbacks. As far as we are aware, this work is the first one to use a CNN based model to solve the aforementioned problem using only MRI FLAIR images. We customized the popular U-Net architecture and trained the proposed model from scratch (using MRI images acquired with 1.5T and 3T scanners). FCD detection rate (recall) of the proposed model is 82.5 (33/40 patients detected correctly). © 2019
  • Item
    A Robust Pansharpening Algorithm Based on Convolutional Sparse Coding for Spatial Enhancement
    (Institute of Electrical and Electronics Engineers, 2019) Gogineni, R.; Chaturvedi, A.
    Pansharpening (PS) is a prominent remote sensing image fusion technique. It yields high-resolution multispectral (HRMS) images, which are imperative for the applications, such as recognition and detection. The PS methods based on conventional sparse representation induce blurring effects and are unable to preserve the essential spatial details in the fused outcome. In this article, to overcome these drawbacks, a robust fusion scheme is proposed based on convolutional sparse coding (CSC). The source images are decomposed into its constituent texture and cartoon components. The sparse coefficient maps are acquired from texture components by adapting CSC. Texture components are fused using activity level measurement, whereas averaging mechanism is used to fuse the cartoon components. The HRMS image is reconstructed by combining the fused components in proportion to the gradient information. Impact of number of filters on quality metrics estimation is analyzed. Comprehensive experiments are performed on the images acquired from distinct sensors. The proposed method is evaluated in terms of visual analysis and the quantitative metrics with reduced-scale and full-scale experiments. Extensive evaluations manifest the capability of the proposed method of maintaining the balanced tradeoff and retaining the desired spatial and spectral details. © 2008-2012 IEEE.
  • Item
    An enhanced protein secondary structure prediction using deep learning framework on hybrid profile based features
    (Elsevier Ltd, 2020) Kumar, P.; Bankapur, S.; Patil, N.
    Accurate protein secondary structure prediction (PSSP) is essential to identify structural classes, protein folds, and its tertiary structure. To identify the secondary structure, experimental methods exhibit higher precision with the trade-off of high cost and time. In this study, we propose an effective prediction model which consists of hybrid features of 42-dimensions with the combination of convolutional neural network (CNN) and bidirectional recurrent neural network (BRNN). The proposed model is accessed on four benchmark datasets such as CB6133, CB513, CASP10, and CAP11 using Q3, Q8, and segment overlap (Sov) metrics. The proposed model reported Q3 accuracy of 85.4%, 85.4%, 83.7%, 81.5%, and Q8 accuracy 75.8%, 73.5%, 72.2%, and 70% on CB6133, CB513, CASP10, and CAP11 datasets respectively. The results of the proposed model are improved by a minimum factor of 2.5% and 2.1% in Q3 and Q8 accuracy respectively, as compared to the popular existing models on CB513 dataset. Further, the quality of the Q3 results is validated by structural class prediction and compared with PSI-PRED. The experiment showed that the quality of the Q3 results of the proposed model is higher than that of PSI-PRED. © 2019 Elsevier B.V.
  • Item
    Affective database for e-learning and classroom environments using Indian students’ faces, hand gestures and body postures
    (Elsevier B.V., 2020) Ashwin, T.S.; Guddeti, R.M.R.
    Automatic recognition of the students’ affective states is a challenging task. These affective states are recognized using their facial expressions, hand gestures, and body postures. An intelligent tutoring system and smart classroom environment can be made more personalized using students’ affective state analysis, and it is performed using machine or deep learning techniques. Effective recognition of affective states is mainly dependent on the quality of the database used. But, there exist very few standard databases for the students’ affective state recognition and its analysis that works for both e-learning and classroom environments. In this paper, we propose a new affective database for both the e-learning and classroom environments using the students’ facial expressions, hand gestures, and body postures. The database consists of both posed (acted) and spontaneous (natural) expressions with single and multi-person in a single image frame with more than 4000 manually annotated image frames with object localization. The classification was done manually using the gold standard study for both Ekman's basic emotions and learning-centered emotions, including neutral. The annotators reliably agree when discriminating against the recognized affective states with Cohen's ? = 0.48. The created database is more robust as it considers various image variants such as occlusion, background clutter, pose, illumination, cultural & regional background, intra-class variations, cropped images, multipoint view, and deformations. Further, we analyzed the classification accuracy of our database using a few state-of-the-art machine and deep learning techniques. Experimental results demonstrate that the convolutional neural network based architecture achieved an accuracy of 83% and 76% for detection and classification, respectively. © 2020 Elsevier B.V.
  • Item
    An Enhanced Protein Fold Recognition for Low Similarity Datasets Using Convolutional and Skip-Gram Features with Deep Neural Network
    (Institute of Electrical and Electronics Engineers Inc., 2021) Bankapur, S.; Patil, N.
    The protein fold recognition is one of the important tasks of structural biology, which helps in addressing further challenges like predicting the protein tertiary structures and its functions. Many machine learning works are published to identify the protein folds effectively. However, very few works have reported the fold recognition accuracy above 80% on benchmark datasets. In this study, an effective set of global and local features are extracted from the proposed Convolutional (Conv) and SkipXGram bi-gram (SXGbg) techniques, and the fold recognition is performed using the proposed deep neural network. The performance of the proposed model reported 91.4% fold accuracy on one of the derived low similarity (< 25%) datasets of latest extended version of SCOPe_2.07. The proposed model is further evaluated on three popular and publicly available benchmark datasets such as DD, EDD, and TG and obtained 85.9%, 95.8%, and 88.8% fold accuracies, respectively. This work is first to report fold recognition accuracy above 85% on all the benchmark datasets. The performance of the proposed model has outperformed the best state-of-the-art models by 5% to 23% on DD, 2% to 19% on EDD, and 3% to 30% on TG dataset. © 2002-2011 IEEE.