4.6 Article

Context-aware saliency detection for image retargeting using convolutional neural networks

Journal

MULTIMEDIA TOOLS AND APPLICATIONS
Volume 80, Issue 8, Pages 11917-11941

Publisher

SPRINGER
DOI: 10.1007/s11042-020-10185-0

Keywords

Image retargeting; Human visual system; Semantic segmentation; Convolutional neural networks; Saliency detection

Ask authors/readers for more resources

Image retargeting is a crucial task that requires preserving high-level visual information and low-level features, with different types of images needing different processing methods. The relationship between image retargeting and image saliency detection is significant, with newer approaches considering high-level semantic knowledge. The proposed methods in the study show excellent performance in practice and can be used as a benchmark for further research on retargeting quality assessment.
Image retargeting is the task of making images capable of being displayed on screens with different sizes. This work should be done so that high-level visual information and low-level features such as texture remain as intact as possible to the human visual system. At the same time, the output image may have different dimensions. Thus, simple methods such as scaling and cropping are not adequate for this purpose. In recent years, researchers have tried to improve the existing retargeting methods, and they have introduced new ones. However, a specific method cannot be utilized to retarget all types of images. In other words, different images require different retargeting methods. Image retargeting has a close relationship to image saliency detection, which is a relatively new image processing task. Earlier saliency detection methods were based on local and global but low-level image information. These methods are called bottom-up processes. On the other hand, newer approaches are top-down and mixed methods that consider the high level and semantic knowledge of the image too. In this paper, we introduce the proposed methods in both saliency detection and retargeting. For the saliency detection, the use of image context and semantic segmentation are examined, and a novel mixed bottom-up and top-down saliency detection method is introduced. After saliency detection, a modified version of an existing retargeting technique is utilized for retargeting the images. The results suggest that the proposed image retargeting pipeline has excellent performance compared to other tested methods. Also, the subjective evaluations on the Pascal dataset can be used as a retargeting quality assessment dataset for further research.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Artificial Intelligence

ReDMark: Framework for residual diffusion watermarking based on deep networks

Mahdi Ahmadi, Alireza Norouzi, Nader Karimi, Shadrokh Samavi, Ali Emami

EXPERT SYSTEMS WITH APPLICATIONS (2020)

Article Computer Science, Information Systems

BlessMark: a blind diagnostically-lossless watermarking framework for medical applications based on deep neural networks

Hamidreza Zarrabi, Ali Emami, Pejman Khadivi, Nader Karimi, Shadrokh Samavi

MULTIMEDIA TOOLS AND APPLICATIONS (2020)

Article Engineering, Biomedical

Simplification of neural networks for skin lesion image segmentation using color channel pruning

M. Hajabdollahi, R. Esfandiarpoor, P. Khadivi, S. M. R. Soroushmehr, N. Karimi, S. Samavi

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS (2020)

Article Engineering, Biomedical

Detection of COVID-19 in X-ray images by classification of bag of visual words using neural networks

Zahra Nabizadeh-Shahre-Babak, Nader Karimi, Pejman Khadivi, Roshanak Roshandel, Ali Emami, Shadrokh Samavi

Summary: This paper proposes an approach using the bag of visual words and a neural network classifier to classify X-ray chest images into COVID-19 and non-COVID-19 with high performance. Experimental results show that extracting features with the bag of visual words leads to better classification accuracy compared to state-of-the-art techniques.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2021)

Article Computer Science, Artificial Intelligence

Classification of diabetic retinopathy using unlabeled data and knowledge distillation

Sajjad Abbasi, Mohsen Hajabdollahi, Pejman Khadivi, Nader Karimi, Roshanak Roshandel, Shahram Shirani, Shadrokh Samavi

Summary: Machine Learning and Artificial Intelligence have shown potential as diagnostic tools in the healthcare domain, but their usefulness is hindered by a lack of labeled data. Transfer Learning and Knowledge Distillation are partial solutions to this issue, but still have limitations that need to be considered.

ARTIFICIAL INTELLIGENCE IN MEDICINE (2021)

Article Computer Science, Information Systems

No-reference stereo image quality assessment based on discriminative sparse representation

Maryam Karimi, Mansour Nejati, Pejman Khadivi, Nader Karimi, Shadrokh Samavi

Summary: The study proposed a fast-efficient algorithm for blind quality assessment of stereoscopic images. It utilized supervised dictionary learning to learn discriminative distortion-specific bases from structural features of stereoscopic images. In experiments, the method achieved an overall correlation of 97% with subjective scores on common datasets.

MULTIMEDIA TOOLS AND APPLICATIONS (2021)

Article Engineering, Biomedical

DGAFF: Deep genetic algorithm fitness Formation for EEG Bio-Signal channel selection

Ghazaleh Ghorbanzadeh, Zahra Nabizadeh, Nader Karimi, Pejman Khadivi, Ali Emami, Shadrokh Samavi

Summary: This article presents a channel selection method combining a sequential search method with a genetic algorithm to improve the performance and efficiency of brain-computer interface systems.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2023)

Article Engineering, Biomedical

Brain tumor segmentation by cascaded multiscale multitask learning framework based on feature aggregation

Zahra Sobhaninia, Nader Karimi, Pejman Khadivi, Shadrokh Samavi

Summary: This paper presents an approach that simultaneously segments and classifies brain tumors in MRI images using a framework that contains MRI image enhancement and tumor region detection. A network called Multiscale Cascaded Multitask Network is proposed, which is based on a multitask learning approach containing segmentation and classification tasks. The proposed method achieves high accuracy in both segmentation (96.27 and 95.88 for DCS and mean IoU, respectively) and classification (97.988 accuracy).

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2023)

Article Engineering, Biomedical

Dynamic-Pix2Pix: Medical image segmentation by injecting noise to cGAN for modeling input and target domain joint distributions with limited training data

Mohammadreza Naderi, Nader Karimi, Ali Emami, Shahram Shirani, Shadrokh Samavi

Summary: This study aims to improve the performance of conditional Generative Adversarial Networks (cGANs) in translating images by learning the target domain distribution from limited data with the help of noise input. The proposed method achieves better model generalization and comparable results compared to state-of-the-art methods.

BIOMEDICAL SIGNAL PROCESSING AND CONTROL (2023)

Article Computer Science, Information Systems

Aesthetic-aware image retargeting based on foreground-background separation and PSO optimization

Mohammad Reza Naderi, Mohammad Hossein Givkashi, Nader Karimi, Shahram Shirani, Shadrokh Samavi

Summary: This study proposes an image retargeting method that aims to change the size of images while preserving important content and minimizing distortions. By utilizing techniques such as inpainting, seam carving, super-resolution, and optimization, the proposed method demonstrates commendable performance in improving visual quality.

MULTIMEDIA TOOLS AND APPLICATIONS (2023)

Proceedings Paper Engineering, Electrical & Electronic

Weighted Fuzzy-Based PSNR for Watermark Visual Quality Evaluation

Maedeh Jamali, Nader Karimi, Shadrokh Samavi

Summary: The traditional visual quality evaluation criteria such as PSNR and MSE lack appropriate standards based on the human visual system. A weighted fuzzy-based criterion proposed in this paper aims to find essential parts of an image based on the HVS for more accurate evaluation of image quality. Experimental results demonstrate significant improvements over standard PSNR evaluations.

2021 29TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE) (2021)

No Data Available