4.6 Article

Cross-Modal Guidance Assisted Hierarchical Learning Based Siamese Network for MR Image Denoising

期刊

ELECTRONICS
卷 10, 期 22, 页码 -

出版社

MDPI
DOI: 10.3390/electronics10222855

关键词

cross-modal; guided; denoising; MRI; machine learning; siamese network; deep learning

资金

  1. H2020-MSCA-ITN Marie Sklodowska-Curie Actions, Innovative Training Networks (ITN)-H2020 MSCA ITN [722068]
  2. Marie Curie Actions (MSCA) [722068] Funding Source: Marie Curie Actions (MSCA)

向作者/读者索取更多资源

Cross-modal medical imaging techniques are utilized to enhance the accuracy of medical image analysis tasks. A deep learning-based denoising method called CMGDNet is proposed in this paper to remove Rician noise in T1-weighted MRI using cross-modal image information, achieving significant improvements in SSIM and FSIM values compared to other denoising methods.
Cross-modal medical imaging techniques are predominantly being used in the clinical suite. The ensemble learning methods using cross-modal medical imaging adds reliability to several medical image analysis tasks. Motivated by the performance of deep learning in several medical imaging tasks, a deep learning-based denoising method Cross-Modality Guided Denoising Network CMGDNet for removing Rician noise in T1-weighted (T1-w) Magnetic Resonance Images (MRI) is proposed in this paper. CMGDNet uses a guidance image, which is a cross-modal (T2-w) image of better perceptual quality to guide the model in denoising its noisy T1-w counterpart. This cross-modal combination allows the network to exploit complementary information existing in both images and therefore improve the learning capability of the model. The proposed framework consists of two components: Paired Hierarchical Learning (PHL) module and Cross-Modal Assisted Reconstruction (CMAR) module. PHL module uses Siamese network to extract hierarchical features from dual images, which are then combined in a densely connected manner in the CMAR module to finally reconstruct the image. The impact of using registered guidance data is investigated in removing noise as well as retaining structural similarity with the original image. Several experiments were conducted on two publicly available brain imaging datasets available on the IXI database. The quantitative assessment using Peak Signal to noise ratio (PSNR), Structural Similarity Index (SSIM), and Feature Similarity Index (FSIM) demonstrates that the proposed method exhibits 4.7% and 2.3% gain (average), respectively, in SSIM and FSIM values compared to other state-of-the-art denoising methods that do not integrate cross-modal image information in removing various levels of noise.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Human Inertial Thinking Strategy: A Novel Fuzzy Reasoning Mechanism for IoT-Assisted Visual Monitoring

Shuai Liu, Shuai Wang, Xinyu Liu, Jianhua Dai, Khan Muhammad, Amir H. H. Gandomi, Weiping Ding, Mohammad Hijji, Victor Hugo C. de Albuquerque

Summary: Computer vision, particularly visual monitoring technology, has shown great potential in the complex monitoring environment. This article proposes a fuzzy inference-based monitoring method that utilizes human inertial thinking characteristics to infer the target's location and applies an alternative selection strategy based on thinking set. Experimental results on multiple datasets demonstrate the effectiveness and robustness of the proposed method in IoT-assisted monitoring.

IEEE INTERNET OF THINGS JOURNAL (2023)

Review Computer Science, Theory & Methods

A Comprehensive Review on Vision-Based Violence Detection in Surveillance Videos

Fath U. Min Ullah, Mohammad S. Obaidat, Amin Ullah, Khan Muhammad, Mohammad Hijji, Sung Wook Baik

Summary: Recent advancements in intelligent surveillance systems for video analysis have attracted significant attention in the research community. Automatic violence detection systems using artificial neural networks and machine intelligence are in high demand in heavily crowded areas to ensure safety and security in smart cities. Extensive literature on violence detection has been published, but existing surveys are limited in scope. To address this, we conduct a comprehensive survey and analysis of the literature, examining machine learning strategies, neural network-based analysis, limitations, and datasets. We also discuss evaluation strategies, metrics, and provide recommendations for future research in violence detection.

ACM COMPUTING SURVEYS (2023)

Editorial Material Computer Science, Artificial Intelligence

Editorial: Deep neural networks with cloud computing

Kit Yan Chan, Bilal Abu-Salih, Khan Muhammad, Vasile Palade, Rifai Chai

NEUROCOMPUTING (2023)

Article Mathematics

Automated Fire Extinguishing System Using a Deep Learning Based Framework

Senthil Kumar Jagatheesaperumal, Khan Muhammad, Abdul Khader Jilani Saudagar, Joel J. P. C. Rodrigues

Summary: Fire accidents cause a high number of casualties and manually extinguishing the fire is risky. The development of fire-extinguishing robots with advanced functionalities is ongoing, however, early detection of fire is lacking in most systems. This study introduces a deep learning-based automatic fire extinguishing mechanism utilizing convolutional neural networks for fire detection and human presence in fire locations. Experimental results show that the best combination of neural network parameters is an Adam optimizer with softmax activation and a learning rate of 0.001. The proposed model was tested using a mobile robotic system in automatic and wireless control modes, successfully extinguishing fires.

MATHEMATICS (2023)

Article Engineering, Electrical & Electronic

RBWCI: Robust and Blind Watermarking Framework for Cultural Images

Samrah Mehraj, Subreena Mushtaq, Shabir A. Parah, Kaiser J. Giri, Javaid A. Sheikh, Amir H. Gandomi, Mohammad Hijji, Brij B. Gupta, Khan Muhammad

Summary: Heritage multimedia is a valuable cultural asset that provides insights into earlier generations and their creative approach, lifestyle, and historical ideologies. It is also an important resource for boosting the local economy, sustainable communities, and tourism and business sectors. With the advancements in technology and 5G networks, protecting heritage media from unauthorized consumers is crucial. This study proposes a robust and blind watermarking-framework for cultural images (RBWCI) that uses the discrete cosine transform domain for ownership verification and copyright protection.

IEEE TRANSACTIONS ON CONSUMER ELECTRONICS (2023)

Article Computer Science, Hardware & Architecture

A Reliable Sample Selection Strategy for Weakly Supervised Visual Tracking

Shuai Liu, Xiyu Xu, Yang Zhang, Khan Muhammad, Weina Fu

Summary: This article introduces a reliable sample selection strategy for weakly supervised visual tracking and verifies its importance in improving model performance. Experiments demonstrate that a scientific sample quality assessment method is of great help to data-based weakly supervised learning systems.

IEEE TRANSACTIONS ON RELIABILITY (2023)

Article Computer Science, Information Systems

Dual-Driven Resource Management for Sustainable Computing in the Blockchain-Supported Digital Twin IoT

Dan Wang, Bo Li, Bin Song, Yingjie Liu, Khan Muhammad, Xiaokang Zhou

Summary: In this article, a novel blockchain-supported hierarchical digital twin IoT (HDTIoT) framework is proposed to achieve secure and reliable real-time computation. The framework combines digital twin with edge network and adopts blockchain technology. By utilizing a data and knowledge dual-driven learning solution, the communication and computation efficiency is improved. Experimental results demonstrate the efficiency and reliability of the proposed resource allocation scheme in the HDTIoT system.

IEEE INTERNET OF THINGS JOURNAL (2023)

Review Engineering, Multidisciplinary

A Comprehensive Survey on Deep Facial Expression Recognition: Challenges, Applications, and Future Guidelines

Muhammad Sajjad, Fath U. Min Ullah, Mohib Ullah, Georgia Christodoulou, Faouzi Alaya Cheikh, Mohammad Hijji, Khan Muhammad, Joel J. P. C. Rodrigues

Summary: Facial expression recognition (FER) is a complex research topic with applications in various fields, such as healthcare and security. Computational FER mimics human facial expression coding skills to assist human-computer interaction. This study thoroughly analyzes and surveys the existing literature on FER, highlights the working flow of FER methods, discusses limitations in existing surveys, investigates FER datasets, and comprehensively discusses measures to evaluate FER performance.

ALEXANDRIA ENGINEERING JOURNAL (2023)

Article Engineering, Electrical & Electronic

Visual Appearance and Soft Biometrics Fusion for Person Re-Identification Using Deep Learning

Samee Ullah Khan, Noman Khan, Tanveer Hussain, Khan Muhammad, Mohammad Hijji, Javier Del Ser, Sung Wook Baik

Summary: This article proposes a multi-scale pyramid attention model for person re-identification (P-ReID) that leverages the complementarity between semantic attributes and visual appearance. The proposed model consists of three steps, including individual training of backbone model and appearance/attribute networks, fusion of dual network features, and re-training for P-ReID.

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING (2023)

Article Engineering, Civil

Efficient Fire Segmentation for Internet-of-Things-Assisted Intelligent Transportation Systems

Khan Muhammad, Hayat Ullah, Salman Khan, Mohammad Hijji, Jaime Lloret

Summary: This paper proposes an efficient and lightweight CNN architecture for early fire detection and segmentation. By utilizing depth-wise separable convolution, point-wise group convolution, and a channel shuffling strategy, the model size and computation costs are significantly reduced. Extensive experiments validate the effectiveness and robustness of the proposed method in fire segmentation.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Engineering, Civil

Network Car Hailing Pricing Model Optimization in Edge Computing-Based Intelligent Transportation System

Zheng Wang, Yifeng Wang, Khan Muhammad

Summary: The purpose of this study is to investigate the pricing and deficiency of Network Car Hailing (NCH) Platform in Edge Computing (EC)-based Intelligent Transportation System. This study introduces EC to address the capacity and load balancing issues in the car-hailing platform, and constructs an EC-based online car-hailing resource allocation and pricing optimization model. Experimental results show that as the number of vehicles with computing tasks increases, the amount of resources purchased and the cost of paying also increase, while the utility function of NCH platforms and operators declines. The model constructed in this study can minimize average cost and energy consumption while maintaining low delay, providing reference for intelligent pricing and resource allocation in the later period of intelligent transportation.

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS (2023)

Article Computer Science, Information Systems

Efficient Person Reidentification for IoT-Assisted Cyber-Physical Systems

Samee Ullah Khan, Ijaz Ul Haq, Noman Khan, Amin Ullah, Khan Muhammad, Huiling Chen, Sung Wook Baik, Victor Hugo C. de Albuquerque

Summary: The study proposes a cyber-physical system (CPS)-based person reidentification (P-ReID) framework for smart surveillance. The framework utilizes AI techniques and IoT environments to improve efficiency and overcome challenges in person reidentification. A dual attention dilated network (DADNet) and dual feature fusion method are introduced to enhance the person matching probability. Additionally, diversity orthogonality regularization is imposed on several CNN layers to boost the performance of DADNet. A comprehensive analysis and comparison demonstrate the strength of DADNet in AI-enabled IoT settings.

IEEE INTERNET OF THINGS JOURNAL (2023)

Article Automation & Control Systems

Large-Scale Person Re-Identification for Crowd Monitoring in Emergency

Nayan Kumar Subhashis Behera, Pankaj Kumar Sa, Khan Muhammad, Sambit Bakshi

Summary: Person Re-identification (PRId) is essential for associating photographs/videos of individuals obtained from various occasions or across cameras, especially in emergencies. Part-level features play a crucial role in person retrieval, and using convolutional partition of body parts to learn discriminative features is highlighted in this research. The proposed method of Convolutional Part Refine (CPR) shows competitive performance and addresses the within-part inconsistency issue in partition strategies.

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING (2023)

Article Computer Science, Hardware & Architecture

Communication Technologies for Edge Learning and Inference: A Novel Framework, Open Issues, and Perspectives

Khan Muhammad, Javier Del Ser, Naercio Magaia, Ramon Fonseca, Tanveer Hussain, Amir H. Gandomi, Mahmoud Daneshmand, Victor Hugo C. de Albuquerque

Summary: With the increasing popularity of smart devices and their need for data, edge computing and edge learning have become powerful tools. However, edge learning faces challenges such as latency sensitivity and resource consumption. This study proposes a prioritization framework for video data based on edge learning, which can reduce resource usage. Additionally, communication aspects related to edge learning are critically examined.

IEEE NETWORK (2023)

暂无数据