4.7 Article

Food Volume Estimation Based on Deep Learning View Synthesis from a Single Depth Map

期刊

NUTRIENTS
卷 10, 期 12, 页码 -

出版社

MDPI
DOI: 10.3390/nu10122005

关键词

dietary assessment; volume estimation; mhealth; deep learning; view synthesis; image rendering; 3d reconstruction

资金

  1. Lee Family Scholarship - Bill & Melinda Gates Foundation [OPP1171395]

向作者/读者索取更多资源

An objective dietary assessment system can help users to understand their dietary behavior and enable targeted interventions to address underlying health problems. To accurately quantify dietary intake, measurement of the portion size or food volume is required. For volume estimation, previous research studies mostly focused on using model-based or stereo-based approaches which rely on manual intervention or require users to capture multiple frames from different viewing angles which can be tedious. In this paper, a view synthesis approach based on deep learning is proposed to reconstruct 3D point clouds of food items and estimate the volume from a single depth image. A distinct neural network is designed to use a depth image from one viewing angle to predict another depth image captured from the corresponding opposite viewing angle. The whole 3D point cloud map is then reconstructed by fusing the initial data points with the synthesized points of the object items through the proposed point cloud completion and Iterative Closest Point (ICP) algorithms. Furthermore, a database with depth images of food object items captured from different viewing angles is constructed with image rendering and used to validate the proposed neural network. The methodology is then evaluated by comparing the volume estimated by the synthesized 3D point cloud with the ground truth volume of the object items.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Computer Science, Information Systems

Video Based Cocktail Causal Container for Blood Pressure Classification and Blood Glucose Prediction

Chuanhao Zhang, Emil Jovanov, Hongen Liao, Yuan-Ting Zhang, Benny Lo, Yuan Zhang, Cuntai Guan

Summary: With the development of modern cameras, physiological signals can now be obtained from portable devices like smartphones. In this paper, a framework called cocktail causal container is proposed to fuse multiple physiological representations and reconstruct the correlation between frequency and temporal information for blood pressure and blood glucose classification. The framework utilizes a token feature fusion block and a causal net to extract discriminative features and disentangle latent factors. A pair-wise temporal frequency map is also developed for extracting PPG information. Extensive comparisons using clinical data and PPG-BP benchmark show promising results with low error rate and high precision.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Article Computer Science, Information Systems

MtCLSS: Multi-Task Contrastive Learning for Semi-Supervised Pediatric Sleep Staging

Yamei Li, Shengqiong Luo, Haibo Zhang, Yinkai Zhang, Yuan Zhang, Benny Lo

Summary: The demand for automatic pediatric sleep staging has increased due to the rising incidence and recognition of children's sleep disorders. The existing supervised sleep stage recognition algorithms face challenges such as limited availability of pediatric sleep physicians and data heterogeneity. To address this, we propose a multi-task contrastive learning strategy that combines semi-supervised learning and self-supervised contrastive learning, named MtCLSS. By applying signal-adapted transformations and an extended contrastive loss function, MtCLSS learns task-specific and general features from limited labeled data, improving the robustness of the model for EEG based automatic pediatric sleep staging in limited data scenarios.

IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS (2023)

Article Computer Science, Information Systems

Ecological Cooperative Adaptive Control of Connected Automate Vehicles in Mixed and Power-Heterogeneous Traffic Flow

Xianmin Song, Yingnan Sun, Haitao Li, Bo Liu, Yuxuan Cao

Summary: This paper proposes an Ecological Control Unit-Cooperative Adaptive Control (ECU-CACC) system to reduce energy consumption in mixed and power-heterogeneous traffic flow. A bi-level control framework is designed to improve traffic efficiency and reduce energy consumption. Numerical experiments verify the effectiveness of the system and analyze the energy-saving effect under different vehicle mixing situations.

ELECTRONICS (2023)

Article Medicine, General & Internal

Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation

Peilun Shi, Jianing Qiu, Sai Mu Dalike Abaxi, Hao Wei, Frank P. -W. Lo, Wu Yuan

Summary: Medical image analysis is crucial in clinical diagnosis. This paper evaluates the Segment Anything Model (SAM) on various medical image segmentation benchmarks and finds that while SAM performs well on general domain images, its zero-shot segmentation ability is limited for medical images. Inconsistent performance is observed across different medical domains, with complete failure in segmentation of certain structured targets. However, fine-tuning SAM with a small amount of data leads to significant improvement, showing the potential of achieving accurate medical image segmentation for precision diagnostics.

DIAGNOSTICS (2023)

Article Robotics

Modified Bilateral Active Estimation Model: A Learning-Based Solution to the Time Delay Problem in Robotic Tele-Control

Xuhui Zhou, Ziqi Yang, Yunxiao Ren, Weibang Bai, Benny Lo, Eric M. M. Yeatman

Summary: The presence of computation delay, transmission delay, and mechanical delay in robotic teleoperation systems is a major factor in system degradation. A neural network-based open-loop approach called BAEM has been proposed to compensate for transmission delay by sending predicted trajectories as commands. A modified version of BAEM (m-BAEM) is proposed to explicitly compensate for all three types of delay, and a real-time robotic teleoperation system based on the ROS 2 framework is built to evaluate its performance.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Article Robotics

A Step Towards Conditional Autonomy-Robotic Appendectomy

Ruiyang Zhang, Junhong Chen, Zeyu Wang, Ziqi Yang, Yunxiao Ren, Peilun Shi, James Calo, Kyle Lam, Sanjay Purkayastha, Benny Lo

Summary: Robot-Assisted Minimally Invasive Surgery (RAMIS) has gained popularity worldwide due to its precision, ergonomics, and intuitive control. With advancements in AI and surgical robot technologies, the cognitive load on surgeons can be reduced, while improving the precision and safety of robot operations. However, research is still focused on task autonomy due to operation complexity and limited clinical data. This paper proposes a method for conditional autonomy in robotic appendectomy, utilizing demonstrated data to carry out the procedure semi-automatically.

IEEE ROBOTICS AND AUTOMATION LETTERS (2023)

Article Computer Science, Information Systems

Federated Blockchain Learning at the Edge

James Calo, Benny Lo

Summary: This paper proposes a method using blockchain and federated learning to effectively train neural networks on IoT devices. It addresses issues of data scarcity and privacy concerns, and enables distributed training across multiple devices.

INFORMATION (2023)

Article Nutrition & Dietetics

Evaluation of Acceptability, Functionality, and Validity of a Passive Image-Based Dietary Intake Assessment Method in Adults and Children of Ghanaian and Kenyan Origin Living in London, UK

Modou L. Jobarteh, Megan A. Mccrory, Benny Lo, Konstantinos K. Triantafyllidis, Jianing Qiu, Jennifer P. Griffin, Edward Sazonov, Mingui Sun, Wenyan Jia, Tom Baranowski, Alex K. Anderson, Kathryn Maitland, Gary Frost

Summary: This study validated an objective, passive image-based method for assessing dietary intake in London, UK and demonstrated its potential applicability in low- and middle-income countries (LMICs). The findings showed good agreement between the image-based method and weighed food records, indicating that this method can provide a comparable assessment of nutritional intake.

NUTRIENTS (2023)

Article Automation & Control Systems

Dual Stream Meta Learning for Road Surface Classification and Riding Event Detection on Shared Bikes

Shuo Jiang, Zach Strout, Bin He, Daiyan Peng, Peter B. B. Shull, Benny P. L. Lo

Summary: This article proposes an IoT-based solution that utilizes shared bikes to intelligently detect road surface conditions and riding events for travel efficiency and rider safety in cities. The proposed dual stream meta learning approach solves the reliability problem with different bike types and the self-adaptive problem when classifying new classes without retraining the model. The results demonstrate high accuracy in road surface condition and riding event detection.

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS (2023)

Article Nutrition & Dietetics

I2N: image to nutrients, a sensor guided semi-automated tool for annotation of images for nutrition analysis of eating episodes

Tonmoy Ghosh, Megan A. McCrory, Tyson Marden, Janine Higgins, Alex Kojo Anderson, Christabel Ampong Domfe, Wenyan Jia, Benny Lo, Gary Frost, Matilda Steiner-Asiedu, Tom Baranowski, Mingui Sun, Edward Sazonov

Summary: This paper presents a semi-automatic dietary assessment tool called Image to Nutrients (I2N), which uses wearable sensors to process eating events and food images for nutritional analysis. The tool provides access to multiple food databases and estimates energy intake and nutrient content.

FRONTIERS IN NUTRITION (2023)

Article Gastroenterology & Hepatology

Current Engineering Developments for Robotic Systems in Flexible Endoscopy

Amirhosein Alian, Emilia Zari, Zeyu Wang, Enrico Franco, James P. Avery, Mark Runciman, Benny Lo, Ferdinando Rodriguez y Baena, George Mylonas

Summary: In the past four decades, the incidence of early-onset gastrointestinal cancer has increased. Mass screening colonoscopy is the most effective prevention strategy for early-stage cancer detection, but conventional endoscopy is a painful and technically challenging procedure. To overcome these limitations, technological innovation is needed in colonoscopy.

TECHNIQUES AND INNOVATIONS IN GASTROINTESTINAL ENDOSCOPY (2023)

Article Automation & Control Systems

Egocentric Image Captioning for Privacy-Preserved Passive Dietary Intake Monitoring

Jianing Qiu, Frank P. -W. Lo, Xiao Gu, Modou L. Jobarteh, Wenyan Jia, Tom Baranowski, Matilda Steiner-Asiedu, Alex K. Anderson, Megan A. McCrory, Edward Sazonov, Mingui Sun, Gary Frost, Benny Lo

Summary: Camera-based passive dietary intake monitoring captures eating episodes, recording visual information on food type, volume, and eating behavior. However, no method incorporates these clues to provide a comprehensive dietary context. Privacy is a concern with wearable cameras. This paper proposes a privacy-preserved solution for dietary assessment, using egocentric image captioning to convert images into text descriptions and reduce privacy risks. A dataset is built for egocentric dietary image captioning, and a transformer-based architecture is designed and evaluated for effectiveness.

IEEE TRANSACTIONS ON CYBERNETICS (2023)

Article Computer Science, Information Systems

An Intelligent Vision-Based Nutritional Assessment Method for Handheld Food Items

Frank Po Wen Lo, Yao Guo, Yingnan Sun, Jianing Qiu, Benny Lo

Summary: Dietary assessment is effective for evaluating the dietary intake of patients with diabetes and obesity. However, traditional methods have limitations, so researchers proposed an intelligent nutritional assessment approach using weakly-supervised point cloud completion. This method shows promising results in estimating food volume and can be implemented using wearable and handheld cameras.

IEEE TRANSACTIONS ON MULTIMEDIA (2023)

Proceedings Paper Automation & Control Systems

Generalizable Movement Intention Recognition with Multiple Heterogeneous EEG Datasets

Xiao Gu, Jinpei Han, Guang-Zhong Yang, Benny Lo

Summary: This paper proposes a method for human movement intention recognition using motor imagery electroencephalogram. Two networks are developed to handle the heterogeneity of inter-subject and inter-dataset, and an online knowledge co-distillation framework is used for collaborative learning, achieving better generalization performance.

2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023) (2023)

暂无数据