Details der Publikation - Two-stream vision transformer based multi-label recognition for TCM prescriptions construction

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction

Copyright © 2024. Published by Elsevier Ltd..

Traditional Chinese medicine (TCM) observation diagnosis images (including facial and tongue images) provide essential human body information, holding significant importance in clinical medicine for diagnosis and treatment. TCM prescriptions, known for their simplicity, non-invasiveness, and low side effects, have been widely applied worldwide. Exploring automated herbal prescription construction based on visual diagnosis holds vital value in delving into the correlation between external features and herbal prescriptions and offering medical services in mobile healthcare systems. To effectively integrate multi-perspective visual diagnosis images and automate prescription construction, this study proposes a multi-herb recommendation framework based on Visual Transformer and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal fusion classification module. The image encoder employs a dual-stream Visual Transformer to learn dependencies between different regions of input images, capturing both local and global features. The label embedding module utilizes Graph Convolutional Networks to capture associations between diverse herbal labels. Finally, two Multi-Modal Factorized Bilinear modules are introduced as effective components to fuse cross-modal vectors, creating an end-to-end multi-label image-herb recommendation model. Through experimentation with real facial and tongue images and generating prescription data closely resembling real samples. The precision is 50.06 %, the recall rate is 48.33 %, and the F1-score is 49.18 %. This study validates the feasibility of automated herbal prescription construction from the perspective of visual diagnosis. Simultaneously, it provides valuable insights for constructing herbal prescriptions automatically from more physical information.

Medienart:	E-Artikel

Erscheinungsjahr:	2024
Erschienen:	2024

Enthalten in:	Zur Gesamtaufnahme - volume:170
Enthalten in:	Computers in biology and medicine - 170(2024) vom: 20. Feb., Seite 107920

Sprache:	Englisch

Beteiligte Personen:	Zhao, Zijuan [VerfasserIn] Qiang, Yan [VerfasserIn] Yang, Fenghao [VerfasserIn] Hou, Xiao [VerfasserIn] Zhao, Juanjuan [VerfasserIn] Song, Kai [VerfasserIn]

Links:	Volltext

Themen:	Facial and tongue images Graph convolutional network Journal Article Multi-label image recognition Prescriptions construction Visual transformer

Anmerkungen:	Date Completed 28.02.2024 Date Revised 28.02.2024 published: Print-Electronic Citation Status MEDLINE

doi:	10.1016/j.compbiomed.2024.107920

funding:
Förderinstitution / Projekttitel:

PPN (Katalog-ID):	NLM367349639

Internformat


LEADER	01000caa a22002652 4500
001	NLM367349639
003	DE-627
005	20240229170720.0
007	cr uuu---uuuuu
008	240121s2024 xx \|\|\|\|\|o 00\| \|\|eng c
024	7		\|a 10.1016/j.compbiomed.2024.107920 \|2 doi
028	5	2	\|a pubmed24n1310.xml
035			\|a (DE-627)NLM367349639
035			\|a (NLM)38244474
035			\|a (PII)S0010-4825(24)00004-0
040			\|a DE-627 \|b ger \|c DE-627 \|e rakwb
041			\|a eng
100	1		\|a Zhao, Zijuan \|e verfasserin \|4 aut
245	1	0	\|a Two-stream vision transformer based multi-label recognition for TCM prescriptions construction
264		1	\|c 2024
336			\|a Text \|b txt \|2 rdacontent
337			\|a ƒaComputermedien \|b c \|2 rdamedia
338			\|a ƒa Online-Ressource \|b cr \|2 rdacarrier
500			\|a Date Completed 28.02.2024
500			\|a Date Revised 28.02.2024
500			\|a published: Print-Electronic
500			\|a Citation Status MEDLINE
520			\|a Copyright © 2024. Published by Elsevier Ltd.
520			\|a Traditional Chinese medicine (TCM) observation diagnosis images (including facial and tongue images) provide essential human body information, holding significant importance in clinical medicine for diagnosis and treatment. TCM prescriptions, known for their simplicity, non-invasiveness, and low side effects, have been widely applied worldwide. Exploring automated herbal prescription construction based on visual diagnosis holds vital value in delving into the correlation between external features and herbal prescriptions and offering medical services in mobile healthcare systems. To effectively integrate multi-perspective visual diagnosis images and automate prescription construction, this study proposes a multi-herb recommendation framework based on Visual Transformer and multi-label classification. The framework comprises three key components: image encoder, label embedding module, and cross-modal fusion classification module. The image encoder employs a dual-stream Visual Transformer to learn dependencies between different regions of input images, capturing both local and global features. The label embedding module utilizes Graph Convolutional Networks to capture associations between diverse herbal labels. Finally, two Multi-Modal Factorized Bilinear modules are introduced as effective components to fuse cross-modal vectors, creating an end-to-end multi-label image-herb recommendation model. Through experimentation with real facial and tongue images and generating prescription data closely resembling real samples. The precision is 50.06 %, the recall rate is 48.33 %, and the F1-score is 49.18 %. This study validates the feasibility of automated herbal prescription construction from the perspective of visual diagnosis. Simultaneously, it provides valuable insights for constructing herbal prescriptions automatically from more physical information
650		4	\|a Journal Article
650		4	\|a Facial and tongue images
650		4	\|a Graph convolutional network
650		4	\|a Multi-label image recognition
650		4	\|a Prescriptions construction
650		4	\|a Visual transformer
700	1		\|a Qiang, Yan \|e verfasserin \|4 aut
700	1		\|a Yang, Fenghao \|e verfasserin \|4 aut
700	1		\|a Hou, Xiao \|e verfasserin \|4 aut
700	1		\|a Zhao, Juanjuan \|e verfasserin \|4 aut
700	1		\|a Song, Kai \|e verfasserin \|4 aut
773	0	8	\|i Enthalten in \|t Computers in biology and medicine \|d 1970 \|g 170(2024) vom: 20. Feb., Seite 107920 \|w (DE-627)NLM000382272 \|x 1879-0534 \|7 nnns
773	1	8	\|g volume:170 \|g year:2024 \|g day:20 \|g month:02 \|g pages:107920
856	4	0	\|u http://dx.doi.org/10.1016/j.compbiomed.2024.107920 \|3 Volltext
912			\|a GBV_USEFLAG_A
912			\|a GBV_NLM
951			\|a AR
952			\|d 170 \|j 2024 \|b 20 \|c 02 \|h 107920

Two-stream vision transformer based multi-label recognition for TCM prescriptions construction

Zugang & Verfügbarkeit

Zugehörige Publikationen/Bände