Using Vision Transformers for Classifying Surgical Tools in Computer Aided Surgeries

to related objects

Abstract: Automated laparoscopic video analysis is essential for assisting surgeons during computer aided medical procedures. Nevertheless, it faces challenges due to complex surgical scenes and limited annotated data. Most of the existing methods for classifying surgical tools in laparoscopic surgeries rely on conventional deep learning methods such as convolutional and recurrent neural networks. This paper explores the use of pure self-attention based models-Vision Transformers for classifying both single-label (SL) and multi-label (ML) frames in Laparoscopic surgeries. The proposed SL and ML models were comprehensively evaluated on the Cholec80 surgical workflow dataset using 5-fold cross validation. Experimental results showed an excellent classification performance with a mean average precision mAP=95.8% that outperforms conventional deep learning multi-label models developed in previous studies. Our results open new avenues for further research on the use of deep transformer models for surgical tool detection in modern operating theaters.

Location: Deutsche Nationalbibliothek Frankfurt am Main

Extent: Online-Ressource

Language: Englisch

Bibliographic citation: Using Vision Transformers for Classifying Surgical Tools in Computer Aided Surgeries ; volume:10 ; number:4 ; year:2024 ; pages:232-235 ; extent:4
Current directions in biomedical engineering ; 10, Heft 4 (2024), 232-235 (gesamt 4)

Creator: El Moaqet, Hisham
Janini, Rami
Abdulbaki Alshirbaji, Tamer
Aldeen Jalal, Nour
Möller, Knut

DOI: 10.1515/cdbme-2024-2056

URN: urn:nbn:de:101:1-2412181802205.276932104545

Rights: Open Access; Der Zugriff auf das Objekt ist unbeschränkt möglich.

Last update: 15.08.2025, 7:37 AM CEST

Data provider

This object is provided by:
Deutsche Nationalbibliothek. If you have any questions about the object, please contact the data provider.

Show original at data provider

Associated

El Moaqet, Hisham
Janini, Rami
Abdulbaki Alshirbaji, Tamer
Aldeen Jalal, Nour
Möller, Knut

Other Objects (12)

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

Konferenzschrift

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Hochschulschrift

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

Konferenzschrift

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Hochschulschrift

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

Konferenzschrift

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Hochschulschrift

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Cultural heritage institutions wishing to register will find more information here.

Fields marked * need to be filled in.

Username*

Please enter your username

Email*

Please enter your email address

Please do not fill this field

First name

Last name

Password*

Please enter your password

Confirm password*

Please enter the same password

I have read the terms of use and the privacy policy for the collection of personal data and accept them. *

This field is required.

I would like to subscribe to the newsletter of the Deutsche Digitale Bibliothek. See newsletter subscription info.

Account created

Your "My DDB" account has been successfully created. Before you can log in to your account, you must click the confirmation link in the message we just sent to the email address you provided.

Using Vision Transformers for Classifying Surgical Tools in Computer Aided Surgeries

Object Details

References and Relationships

Contributors, Places and Time

Further information

Data provider

Associated

Other Objects (12)

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Analysis of expansion within a pressure inflated section of an urethral stricture model

Using adaptive learning rate to generate adversarial images

Robustness evaluation on different training state of a CNN model

Classifying smoke in laparoscopic videos using SVM

Determination of the water content of an ex vivo porcine liver

AUTOMED 2018 : automatisierungstechnische Verfahren für die Medizin

Sicherheiten im Recht der Europäischen Gemeinschaft

Dynamic emotion recognition using histogram of oriented gradients

Generative Adversarial Network for Facial Emotion Recognition: A Feasibility Study

Assistive Navigation Device for Visually Impaired—A Study on Reaction Time to Tactile Modality Stimuli

Nonlinearity of Magnetostrictive Torque Sensor under Varying External Magnetic Field Strength

Influence of temperature-dependent tissue parameters on monopolar coagulation model

Related objects

Reset password