SDTP-ViT: Vision Transformers based obstruction classification in telecommunication pipes

Calvin D'Couto, Karthick Thiyagarajan, Raphael Falque, Sarath Kodagoda

Research output: Chapter in Book / Conference PaperConference Paperpeer-review

1 Citation (Scopus)

Abstract

Telecommunication pipes play a critical role in maintaining global communication infrastructure. These pipes are of small diameter, and obstructions within them can lead to significant service disruptions. Existing inspection methods are predominantly manual, labor-intensive, and time-consuming. To address these challenges, we are developing a robotic solution equipped with cameras and deep-learning algorithms for autonomous obstruction detection. However, there is currently no publicly available dataset for obstruction classification in telecommunication pipe environments. In this context, this paper presents the development of the SDTP-ViT model, a Vision Transformer (ViT)-based framework for obstruction classification in small-diameter telecommunication pipes (SDTP). We introduce the SDTP dataset, comprising 69,710 images categorized into five classes as a benchmark dataset. We use this dataset to train and evaluate the SDTP-ViT model with other state-of-the-art defect detection algorithms. The results show the SDTP-ViT model has performances comparable with the state-of-the-art with significant improvement in running time.

Original languageEnglish
Title of host publicationProceedings of the 20th IEEE Conference on Industrial Electronics and Applications (ICIEA), 3-6 August 2025, Yantai, China
Place of PublicationU.S.
PublisherIEEE
Number of pages6
ISBN (Electronic)9798331524036
DOIs
Publication statusPublished - 2025
Event20th IEEE Conference on Industrial Electronics and Applications, ICIEA 2025 - Yantai, China
Duration: 3 Aug 20256 Aug 2025

Conference

Conference20th IEEE Conference on Industrial Electronics and Applications, ICIEA 2025
Country/TerritoryChina
CityYantai
Period3/08/256/08/25

Keywords

  • Field Robotics
  • Obstruction Dataset
  • Telecommunication pipe
  • Vision Transformers

Fingerprint

Dive into the research topics of 'SDTP-ViT: Vision Transformers based obstruction classification in telecommunication pipes'. Together they form a unique fingerprint.

Cite this