About me

My name is Nakul Poudel, and I am a third-year PhD student at the Chester F. Carlson Center for Imaging Science at the Rochester Institute of Technology (RIT), Rochester, New York, USA. I conduct my research under the supervision of Dr. Cristian A. Linte in the Biomedical Modeling, Visualization, and Image-Guided Navigation (BiMVisGN) Lab. My research focuses on deep learning methods for medical imaging applications.

Prior to starting my PhD, I obtained my Bachelor’s degree in Computer Engineering from Sagarmatha Engineering College, affiliated with Tribhuvan University, Nepal. Outside of my research, I enjoy traveling, exploring new places, hiking, and experiencing different cultures.

Research

My research focuses on developing advanced deep learning methodologies for medical imaging and surgical data analysis. I am particularly interested in designing robust, generalizable, and clinically meaningful computational models that bridge the gap between artificial intelligence and real-world surgical applications.

During my PhD, I have worked on several foundational problems in medical artificial intelligence, including image and surface registration for aligning preoperative and intraoperative data, organ surface reconstruction and completion from sparse, partial, and noisy observations, and segmentation and detection of anatomical structures and surgical instruments. A significant aspect of my research involves addressing real-world challenges such as non-canonical poses, partial visibility, noise, and domain shifts between imaging modalities.

Beyond solving individual tasks, my broader research vision is to move toward general-purpose surgical AI systems capable of understanding and reasoning about the entire surgical workflow. Modern operating rooms generate rich visual and geometric data streams, yet most existing approaches remain narrowly specialized for single tasks. I am interested in developing unified surgical vision models that can simultaneously interpret surgical scenes, detect and classify tools, recognize tool–tissue interactions, identify surgical phases, recognize procedural steps, assess surgical skill, and detect potential errors. Such models would enable a more holistic understanding of surgical procedures rather than treating each task in isolation.

Currently, I am exploring surgical vision foundation models designed to learn shared representations across multiple tasks and domains. By integrating large-scale pretraining, multi-task learning, and vision–language modeling strategies, I aim to design a single unified architecture capable of generalizing across diverse surgical scenarios. My long-term goal is to contribute to the development of intelligent, context-aware surgical assistance systems that enhance intraoperative decision-making, improve safety, and ultimately advance patient outcomes.

News

[Apr 2026] Our paper “Evaluating Large Vision–language Models for Surgical Tool Detection” has been accepted for presentation at the 48th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) held in Toronto, Canada.
[Dec 2025] Our paper “Assessing Learning-Based Reconstructed Liver Surfaces from Partial Point Clouds for Improving Pre- to Intra-Operative 3D–3D Registration” has been published in Wiley’s IET Healthcare Technology Letters journal.
[Aug 2025] Accepted to present my research titled “Assessing Learning-Based Reconstructed Liver Surfaces from Partial Point Clouds for Improving Pre- to Intra-Operative 3D–3D Registration” at the AE-CAI Workshop at the Medical Image Computing and Computer-Assisted Intervention (MICCAI) 2025 in Daejeon, South Korea.
[July 2025] Delivered an oral presentation of our paper, “Toward Patient-Specific Partial Point Cloud to Surface Completion for Pre- to Intra-Operative Registration in Image-Guided Liver Interventions”, at the Medical Image Understanding and Analysis (MIUA) 2025 held at the University of Leeds, Leeds, UK.
[May 2025] Excited to share that our paper, “Toward Patient-Specific Partial Point Cloud to Surface Completion for Pre- to Intra-Operative Registration in Image-Guided Liver Interventions”, has been accepted at the Medical Image Understanding and Analysis (MIUA) Annual Conference!
[Feb 2025] Presented our paper as a poster presentation at SPIE Medical Imaging 2025, held in San Diego, California, USA.
[Oct 2024] Our paper titled “Evaluation of Intraoperative Patient-Specific Methods for Point Cloud Completion for Minimally Invasive Liver Interventions” has been accepted to SPIE Medical Imaging 2025.
[May 2024] Appointed as Treasurer of the Nepalese Student Association at the Rochester Institute of Technology.
[Aug 2023] Started my PhD in Imaging Science at the Rochester Institute of Technology, Rochester, United States.
[July 2022] Graduated with a Bachelor’s degree in Computer Engineering from Sagarmatha Engineering College, affiliated with Tribhuvan University, Nepal.
[April 2022] Selected for the Erasmus+ program to study a semester at UPCT Universidad Politécnica de Cartagena.
[June 2020] Our team “Matrix” secured First Runner-Up position in the Hack for Good Online Hackathon organized by Sagarmatha Engineering College, Kathmandu, Nepal.