The work couples a VLM backbone with a dual-phase training recipe (SFT โ RFT) to turn language prompts into robust tracking and motor commands for a continuum endoscope.
A highlight was a meeting with Prof. Ken Goldbergโespecially meaningful since Prof. Ren previously served as a postdoc at UC Berkeley, keeping our CUHKโ๏ธUCB connection strong.
๐ค Huge thanks to collaborators and everyone who stopped by the poster!
Authors: Chi Kit Ng*, Long Bai*, Guankun Wang*, Yupeng Wang, Huxin Gao, Kun Yuan, Chenhan Jin, Tieyong Zeng, Hongliang RenโจAffiliations: CUHK, TUM