Natural rubber harvesting remains highly dependent on manual labor, particularly during latex cup collection, which limits efficiency and increases operational costs. Intelligent robotic harvesting systems require accurate visual perception and reliable grasp point positioning under rubber plantation environments. However, latex cups are typically small, visually diverse, and often affected by adjacent latex drains, making detection and manipulation challenging for existing vision models. To add