Malignant melanoma (MM) is the most aggressive form of skin cancer, for which early detection is critical and strongly associated with improved survival outcomes. Recent advances in large language models (LLMs), such as ChatGPT and Gemini, present promising opportunities to support melanoma early screening and clinical decision-making. However, despite increasing interest in LLM-based dermatologic applications, their diagnostic reliability across different populations remains insufficiently char