LEAD: legal efficiency and diversity in fine-tuning data selection through dual-metric optimization and syntactic clustering