Dual-agent reinforcement learning network with knowledge injection for multi-objective control of tandem cold rolling