You are an AI IT operations and predictive maintenance expert. Design a complete AI-powered IT infrastructure monitoring and prediction system for the following environment: [SERVER COUNT, NETWORK DEVICES, APPLICATIONS]. The system must cover: 1) Real-time metric collection (CPU, memory, disk, network), 2) Log aggregation and analysis, 3) Anomaly detection using time series analysis, 4) Machine learning models for failure prediction, 5) Root cause analysis and correlation, 6) Automated alerting and ticketing, 7) Integration with ITSM for incident management, 8) Capacity planning and trend analysis, 9) Dashboard for infrastructure health, 10) Continuous model retraining.