Valtixposted 3 days ago
Full-time • Mid Level
RTP, NC
Publishing Industries

About the position

AI/ML Engineer - Cloud Operations. Who You'll Work With We are the Cloud Operations team within Cisco IT, driving the development and management of Infrastructure capabilities that support Cisco's Engineering and business functions worldwide. Our mission is to build scalable, efficient, and cutting-edge infrastructure powering the next generation of AI solutions. By using automation, advanced hardware, and AI-optimized frameworks, we ensure seamless integration, reliable performance, and future-ready services through continuous innovation and emerging technologies. The team culture is dynamic and collaborative, where creative problem-solving, exploring new ideas, and pushing boundaries are celebrated. Who You Are You are an innovative and skilled AI Engineer to join our Cloud Operations team. This role involves applying artificial intelligence and machine learning techniques to optimize cloud infrastructure, automate routine operations, enhance performance monitoring, and improve system resilience. The ideal candidate has experience in cloud platforms (AWS, Azure, or GCP, OpenStack and VMWare), DevOps practices, and AI/ML development. An excellent collaborator who can partner, lead, guide, and communicate advanced technical concepts. A hardworking and passionate engineer comfortable working in high-pressure, large-scale enterprise environments.

Responsibilities

  • Design and implement AI Agents to optimize cloud resource allocation, auto-scaling, and performance tuning.
  • Develop predictive models for failure detection, incident management, and system health monitoring.
  • Automate operational workflows using machine learning and intelligent scripting.
  • Integrate AI-driven insights with existing cloud monitoring tools.
  • Collaborate with DevOps and SRE teams to deploy, monitor, and improve ML models in production environments.
  • Conduct anomaly detection for security, cost optimization, and performance analytics.
  • Continuously evaluate emerging AI technologies and tools for operational improvements.
  • Maintain documentation and best practices for AI/ML integration in cloud systems.

Requirements

  • Bachelor's or equivalent experience or Master's degree in Computer Science, Data Science, or related technical field.
  • Proven ability building and deploying ML models, with at least 2 years focused on infrastructure or cloud operations.
  • Solid knowledge of hybrid cloud technologies (AWS, GCP, OpenStack, Kubernetes).
  • Experience with Python, Jupyter, and ML libraries such as PyTorch, TensorFlow, or scikit-learn.
  • Familiarity with cloud-native monitoring, logging, and automation tools (e.g., Terraform, Ansible, Prometheus, Splunk, AppDynamics).
  • Comfortable working with streaming data, APIs, and telemetry systems.
  • Strong communication and multi-functional collaboration skills.
  • Experience with Agile and DevOps operating models, including project tracking tools (e.g., Jira), Git (any Version Control systems), and CI/CD systems (e.g., GitLab, GitHub Actions, Jenkins).
  • Proficient in general-purpose programming languages (Python, GoLang, Bash and/or C/C++) and development platforms and technologies.

Nice-to-haves

  • Deep understanding of operating systems and experience with Cisco technologies (UCS, Nexus, Thousand Eyes).
  • Established record of leading technical initiatives, delivering results, and a commitment to fostering a supportive work environment.
  • Hard-working, dedicated to providing quality support for your customers.

Benefits

  • Quality medical, dental and vision insurance.
  • 401(k) plan with a Cisco matching contribution.
  • Short and long-term disability coverage.
  • Basic life insurance.
  • Numerous wellbeing offerings.
  • Up to twelve paid holidays per calendar year, including one floating holiday.
  • Vacation time off policy with flexible limits for exempt employees.
  • Sick time off policy with 80 hours provided on hire date and annually.
  • Paid time away for critical or emergency issues.
  • Additional paid time to volunteer and give back to the community.
© 2024 Teal Labs, Inc
Privacy PolicyTerms of Service