IT Operations Engineer

Diya is seeking skilled Operations Specialists to join our expanding Operations Team for our key partner in Canada — leading logistics provider.

These specialists will play a crucial role in providing operational support, embedding directly into development teams to ensure the seamless functionality, reliability, and scalability of the systems. Unlike traditional DevOps roles, this position focuses on monitoring, troubleshooting, and coordinating with development teams without requiring coding expertise.

System Monitoring and Alerting:

  • Proactively monitor system performance, availability, and reliability across Azure cloud services, applications, and APIs.
  • Set up and maintain alerts using Azure Monitor, Application Insights, and Splunk to identify anomalies in real time.

Incident Management:

  • Serve as the first point of contact for escalated issues from ServiceDesk (L1, L2) and SecurityDesk teams.
  • Collaborate with development teams to troubleshoot and resolve issues related to infrastructure, workflows, or application performance.

Operational Oversight:

  • Maintain a comprehensive understanding of the applications supported by each team.
  • Provide feedback to development teams on operational bottlenecks and recommend improvements to monitoring and workflows.
  • Collaboration with Development Teams:
  • Act as an embedded operations expert within each team, ensuring alignment between operations and development efforts.
  • Support cross-team initiatives to ensure new features and services are operationally ready.

Documentation and Reporting:

  • Maintain up-to-date documentation of systems, workflows, and incident responses.
  • Generate periodic reports on operational metrics, such as system uptime, incident resolution times, and performance trends.

Required Skills and Experience:

  • Strong expertise in monitoring and alerting tools (e.g., Splunk, Azure Monitor, Application Insights).
  • Familiarity with cloud platforms such as Azure (preferred) or AWS.
  • Experience working with microservices architectures and understanding of RESTful APIs.
  • Knowledge of incident management processes and tools such as Azure DevOps, Jira, or similar.
  • Excellent problem-solving skills and ability to diagnose and resolve system issues efficiently.
  • Strong collaboration and communication skills to work effectively with cross-functional teams.

Nice to Have

  • Exposure to messaging systems (e.g., Service Bus).
  • Familiarity with CI/CD pipelines (e.g., Azure DevOps, GitHub Actions).
  • Basic understanding of database management systems (SQL Server, or similar).
  • Comfortable work environment, remote work.
  • Competitive salary.
  • Paid vacation and sick leave.
  • Healthcare insurance and gym.
  • Training and useful experience.
  • Work in a young and friendly team.
  • Career opportunities and professional growth.

You’re sharing your CV with Diya