Job Summary
We are seeking a Senior Platform Engineer to design, build, and maintain enterprise software platforms that support critical business and technology operations.
This role involves configuring and customizing platform environments, developing workflows and integrations, and ensuring platform stability, performance, and security.
You will collaborate with cross-functional teams to gather requirements, architect scalable solutions, automate processes, and troubleshoot technical issues.
Your contributions will help streamline organizational operations, improve service delivery, and enhance productivity across the enterprise.
Key Responsibilities
- Develop monitoring solutions for IT operations systems, infrastructure, and applications across cloud and on-premises environments.
- Conduct business and technical analysis and participate in architectural discussions with stakeholders.
- Maintain and enhance CI/CD processes and pipelines.
- Enable automated monitoring and alerting capabilities across systems.
- Build ingest pipelines, visualizations, and dashboards for structured and unstructured data.
- Develop triggered alerts for on-screen notifications and send events to ITSM and event management platforms.
- Manage day-to-day request and incident tickets as needed.
- Collaborate with stakeholders to gather requirements, design solutions, and ensure platform scalability, resiliency, and efficiency.
- Create system guidelines, documentation, and training materials.
- Evaluate and respond to emerging requirements and evolving technologies.
- Manage IT and business unit projects, including acquisitions, divestitures, and migrations.
Required Qualifications
- Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent experience.
- 5–7 years of related work experience.
- Ability to design and enforce monitoring standards with tools such as AppDynamics, Elastic Stack, CloudWatch, and Site24x7.
- Experience engineering and scaling distributed telemetry pipelines (Elastic ingestion, data normalization, dashboards).
- Expertise in alert normalization, enrichment, and correlation patterns.
- Experience with Open Integration Hub, webhook integrations, and API-based event ingestion.
- Understanding of BigPanda incident lifecycle and automated routing to ServiceNow.
- Strong understanding of logs, metrics, traces, and observability practices (APM, RUM, synthetic monitoring).
- Ability to configure and tune AI-driven workflows (incident analysis, similarity scoring, change risk scoring).
- Familiarity with vector database concepts, enrichment pipelines, and generative AI guardrails.
- Knowledge of SSO, OAuth, API Gateway, and secure data flows.
- Strong AWS experience (Lambda, S3, API Gateway, CloudWatch, IAM).
- Ability to analyze telemetry and proactively identify patterns or anomalies.
- Proficiency in AI prompt engineering and experience working with large language models.
- Proven experience as a Platform Engineer or similar role (M365, AWS, Azure).
- Strong understanding of cloud technologies, DevOps processes, and automation.
- Experience with CI/CD tools such as Jenkins or Azure Pipelines.
- Hands-on experience with scripting and automation tools (PowerShell, Graph API, etc.).
Preferred Qualifications
- Experience with BigPanda and Biggy AI.
- Expertise with Elastic and its extended capabilities.
- Familiarity with monitoring/logging tools that integrate with ServiceNow.
- Knowledge of platform engineering security best practices.
- Cloud platform certifications (GCP, AWS, Azure, M365).
- Ability to mentor team members.
Certifications
- Certifications in GCP, AWS, Azure, or M365 (preferred).
Working Conditions
- On-call support may be required.
- Hybrid/office work environment.
- Minimal travel required.