Responsible for the operations, monitoring, and management of the Splunk infrastructure and services
Investigate, diagnose, and remediate NOC incidents
Manage NOC incidents lifecycle in ServiceNow
Lead incident triage efforts in collaboration with development teams
Develop, enhance, and maintain the NOC playbooks
Responsible for the continuous Improvement of application monitoring and process automation
Collect Evidence for compliance audits
Assist in SOC investigations if needed
Proactive and self-motivated with a keen sense of ownership and accountability.
Overseeing and resolving infrastructure, application, and database issues in a large-scale AWS environment.
Technical excellence. Use continuous delivery, testing, and security standard methodologies.
Operational excellence. Make decisions based on numbers rather than assumptions. If an issue arises, you strive to be alerted before our customers notice.
Keeping calm and carrying on. Capable of brainstorming product outages, skilled in identifying performance bottlenecks, spotting anomalous system behavior, and determining root cause of incidents.
Commit to automation. Passionately embrace and master modern technologies to help automate routine tasks and free up time for innovation. You will be working with a variety of languages used in systems programming like Go, Python, Terraform etc.
Must-Have Qualifications
Experience in operational roles within Network Operations Center (NOC) or a Security Operations Center (SOC)
Experience with Splunk deployment, configuration, operations, and troubleshooting (infrastructure and services)
Experience developing Splunk dashboards
Experience working with ServiceNow incidents, vulnerability management and change management
Experience creating ServiceNow dashboards
Experience with infrastructure as code tools (Terraform, Cloud Formation or other)
Experience deploying production cloud networking and infrastructure solutions while adhering to industry-standard DevOps principles.
Experience handling SaaS and/or On-prem applications for a large customer base.
Experience with one or more of the public cloud providers e.g., AWS, Azure or GCP, preferably AWS
Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes).
Experience with configuration management tools (e.g., Ansible, Puppet, Chef).
Familiarity with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI/CD).
5+ years of relevant industry experience with bachelor’s degree in computer science, computer engineering, or equivalent work experience.
Knowledge of Linux and bash scripting.
Good to Have:
Experience working within federal environments such as FedRAMP and DoD IL5