hireejobs
Hyderabad Jobs
Banglore Jobs
Chennai Jobs
Delhi Jobs
Ahmedabad Jobs
Mumbai Jobs
Pune Jobs
Vijayawada Jobs
Gurgaon Jobs
Noida Jobs
Oil & Gas Jobs
Banking Jobs
Construction Jobs
Top Management Jobs
IT - Software Jobs
Medical Healthcare Jobs
Purchase / Logistics Jobs
Sales
Ajax Jobs
Designing Jobs
ASP .NET Jobs
Java Jobs
MySQL Jobs
Sap hr Jobs
Software Testing Jobs
Html Jobs
IT Jobs
Logistics Jobs
Customer Service Jobs
Airport Jobs
Banking Jobs
Driver Jobs
Part Time Jobs
Civil Engineering Jobs
Accountant Jobs
Safety Officer Jobs
Nursing Jobs
Civil Engineering Jobs
Hospitality Jobs
Part Time Jobs
Security Jobs
Finance Jobs
Marketing Jobs
Shipping Jobs
Real Estate Jobs
Telecom Jobs

Associate Site Reliability Engineer mUI/Ca_553

5.00 to 10.00 Years   Hyderabad   22 Apr, 2022
Job LocationHyderabad
EducationNot Mentioned
SalaryNot Disclosed
IndustryBanking / Financial Services
Functional AreaNetwork / System Administration
EmploymentTypeFull-time

Job Description

    As a JPMorgan Chase & Co. Site Reliability Engineer (SRE), you will combine software and systems to help us build a world-class engineering function. Working with your team, you ll focus on improving our production applications and systems to creatively solve operations problems. Much of our support and software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.Our culture of diversity, intellectual curiosity and problem solving is essential to our success. We bring people together with a wide variety of backgrounds, experiences and perspectives. We support teamwork, thinking big and taking risks in a blame-free environment. We promote self-direction to work on relevant projects, while building an environment that provides the support and mentorship needed to learn and grow. We are excited to see what you will bring to our team.This role requires a wide variety of strengths and capabilities, including:
    • Strong understanding of SRE concepts e.g., SLO, SLIs & Error Budget
    • Background as a software developer with experience in cloud native, distributed application design and implementation
    • Deep understanding of multi-platform environments and the ability to assess, design and evaluate large systems.
    • Strong communication skill, ownership, debugging and trouble shooting skills
    • Significant experience building and/or configuring metrics, distributed tracing, and logging systems, preferably using CNCF technologies and standards
    • Proficient in one or more technology domains, may be a cross-domain expert, able to solve complex and mission critical problems within a business or across the firm
    • Working knowledge of infrastructure components such as routers, load balancers, cloud products, container systems, compute, storage and networks
    Responsibilities:
    • Responsible for how code is deployed, configured, and monitored, as well as the availability, latency, change management, emergency response, and capacity management of services already in / going to production.
    • Design, code, test and deliver software to automate manual operational work, develop self-service, auto-detection and healing
    • Develop software for reliability and scale, ensuring minimal refactoring or changes
    • Define, monitor and defend SLOs
    • Deploying closed-loop remediation continuous testing and remediation to fix problems in pre-production before software is released to production.
    • Build custom tooling from scratch to meet specific needs in the incident management workflow.
    • Complex incident resolution across public cloud, private cloud, 3rd party, and on-premise tech.
    • Leverage Chaos Engineering to find and prevent future problems and to confirm fixes from past incidents function as intended.
    • Focus on end-user experiences and partner with development teams to implement changes to increase uptime and performance based on empirical evidence.
    • Troubleshoot priority incidents, facilitate blameless post-incident evaluations and ensure permanent closure of incidents
    • Identify application patterns and analytics in support of better service level objectives
    • Design performance tests, identify bottlenecks and opportunities for optimization and capacity demands, and present solutions for continuous improvements
    • Design best in class monitoring frameworks to accomplish end-to-end flow monitoring and noiseless alerting
    • Design automated software and product upgrades, change management and release management solutions
    Skills/Qualifications
    • Bachelor s degree or equivalent experience in a software engineering discipline
    • 5+ years of SRE or System Engineering experience.
    • Expert in at least one technology stack designing, coding, testing, delivering software e.g., Java, Python, C++, Go, etc.
    • Deep knowledge of Internet protocols and web services technologies e.g., HTTP, DNS, TCP/UDP, SOAP, JSON, Apache, Tomcat and REST
    • Experience working with containers e.g., Docker, Kubernetes, Cloud Foundry, etc.
    • Experience in working with automation tools e.g., Ansible, Puppet, Selenium etc.
    • In-Depth OS Experience e.g., RHEL, Ubuntu, Windows Server with strong debugging, troubleshooting, and problem-solving skills
    • Testing and build automation with a continuous integration/continuous delivery (CI/CD) pipeline e.g., Travis CI, Maven, Gradle, Groovy, Git, Terraform, Jenkins etc.
    • Experience deploying and managing services on modern platforms e.g., AWS, GCP, Azure.
    • Strong experience in using industry standard monitoring tools e.g., AppDynamics, Dynatrace, APICA, Splunk, ELK, FluentD, Prometheus, Kibana, Elasticsearch, Grafana, Nagios, Datadog, New Relic, etc.
    • Advanced understanding of application monitoring stack (Logs, Events Metrics & Alerts) and ability to visualize and setup end-to-end observability
    • Certified in one or more cloud technology e.g., AWS, Azure, GCP or RedHat is a big plus
    ,

Keyskills :
javaacademicsacpalgorithmsandroidnew relicweb servicesthinking bigprivate cloudcloud foundryservice levelwindows serverproblem solvingflow monitoring

Associate Site Reliability Engineer mUI/Ca_553 Related Jobs

© 2019 Hireejobs All Rights Reserved