4

Software Engineer III - SRE- Digital Platform Services

405190-Hire Us
Full-time
On-site
Houston, Texas, United States
$383,082.75 - $728,210.86 USD yearly
Description

We have an exciting and rewarding opportunity for you to take your software engineering career to the next level.Β 


As a Software Engineer III at JPMorgan Chase within the Corporate Investment Bank, Digital Platform Services team, you serve as a seasoned member of an agile team to design and deliver trusted market-leading technology products in a secure, stable, and scalable way. You are responsible for carrying out critical technology solutions across multiple technical areas within various business functions in support of the firm’s business objectives.


Job responsibilities



  • Executes software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems

  • Implement Monitoring Solutions: Deploy Splunk, Dynatrace, and Datadog for real-time tracking of system performance and health.

  • Enforce NFRs: Define and maintain Non-Functional Requirements to ensure systems meet performance, reliability, and scalability standards.

  • Conduct PRRs: Regularly perform Performance and Reliability Reviews to identify improvement areas for system reliability.

  • Integrate Expertise: Seamlessly integrate various tools and systems via Integrate for efficient visual exchange and automation.

  • Visualize Data with Grafana: Create real-time, insightful dashboards using Grafana for monitoring key system metrics.

  • Independent Troubleshooting: Diagnose and resolve system issues independently with minimal oversight.

  • Automation and Scripting: Develop automation scripts to streamline operations and reduce manual intervention.

  • Capacity Planning: Analyze trends to forecast system demand and plan capacity accordingly.

  • Incident Management: Lead incident response efforts to minimize downtime and impact on business operations.

  • Continuous Improvement: Continuously evaluate and improve SRE practices and tools for enhanced system reliability and efficiency.


Required qualifications, capabilities, and skills



  • Formal training or certification on site reliability engineering concepts and 3+ years applied experience

  • Proficiency in Python/Java programming language.

  • Eagerness to learn and participate in learning opportunities to enhance day-to-day effectiveness

  • Experience in setting up monitoring dashboards from observability tools with Splunk, dynatrace, grafana, datadog.

  • Familiarity with containers and common server OS such as Linux and Windows.

  • Understanding of software, applications, and technical processes within disciplines like cloud and AI.

  • Experience with continuous integration tools like Jenkins, Bitbucket , or Terraform.


Preferred qualifications, capabilities, and skills



  • Knowledge of site reliability principles, practices

  • Familiarity with modern front-end technologies

  • Exposure to cloud technologies

  • Experience with cloud infrastructure maintenance, Preferably AWS .

  • Experience with network technologies.

  • Proven ability to work in a large, collaborative team environment.


#LI-ID1