Job Description:
Job title: Performance Engineer
Location: 100% Remote USA! You must reside in the US currently.
Type: 18 month contract
Pay: $45 -$55/ Hour (Commensurate with experience)
Required Skill set: Proven hands on experience (Scripting and execution) with LoadRunner, Jmeter and Eggplant (Eggplant is a must!). Monitoring tools such as Dynatrace. Java programming skills are also required
About
We are hiring for our Site Reliability Engineering (SRE) team. This team will be responsible for planning, designing, and rolling out proactive resiliency practices which protect customer journeys from disruption and avoid re-engineering costs through the early detection of existing and emerging resiliency threats. The successful candidate should have experience debugging incidents and partnering with teams to improve reliability and performance of systems and services.
What will your job look like?
Execute Benchmark, load, stress, endurance, and other non-functional performance tests.
Monitor application logs to determine system behavior. Analyze the CPU Utilization, Memory usage, Network usage, Garbage Collection and DB Parameters and DB Reports to verify the performance of the applications.
Generate performance graphs, session reports and other related documentation required for validation and analysis.
Identifying memory leakage, connection issues & Bottleneck problem in the application.
Design, develop and execute Chaos Engineering experiments, that uncover system vulnerabilities which could potentially lead to outages and adversely impact business
Automate Chaos experiments through chaos engineering tools (Chaos Monkey, Gremlin etc) to run continuously and enhance observability through SRE dashboards to have real time performance monitoring and to quickly identify issues
Collaborate with operations and application engineering teams to design and execute production game day scenarios that will help enhance emergency response processes
Develop requirements to enhance observability of performance visuals, implement telemetry controls, and consult on self-healing capabilities for identified/prioritized failure scenarios
Requirements
Minimum of 2-5 years' hands-on experience in Performance Testing and Engineering areas
Expert knowledge and strong experience using tools, LoadRunner/JMeter/Eggplant etc. and understanding of APM solutions like Grafana, AppDynamics, Dynatrace etc
Experience with microservice architecture, Docker, Kubernetes, Jenkins, AWS, Azure, Jira, Confluence, and application monitoring tools
Strong expertise on monitoring and analyzing application logs, database reports, system metrics like CPU Utilization, Memory usage, Network usage, Garbage Collection and DB Parameters
Strong expertise on identifying potential performance issues and providing recommendations to improve performance
Experience with chaos engineering and tools (Chaos Monkey, Gremlin etc.) is an advantage
Good to have programming experience with Java or Python
Strong technical, analytical, problem solving, and communication skills
Excellent verbal, written, interpersonal, organizational and time management skills
Ability to communicate effectively to both business and technical audiences
Ability to orchestrate and drive complex strategies and solutions