Systems Development Engineer III, Annapurna Labs Infrastructure
Company: Annapurna Labs (U.S.) Inc.
Location: Pflugerville
Posted on: April 17, 2024
|
|
Job Description:
Annapurna Labs, our organization within AWS, is responsible for
building innovation in silicon and software for AWS customers. With
development centers in the U.S. and Israel, Annapurna is at the
forefront of innovation by combining cloud scale with the world's
most talented engineers. Our team covers multiple disciplines
including silicon engineering, hardware design and verification,
software, and operations. Because of our teams' breadth of talent,
we've been able to improve AWS cloud infrastructure in networking
and security with products such as AWS Nitro, Enhanced Network
Adapter (ENA), and Elastic Fabric Adapter (EFA), in compute with
AWS Graviton and F1 EC2 Instances, in machine learning with AWS
Neuron, Inferentia and Trainium ML Accelerators, and in storage
with scalable NVMe.As part of Annapurna Labs team, you'll have the
opportunity to invent the next generation of cloud computing
infrastructure. You'll experience what it's like to work in a
fast-paced, innovative, and start-up like environment filled with
some of the brightest minds in the industry. The work we do is not
only cutting-edge and internet-scale but also deeply important to
our customers. We design and build every component of our hardware
and software to come together into products that our customers use
for accelerated computing: either Machine Learning acceleration, or
FPGA acceleration. We get our hands dirty, from creating our own
silicon, ensuring our hardware is functional and healthy, and
managing the full lifecycle of our systems at the huge scale and
complexity of AWS. If you want a career that makes an impact,
allows you to invent, and have first-hand visibility into how your
implementations delight customers, then we have a role for you. If
you're interested in being on a team that is "building a complete
product" from inception to delighted customers, Annapurna is a
fantastic choice.Join us in creating the most advanced Machine
Learning Accelerators in the world!Key job responsibilitiesAs a
technical leader of the Cloud-Scale Machine Learning Acceleration
Infrastructure team you'll be responsible for architecting and
leading development of the infrastructure used by our engineering
teams. Our customers, the engineering teams, building
hardware/software running in our data centers which are custom
designed machine learning products: AWS Inferentia2 and
Trainium.You will need to lead across teams to develop and execute
in-depth infrastructure development plans that enables the
engineering development of the Machine Learning Acceleration
product family. You will dive deep to solve critical infrastructure
issues involving networking, high performance compute clusters,
infrastructure automation of hardware/software/firmware testing,
and ASIC/EDA development. You will execute and scale the next
generation of cloud infrastructure based on cloud frameworks and
AWS services. You will own design reviews for infrastructure
development and partner with AWS service teams and vendors. You
will influence within your team, your customers and AWS service
teams to help drive and develop the technical implementation for
overall system designs. You will identify and implement process
improvements which improve your team's agility and operations,
including improvements to design, automation, development, test or
operations. You will define new mechanisms that execute system
health monitoring, diagnostics, repair, and automation. You will
develop, document and update operational runbooks as you
participate in on-call rotations. A day in the lifeEach day you
will work with the best engineers in the industry to develop
Machine Learning Accelerators. On-site in Austin, Texas, you will
be apart of the team that develops custom silicon and you will own
the infrastructure that enables this innovation. Take a look inside
our labs to see what you will learn at Annapurna Labs:
https://www.aboutamazon.com/news/aws/take-a-look-inside-the-lab-where-aws-makes-custom-chipshttps://youtu.be/rViVFrQg4HkWe
are open to hiring candidates to work out of one of the following
locations:Austin, TX, USA
BASIC QUALIFICATIONS- 5+ years of programming with at least one
modern language such as C++, C#, Java, Python, Golang, PowerShell,
Ruby experience- 3+ years of non-internship professional software
development experience- 5+ years of designing or architecting
(design patterns, reliability and scaling) of new and existing
systems experience- 5+ years of deploying and operating in a
Linux/Unix environment experience- 3+ years of systems design,
software development, operations, automation, and process
improvement experience- Experience leading the design, build and
deployment of complex and performant (reliable and scalable)
software solutions in production- 3+ years of systems development
in an IT or data center environment experience- Experience with
debugging complex issues with HW/SW, networking and storage
systems- Experience with operations of large scale infrastructure
deployments including improving operational excellence
PREFERRED QUALIFICATIONS- Knowledge of engineering practices and
patterns for the full software/hardware/networks development life
cycle, including coding standards, code reviews, source control
management, build processes, testing, certification, and livesite
operations- Experience taking a leading role in building complex
software or computing infrastructure that has been successfully
delivered to customers- Experience writing technical documents,
project plans and progress reports to leadership and to
stakeholders- Experience with AWS Cloud Infrastructure deployments
using CDK- Experience with IT security
software/tools/standardsAmazon is committed to a diverse and
inclusive workplace. Amazon is an equal opportunity employer and
does not discriminate on the basis of race, national origin,
gender, gender identity, sexual orientation, protected veteran
status, disability, age, or other legally protected status. For
individuals with disabilities who would like to request an
accommodation, please visit
https://www.amazon.jobs/en/disability/us.
Keywords: Annapurna Labs (U.S.) Inc., Bryan , Systems Development Engineer III, Annapurna Labs Infrastructure, Healthcare , Pflugerville, Texas
Click
here to apply!
|