Jobs / Ama***

Sr GenAI Infra Specialist SA, AWS WWSO Startup

Ama*** · New York, NY, United States
Visa sponsorship details are locked. Unlock company name and apply link with .
New York, NY, United States153,600-207,800 USD/yearlyRemote
Remuneration
153,600-207,800 USD/yearly
Location
New York, NY, United States
Visa sponsorship
Sponsors visa

Job summary

DESCRIPTION Do you want to help define the future of technology on AWS Generative AI as part of the Specialist Solutions Architect team in the Go-To-Market (GTM) Startup team? Are you passionate about AI infrastructure and helping customers understand the complexities of training and serving large-scale models?

Benefits

Of AWS infrastructure to ML engineers, platform engineers, and C-Level executiveAbout the teamAbout AWSDiverse ExperiencesAWS values diverse experiences.Even if you do not meet all of the preferredLearn more about ourAt https://amazon.jobs/en/USA, NY, New York - 169,000.00 - 228,600.00 USD annuallyUSA, VA, Herndon - 153,600.00 - 207,800.00 USD annually

Qualifications

  • and complexity (GPU, Trainium, networking), while also providing deep expertise in optimization of models and techniques for both inference serving and distributed training at scale.
  • AWS Specialist Solutions Architects (SSAs) are technologists with deep domain-specific expertise, able to address advanced concepts and feature designs.
  • As part of the AWS sales organization, SSAs work with customers who have complex challenges that require expert-level knowledge to solve.
  • SSAs craft scalable, flexible, and resilient technical architectures that address those challenges.
  • Key job
  • and trade-offs including GPU/Trainium selection, cluster topology, storage, networking (EFA), and cost optimization for training and inference
  • Provide deep technical guidance on training optimization distributed training strategies, framework selection (PyTorch, JAX, NeMo), SageMaker HyperPod, Slurm/PCS integration, checkpointing, and data pipeline design
  • Guide customers on GPU and accelerator profiling identifying bottlenecks (compute, memory, I/O), optimizing utilization, and tuning system-level performance
  • Help customers understand and apply model optimization techniques fine-tuning approaches (LoRA, QLoRA, full fine-tuning), RLHF/DPO, knowledge distillation, and efficient serving techniques (vLLM, TensorRT-LLM, Triton)
  • Develop demos, proof-of-concepts, reference architectures, and benchmarks that demonstrate AWS infrastructure value proposition for GenAI workloads
  • Collaborate with product teams (EC2, Trainium/Inferentia, SageMaker, EKS, PCS, EC2) to shape product vision, prioritize features, and represent the voice of the customer
  • Work with account teams, research scientists, ISVs, framework communities, and model providers to drive implementations and accelerate innovation

Responsibilities

  • You will be part of the core Specialist Organization focused on Startup Customers GenAI and Go-to-Market (GTM) team, focused on AI infrastructure for model training and inference optimization.
  • This role sits at the intersection of AI infrastructure architecture and model optimization — you will help customers understand hardware
  • Advise customers on AI infrastructure

Skills

CommunicationLeadership

Degrees

Associate

Industry

EnergyInsurance

Company size

EnterpriseSmbStartup