Where is this Machine Learning Research Engineer job located?

This position is located in Seattle.

Machine Learning Research Engineer

Nuance Labs

SeattleFull-timeMid LevelOn-site

Job Description

Responsibilities Operationalize Research: Collaborate with researchers to move models from experimental checkpoints to production‑ready systems. Establish patterns for large‑scale training, rapid experimentation, and deployment of new architectures. Optimize Model Performance: Profile and improve model inference for latency and throughput using quantization, pruning, distillation, and architectural refinements to ensure viable unit economics.

Model Acceleration: Apply optimization techniques (TensorRT, ONNX, vLLM) to accelerate multimodal models including video diffusion, LLMs, and speech models. Design Data Pipelines: Design and implement efficient pipelines for video data ingestion, preprocessing, and training at petabyte scale using tools like Dagster and Ray. Evaluate and Iterate: Build evaluation frameworks to measure model quality, establish benchmarks, and guide continuous improvement of model capabilities.

Requirements Production ML: Experience deploying ML models to production. Understanding of common failure modes and how to address them (resource contention, OOMs, batch optimization). Deep Learning Experience: Strong knowledge of PyTorch and modern ML architectures.

Experience training and optimizing large models (transformers, diffusion models, or similar). Systems Proficiency: Comfortable working with GPUs, debugging CUDA issues, and profiling model workloads to identify compute or memory bottlenecks. Data Engineering: Experience building scalable data pipelines for high‑bandwidth media processing and training workflows.

Preferred Experience Experience with video or audio models in research or production settings. Familiarity with low‑level optimization (CUDA kernels, Triton, custom operators). Knowledge of real‑time ML systems and latency‑critical inference.

Prior work with model compression techniques (quantization, distillation, pruning). #J-18808-Ljbffr

Posted 3 weeks ago

Related Jobs

Healthcare AI Quality Analyst (Remote, Flexible)

DataAnnotation

Oregon Yesterday

Full-time

Inpatient Coding & DRG/Data Analytics Specialist

Vanderbilt University Medical Center

Nashville Yesterday

Full-time

Inpatient Facility Coding Specialist (Data Analytics & Trending Specialist) (Coding Certificati[...]

Vanderbilt University Medical Center

Nashville Yesterday

Full-time

Biology AI Training Specialist (Remote)

YO IT Consulting

Los Angeles Yesterday

Full-time

Remote Data Analyst — Training & Growth Path

Affordable land scaping services

Tucker Yesterday

Full-time

Remote Data Analyst: Insights & Dashboards for Growth

Affordable land scaping services

Tucker Yesterday

Full-time

Remote Clinical AI Systems Quality Analyst

Alignerr

Atlanta Yesterday

Full-time

AI Training Specialist (Egocentric Video)

Toloka Annotators

Miami Yesterday

Full-time

Explore More Jobs

Browse jobs by category, type, and pay level

Job Search High Paying Jobs Remote Jobs Part Time Jobs Full Time Jobs No Experience Jobs Entry Level Jobs Tech Jobs Healthcare Jobs Finance Jobs Marketing Jobs Engineering Jobs Hiring Near Me Local Jobs Salary Jobs Weekend Jobs Best Part Time Jobs Data & Analytics Jobs Sales Jobs

Browse all jobs »

Apply Now

Machine Learning Research Engineer

Job Description

Related Jobs

Related Searches

Explore More Jobs