Saviynt logo

Principal Software Engineer, AI Platform Engineering

SaviyntEl Segundo, CA

$240,000 - $260,000 / year

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Overview

Schedule
Full-time
Career level
Senior-level
Remote
On-site
Compensation
$240,000-$260,000/year
Benefits
Paid Vacation
Career Development

Job Description

ABOUT SAVIYNT

Saviynt is a leader in identity security, delivering an AI-powered platform that governs and secures access to applications, data, and business processes for global enterprises and government institutions. Built for the AI era, Saviynt helps organizations move faster - securely and compliantly.

ABOUT THE ROLE

You set the architectural direction for how training data flows, evolves, and is governed across the AI Platform. You define the standards ML engineers and scientists build on, and ensure every training signal is tenant-isolated, PII-free, and traceable from source to model.

WHAT YOU'LL OWN

  • AI Data Lake on GCS: bucket layout, raw → silver → gold tier separation, CMEK encryption, lifecycle rules

  • Batch pipelines: Spark on Dataproc for TB-scale feature backfills, Iceberg compaction, and daily S3→GCS incremental sync

  • Streaming pipelines: Apache Beam on Dataflow for sub-5-min CDC ingestion with exactly-once semantics and PII assertion gates

  • Schema registry: Avro / Protobuf schema versioning, compatibility modes, and migration playbooks for safe schema evolution

  • Orchestration: Flyte as primary DAG layer - task authoring standards, domain isolation, retry policies, DataCatalog memoization; evaluate Kubeflow Pipelines where relevant

  • Multi-tenancy: strict per-tenant GCS prefix isolation, quota policies, and cross-tenant contamination validation

  • Data Anonymizer and Data Labeler microservices: strip PII and attach ML labels before signals leave each customer environment

  • Feature store: Feast offline (GCS Parquet) and online (Redis) with point-in-time correctness and

  • Vector database: operate Pgvector (Cloud SQL) for POC and Qdrant on GKE for production-scale embedding storage; design index strategies (IVFFlat, HNSW) and manage ANN query latency SLAs

  • RAG data pipeline: build embedding generation pipelines that chunk, encode, and upsert document embeddings into the vector store; own the data refresh cadence and staleness SLAs for retrieval context

  • Service APIs: expose data platform services (feature serving, embedding upsert, schema validation) over HTTPS with mTLS and gRPC where low-latency streaming is required

  • Synthetic data pipelines for dev/staging where real customer data is not permitted

  • Data quality gates: Great Expectations / dbt checks as Flyte tasks, blocking on schema and PII-absence failures

YOU'LL THRIVE HERE IF YOU HAVE

  • 8+ years of data engineering at production scale across multiple companies

  • Demonstrated principal impact: platform standards you defined adopted org-wide, or major cross-team pipeline/schema migrations you led

  • Data lake ownership (essential): you have designed and operated a production data lake end-to-end - storage layout, partitioning strategy, tiered retention (hot/warm/cold), table format (Iceberg or Delta Lake), compaction, and access control; not just consumed one

  • Deep Spark (PySpark / Scala): executor tuning, shuffle diagnosis, Iceberg table maintenance

  • Hands-on Beam / Dataflow: windowing, exactly-once, side inputs, autoscaling

  • Schema registry experience: Protobuf / Avro compatibility rules, breaking-change migrations in production

  • Orchestration at scale: Flyte, Kubeflow Pipelines, Airflow, or Prefect - operated in production, ideally benchmarked two

  • Multi-tenant data architecture: per-tenant isolation as a hard requirement, not a post-hoc concern

  • Feature store operations: Feast or Tecton, point-in-time joins, online/offline consistency

  • Vector databases: Pgvector or Qdrant in production - index tuning, ANN search, embedding upsert pipelines

  • RAG data fundamentals: chunking strategies, embedding model selection, retrieval quality evaluation, and context freshness management

  • API transport: gRPC and HTTPS/mTLS for service-to-service communication; comfortable defining proto contracts and managing certificate lifecycle

  • Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience or equivalent military experience

NICE TO HAVE

  • Differential privacy or k-anonymity for ML training datasets

  • Open source contributions: Feast, Great Expectations, Apache Beam, or dbt

  • Familiarity with IAM / access governance data: entitlements, provisioning events, access graphs

  • Iceberg or Delta Lake at petabyte scale

WHY JOIN SAVIYNT

  • Work on a large-scale, Kubernetes-based SaaS platform

  • Solve challenging cloud and reliability problems at scale

  • Collaborate with strong engineers in a reliability-focused culture

  • Competitive compensation, benefits, and growth opportunities

SECURITY & COMPLIANCE

This role requires adherence to Saviynt's information security and privacy policies, including annual security training.

$240,000 - $260,000 a year

We offer you a competitive total rewards package, learning and tremendous opportunities to grow and advance in your career. At Saviynt, it is not typical for an individual to be hired at or near the top of the range for their role and final compensation decisions are dependent on many factors including but are not limited to location; skill sets; experience and training; licensure and certifications; and other relevant business and organizational needs. A reasonable estimate of the current range is $240,000 - $260,000 annually.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall

FAQs About Principal Software Engineer, AI Platform Engineering Jobs at Saviynt

What is the work location for this position at Saviynt?
This job at Saviynt is located in El Segundo, CA, according to the details provided by the employer. Some roles may also include multiple work locations depending on the requirement.
What pay range can candidates expect for this role at Saviynt?
Candidates can expect a pay range of $240,000 and $260,000 per year.
What employment applies to this position at Saviynt?
Saviynt lists this role as a Full-time position.
What experience level is required for this role at Saviynt?
Saviynt is looking for a candidate with "Senior-level" experience level.
What benefits are offered by Saviynt for this role?
Saviynt offers following benefits: Paid Vacation and Career Development for this position. Actual benefits may vary depending on the employer's policies and employment terms.
What is the process to apply for this position at Saviynt?
You can apply for this role at Saviynt either through Sonara's automated application system, which helps you submit applications 10X faster with minimal effort, or by applying manually using the direct link on the job page.