Johnson & Johnson logo

Sr Pr Eng Data Engineering

Johnson & JohnsonCambridge, MA

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.1

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

pay-wall

Job Description

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow, and profoundly impact health for humanity. Learn more at https://www.jnj.com

Job Function:

Data Analytics & Computational Sciences

Job Sub Function:

Data Engineering

Job Category:

Scientific/Technology

All Job Posting Locations:

Cambridge, Massachusetts, United States of America, Spring House, Pennsylvania, United States of America

Job Description:

Data Lake Engineer and Solution Architect, R&D Therapeutics Discovery

At Johnson & Johnson, we believe health is everything. Our strength in healthcare innovation empowers us to build a world where complex diseases are prevented, treated, and cured, where treatments are smarter and less invasive, and solutions are personal. Through our expertise in Innovative Medicine and MedTech, we are uniquely positioned to innovate across the full spectrum of healthcare solutions today to deliver the breakthroughs of tomorrow and profoundly impact health for humanity. Learn more at https://www.jnj.com.

About Innovative Medicine

Our expertise in Innovative Medicine is informed and inspired by patients, whose insights fuel our science-based advancements. Visionaries like you work on teams that save lives by developing the medicines of tomorrow.

Join us in developing treatments, finding cures, and pioneering the path from lab to life while championing patients every step of the way.

Learn more at https://www.jnj.com/innovative-medicine

We are searching for the best talent for Data Lake Engineer and Solution Architect, R&D Therapeutics Discovery in Spring House, PA or Beerse, Belgium.

The Data Lake Engineer and Solution Architect is responsible for designing, optimizing, and operationalizing the data lake to serve high-dimensional biology teams, including High-Content Imaging, High-Throughput Transcriptomics, High-Throughput Proteomics among others. The candidate will optimize data models for high-dimensional biology data teams, make high-dimensional data AI/ML ready, tune storage and query performance for large-scale combined analyses across high-dimensional modalities, and deliver a standardized API for programmatic access.

What You'll Do

  • Design scalable data models and optimize schemas for high-dimensional biological data.

  • Architect and tune data lakes for performance and cost efficiency.

  • Develop standardized APIs and SDKs for secure, streamlined data access.

  • Collaborate with scientific teams and vendors to deliver platform capabilities.

  • Maintain documentation and train users on best practices.

  • Implement governance, security, and compliance frameworks.

What We're Looking For

  • Degree in Computer Science, Data Engineering, Bioinformatics, or related field; advanced degree (MS/PhD) preferred.

  • 7+ years in data/platform engineering, including 3+ years with data lakes.

  • Experience with biological data (omics, imaging) and analytic workflows.

  • Hands-on expertise with Snowflake, SQL at scale, and cloud platforms.

  • Strong programming and scripting skills (Python, SQL), and pipeline orchestration tools.

  • Proven ability to design APIs and communicate technical trade-offs effectively.

Core Expertise

  • Data modeling and schema optimization.

  • Performance tuning for data lakes and queries.

  • API development and secure data access.

  • Governance, lineage, and metadata management.

  • Cloud-based data platforms and orchestration tools.

  • Programming in Python and SQL.

Preferred Qualifications

  • Familiarity with ML infrastructure and feature stores.

  • Advanced Snowflake optimization and cost-control strategies.

  • Knowledge of data catalog tools and metadata standards.

  • Experience with containerization and CI/CD for data pipelines.

  • Background in omics or high-dimensional imaging pipelines.

Required Skills:

Preferred Skills:

Advanced Analytics, Agility Jumps, Consulting, Continuous Improvement, Critical Thinking, Data Engineering, Data Governance, Data Modeling, Data Privacy Standards, Data Science, Execution Focus, Hybrid Clouds, Mentorship, Tactical Planning, Technical Credibility, Technical Development, Technical Writing, Technologically Savvy

The anticipated base pay range for this position is :

.

Additional Description for Pay Transparency:

.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

pay-wall