Senior AI Agent & Evaluations Engineer

VacatiaPortland, OR

Apply with Sonara

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.¹

Reclaim your time by letting our AI handle the grunt work of job searching.

We continuously scan millions of openings to find your top matches.

Overview

Remote

On-site

Job Description

Join Vacatia and Help Build the Future of AI-Powered Vacation Ownership

Location: Portland, OR (Hybrid – Three Days In Office) Remote considered for exceptional candidates.

About Vacatia Vacatia is building the future of vacation ownership. We operate in a fragmented, operationally complex industry where AI has the potential to fundamentally transform how decisions are made, how customers are supported, and how businesses scale.

We're developing AI agents that sit at the center of critical business workflows—helping owners, supporting operations, surfacing insights, and automating decisions that historically required significant human effort. These agents interact with real customers and influence real business outcomes, making reliability, safety, and performance essential.

We're looking for a hands-on Senior AI Agent & Evals Engineer to own the intelligence layer behind these systems. You'll be responsible for designing agent behavior, building evaluation frameworks, creating guardrails, and continuously improving agent performance as our AI footprint expands across the organization.

If you're passionate about prompt engineering, agent reliability, and creating measurable AI systems that solve meaningful business problems, we'd love to meet you.

Why You'll Love Working at Vacatia

Build the Future of Applied AI Design and improve AI agents that directly impact customer experiences, operational efficiency, and business outcomes across our organization.

Work on Problems That Matter Your work will influence real-world decisions involving customer communications, mortgage outcomes, rental operations, and owner experiences.

Own the Intelligence Layer Take full ownership of prompt design, agent behavior, evaluation systems, guardrails, and continuous performance improvement.

Measure What Matters Build sophisticated evaluation frameworks, golden datasets, and automated scoring systems that ensure our agents continually improve.

Partner Across the Business Collaborate closely with engineers, operators, and subject matter experts to transform business knowledge into scalable AI systems.

Join a Small Team with Outsized Impact Work alongside experienced engineers and leaders who believe AI can create meaningful competitive advantages in a traditionally underserved industry.

Your Impact

Design, refine, and optimize prompts, tool definitions, routing logic, and decision-making behavior across Vacatia's AI agent ecosystem
Build and maintain evaluation frameworks, golden datasets, grading systems, and regression testing pipelines that measure agent quality and reliability
Develop guardrails and safe-failure mechanisms that ensure agents operate responsibly in customer-facing and financially sensitive workflows
Monitor production performance, investigate failures, identify edge cases, and continuously improve agent outcomes through data-driven iteration
Partner with business stakeholders to translate policies, operational requirements, and domain expertise into measurable agent behavior
Collaborate with engineering teams to define context requirements, tool contracts, and integration specifications that support agent success
Create scalable frameworks and reusable patterns for deploying AI agents across new business workflows and use cases
Establish best practices for prompt engineering, evaluation methodologies, observability, and agent operations

What You Bring

Proven experience shipping and owning production AI agents or LLM-powered systems beyond proof-of-concept environments
Deep expertise in prompt engineering, including system prompts, tool usage, context management, output constraints, and agent behavior design
Hands-on experience building evaluation frameworks using golden datasets, scoring rubrics, LLM-as-judge methodologies, and regression testing
Strong familiarity with modern AI development tools such as Claude Code, Codex, or similar coding agents
Experience with agent observability and evaluation platforms such as LangSmith, Langfuse, Arize, Galileo, or comparable solutions
Ability to distinguish prompt issues from data, tooling, model, or evaluation failures and systematically improve agent performance
Strong written and verbal communication skills with the ability to work effectively across engineering and business teams
Demonstrated ownership mindset with a passion for building reliable, measurable, and continuously improving AI systems

Strongly Preferred

Experience building agents that process communication-based workflows including emails, support tickets, chat interactions, or transcripts
Experience with multiple agent frameworks and a practical understanding of their tradeoffs
Familiarity with the evolving LLM landscape and model selection strategies
Experience designing and implementing end-to-end evaluation pipelines and agent operations workflows
Production experience with online evaluation systems and automated scoring of live traffic

Nice to Have

Experience integrating AI systems with Salesforce, AWS Connect, or customer engagement platforms
Background in customer-facing industries where accuracy, compliance, and communication quality are critical
Contributions to open-source projects, technical writing, or public thought leadership in AI, prompt engineering, or agent development

Join Us Join us at the forefront of applied AI innovation. If you're excited about building intelligent systems that solve complex business problems, improving agent behavior through rigorous evaluation, and helping shape the future of vacation ownership, we'd love to hear from you.

At Vacatia, you'll have the opportunity to build AI solutions that matter, work alongside talented teammates, and create technology that drives real business impact.

Automate your job search with Sonara.

Submit 10x as many applications with less effort than one manual application.

Apply with Sonara Apply manually

FAQs About Senior AI Agent & Evaluations Engineer Jobs at Vacatia

What is the work location for this position at Vacatia?

This job at Vacatia is located in Portland, OR, according to the details provided by the employer. Some roles may also include multiple work locations depending on the requirement.

What pay range can candidates expect for this role at Vacatia?

Employer has not shared pay details for this role.

What employment applies to this position at Vacatia?

The employer has not provided this information. This may be discussed during the hiring process.

What is the process to apply for this position at Vacatia?

You can apply for this role at Vacatia either through Sonara's automated application system, which helps you submit applications 10X faster with minimal effort, or by applying manually using the direct link on the job page.