💎 Get Full Access to 4,000+ Remote Jobs • Starting at $4/week→ View Plans

Data Scientist - AI Response Monitoring

Datagrid

Posted 11/24/2025Senior Level

Full-time

Technology

Data Science

Machine Learning

AI Evaluation

Production ML Systems

Quality Metrics

⭐ Join thousands of remote professionals with full access • From $4/week

Job Description

About Datagrid Forget everything you know about AI assistants. At Datagrid, we’re building AI agents that actually do the work. We’re a team of passionate, hard-working builders, thinkers, and problem-solvers who are genuinely excited about what we do. Our mission is to supercharge the workday by turning complex data and tedious workflows into simple, automated actions. It’s an incredibly exciting time to join us—we’re growing fast, expanding our platform’s capabilities, and partnering with enterprise customers who want to 10x their teams’ output. We thrive on collaboration and are looking for people who are ready to make a tangible impact. If you want to be part of a team that’s not just talking about the future of AI but actively creating it, you’ve come to the right place.

Our Values At Datagrid, our values guide how we work, build, and grow together. Act with Purpose: Everything we do is tied to our mission. You’ll see the impact of your work as we move quickly to solve meaningful problems for our customers. Own the Outcome: We believe in true ownership. You’ll take responsibility for your projects and see them through to success—empowered to make decisions that drive real results. Clarity without Ego: We value honesty, transparency, and trust. You can expect and provide direct feedback in an environment where candor sharpens our ideas and strengthens our team. Creativity with Purpose: Innovation is central to our culture. Your creative thinking will be valued and directed toward solving real-world challenges and creating lasting impact.

About the role:

This is an opportunity to build the core infrastructure that powers the next generation of AI agents. Our agents must ingest, enrich, and vectorize massive, multi-modal dataset from over 100 customer sources. The core challenge is twofold: how do we do this in a way that is radically cost-efficient, while still allowing Agents to deliver thorough responses in seconds. As a Data Scientist, you will be a key player in solving this problem. You will be responsible for building, managing, and optimizing the entire lifecycle of our agent platform, from the large-scale data ingestion pipelines to the core systems that allow agents to reason and act. You will help build the engine that allows our agents to be not just smart, but also incredibly fast and secure for thousands of enterprise users.

What you’ll do:

As a Data Scientist focused on AI Agent Monitoring & Evaluation, you will be the guardian of quality for our AI agents that serve customers across 100+ integrated systems. You will build the infrastructure that ensures our agents deliver accurate, reliable, and safe responses before our customers experience issues. This role sits at the critical intersection of data science, customer empathy, and production system reliability—where your evaluations directly impact the trust customers place in our AI-powered platform. You'll take ownership of sophisticated evaluation frameworks incorporating LLM-as-a-judge methodologies, create real-time monitoring dashboards + alerts that surface quality degradation before it impacts users, and leverage multi-modal analysis to ensure our visual language models perform flawlessly. This is an opportunity to define what "agent quality" means at scale while working with cutting-edge observability tools like Arize Phoenix and vector databases like Milvus.

What you'll have:

Required Qualifications 3–7+ years of experience in data science, machine learning, or AI evaluation roles with demonstrated expertise in production ML systems Track record of translating customer feedback and business priorities into automated quality metrics and evaluation criteria Strong knowledge of dashboarding and alerting best practices (Preferably in Looker + GCP) Direct experience building, deploying, and monitoring ML/GenAI workflows in production Direct experience convincing leaders to allocate millions of dollars in investments

Bonus Qualifications Solid understanding of vector databases and embedding-based search/monitoring, especially for multi-modal and LLM-based systems (Milvus, Pinecone, Weaviate, etc.). Strong understanding of VectorDBs, LLMs, and Agentic Evaluation. Familiarity with state-of-the-art observability tools for production AI/ML (Arize Phoenix, MLflow, etc.). Experience with LLM evaluation and benchmarking frameworks (LLM-as-a-judge, agentic evaluation, prompt engineering, etc.).

Salary & Benefits Salary Range: $125,000 - $185,000 Generous equity compensation Flexible vacation/time-off policy All U.S. federal holidays observed, plus an additional company-wide Week of Rest in December Competitive benefits package - 100% premium coverage for employees and generous coverage for dependents Work-from-home stipend to support your ideal setup 401(k) plan The base pay range target for the role seniority described in this job description is between $125,000 - $185,000. Final offer amounts depend on multiple factors such as candidate experience and expertise, geographic location, total compensation, and market data. In addition to cash pay, full-time regular positions are eligible for equity, 401(k), health benefits, and other benefits; some of these benefits may be available for part-time or temporary positions.

💼 Want More Jobs Like This?

Get similar opportunities delivered to your inbox. Free, no account needed!

Similar Jobs You Might Like

Design Architect

AppViewX

RemoteNot specifiedabout 4 hours ago

Full-time

Technical Architectural Design

Customer Success

PKI

DNS

Active Directory

Senior Python Backend Developer

Flexxy Recruitment Solutions

Not specifiedabout 5 hours ago

Full-time

Python

Backend Development

RESTful APIs

Django

Flask

Game Release Specialist Job - iGaming - Remote

SmartRecruitment.com

Not specifiedabout 5 hours ago

Full-time

Game Release Coordination

Attention to Detail

Multitasking

Organizational Skills

Casino Game Mechanics

Junior Software Developer

Salvo Software

Not specifiedabout 5 hours ago

Full-time

Python

XML

Git

Data Parsing

Want to see all 36,570 jobs?

You're currently viewing 1 out of 36,570 available remote opportunities

🔒 36,569 more jobs are waiting for you

Unlock All Jobs

Access every remote opportunity

Advanced Filters

Find your perfect match faster

Daily Updates

New opportunities every day

Save & Alerts

Never miss an opportunity

Weekly

Perfect for quick searches

POPULAR

Monthly

$12

Best for active job seekers

Yearly

$48

Save 67% • Best value

Unlock All 36570 Jobs

Join thousands of remote workers who found their dream job

Frequently Asked Questions

What's included in premium access?

Premium members get unlimited access to all remote job listings, advanced search filters, job alerts, and the ability to save favorite jobs.

Can I cancel anytime?

Yes! You can cancel your subscription at any time from your account settings. You'll continue to have access until the end of your billing period.

Do you offer refunds?

We offer a 7-day money-back guarantee on all plans. If you're not satisfied, contact us within 7 days for a full refund.

Is my payment secure?

Absolutely! We use Stripe for payment processing, which is trusted by millions of businesses worldwide. We never store your payment information.