skillreveal

    Senior AI Data Scientist/Engineer

    skillreveal
    Posted 11/25/2025Senior Level
    Full-time
    Technology
    Data Science
    LLMs
    RAG Architecture
    SQL Databases
    Embedding Generation

    ⭐ Join thousands of remote professionals with full access • From $4/week

    Job Description

    We're currently looking for a Senior AI Data Scientist / Engineer to join our team for a full-time position (remotely in Ukraine or in Lviv's office).

    About the Customer and the Project:

    Our customer is the world's largest DNA network, based in the USA. This presents a unique opportunity to work with more than 60 billion digitized global historical records, 100 million family trees, and 18+ million people in their growing database. Our customers help people discover their family stories and gain actionable insights about their health and wellness.

    About the team:

    You will join the AI Content team, a dynamic group at the forefront of Document Understanding. You'll play a vital role in developing innovative AI models that extract and organize text and image information from billions of historical and genealogical records, enabling customers to discover, share, and connect with their family history. As a member of the team, you will work with KB (Knowledge Base) and RAG (Retrieval Augmented Generation) implementations, integrating architectures leveraging SQL-structured databases along with vector databases supporting semantic search and retrieval applications. You will work with a dedicated mentor from the data science team, as well as engineering teams, to train, optimize, and deploy models that promote product development, customer success, and content creation across our project.

    What you will do:

    Configure structured and vector databases: Align and sync database schemas across structured and vector databases Curate and organize content collection metadata: Prepare and format provided content collection metadata to be compatible with defined database schemas

    • Ingest content collection metadata: Ingest collection metadata from provided sources into a structured SQL database.
    • Embeddings generation: Help develop a tool/script to generate embeddings from the structured data to populate the vector database.
    • Iterative improvement: Iterate on adjusting the database schema, indexes, embeddings, etc., to support various queries and use cases for analyzing the ingested content collection metadata

    Collaborate on Cloud Deployment: Partner closely with ML Ops and Data Science Engineers to seamlessly deploy datasets, truth sets, models, and pipelines for training and inference in cloud environments. Communicate Insights Effectively: Clearly and confidently present your findings, deliverables, and proposed solutions to technical and non-technical audiences, including teams, stakeholders, and executives.

    Requirements:

    5+ years of experience in Data Science Strong hands-on commercial experience with LLMs in production, RAG architecture, and agentic systems

    • Expertise with data collection, organization, curation, and formatting to populate SQL databases.
    • Experience with SQL databases, including adjusting schemas and indices to optimize for efficient queries.
    • Familiar with embedding generation and use of vector databases for semantic search and retrieval.
    • Strong proficiency and experience with Python and relevant tools and libraries
    • Practical experience with cloud platform AWS (e.g. Amazon SageMaker, EC2, S3, AWS Lambda).
    • English: Upper-intermediate at least (both spoken and written)

    It will be a plus:

    • Knowledge and experience with cloud platforms and related AI/ML services such as Google GCP Gemini API, Vertex AI, Azure, etc.
    • Strong knowledge and experience with LightLLM
    • Commercial experience with Terraform or CloudFormation
    • Experience with agentic web scraping tools

    What You'll Gain

    • Mentorship & Growth: Learn from experienced Data Scientists while tackling meaningful, real-world AI projects, expanding your knowledge and professional network within a collaborative culture.
    • Collaboration & Impact: Work alongside top industry professionals and help shape the tools that bring family history to life for millions of users.
    • Innovation & Purpose: Join a team at the forefront of applying AI to historical data where every model you build helps preserve human stories.
    • What do we offer our new colleague?
    • Competitive compensation (based on market data, but also depending on the technical level of the candidate)
    • Flexible work schedule

    3 health packages to choos frome

    • Annual paid vacation and state holiday celebration
    • Free English classes (online)

    Individual approach to professional growth Lack of bureaucracy and micromanagement Modern, comfortable office facilities (a barbecue zone, kitchens, lounge rooms, coffee machines, etc.) Foreign business trips (after the war) On-site parking lot and charge station for Electric Cars Corporate gifts, celebrations, and fun activities Sports activities: ping-pong, soccer, work-out

    Suppose you have a passion for solving challenging problems, building scalable, robust systems, love working with the latest technologies in a fast-paced, flexible environment, and are excited about the prospect of having a significant impact on products with more than 3 million paying subscribers. In that case, we want to talk to you! ;-)

    💼 Want More Jobs Like This?

    Get similar opportunities delivered to your inbox. Free, no account needed!

    Similar Jobs You Might Like

    ML Engineer - Document Intelligence & Applied GenAI

    PandaDoc
    Not specifiedabout 2 hours ago
    Full-time
    Machine Learning
    Document Intelligence
    GenAI
    Model Development
    Evaluation Frameworks

    ML Engineer - Document Intelligence & Applied GenAI

    PandaDoc
    RemoteNot specifiedabout 2 hours ago
    Full-time
    Machine Learning
    Document Intelligence
    GenAI
    Model Development
    Evaluation Frameworks

    ML Engineer - Document Intelligence & Applied GenAI

    PandaDoc
    Not specifiedabout 2 hours ago
    Full-time
    Machine Learning
    Document Intelligence
    GenAI
    Model Development
    Evaluation Frameworks

    DevOps Engineer 1

    Stellar Health
    Not specifiedabout 3 hours ago
    Full-time
    DevOps
    SRE
    Software Development
    Linux
    Git

    Want to see all 35,300 jobs?

    You're currently viewing 1 out of 35,300 available remote opportunities

    🔒 35,299 more jobs are waiting for you

    Unlock All Jobs

    Access every remote opportunity

    Advanced Filters

    Find your perfect match faster

    Daily Updates

    New opportunities every day

    Save & Alerts

    Never miss an opportunity

    Weekly
    $4
    Perfect for quick searches
    POPULAR
    Monthly
    $12
    Best for active job seekers
    Yearly
    $48
    Save 67% • Best value
    Unlock All 35300 Jobs

    Join thousands of remote workers who found their dream job

    Frequently Asked Questions

    What's included in premium access?

    Premium members get unlimited access to all remote job listings, advanced search filters, job alerts, and the ability to save favorite jobs.

    Can I cancel anytime?

    Yes! You can cancel your subscription at any time from your account settings. You'll continue to have access until the end of your billing period.

    Do you offer refunds?

    We offer a 7-day money-back guarantee on all plans. If you're not satisfied, contact us within 7 days for a full refund.

    Is my payment secure?

    Absolutely! We use Stripe for payment processing, which is trusted by millions of businesses worldwide. We never store your payment information.