Crea un profilo in modo da poter essere trovato dalle aziende, ottenere offerte di lavoro più adatte alle tue esigenze e candidarti più velocemente.
  • Cerca lavoro
  • Preferiti
  • Crea CV
    Novità
  • Stipendi
  • Iscrizioni

AI Agent Evaluation Analyst (Freelance)

30 $/ora

Mindrift

1 day ago Be among the first 25 applicants Overview

This opportunity is for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we’re looking for

We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for:

  • Analysts, researchers, or consultants with strong critical thinking skills
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig
  • People open to a part-time and non-permanent opportunity
About the project

We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve excelled in consulting, CHGK, Olympiads, case solving, or systems thinking, you might be a great fit.

What you’ll be doing
  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism
  • Identifying inconsistencies, missing assumptions, or unclear decision points
  • Helping define clear expected behaviors (gold standards) for AI agents
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage
How to get started

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements
  • Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications
  • Strong attention to detail: can spot contradictions, ambiguities, and vague requirements
  • Familiarity with structured data formats: can read, not necessarily write JSON/YAML
  • Ability to assess scenarios holistically: what’s missing, what’s unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings

We also value applicants who have:

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
  • Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
  • Exposure to LLMs, prompt engineering, or AI-generated content
  • Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
Benefits
  • Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio
  • Influence how future AI models understand and communicate in your field of expertise

Referrals increase your chances of interviewing at Mindrift by 2x

#J-18808-Ljbffr
Offerta di lavoro pubblicata 12 giorni fa
Offerte di lavoro simili
  • 30 $/ora

     ...shape the future of AI. What We Do The...  ...are tested and evaluated? This is a flexible...  ...-suited for: Analysts, researchers, or...  ...for autonomous AI agents for a new project...  ...policy logic, and agent evaluation frameworks...  ...flexible, remote, freelance project that fits... 
    Libero professionista
    Part-time
    Impiego permanente
    Remoto
    Orario flessibile

    Mindrift

    Roma
    12 giorni fa
  •  ...ISA Digital Consulting is searching for a Business Analyst Freelance to join their remote working team. The ideal candidate should have a Bachelor's degree in fields like Computer Science, IT, Mathematics, or Business. With over 5 years of experience in Business Analysis... 
    Libero professionista
    Remoto

    ISA Digital Consulting

    Roma
    5 giorni fa
  • A leading global payment services company is seeking a Sales Operations Analyst based in Rome, Italy. This role involves generating revenue, developing relationships with agents, and facilitating sales processes. The ideal candidate has a BA/BS degree and 2+ years of sales... 
    Consigliato
    Lavoro ibrido

    MoneyGram

    Roma
    4 giorni fa
  •  ...the world's fastest-growing AI companies accelerating the advancement...  ...leverage AI to be a better analyst. You would spend time...  ...is highly desirable. Perks of Freelancing With Turing: Work in a fully...  ...needs. Contractor assignment/freelancer (no medical/paid leave)- Commitments... 
    Libero professionista
    Remoto
    40 h/sett.

    Turing

    Roma
    12 giorni fa
  •  ...Corporate Law & M&A Expert to join their team in cutting-edge AI projects. The role involves evaluating AI responses to complex corporate legal scenarios,...  ...corporate governance and securities regulations. This freelance opportunity offers flexible hours and competitive... 
    Libero professionista
    Remoto
    Orario flessibile

    Turing

    Roma
    12 giorni fa
  • 12 $/ora

     ...and data transformation company powering AI systems worldwide. We run one of the largest...  ...As an Ads Quality Rater , you will evaluate and rate online advertisements to help improve...  ...Type: Independent Contractor / Freelance / Self-Employed Project Duration: Long... 
    Libero professionista
    Secondo lavoro
    Part-time
    Contratto con partita IVA
    Disponibilità immediata
    Remoto
    Lavoro da casa
    Orario flessibile
    20 h/sett.

    Welo Data

    Roma
    2 mesi fa
  •  ...A leading AI-focused company is looking for detail-oriented linguists with native Italian fluency...  ...multimedia content, with a focus on tasks like prompt evaluation and video understanding. This position is remote and freelance, allowing you to work from anywhere. Ideal... 
    Libero professionista
    Remoto

    LILT (Production)

    Roma
    2 giorni fa
  • 30 $/ora

     ...shape the future of AI. What We Do The Mindrift...  ...and structured evaluation scenarios for LLM‑based agents. Create test cases that...  ...behavior to compare agent actions against....  ...and scoring logic to evaluate agent actions. Analyze...  ...Flexible, remote, freelance project that fits... 
    Libero professionista
    Part-time
    Remoto
    Orario flessibile

    Mindrift

    Roma
    12 giorni fa
  •  ...Accelerare le tue competenze : Grazie a corsi e programmi di sviluppo orientati al futuro, utilizzando tecnologie smart e applicazioni AI che ti liberano dalle attività ripetitive: da Copilot365 al tool proprietario EYQ, passando per PowerBI Allargare i tuoi orizzonti... 
    Smart working
    Lavoro ibrido
    Disponibilità immediata
    Orario flessibile

    Ernst & Young Advisory Services Sdn Bhd

    Roma
    3 giorni fa
  • Ernst & Young Advisory Services Sdn Bhd cerca un Analista Funzionale per progetti di trasformazione digitale. Il candidato ideale deve avere una laurea in materie STEM e esperienza in società di consulenza. Avrai l'opportunità di lavorare in un ambiente internazionale ...
    Lavoro ibrido

    Ernst & Young Advisory Services Sdn Bhd

    Roma
    3 giorni fa
  •  ...for over 25 years in ICT consultancy, specializing in projects with a high IT technological content, is looking for a Business Analyst Freelance. Skills & Experience A level of education which corresponds to completed university studies of at least three (3)... 
    Libero professionista
    Tempo pieno
    Remoto

    ISA Digital Consulting

    Roma
    un mese fa
  •  ...mondo più sostenibile ed inclusivo.## **IL TUO PROFILO**Come Soc Analyst presso Capgemini ti occuperai di:* Analisi del contesto delle...  ...tecnologica e di business delle aziende, che sfrutta la potenza dell’AI per offrire valore ai propri clienti. Immaginiamo il futuro... 
    Impiego permanente
    Remoto
    Turni
    Orario flessibile

    Capgemini

    Roma
    4 giorni fa
  • 1.000 €/mese

     ...a noi? Scopri nel concreto che cosa fa un/una Junior Business Analyst in EY Farai parte della Practice Technology Strategy Transformation...  ...al futuro, utilizzando tecnologie smart e applicazioni AI che ti liberano dalle attività ripetitive: da Copilot365 al tool... 
    Smart working
    Stage/Tirocinio
    Lavoro ibrido

    Ernst & Young

    Roma
    1 giorno fa
  •  ...patient deserves access to treatment options. Massive Bio is an AI-powered precision medicine platform transforming how cancer patients...  ...style or business development approach. Profiles with senior freelance CRA experience, or other physician-facing experience in oncology... 
    Libero professionista
    Remoto

    Neara

    Roma
    3 giorni fa
  •  ...Ernst & Young cerca un/una Junior Business Analyst per unirsi al team di Technology Strategy Transformation a Roma. Questa figura si occuperà di analizzare i trend nei pagamenti digitali, supportare i progetti e collaborare con team internazionali. Offriamo opportunità... 
    Stage/Tirocinio

    Ernst & Young

    Roma
    5 giorni fa
  •  ...skill growth, great benefits, and a team that wants you to grow and succeed. Overview We are seeking a visionary SAP Business AI Architect to drive the strategy, design, and implementation of AI capabilities across our SAP eco-system. This candidate will possess... 
    Remoto

    SAP SE

    Roma
    4 giorni fa
  • 29 $/ora

    Mindrift is looking for specialists to design original computational physics problems that emulate real research workflows. Responsibilities include developing Python-based tasks and ensuring problems are computationally intensive.The ideal candidate should have a degree...
    Libero professionista
    Paga oraria

    Mindrift

    Roma
    4 giorni fa
  • 29 $/ora

    Mindrift is seeking specialists to design computational physics problems that simulate real research workflows. Candidates will employ their expertise to create problems requiring Python programming and ensure computational intensity. An ideal candidate has a degree in...
    Libero professionista
    Paga oraria

    Mindrift

    Roma
    4 giorni fa
  •  ...Are you passionate about AI and skilled at analyzing both text and multimedia content...  ...projects, including tasks such as prompt evaluation, video content understanding, text review...  ..., and more. This is a remote, freelance opportunity . Who we're looking for... 
    Libero professionista
    Remoto

    LILT (Production)

    Roma
    2 giorni fa
  •  ...RealAdvisor, azienda innovativa della PropTech, ricerca un Sales Executive freelance per il mercato italiano. In questo ruolo, gestirai progetti di vendita consapevole con l'obiettivo di aiutare i professionisti del settore immobiliare. Se hai più di 4 anni di esperienza... 
    Libero professionista
    Remoto

    RealAdvisor

    Roma
    1 giorno fa
  •  ...complessi di trasformazione digitale in ambito pubblico e privato, in contesti nazionali e internazionali. Siamo alla ricerca di un AI Architect che contribuisca alla progettazione, sviluppo e implementazione di una piattaforma esistente di ricerca e matching... 
    Libero professionista
    Lavoro ibrido
    Remoto

    ISA Digital Consulting

    Roma
    4 giorni fa
  • 30 $/ora

    A technology platform for AI projects is seeking contributors with a degree in Mathematics for project-based work. Ideal candidates...  ...for numerical validation. Tasks involve designing math problems, evaluating AI solutions, and validating results. Contributors can earn up... 
    Libero professionista
    Paga oraria
    Orario flessibile

    Mindrift

    Roma
    12 giorni fa
  • A leading tech company in Rome is seeking a skilled developer to build, test, and deploy AI agents on their platform. The ideal candidate will have 2-6 years of software development experience and a strong understanding of APIs and networking. This role involves collaboration... 

    Wonderful Ltd.

    Roma
    5 giorni fa
  •  ...Obsidian is seeking a Private Equity Expert based in Italy to participate in a research project with a leading AI lab. The role requires at least 2 years of experience in private equity, with proficiency in skills such as financial modeling and market sizing. This is... 
    10 h/sett.

    Obsidian

    Roma
    2 giorni fa
  • 35.000 € - 45.000 €

     ...e ambientali, l'implementazione di modelli di scenario analysis e l'automazione dei processi di reportistica Integrare soluzioni AI-driven nei workflow di valutazione del rischio climatico e ambientale, con conoscenza operativa dei principali modelli linguistici di... 
    Smart working
    Lavoro ibrido

    Be Think, Solve, Execute S.p.A.

    Roma
    5 giorni fa
  •  ...As Business Analyst & Controlling, you will contribute to the achievement of business objectives by supporting planning processes, monitoring financial and operational performance, and providing accurate analysis to guide decision-making. The role ensures reliable reporting... 

    LVMH Group

    Roma
    2 giorni fa
  • Frontiere cerca un professionista esperto in Data Analytics e Data Science per trasformare dati complessi in insight strategici. Sarai responsabile della gestione end-to-end delle iniziative analitiche, producendo dashboard e collaborando con stakeholder di business. ...
    Lavoro ibrido

    Frontiere

    Roma
    4 giorni fa
  • The Food and Agriculture Organization of the United Nations is looking for a Food Standards Officer (Data Management) in Rome, Italy. This position involves collecting and analyzing critical data related to food safety and quality, and updating important databases to aid...

    Food and Agriculture Organization of the United Nations

    Roma
    4 giorni fa
  • 17 $/ora

     ...curious people from around the world with freelance online tasks that train and improve...  ...Annotators connects individuals with Generative AI projects from leading tech innovators....  ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses... 
    Libero professionista
    Paga oraria
    Part-time
    Remoto

    Toloka Annotators

    Roma
    25 giorni fa
  • 100.000 €

    ## Retail Credit Risk AnalystApplylocations: Rometime type: Full timeposted on: Posted Todayjob requisition id: JR\_10039932**At Ayvens, progress starts with you.**Our ambitions to shape the future of sustainable mobility are powered by our talent. Join us, and get better...
    Libero professionista
    Lungo termine
    Orario flessibile

    Ayvens Group

    Roma
    3 giorni fa