Crea un profilo in modo da poter essere trovato dalle aziende, ottenere offerte di lavoro più adatte alle tue esigenze e candidarti più velocemente.
  • Cerca lavoro
  • Preferiti
  • Crea CV
    Novità
  • Stipendi
  • Iscrizioni

AI Agent Evaluation Analyst (Freelance)

30 $/ora

Mindrift

1 day ago Be among the first 25 applicants Overview

This opportunity is for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.

What We Do

The Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.

Who we’re looking for

We’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?

This is a flexible, project-based opportunity well-suited for:

  • Analysts, researchers, or consultants with strong critical thinking skills
  • Students (senior undergrads / grad students) looking for an intellectually interesting gig
  • People open to a part-time and non-permanent opportunity
About the project

We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.

You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve excelled in consulting, CHGK, Olympiads, case solving, or systems thinking, you might be a great fit.

What you’ll be doing
  • Reviewing evaluation tasks and scenarios for logic, completeness, and realism
  • Identifying inconsistencies, missing assumptions, or unclear decision points
  • Helping define clear expected behaviors (gold standards) for AI agents
  • Annotating cause-effect relationships, reasoning paths, and plausible alternatives
  • Thinking through complex systems and policies as a human would to ensure agents are tested properly
  • Working closely with QA, writers, or developers to suggest refinements or edge case coverage
How to get started

Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.

Requirements
  • Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications
  • Strong attention to detail: can spot contradictions, ambiguities, and vague requirements
  • Familiarity with structured data formats: can read, not necessarily write JSON/YAML
  • Ability to assess scenarios holistically: what’s missing, what’s unrealistic, what might break?
  • Good communication and clear writing (in English) to document your findings

We also value applicants who have:

  • Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
  • Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
  • Exposure to LLMs, prompt engineering, or AI-generated content
  • Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
  • Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
Benefits
  • Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs
  • Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
  • Participate in an advanced AI project and gain valuable experience to enhance your portfolio
  • Influence how future AI models understand and communicate in your field of expertise

Referrals increase your chances of interviewing at Mindrift by 2x

#J-18808-Ljbffr
Offerta di lavoro pubblicata 2 giorni fa
Offerte di lavoro simili
  •  ...We are looking for a senior freelance IT Business Analyst to work permanently with Synextya (approx. 100hrs/month). You will be in charge of analysing complex digital infrastructures, mapping business processes and needs, supporting the technical team in structuring tasks... 
    Libero professionista
    Lavoro ibrido
    Remoto

    Synextya

    Bologna
    2 giorni fa
  • Description Il Ruolo Business Enablement Specialist – Sales & Client Management La risorsa intraprenderà un interessante cammino professionale all’interno della divisione Enti Pubblici presso la sede di Bologna. La figura supporterà il Team nell’attività di...
    Consigliato

    Willis Towers Watson

    Bologna
    4 giorni fa
  •  ...Informatica Srl, con sede a Turbigo, è alla ricerca di un/a Business Analyst E-commerce. Il candidato ideale ha esperienza nell'analisi dei...  ...di metodologie Agile. Si offre un contratto di collaborazione freelance con rate giornalieri tra 230-240€. Il lavoro è completamente... 
    Libero professionista
    Remoto

    Gecal Informatica Srl

    Bologna
    3 giorni fa
  •  ...ISA Digital Consulting is searching for a Business Analyst Freelance to join their remote working team. The ideal candidate should have a Bachelor's degree in fields like Computer Science, IT, Mathematics, or Business. With over 5 years of experience in Business Analysis... 
    Libero professionista
    Remoto

    ISA Digital Consulting

    Bologna
    3 giorni fa
  •  ...informatica attiva dal 1985, è alla ricerca di uno/una BUSINESS ANALYST E-COMMERCE .Competenze tecniche Esperienza come Business...  ...: per questa posizione offriamo un contratto di COLLABORAZIONE FREELANCE (Rate giornaliero 230-240€) L\'attività è in FULLREMOTE Requisiti... 
    Libero professionista
    Remoto
    Orario flessibile

    Gecal Informatica Srl

    Bologna
    3 giorni fa
  •  ...GRUPPO CAPGEMINICapgemini è un partner globale per la trasformazione tecnologica e di business delle aziende, che sfrutta la potenza dell’AI per offrire valore ai propri clienti. Immaginiamo il futuro delle organizzazioni e lo trasformiamo in realtà grazie all’AI, alla... 
    Impiego permanente
    Lavoro ibrido
    Remoto
    Orario flessibile

    Capgemini

    Bologna
    2 giorni fa
  • A leading consultancy firm in Bologna seeks a Junior AI Analyst. The role involves contributing to innovative projects using AI and advanced analytics for various sectors. Ideal candidates are recent graduates with a Master's degree in quantitative fields and a passion... 
    Lavoro ibrido
    Remoto

    Prometeia

    Bologna
    4 giorni fa
  •  ...Junior AI Analyst – Corporate & Government Are you curious and ready to take on a new career...  ...approaches such as Generative AI and agent‑based systems) in areas such as...  ...empowerment of people. PEOPLE PROGRAM Our evaluation system is based on the full enhancement... 
    Disponibilità immediata
    Remoto
    Orario flessibile

    Prometeia

    Bologna
    4 giorni fa
  •  ...empower the people that will power the future. From a simple swipe to life-changing medicines, from push notifications to generative AI. We design, manufacture, and service the products and solutions that keep the world connected. With $6.9 billion in sales, a strong customer... 

    Vertiv Co

    Bologna
    2 giorni fa
  •  ...life-changing medicines, from push notifications to generative AI. We design, manufacture, and service the products and solutions...  ...our people. We are currently seeking a Technical Sales Senior Analyst to join our dynamic team in Castel Guelfo (BO), Italy ! Deliver... 
    Temporaneo
    Disponibilità immediata

    Vertiv Co

    Bologna
    5 giorni fa
  • Consulente Report Benchmarking | Progetto CTE Descrizione della posizione ALMACUBE SRL è soggetto partner nell’ambito del progetto CASA DELLE TECNOLOGIE EMERGENTI – CTE COBO CUP F39I22001840004 PSC MISE 2014-2020 con Capofila il Comune di Bologna in partenariato ...
    Smart working
    Contratto con partita IVA

    AlmaCube Srl - The Business Incubator of the University of B...

    Bologna
    2 giorni fa
  • Un'azienda innovativa in Emilia-Romagna cerca un AI Agent Designer per progettare agenti AI testuali e vocali. Questa posizione richiede esperienza con LLM, tecniche di prompt engineering e strumenti di workflow automation. Il candidato ideale avrà un background in Linguistica... 
    Smart working
    Lavoro ibrido
    Orario flessibile

    Heres

    Bologna
    4 giorni fa
  •  ...Turing is looking for a Corporate Law & M&A Expert to work on AI projects based in L'Aquila, Abruzzo. In this role, you will evaluate AI-generated legal scenarios, focusing on U.S. corporate governance and transactions. Strong reasoning skills in corporate law are essential... 
    Remoto
    Orario flessibile

    Turing

    Bologna
    3 giorni fa
  • 35 $/ora

     ...Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is...  ...Involves Generate prompts that challenge AI; Evaluate AI-generated solutions for correctness,... 
    Libero professionista
    Paga oraria
    Temporaneo
    Part-time
    Impiego permanente

    Mindrift

    Bologna
    2 giorni fa
  •  ...Turing is looking for a Remote Business Analyst fluent in English and Italian to conduct research, analyze data, and improve large language models. This role requires strong analytical skills and independence for remote work. You will create scenarios to train models... 
    Remoto
    40 h/sett.
    Orario flessibile

    Turing

    Bologna
    3 giorni fa
  •  ...lavoro flessibile e inclusivo Opportunità di crescita in un ambiente giovane e dinamico Siamo aperti a valutare anche risorse freelance in partita iva La selezione è rivolta a candidati ambosessi (L. 903/77). I dati personali verranno trattati secondo il... 
    Libero professionista
    Contratto con partita IVA
    Orario flessibile

    it's prodigy

    Bologna
    2 giorni fa
  •  ...di vita dei progetti, dalla raccolta dei requisiti alla formazione degli utenti finali. Offriamo un contratto di collaborazione freelance e siamo aperti a candidature di ogni orientamento o espressione di genere. Si richiede l'invio di curricula che soddisfano i requisiti... 
    Libero professionista
    Remoto

    Gruppo Gecal Informatica - Altair Systems

    Bologna
    3 giorni fa
  •  ...filtration solutions manufacturer, is offering an exciting Data Analyst – Competitive Pricing Internship at their headquarters in...  ...analytical tools to streamline reporting and decision-making. Evaluating product features on a 0–10 scale to assess market positioning.... 
    Tempo pieno
    Stage/Tirocinio

    Workinvirtual

    Zola Predosa (BO)
    3 giorni fa
  • Il candidato si unirà alla Business Line Marketing Services di CRIF per il mercato B2B ed opererà in un contesto internazionale. Si occuperà principalmente di: • Definire le esigenze aziendali e analisi funzionale dei prodotti e servizi offerti dalla linea di business...

    CRIF S.p.A.

    Bologna
    3 giorni fa
  • Azienda di prodotto, 150 dipendenti Castel Maggiore Azienda Realtà operante nel settore manifatturiero metalmeccanico, specializzata nella progettazione e produzione di componenti elastici e particolari metallici. Offerta Analisi dei dati relativi agli interventi...
    Lavoro ibrido

    Michael Page International Italia S.r.l.

    Bologna
    2 giorni fa
  • Per nuova progettualità, stiamo ricercando un Analista Funzionale IT Senior, con esperienza in ambito Bancario per attività legate ai processi di Pagamento, SDD, SCT, messaggi interbancari ISO 20022, tipo PACS, PAIN, CAMT. PROFILO RICERCATO Esperienza ...
    Libero professionista
    Tempo pieno
    Lavoro ibrido
    Disponibilità immediata

    Jobtome

    Bologna
    5 giorni fa
  • Iaawg is seeking a Client & Financial Data Services Specialist in Bologna to act as a key contact for business and IT teams. The role involves managing operational activities and ensuring the accuracy of data services related to financial instruments. A Master's degree...

    Iaawg

    Bologna
    5 giorni fa
  • Language Matters Recruitment Consultants Ltd sta cercando un Consulente Finanziario con italiano da remoto. Il candidato ideale ha esperienza nella gestione di account con alto profilo patrimoniale e competenze nel settore degli investimenti. Il lavoro è completamente...
    Libero professionista
    Remoto

    Language Matters Recruitment Consultants Ltd

    Bologna
    3 giorni fa
  •  ...A leading digital luxury group based in Bologna is seeking a BI Data Analyst specializing in Power BI. This role focuses on reporting, dashboards, and data modeling, with an emphasis on leveraging an existing semantic layer built on a Tabular cube. The ideal candidate... 
    Lavoro ibrido

    Yoox Group

    Bologna
    4 giorni fa
  • Ti piacerebbe dare uno slancio alla tua carriera? Vuoi contribuire a progetti innovativi in una realtà leader del settore IT come Capgemini? Cogli l’opportunità, unisciti alla squadra, intraprendi il tuo viaggio. Per il potenziamento della practice Insights & Data...

    Capgemini

    Bologna
    4 giorni fa
  • 50.000 € - 55.000 €

     ...Wyser S.r.l. A Socio Unico sta cercando un Senior AI Agent Developer in Emilia-Romagna, Bologna. Il candidato ideale avrà almeno 2 anni di esperienza nello sviluppo di agenti intelligenti e dovrà progettare e implementare soluzioni AI integrate nell’ecosistema Microsoft... 
    Remoto

    Wyser S.r.l. A Socio Unico

    Bologna
    23 ore fa
  • RESPONSIBILITIES Analysis and design of use cases and processes based on business requirements and related presentation to IT/Business. Detailed technical/functional analysis based on the designed use cases. Integration analysis with external systems. Creation...

    Iaawg

    Bologna
    5 giorni fa
  • Iaawg is seeking candidates for a position focused on the analysis and design of business processes and use cases. Responsibilities include detailed analysis based on business requirements, integration with external systems, and supporting the project manager. The ideal...

    Iaawg

    Bologna
    5 giorni fa
  • Descrizione annuncio Intersport Italia , leader internazionale nel settore articoli sportivi, ricerca una figura di Business Developer Specialist itinerante sul territorio italiano con base a Bologna. La risorsa supporterà le attività di sviluppo e ampliamento...

    Cisalfa Sport

    Bologna
    5 giorni fa
  • Iaawg is seeking a professional in Bologna / Milano to analyze and design business processes based on requirements. Candidates should have a Master's degree in relevant fields and 1+ years of consulting experience. Successful applicants will possess strong analytical...

    Iaawg

    Bologna
    5 giorni fa