AI Agent Evaluation Analyst (Freelance)
30 $/oraMindrift
1 day ago Be among the first 25 applicants Overview
This opportunity is for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
What We DoThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we’re looking forWe’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills
- Students (senior undergrads / grad students) looking for an intellectually interesting gig
- People open to a part-time and non-permanent opportunity
We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve excelled in consulting, CHGK, Olympiads, case solving, or systems thinking, you might be a great fit.
What you’ll be doing- Reviewing evaluation tasks and scenarios for logic, completeness, and realism
- Identifying inconsistencies, missing assumptions, or unclear decision points
- Helping define clear expected behaviors (gold standards) for AI agents
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives
- Thinking through complex systems and policies as a human would to ensure agents are tested properly
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.
Requirements- Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications
- Strong attention to detail: can spot contradictions, ambiguities, and vague requirements
- Familiarity with structured data formats: can read, not necessarily write JSON/YAML
- Ability to assess scenarios holistically: what’s missing, what’s unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
- Exposure to LLMs, prompt engineering, or AI-generated content
- Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
- Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio
- Influence how future AI models understand and communicate in your field of expertise
Referrals increase your chances of interviewing at Mindrift by 2x
#J-18808-Ljbffr- Un'azienda nel settore software cerca un Backend Developer Freelance con esperienza in .NET per sviluppare soluzioni che integrano strumenti di Intelligenza Artificiale. Sono richiesti almeno 3 anni di esperienza, competenze in ASP.Net MVC e Core, e conoscenze di SQL. Il...Libero professionistaLavoro ibridoRemoto
- ...Corporate Law & M&A Expert to join their team in cutting-edge AI projects. The role involves evaluating AI responses to complex corporate legal scenarios,... ...corporate governance and securities regulations. This freelance opportunity offers flexible hours and competitive...Libero professionistaRemotoOrario flessibile
- In SDG hai l’opportunità di diventare un esperto di Big Data & Analytics edi entrare a far parte di una delle società più all’avanguardia d’Europa in questo ambito. Potrai partecipare a diversi progetti nazionali e internazionali lavorando in team giovani e dinamici....ConsigliatoSmart workingRemoto
- Un'azienda innovativa nel settore della consulenza offre un'opportunità unica per diventare esperto in Big Data e Analytics. Unisciti a un team giovane e dinamico, partecipa a progetti internazionali e costruisci il tuo percorso professionale personalizzato. Con un forte...ConsigliatoRemoto
- ...MAS Management Network è una società di consulenza manageriale e sta ricercando un "Process and data junior analyst" per nostre attività a Firenze. La figura ideale è un neolaureato in ingegneria gestionale che abbia già maturato una breve esperienza nel settore manifatturiero...Consigliato
- Cosa farai concretamente? All’interno del Team Basilea 2, comprendi la normativa regolamentare in ambito di definizione del Default e della Forbearance. Ti occuperai della sua applicazione tecnica all’interno del motore di calcolo ad alta complessità dedicato alle transizioni...Impiego permanenteOrario flessibile
- ...Gruppo Gecal Informatica - Altair Systems cerca un Business Analyst esperto in e-commerce per lavoro full remote. La posizione richiede... ...User Story su Jira. Offriamo un contratto di collaborazione freelance con un compenso giornaliero compreso tra 230 e 240€. La candidatura...Libero professionistaRemoto
- ...ISA Digital Consulting is searching for a Business Analyst Freelance to join their remote working team. The ideal candidate should have a Bachelor's degree in fields like Computer Science, IT, Mathematics, or Business. With over 5 years of experience in Business Analysis...Libero professionistaRemoto
30 $/ora
...shape the future of AI. What We Do The Mindrift... ...and structured evaluation scenarios for LLM‑based agents. Create test cases that... ...behavior to compare agent actions against.... ...and scoring logic to evaluate agent actions. Analyze... ...Flexible, remote, freelance project that fits...Libero professionistaPart-timeRemotoOrario flessibile- 'Going where we have not gone before'. For us, this means breaking new ground, exploring new ways to expand the distribution and to enlarge our sales force. And why? Our Mission is to make RedBull – the brand, the product, and the content – available to everyone, anywhere...Tempo pieno
- ...trasformazione digitale, portandole a un livello superiore. Potresti essere TU la persona che cerchiamo, se… Sei un Backend Developer Freelance con esperienza di sviluppo in .NET e vuoi dedicarti allo sviluppo di soluzioni che integrano strumenti di Intelligenza Artificiale...Libero professionistaLavoro ibridoRemoto
- ...informatica attiva dal 1985, è alla ricerca di uno/una BUSINESS ANALYST E-COMMERCE .Competenze tecniche Esperienza come Business... ...: per questa posizione offriamo un contratto di COLLABORAZIONE FREELANCE (Rate giornaliero 230-240€) L\'attività è in FULLREMOTE Requisiti...Libero professionistaRemotoOrario flessibile
- Una società di consulenza manageriale cerca un 'Process and data junior analyst' a Firenze. Il candidato ideale è un neolaureato in Ingegneria Gestionale con esperienza nel settore manifatturiero, preferibilmente nel Fashion/Luxury. È richiesta una buona dimestichezza con...
- Findomestic Banca S.p.A cerca un professionista per gestire la normativa sul Default e la Forbearance, nel contesto della sua applicazione tecnica. La figura sarà coinvolta in analisi e reportistica riguardo la rischiosità del portafoglio, contribuendo a importanti progetti...Orario flessibile
- Un'azienda innovativa offre l'opportunità di diventare un esperto in Big Data & Analytics. Unisciti a un team giovane e dinamico per lavorare su progetti nazionali e internazionali. Avrai accesso a un percorso di carriera personalizzato con formazione continua e opportunità...Smart workingRemoto
- Software Partner Italia è alla ricerca di un Business Intelligence da inserire in un team di progetto presso i clienti a Firenze. Il candidato ideale avrà esperienza nella gestione di progetti, conoscenze in Business Object, Datastage, Oracle e competenze in sviluppo SQL...Tempo pienoImpiego permanenteDisponibilità immediata
- Prima di consultare le posizioni aperte… Se sei alla ricerca di una nuova opportunità lavorativa nel mondo ICT, troverai nelle posizioni aperte la descrizione del ruolo e i requisiti che stiamo cercando. Vogliamo però esser certi per prima cosa che tu conosca le competenze...Smart workingTempo pienoStage/TirocinioContratto con partita IVARemotoLavoro da casa
- ...Hunters Group è alla ricerca di Agenti di Vendita Indipendenti in Toscana per promuovere una piattaforma di Intelligenza Artificiale destinata ai professionisti legali. Questo ruolo chiave prevede lo sviluppo commerciale, l'acquisizione di nuovi clienti e la gestione...Contratto con partita IVA
- A leading sports technology company is seeking a Football Tracking Systems Technician. The role involves preparing and monitoring tracking systems during matches in Firenze, Italy. Ideal candidates are proactive problem solvers with excellent organizational skills, capable...Libero professionistaOrario flessibile
- A leading sports technology company is seeking a Football Tracking Systems Technician for onsite positions in Firenze, Italy. Responsibilities include preparing and monitoring tracking systems during matches. Candidates should possess strong analytical, organisational,...Libero professionistaOrario flessibile
- Who are we? SupportYourApp is a global Intelligent Support-as-a-Service leader, partnering with tech companies and industry leaders like MasterCard, Calm and MacPaw in 30+ countries since 2010 to deliver secure customer and technical support. We operate globally, supporting...Libero professionistaRemotoOrario fisso
- Overview CGM Consulting ricerca Analista programmatore JavaEE su FIRENZE. Requirements Oracle JPA, EJB Web services REST AngularJS, Javascript SQL, PL/SQL HMTL, CSS Responsibilities Relazionarsi correttamente con il cliente e con il proprio...Tempo pienoImpiego permanente
- CGM Consulting S.r.l. ricerca un Analista programmatore JavaEE da inserire nel team di Firenze. La figura avrà il compito di relazionarsi con i clienti e lavorare in autonomia per risolvere eventuali problematiche durante il processo di sviluppo. È richiesta Laurea ...Tempo pienoImpiego permanente
- ...Turing is looking for a Corporate Law & M&A Expert to work on AI projects based in L'Aquila, Abruzzo. In this role, you will evaluate AI-generated legal scenarios, focusing on U.S. corporate governance and transactions. Strong reasoning skills in corporate law are essential...RemotoOrario flessibile
35 $/ora
...Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is... ...Involves Generate prompts that challenge AI; Evaluate AI-generated solutions for correctness,...Libero professionistaPaga orariaTemporaneoPart-timeImpiego permanente- ...Turing is looking for a Remote Business Analyst fluent in English and Italian to conduct research, analyze data, and improve large language models. This role requires strong analytical skills and independence for remote work. You will create scenarios to train models...RemotoOrario flessibile40 h/sett.
30 $/ora
An innovative AI project firm in Milan seeks QAs for autonomous AI agents. This flexible, project-based role requires strong analytical thinking, attention to detail... ...skills in English. Ideal candidates include analysts or students eager to contribute to AI validation efforts...RemotoOrario flessibile33 $/ora
...Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is... ...problems reflecting professional practice Evaluate AI solutions for correctness, assumptions, and...Libero professionistaTemporaneoPart-timeImpiego permanente- ...di vita dei progetti, dalla raccolta dei requisiti alla formazione degli utenti finali. Offriamo un contratto di collaborazione freelance e siamo aperti a candidature di ogni orientamento o espressione di genere. Si richiede l'invio di curricula che soddisfano i requisiti...Libero professionistaRemoto
15 $/ora
A leading technology company is seeking an AI Quality Analyst in Lombardia, Italy, to evaluate a new personalization feature for Gemini. This role involves designing conversational prompts and assessing AI responses for quality and relevance. Candidates must be proficient...Paga orariaRemotoOrario flessibile40 h/sett.
