AI Agent Evaluation Analyst (Freelance)
30 $/oraMindrift
1 day ago Be among the first 25 applicants Overview
This opportunity is for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
What We DoThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we’re looking forWe’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills
- Students (senior undergrads / grad students) looking for an intellectually interesting gig
- People open to a part-time and non-permanent opportunity
We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve excelled in consulting, CHGK, Olympiads, case solving, or systems thinking, you might be a great fit.
What you’ll be doing- Reviewing evaluation tasks and scenarios for logic, completeness, and realism
- Identifying inconsistencies, missing assumptions, or unclear decision points
- Helping define clear expected behaviors (gold standards) for AI agents
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives
- Thinking through complex systems and policies as a human would to ensure agents are tested properly
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.
Requirements- Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications
- Strong attention to detail: can spot contradictions, ambiguities, and vague requirements
- Familiarity with structured data formats: can read, not necessarily write JSON/YAML
- Ability to assess scenarios holistically: what’s missing, what’s unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
- Exposure to LLMs, prompt engineering, or AI-generated content
- Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
- Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio
- Influence how future AI models understand and communicate in your field of expertise
Referrals increase your chances of interviewing at Mindrift by 2x
#J-18808-Ljbffr30 $/ora
...shape the future of AI. What We Do The... ...are tested and evaluated? This is a flexible... ...-suited for: Analysts, researchers, or... ...for autonomous AI agents for a new project... ...policy logic, and agent evaluation frameworks... ...flexible, remote, freelance project that fits...Libero professionistaPart-timeImpiego permanenteRemotoOrario flessibile- ...ISA Digital Consulting is searching for a Business Analyst Freelance to join their remote working team. The ideal candidate should have a Bachelor's degree in fields like Computer Science, IT, Mathematics, or Business. With over 5 years of experience in Business Analysis...Libero professionistaRemoto
- A leading global payment services company is seeking a Sales Operations Analyst based in Rome, Italy. This role involves generating revenue, developing relationships with agents, and facilitating sales processes. The ideal candidate has a BA/BS degree and 2+ years of sales...ConsigliatoLavoro ibrido
- ...the world's fastest-growing AI companies accelerating the advancement... ...leverage AI to be a better analyst. You would spend time... ...is highly desirable. Perks of Freelancing With Turing: Work in a fully... ...needs. Contractor assignment/freelancer (no medical/paid leave)- Commitments...Libero professionistaRemoto40 h/sett.
- ...Corporate Law & M&A Expert to join their team in cutting-edge AI projects. The role involves evaluating AI responses to complex corporate legal scenarios,... ...corporate governance and securities regulations. This freelance opportunity offers flexible hours and competitive...Libero professionistaRemotoOrario flessibile
12 $/ora
...and data transformation company powering AI systems worldwide. We run one of the largest... ...As an Ads Quality Rater , you will evaluate and rate online advertisements to help improve... ...Type: Independent Contractor / Freelance / Self-Employed Project Duration: Long...Libero professionistaSecondo lavoroPart-timeContratto con partita IVADisponibilità immediataRemotoLavoro da casaOrario flessibile20 h/sett.- ...A leading AI-focused company is looking for detail-oriented linguists with native Italian fluency... ...multimedia content, with a focus on tasks like prompt evaluation and video understanding. This position is remote and freelance, allowing you to work from anywhere. Ideal...Libero professionistaRemoto
30 $/ora
...shape the future of AI. What We Do The Mindrift... ...and structured evaluation scenarios for LLM‑based agents. Create test cases that... ...behavior to compare agent actions against.... ...and scoring logic to evaluate agent actions. Analyze... ...Flexible, remote, freelance project that fits...Libero professionistaPart-timeRemotoOrario flessibile- ...Accelerare le tue competenze : Grazie a corsi e programmi di sviluppo orientati al futuro, utilizzando tecnologie smart e applicazioni AI che ti liberano dalle attività ripetitive: da Copilot365 al tool proprietario EYQ, passando per PowerBI Allargare i tuoi orizzonti...Smart workingLavoro ibridoDisponibilità immediataOrario flessibile
- Ernst & Young Advisory Services Sdn Bhd cerca un Analista Funzionale per progetti di trasformazione digitale. Il candidato ideale deve avere una laurea in materie STEM e esperienza in società di consulenza. Avrai l'opportunità di lavorare in un ambiente internazionale ...Lavoro ibrido
- ...for over 25 years in ICT consultancy, specializing in projects with a high IT technological content, is looking for a Business Analyst Freelance. Skills & Experience A level of education which corresponds to completed university studies of at least three (3)...Libero professionistaTempo pienoRemoto
- ...mondo più sostenibile ed inclusivo.## **IL TUO PROFILO**Come Soc Analyst presso Capgemini ti occuperai di:* Analisi del contesto delle... ...tecnologica e di business delle aziende, che sfrutta la potenza dell’AI per offrire valore ai propri clienti. Immaginiamo il futuro...Impiego permanenteRemotoTurniOrario flessibile
1.000 €/mese
...a noi? Scopri nel concreto che cosa fa un/una Junior Business Analyst in EY Farai parte della Practice Technology Strategy Transformation... ...al futuro, utilizzando tecnologie smart e applicazioni AI che ti liberano dalle attività ripetitive: da Copilot365 al tool...Smart workingStage/TirocinioLavoro ibrido- ...patient deserves access to treatment options. Massive Bio is an AI-powered precision medicine platform transforming how cancer patients... ...style or business development approach. Profiles with senior freelance CRA experience, or other physician-facing experience in oncology...Libero professionistaRemoto
- ...Ernst & Young cerca un/una Junior Business Analyst per unirsi al team di Technology Strategy Transformation a Roma. Questa figura si occuperà di analizzare i trend nei pagamenti digitali, supportare i progetti e collaborare con team internazionali. Offriamo opportunità...Stage/Tirocinio
- ...skill growth, great benefits, and a team that wants you to grow and succeed. Overview We are seeking a visionary SAP Business AI Architect to drive the strategy, design, and implementation of AI capabilities across our SAP eco-system. This candidate will possess...Remoto
29 $/ora
Mindrift is looking for specialists to design original computational physics problems that emulate real research workflows. Responsibilities include developing Python-based tasks and ensuring problems are computationally intensive.The ideal candidate should have a degree...Libero professionistaPaga oraria29 $/ora
Mindrift is seeking specialists to design computational physics problems that simulate real research workflows. Candidates will employ their expertise to create problems requiring Python programming and ensure computational intensity. An ideal candidate has a degree in...Libero professionistaPaga oraria- ...Are you passionate about AI and skilled at analyzing both text and multimedia content... ...projects, including tasks such as prompt evaluation, video content understanding, text review... ..., and more. This is a remote, freelance opportunity . Who we're looking for...Libero professionistaRemoto
- ...RealAdvisor, azienda innovativa della PropTech, ricerca un Sales Executive freelance per il mercato italiano. In questo ruolo, gestirai progetti di vendita consapevole con l'obiettivo di aiutare i professionisti del settore immobiliare. Se hai più di 4 anni di esperienza...Libero professionistaRemoto
- ...complessi di trasformazione digitale in ambito pubblico e privato, in contesti nazionali e internazionali. Siamo alla ricerca di un AI Architect che contribuisca alla progettazione, sviluppo e implementazione di una piattaforma esistente di ricerca e matching...Libero professionistaLavoro ibridoRemoto
30 $/ora
A technology platform for AI projects is seeking contributors with a degree in Mathematics for project-based work. Ideal candidates... ...for numerical validation. Tasks involve designing math problems, evaluating AI solutions, and validating results. Contributors can earn up...Libero professionistaPaga orariaOrario flessibile- A leading tech company in Rome is seeking a skilled developer to build, test, and deploy AI agents on their platform. The ideal candidate will have 2-6 years of software development experience and a strong understanding of APIs and networking. This role involves collaboration...
- ...Obsidian is seeking a Private Equity Expert based in Italy to participate in a research project with a leading AI lab. The role requires at least 2 years of experience in private equity, with proficiency in skills such as financial modeling and market sizing. This is...10 h/sett.
35.000 € - 45.000 €
...e ambientali, l'implementazione di modelli di scenario analysis e l'automazione dei processi di reportistica Integrare soluzioni AI-driven nei workflow di valutazione del rischio climatico e ambientale, con conoscenza operativa dei principali modelli linguistici di...Smart workingLavoro ibrido- ...As Business Analyst & Controlling, you will contribute to the achievement of business objectives by supporting planning processes, monitoring financial and operational performance, and providing accurate analysis to guide decision-making. The role ensures reliable reporting...
- Frontiere cerca un professionista esperto in Data Analytics e Data Science per trasformare dati complessi in insight strategici. Sarai responsabile della gestione end-to-end delle iniziative analitiche, producendo dashboard e collaborando con stakeholder di business. ...Lavoro ibrido
- The Food and Agriculture Organization of the United Nations is looking for a Food Standards Officer (Data Management) in Rome, Italy. This position involves collecting and analyzing critical data related to food safety and quality, and updating important databases to aid...
17 $/ora
...curious people from around the world with freelance online tasks that train and improve... ...Annotators connects individuals with Generative AI projects from leading tech innovators.... ...projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses...Libero professionistaPaga orariaPart-timeRemoto100.000 €
## Retail Credit Risk AnalystApplylocations: Rometime type: Full timeposted on: Posted Todayjob requisition id: JR\_10039932**At Ayvens, progress starts with you.**Our ambitions to shape the future of sustainable mobility are powered by our talent. Join us, and get better...Libero professionistaLungo termineOrario flessibile

