AI Agent Evaluation Analyst (Freelance)
30 $/oraMindrift
1 day ago Be among the first 25 applicants Overview
This opportunity is for candidates currently residing in the specified country. Your location may affect eligibility and rates. Please submit your resume in English and indicate your level of English proficiency. At Mindrift, innovation meets opportunity. We believe in using the power of collective human intelligence to ethically shape the future of AI.
What We DoThe Mindrift platform, launched and powered by Toloka, connects domain experts with cutting-edge AI projects from innovative tech clients. Our mission is to unlock the potential of GenAI by tapping into real-world expertise from across the globe.
Who we’re looking forWe’re looking for curious and intellectually proactive contributors, the kind of person who double-checks assumptions and plays devil’s advocate. Are you comfortable with ambiguity and complexity? Does an async, remote, flexible opportunity sound exciting? Would you like to learn how modern AI systems are tested and evaluated?
This is a flexible, project-based opportunity well-suited for:
- Analysts, researchers, or consultants with strong critical thinking skills
- Students (senior undergrads / grad students) looking for an intellectually interesting gig
- People open to a part-time and non-permanent opportunity
We’re on the hunt for QAs for autonomous AI agents for a new project focused on validating and improving complex task structures, policy logic, and agent evaluation frameworks. Throughout the project, you’ll balance quality assurance, research, and logical problem-solving. This project opportunity is ideal for people who enjoy looking at systems holistically and thinking through scenarios, implications, and edge cases.
You do not need a coding background, but you must be curious, intellectually rigorous, and capable of evaluating the soundness and consistency of complex setups. If you’ve excelled in consulting, CHGK, Olympiads, case solving, or systems thinking, you might be a great fit.
What you’ll be doing- Reviewing evaluation tasks and scenarios for logic, completeness, and realism
- Identifying inconsistencies, missing assumptions, or unclear decision points
- Helping define clear expected behaviors (gold standards) for AI agents
- Annotating cause-effect relationships, reasoning paths, and plausible alternatives
- Thinking through complex systems and policies as a human would to ensure agents are tested properly
- Working closely with QA, writers, or developers to suggest refinements or edge case coverage
Apply to this post, qualify, and get the chance to contribute to a project aligned with your skills, on your own schedule. Shape the future of AI while building tools that benefit everyone.
Requirements- Excellent analytical thinking: can reason about complex systems, scenarios, and logical implications
- Strong attention to detail: can spot contradictions, ambiguities, and vague requirements
- Familiarity with structured data formats: can read, not necessarily write JSON/YAML
- Ability to assess scenarios holistically: what’s missing, what’s unrealistic, what might break?
- Good communication and clear writing (in English) to document your findings
We also value applicants who have:
- Experience with policy evaluation, logic puzzles, case studies, or structured scenario design
- Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research
- Exposure to LLMs, prompt engineering, or AI-generated content
- Familiarity with QA or test-case thinking (edge cases, failure modes, "what could go wrong")
- Some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.)
- Get paid for your expertise, with rates that can go up to $30/hour depending on your skills, experience, and project needs
- Take part in a flexible, remote, freelance project that fits around your primary professional or academic commitments
- Participate in an advanced AI project and gain valuable experience to enhance your portfolio
- Influence how future AI models understand and communicate in your field of expertise
Referrals increase your chances of interviewing at Mindrift by 2x
#J-18808-Ljbffr- ...We are looking for a senior freelance IT Business Analyst to work permanently with Synextya (approx. 100hrs/month). You will be in charge of analysing complex digital infrastructures, mapping business processes and needs, supporting the technical team in structuring tasks...Libero professionistaLavoro ibridoRemoto
- Description Il Ruolo Business Enablement Specialist – Sales & Client Management La risorsa intraprenderà un interessante cammino professionale all’interno della divisione Enti Pubblici presso la sede di Bologna. La figura supporterà il Team nell’attività di...Consigliato
- ...Informatica Srl, con sede a Turbigo, è alla ricerca di un/a Business Analyst E-commerce. Il candidato ideale ha esperienza nell'analisi dei... ...di metodologie Agile. Si offre un contratto di collaborazione freelance con rate giornalieri tra 230-240€. Il lavoro è completamente...Libero professionistaRemoto
- ...ISA Digital Consulting is searching for a Business Analyst Freelance to join their remote working team. The ideal candidate should have a Bachelor's degree in fields like Computer Science, IT, Mathematics, or Business. With over 5 years of experience in Business Analysis...Libero professionistaRemoto
- ...informatica attiva dal 1985, è alla ricerca di uno/una BUSINESS ANALYST E-COMMERCE .Competenze tecniche Esperienza come Business... ...: per questa posizione offriamo un contratto di COLLABORAZIONE FREELANCE (Rate giornaliero 230-240€) L\'attività è in FULLREMOTE Requisiti...Libero professionistaRemotoOrario flessibile
- ...GRUPPO CAPGEMINICapgemini è un partner globale per la trasformazione tecnologica e di business delle aziende, che sfrutta la potenza dell’AI per offrire valore ai propri clienti. Immaginiamo il futuro delle organizzazioni e lo trasformiamo in realtà grazie all’AI, alla...Impiego permanenteLavoro ibridoRemotoOrario flessibile
- A leading consultancy firm in Bologna seeks a Junior AI Analyst. The role involves contributing to innovative projects using AI and advanced analytics for various sectors. Ideal candidates are recent graduates with a Master's degree in quantitative fields and a passion...Lavoro ibridoRemoto
- ...Junior AI Analyst – Corporate & Government Are you curious and ready to take on a new career... ...approaches such as Generative AI and agent‑based systems) in areas such as... ...empowerment of people. PEOPLE PROGRAM Our evaluation system is based on the full enhancement...Disponibilità immediataRemotoOrario flessibile
- ...empower the people that will power the future. From a simple swipe to life-changing medicines, from push notifications to generative AI. We design, manufacture, and service the products and solutions that keep the world connected. With $6.9 billion in sales, a strong customer...
- ...life-changing medicines, from push notifications to generative AI. We design, manufacture, and service the products and solutions... ...our people. We are currently seeking a Technical Sales Senior Analyst to join our dynamic team in Castel Guelfo (BO), Italy ! Deliver...TemporaneoDisponibilità immediata
- Consulente Report Benchmarking | Progetto CTE Descrizione della posizione ALMACUBE SRL è soggetto partner nell’ambito del progetto CASA DELLE TECNOLOGIE EMERGENTI – CTE COBO CUP F39I22001840004 PSC MISE 2014-2020 con Capofila il Comune di Bologna in partenariato ...Smart workingContratto con partita IVA
- Un'azienda innovativa in Emilia-Romagna cerca un AI Agent Designer per progettare agenti AI testuali e vocali. Questa posizione richiede esperienza con LLM, tecniche di prompt engineering e strumenti di workflow automation. Il candidato ideale avrà un background in Linguistica...Smart workingLavoro ibridoOrario flessibile
- ...Turing is looking for a Corporate Law & M&A Expert to work on AI projects based in L'Aquila, Abruzzo. In this role, you will evaluate AI-generated legal scenarios, focusing on U.S. corporate governance and transactions. Strong reasoning skills in corporate law are essential...RemotoOrario flessibile
35 $/ora
...Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is... ...Involves Generate prompts that challenge AI; Evaluate AI-generated solutions for correctness,...Libero professionistaPaga orariaTemporaneoPart-timeImpiego permanente- ...Turing is looking for a Remote Business Analyst fluent in English and Italian to conduct research, analyze data, and improve large language models. This role requires strong analytical skills and independence for remote work. You will create scenarios to train models...Remoto40 h/sett.Orario flessibile
- ...lavoro flessibile e inclusivo Opportunità di crescita in un ambiente giovane e dinamico Siamo aperti a valutare anche risorse freelance in partita iva La selezione è rivolta a candidati ambosessi (L. 903/77). I dati personali verranno trattati secondo il...Libero professionistaContratto con partita IVAOrario flessibile
- ...di vita dei progetti, dalla raccolta dei requisiti alla formazione degli utenti finali. Offriamo un contratto di collaborazione freelance e siamo aperti a candidature di ogni orientamento o espressione di genere. Si richiede l'invio di curricula che soddisfano i requisiti...Libero professionistaRemoto
- ...filtration solutions manufacturer, is offering an exciting Data Analyst – Competitive Pricing Internship at their headquarters in... ...analytical tools to streamline reporting and decision-making. Evaluating product features on a 0–10 scale to assess market positioning....Tempo pienoStage/Tirocinio
- Il candidato si unirà alla Business Line Marketing Services di CRIF per il mercato B2B ed opererà in un contesto internazionale. Si occuperà principalmente di: • Definire le esigenze aziendali e analisi funzionale dei prodotti e servizi offerti dalla linea di business...
- Azienda di prodotto, 150 dipendenti Castel Maggiore Azienda Realtà operante nel settore manifatturiero metalmeccanico, specializzata nella progettazione e produzione di componenti elastici e particolari metallici. Offerta Analisi dei dati relativi agli interventi...Lavoro ibrido
- Per nuova progettualità, stiamo ricercando un Analista Funzionale IT Senior, con esperienza in ambito Bancario per attività legate ai processi di Pagamento, SDD, SCT, messaggi interbancari ISO 20022, tipo PACS, PAIN, CAMT. PROFILO RICERCATO Esperienza ...Libero professionistaTempo pienoLavoro ibridoDisponibilità immediata
- Iaawg is seeking a Client & Financial Data Services Specialist in Bologna to act as a key contact for business and IT teams. The role involves managing operational activities and ensuring the accuracy of data services related to financial instruments. A Master's degree...
- Language Matters Recruitment Consultants Ltd sta cercando un Consulente Finanziario con italiano da remoto. Il candidato ideale ha esperienza nella gestione di account con alto profilo patrimoniale e competenze nel settore degli investimenti. Il lavoro è completamente...Libero professionistaRemoto
- ...A leading digital luxury group based in Bologna is seeking a BI Data Analyst specializing in Power BI. This role focuses on reporting, dashboards, and data modeling, with an emphasis on leveraging an existing semantic layer built on a Tabular cube. The ideal candidate...Lavoro ibrido
- Ti piacerebbe dare uno slancio alla tua carriera? Vuoi contribuire a progetti innovativi in una realtà leader del settore IT come Capgemini? Cogli l’opportunità, unisciti alla squadra, intraprendi il tuo viaggio. Per il potenziamento della practice Insights & Data...
50.000 € - 55.000 €
...Wyser S.r.l. A Socio Unico sta cercando un Senior AI Agent Developer in Emilia-Romagna, Bologna. Il candidato ideale avrà almeno 2 anni di esperienza nello sviluppo di agenti intelligenti e dovrà progettare e implementare soluzioni AI integrate nell’ecosistema Microsoft...Remoto- RESPONSIBILITIES Analysis and design of use cases and processes based on business requirements and related presentation to IT/Business. Detailed technical/functional analysis based on the designed use cases. Integration analysis with external systems. Creation...
- Iaawg is seeking candidates for a position focused on the analysis and design of business processes and use cases. Responsibilities include detailed analysis based on business requirements, integration with external systems, and supporting the project manager. The ideal...
- Descrizione annuncio Intersport Italia , leader internazionale nel settore articoli sportivi, ricerca una figura di Business Developer Specialist itinerante sul territorio italiano con base a Bologna. La risorsa supporterà le attività di sviluppo e ampliamento...
- Iaawg is seeking a professional in Bologna / Milano to analyze and design business processes based on requirements. Candidates should have a Master's degree in relevant fields and 1+ years of consulting experience. Successful applicants will possess strong analytical...
