Experience
Founding AI Engineer
Oct. 2025 - PresentBuilding an agentic OS for team productivity. Responsible for human-to-agent interactions, designing user-agent interactions while inferring recurring tasks to enable workflow structuring and automation.
Founding AI Engineer / AI Researcher
Mar. 2024 - PresentBuilding a multi-agent framework enabling software development in minutes. Lead AI engineer responsible for 90% of AI backend architecture and implementation in a 7-person startup. Participating in cloud infrastructure, MLOps, and product architecture alongside core AI development.
Lead AI engineer (90% of AI backend) building multi-agent framework in a 7-person startup.
- Designed unified LLM client supporting 8+ providers (OpenAI, Anthropic, Gemini, Groq, Baseten, local via vLLM, ...), enabling vendor-agnostic AI integration
- Developed innovative function calling techniques beyond standard MCP with function dependencies, full OpenAPI specs support, and stateful tools for less tokens and API calls [presentation]
- Built flexible declarative state-machine based agents with human-in-the-loop workflows
- Implemented robust one-shot learning and distillation "blueprint" mechanism for agent behavior replication
- Created knowledge extraction framework from user interactions and agent trial-and-error for continuous learning
- Improving SotA techniques for codebase representation with Knowledge Graph representation, unifying LSP, AST and additional contextual/metadata into a single graph (using Memgraph - alternative to Neo4j - for building, and LLM to Cypher for querying)
- Architected advanced RAG system with context engineering and rescoring techniques
- Unified LLM client (8+ providers), innovative function calling, declarative state-machine agents with human-in-the-loop
- Knowledge Graph codebase representation (Memgraph, Cypher); advanced RAG with context engineering
Member of the Community of Experts
Jan. 2025 - PresentProviding expertise in AI, notably LLM, and associated risks for European consortium dedicated to conversational AI. Assess technical proposals to ensure innovative value while mitigating ethical bias. Project funded by Horizon Europe, the EU's key funding programme for research and innovation.
AI/LLM expertise for EU Horizon-funded European consortium on conversational AI and ethical bias.
AI & Data Science Consultant
2017 - PresentFounding partner of specialized AI consulting firm serving startups and enterprises.
Founding partner of AI consulting firm. 4+ delivered projects across vision, clinical AI, NLP, financial AI.
- Image Recognition (CDD services, 2025): Designed and developed pipeline to train and fine-tune vision models for passport detection and reading, optimized for on-device inference on mid-range smartphones
- Clinical AI (InovIntell, 2024-present): Synthetic data generation for clinical studies with counter-factual analysis using Bayesian Networks, GANs, VAEs, and Neural ODEs
- Enterprise Search (Undisclosed, 2023-2024): Built semantic search engine and conversational SQL agent, designed SaaS architecture including cost estimation
- Financial AI (2023-present): Predictive modeling and recommendation system for M&A analysis using tripartite graph embeddings
- SEO AI (Watussi, 2017): Conceived and implemented NLP tool for Search Engine Optimization using Gensim and NLTK
- Successfully delivered 4+ AI projects with expertise in SBERT, LLM integration, Text-to-SQL, PostgreSQL, FastAPI, Docker, and Kubernetes
- Clinical AI (InovIntell): Synthetic data for clinical studies — Bayesian Networks, GANs, Neural ODEs
- Financial AI: M&A predictive modeling with tripartite graph embeddings
Head of Data and AI / Technical Lead
Apr. 2022 - Mar. 2024Led data science and AI team building SaaS platform for synthetic data generation and automated data quality assessment. Managed team of 6 engineers, shaped core IP, and drove product roadmap from research to production. Responsible for building the team, leading implementation effort, and answering fundamental research questions at fast pace.
Led team of 6 for synthetic data SaaS. Co-author of ydata-profiling (10k+ GitHub stars). New generative models (Normalizing Flow, Bayesian networks). TB-scale with Dask.
- Co-author and maintainer of ydata-profiling library (10k+ GitHub stars), one of the leading Python tools for automated data profiling
- Researched and developed new generative models for synthetic data: scalable Normalizing Flow for tabular data, Bayesian network time series synthesis, SQL database generator with schema integrity, and longitudinal data augmentation (see Patent section)
- Built distributed processing systems using Dask, scaling models to process TB-scale datasets for enterprise clients
- Integrated LLM capabilities for table-to-text generation, enhancing synthetic data quality and interpretability
Senior AVP of Data Science
Dec. 2020 - Apr. 2022Led global financial crime detection models serving 200M+ users processing billions of transactions monthly. Managed complete model governance lifecycle from data quality to production monitoring, following whole governance lifecycle process of global models.
Led global financial crime models (200M+ users). Name Screening model: 70% false positive reduction. Automated reporting: 90%+ time savings (20 FTE/year).
- Designed and deployed main global Name Screening model used across HSBC group, reducing false positives by 70% while maintaining <2.0% false negative rate
- Automated data science reporting pipeline, achieving 90%+ time savings equivalent to 20 FTE/year in operational efficiency
- Developed controllable ML framework using rule injection, guaranteeing fixed model risk profiles for regulatory compliance
- Created multiobjective optimization methodology for model risk assessment, adopted as standard across compliance analytics
- Served as scientific advisor and peer reviewer while improving internal model governance lifecycle including scientific culture, code governance, and engineering risks
Senior Software Engineer & Data Scientist
May 2015 - Dec. 2020Architecture and engineering roles spanning cloud platforms, AI systems, and research collaborations. Responsible for flagship products and innovative AI platform development.
Cloud Pak for Data (1000+ enterprise clients). AI platforms, semantic search. Supervised 10+ master students.
- Engineering on IBM Cloud Pak for Data flagship platform for Multi/Private/Hybrid Cloud serving 1000+ enterprise clients with a focus on monitoring, HA, predictive maintenance and D&R capabilities
- Built AI platforms for financial process automation (IBM Cobee) and code quality prediction (IBM Code Quality Center)
- Developed semantic search engine for legal documents and NLP/Topic Modeling services for enterprise applications
- Led monitoring solutions for GPFS and Red Hat OpenShift in hybrid cloud environments
- Served as Machine Learning and Data Science trainer, managed university collaborations, and supervised 10+ master students on AI research projects with conference presentations and lectures
Research Collaborations
2012 - 2014Early career research collaborations across academic environments focusing on optimization and machine learning.
Evolutionary planners, parallel metaheuristics, ParadisEO C++ framework. Inria Saclay & Nord-Europe.
- Research Collaboration (Inria Saclay, 2013-2014): Evolutionary planner Divide-and-Evolve, MultiZenoTravel solver development, online parameter adaptation with Marc Schoenauer, TAO team
- Research Collaboration (Inria Nord-Europe, 2012-2013): Parallel Island Model for Metaheuristics, genetic algorithms, stochastic optimization with Clive Canape, DOLPHIN team
- Software Engineer Internship (Inria Nord-Europe, 2012): ParadisEO C++ framework development, shared memory parallelism module (SMP), continuous integration
Education
Ph.D. in Machine Learning
2016 - 2021Thesis: "End-to-end approach to classification in unstructured spaces with application to judicial decisions"
M.Sc. Applied Mathematics & Computer Science
2009 - 2015French Engineer Diploma with specializations in Decision Aids (AI, Optimization, Intelligent Systems), Mathematical Methods for Data Science (Big Data, Machine Learning, Statistics), and Modeling & Numerical Simulations (PDEs, HPC, Probabilities)
M.Sc. High Performance Computing and AI
2013 - 2014Specialized master's program in high-performance computing and artificial intelligence
Patents & Intellectual Property
- US20240256946A1 (2024): Data synthesis with enhanced scalability using plurality of data synthesizers
- US11328715B2 (2021): Cooperative platform tasks - automatic assignment using machine learning algorithms
- US11263113B2 (2021): Code quality prediction - cloud application for automatic issue detection using RL and rule-based learning
- US11226889B2 (2021): Regression prediction - software development regression prediction using ML techniques
Selected Publications
Full publication list (20+ papers) available at: quemy.info/research
Intrinsic Green's Learning: Supervised Learning on Manifolds via Inverse PDE (2026)
A. Quemy - AI & PDE Workshop @ ICLR 2026
ydata-profiling: Accelerating Data-Centric AI with high-quality data (2023)
F. Clemente, G. Martins Ribeiro, A. Quemy, et al. - Neurocomputing
Artificial Intelligence & Fair Trial Rights (2023)
H. Molbæk-Steensig, A. Quemy - Oxford Press book chapter
Binary Classification in Unstructured Space With Hypergraph Case-Based Reasoning (2019)
A. Quemy - Information Systems
A large reproducible benchmark on text classification for the legal domain (2023)
A. Quemy, R. Wrembel, et al. - Information Systems
Teaching & Supervision
MLOps Lectures (Winter 2023)
Poznan University of Technology · Guest lecturer on machine learning operations and deployment
Theoretical Machine Learning (2018-2019)
IBM Krakow Software Lab · Internal training on advanced ML theory and applications
Master Thesis Supervision
- Rodrigo Tuna (2023-2024): Fairness Aware Semi-Synthetic Graph Generation
- Pawel Mroz (2018-2019): Graph-based linear explanation for supervised machine learning models
Research Internship Supervision
Supervised 8+ research internships on topics including hyperparameter optimization, ECHR decision prediction, regression prediction, and semantic partitioning algorithms
Professional Recognition
- Technical Program Committee Member: Fuzzy Information and Engineering (Taylor & Francis), Computing (Springer), Data & Knowledge Engineering (Springer), Expert Systems With Applications (Elsevier)
- Open Source Impact: Maintainer of multiple open-source projects including ParadisEO, Descarwin, ZenoSolver, and ECHR-OD
- Research Impact: 20+ peer-reviewed publications in international journals and conferences
- Industry Recognition: 4 granted US patents and 1 pending application in AI/ML domain