Professional Summary

AI Engineer and Researcher with Ph.D. in Machine Learning, specializing in LLMs, multi-agent systems, and cloud-native AI solutions. Proven track record building production AI platforms at scale, from financial crime detection models processing billions of transactions to robust multi-agent frameworks. My Ph.D. dissertation focused on sequences of hypergraphs as a novel approach to represent judicial cases and enable reasoning with explainability and counter-factual analysis at the core. Dual academic-industrial background with 4 patents, 15+ publications, and expertise (+10 years of experience) in both research innovation and commercial product development. Looking to work for innovative environments.

Experience

Founding AI Engineer / AI Researcher

Mar. 2024 - Present
Proofs.io · Cracow, Poland

Building a multi-agent framework enabling software development in minutes. Lead AI engineer responsible for 90% of AI backend architecture and implementation in a 7-person startup. Participating in cloud infrastructure, MLOps, and product architecture alongside core AI development.

  • Designed unified LLM client supporting 8+ providers (OpenAI, Anthropic, Gemini, Groq, Baseten, local via vLLM, ...), enabling vendor-agnostic AI integration
  • Developed innovative function calling techniques beyond standard MCP with function dependencies, full OpenAPI specs support, and stateful tools for less tokens and API calls
  • Built flexible declarative state-machine based agents with human-in-the-loop workflows
  • Implemented robust one-shot learning and distillation "blueprint" mechanism for agent behavior replication
  • Created knowledge extraction framework from user interactions and agent trial-and-error for continuous learning
  • Improving SotA techniques for codebase representation with Knowledge Graph representation, unifying LSP, AST and additional contextual/metadata into a single graph (using Memgraph)
  • Architected advanced RAG system with context engineering and rescoring techniques

Member of the Community of Experts

Jan. 2025 - Present
Eloquence AI (EU Horizon Project) · Remote

Providing expertise in AI, notably LLM, and associated risks for European consortium dedicated to conversational AI. Assess technical proposals to ensure innovative value while mitigating ethical bias. Project funded by Horizon Europe, the EU's key funding programme for research and innovation.

AI & Data Science Consultant

2017 - Present
Hother LLC (Independent Consulting) · Cracow, Poland · 🌐 hother.io

Through my LLC, I accept some consulting missions related to AI, Data Science and Software Engineering.

  • Clinical AI (InovIntell, 2024-present): Synthetic data generation for clinical studies with counter-factual analysis using Bayesian Networks, GANs, VAEs, and Neural ODEs
  • Enterprise Search (Undisclosed, 2023-2024): Built semantic search engine and conversational SQL agent, designed SaaS architecture including cost estimation
  • Financial AI (Undisclosed, 2023): Predictive modeling and recommendation system for M&A analysis using tripartite graph embeddings
  • SEO AI (Watussi, 2017): Conceived and implemented NLP tool for Search Engine Optimization using Gensim and NLTK
  • Successfully delivered 4+ AI projects with expertise in SBERT, LLM integration, Text-to-SQL, PostgreSQL, FastAPI, Docker, and Kubernetes

Head of Data and AI / Technical Lead

Apr. 2022 - Mar. 2024
YData · Cracow, Poland

Led data science and AI team building SaaS platform for synthetic data generation and automated data quality assessment. Managed team of 6 engineers, shaped core IP, and drove product roadmap from research to production. Responsible for building the team, leading implementation effort, and answering fundamental research questions at fast pace.

  • Co-author and maintainer of ydata-profiling library (10k+ GitHub stars), one of the leading Python tools for automated data profiling
  • Researched and developed new generative models for synthetic data: scalable Normalizing Flow for tabular data, Bayesian network time series synthesis, SQL database generator with schema integrity, and longitudinal data augmentation (see Patent section)
  • Built distributed processing systems using Dask, scaling models to process TB-scale datasets for enterprise clients
  • Integrated LLM capabilities for table-to-text generation, enhancing synthetic data quality and interpretability

Senior AVP of Data Science

Dec. 2020 - Apr. 2022
HSBC (Financial Crimes Threat Mitigation) · Cracow, Poland

Led global financial crime detection models serving 200M+ users processing billions of transactions monthly. Managed complete model governance lifecycle from data quality to production monitoring.

  • Designed and deployed main global Name Screening model used across HSBC group, reducing false positives by 70% while maintaining <2.0% false negative rate
  • Automated data science reporting pipeline, achieving 90%+ time savings equivalent to 20 FTE/year in operational efficiency
  • Developed controllable ML framework using rule injection, guaranteeing fixed model risk profiles for regulatory compliance
  • Created multiobjective optimization methodology for model risk assessment, adopted as standard across compliance analytics
  • Served as scientific advisor and peer reviewer while improving internal model governance lifecycle including scientific culture, code governance, and engineering risks

Senior Software Engineer & Data Scientist

May 2015 - Dec. 2020
IBM · Cracow, Poland

Architecture and engineering roles spanning cloud platforms, AI systems, and research collaborations. Responsible for flagship products and innovative AI platform development.

  • Engineering on IBM Cloud Pak for Data flagship platform for Multi/Private/Hybrid Cloud serving 1000+ enterprise clients with a focus on monitoring, HA, predictive maintenance and D&R capabilities
  • Built AI platforms for financial process automation (IBM Cobee) and code quality prediction (IBM Code Quality Center)
  • Developed semantic search engine for legal documents and NLP/Topic Modeling services for enterprise applications
  • Led monitoring solutions for GPFS and Red Hat OpenShift in hybrid cloud environments
  • Served as Machine Learning and Data Science trainer, managed university collaborations, and supervised 10+ master students on AI research projects with conference presentations and lectures

Research Collaborations

2012 - 2014
Inria (multiple collaborations) · France

Early career research collaborations across academic environments focusing on optimization and machine learning.

  • Research Collaboration (Inria Saclay, 2013-2014): Evolutionary planner Divide-and-Evolve, MultiZenoTravel solver development, online parameter adaptation with Marc Schoenauer, TAO team
  • Research Collaboration (Inria Nord-Europe, 2012-2013): Parallel Island Model for Metaheuristics, genetic algorithms, stochastic optimization with Clive Canape, DOLPHIN team
  • Software Engineer Internship (Inria Nord-Europe, 2012): ParadisEO C++ framework development, shared memory parallelism module (SMP), continuous integration

Education

Ph.D. in Machine Learning

2016 - 2021
Poznan University of Technology · Poland

Thesis: "End-to-end approach to classification in unstructured spaces with application to judicial decisions"

M.Sc. Applied Mathematics & Computer Science

2009 - 2015
National Institute of Applied Sciences (INSA) · Rouen, France

French Engineer Diploma with specializations in Decision Aids (AI, Optimization, Intelligent Systems), Mathematical Methods for Data Science (Big Data, Machine Learning, Statistics), and Modeling & Numerical Simulations (PDEs, HPC, Probabilities)

M.Sc. High Performance Computing and AI

2013 - 2014
Cracow University of Technology · Poland

Specialized master's program in high-performance computing and artificial intelligence

Patents & Intellectual Property

  • US20240256946A1 (2024): Data synthesis with enhanced scalability using plurality of data synthesizers
  • US11328715B2 (2021): Cooperative platform tasks - automatic assignment using machine learning algorithms
  • US11263113B2 (2021): Code quality prediction - cloud application for automatic issue detection using RL and rule-based learning
  • US11226889B2 (2021): Regression prediction - software development regression prediction using ML techniques

Selected Publications

Full publication list (15+ papers) available at: quemy.info/research


ydata-profiling: Accelerating Data-Centric AI with high-quality data (2023)

F. Clemente, G. Martins Ribeiro, A. Quemy, et al. - Neurocomputing

Artificial Intelligence & Fair Trial Rights (2023)

H. Molbæk-Steensig, A. Quemy - Oxford Press book chapter

Binary Classification in Unstructured Space With Hypergraph Case-Based Reasoning (2019)

A. Quemy - Information Systems

A large reproducible benchmark on text classification for the legal domain (2023)

A. Quemy, R. Wrembel, et al. - Information Systems

Teaching & Supervision

MLOps Lectures (Winter 2023)

Poznan University of Technology · Guest lecturer on machine learning operations and deployment

Theoretical Machine Learning (2018-2019)

IBM Krakow Software Lab · Internal training on advanced ML theory and applications

Master Thesis Supervision

  • Rodrigo Tuna (2023-2024): Fairness Aware Semi-Synthetic Graph Generation
  • Pawel Mroz (2018-2019): Graph-based linear explanation for supervised machine learning models

Research Internship Supervision

Supervised 8+ research internships on topics including hyperparameter optimization, ECHR decision prediction, regression prediction, and semantic partitioning algorithms

Professional Recognition

  • Technical Program Committee Member: Fuzzy Information and Engineering (Taylor & Francis), Computing (Springer), Data & Knowledge Engineering (Springer), Expert Systems With Applications (Elsevier)
  • Conference Reviewer: Data Analytics solutions for Real-Life Applications (DARLI-AP) workshop collocated with EDBT/ICDT Joint Conference
  • Open Source Impact: Maintainer of multiple open-source projects including ParadisEO, Descarwin, ZenoSolver, and ECHR-OD
  • Research Impact: 15+ peer-reviewed publications in international journals and conferences
  • Industry Recognition: 4 granted US patents and 1 pending application in AI/ML domain