Erfan Miahi

ML/RL Research Engineer, ex-founder, focused on decentralized AI and LLMs

erfan-miahi

Toronto, Canada

Joined November 2024

Network

9.2K connections

NSMTMC

HAFFSA

AWMWZW

MMMTEA

MSADSL

APRM

KMDG

Summary

Erfan Miahi is a highly active and accomplished Machine Learning Research Engineer with a deep specialization in Reinforcement Learning (RL) and its applications. He possesses over seven years of hands-on experience in building scalable AI/SaaS systems and reliable ML pipelines, collaborating with prominent researchers from institutions like Google DeepMind and the University of Alberta's RLAI Lab. github+2

He is an entrepreneurial technical founder, currently serving as Technical Founder at Qelix, where he is developing a platform for training custom AI models using RL. Previously, he was a Technical Co-founder at Intract and a Founder in Residence at Antler. His startup experience is complemented by his role as a Machine Learning Research Engineer and Consultant at DeepR Analytics, where he developed large-scale ML systems for automated trading. thenetwork

Erfan is a prolific researcher, with a Google Scholar h-index of 5 and i10-index of 5, indicating significant contributions to the fields of Reinforcement Learning, Machine Learning, and Optimization. His academic work includes a Master's thesis on feature generalization in deep RL, and notable papers on topics such as neural architecture search for medical image assessment and scalable evolutionary optimization. google+2

His current work at Covenant AI and the development of projects like 'grail' showcases a strong focus on decentralized AI and large language models (LLMs). He is actively involved in implementing advanced techniques like GRPO for verifiable post-training and scaling RL finetuning on Bittensor, and has published a paper (CCPS) on calibrating LLM confidence. nitter+2

Beyond his technical expertise, Erfan Miahi is dedicated to mentorship, offering a free program to guide students interested in AI. He also maintains a personal interest in philosophy and actively participates in extreme sports, including parkour and skateboarding, reflecting a dynamic and multifaceted personality. github+3

Work

Education

Projects

Writing

Calibrating LLM Confidence by Probing Perturbed Representation Stability (CCPS)

January 1, 2025

Introduces CCPS, a novel method for more accurately estimating Large Language Model (LLM) confidence by analyzing internal representational stability. It applies targeted adversarial perturbations to final hidden states, extracts features, and uses a lightweight classifier to predict answer correctness. The paper demonstrates significant improvements in calibration metrics and accuracy over existing methods.

arxiv.org

Investigating the properties of neural network representations in reinforcement learning

January 1, 2024

A research paper (pre-print on arXiv) exploring neural network representations within reinforcement learning.

erfanmhi.github.io

Feature Generalization in Deep Reinforcement Learning: An Investigation into Representation Properties

January 1, 2022

Master's thesis investigating the connection between properties and generalization performance of representations learned by deep reinforcement learning algorithms. It empirically examines representations good at generalization and proposes novel hypotheses regarding the impact of auxiliary tasks in end-to-end training.

ualberta.scholaris.ca

Genetic Neural Architecture Search for automatic assessment of human sperm images

January 1, 2022

Published in Expert Systems and Applications, this paper details the use of Genetic Neural Architecture Search for the automatic assessment of human sperm images.

erfanmhi.github.io

Scalable Transfer Evolutionary Optimization: Coping with Big Task Instances

January 1, 2022

Published in IEEE Transactions on Cybernetics, this work addresses scalable transfer evolutionary optimization for handling large task instances.

erfanmhi.github.io

Effect of deep transfer and multi-task learning on sperm abnormality detection

January 1, 2021

Published in Computers in Biology and Medicine, this paper explores the impact of deep transfer and multi-task learning on detecting sperm abnormalities.

erfanmhi.github.io

Hobbies

Engages in various extreme sports including parkour, skateboarding, skydiving, workouts, climbing, and cycling. nitter+2

Expresses creativity through AI art, with dedicated Instagram pages for his work. instagram

Has an interest in philosophy, often mentioning it in his online presence. nitter

Notes

AFAllan Fischsaid they've seen

EMErfan Miahi

build or research AI technologies in a way that stood out to them.

Nov 19, 2025

Nathan Benaich

Founder at Spinout.fyi

Alexander Norman

Board Member at Planned

sri vishnu

Hussain Shahbaz

Head of AI & Innovation at Troon Technologies

Devi Parikh

Co-Founder at Yutori

Deep Gandhi

Machine Learning Engineer at Transformer Lab

Erfan Miahi

Network

Summary

Work

Technical FounderQelix2024–Present

Machine Learning Research EngineerCovenant AI2025–Present

Technical Co-founderIntract2023–2025

Founder in ResidenceAntler2025–2025

Machine Learning Research EngineerDeepR Analytics2023–2024

Assistant AI ConsultantAlberta Machine Intelligence Institute (Amii)2021–2023

Private TutorSelf-employed2021–2023

ConsultantDeepR Analytics2022–2023

Research AssociateUniversity of Alberta2022–2023

Graduate Research AssistantUniversity of Alberta2021–2022

ResearcherGuilan NLP Group2019–2021

CMPUT 296 (Basics of Machine Learning) Teaching AssistantUniversity of Alberta2021–2021

Vice Chairman of the Modeling and Artificial Intelligence Committee in Brain and Cognition SocietyUniversity of Guilan2018–2019

Computational Intelligence Teaching AssistantUniversity of Guilan2018–2019

Algorithm Design Teaching AssistantUniversity of Guilan2019–2019

Software DeveloperNilasoft2015–2017

Advanced Programming Teaching AssistantUniversity of Guilan2016–2017

Principles of Computer & Programming Teaching AssistantUniversity of Guilan2016–2017

Data Structures Teaching AssistantUniversity of Guilan2017–2017

Principles of Data Mining Teaching AssistantUniversity of Guilan2017–2017

Education

Master's degree, Computing ScienceUniversity of Alberta2021–2022

Bachelor Degree, Computer EngineeringGuilan University2015–2019

High School, Math and PhysicsNational Organization for Development of Exceptional Talents (Sampad)2011–2015

Middle School, General StudiesNational Organization for Development of Exceptional Talents (Sampad)2008–2011

Projects

GVFs in the Real World: Making Predictions Online for Water Treatment

Algorithmic Trading Strategies with Deep Reinforcement Learning

grail (Verifiable Post-Training for LLMs)

q Evaluation Harness

Writing

Calibrating LLM Confidence by Probing Perturbed Representation Stability (CCPS)

Investigating the properties of neural network representations in reinforcement learning

Feature Generalization in Deep Reinforcement Learning: An Investigation into Representation Properties

Genetic Neural Architecture Search for automatic assessment of human sperm images

Scalable Transfer Evolutionary Optimization: Coping with Big Task Instances

Effect of deep transfer and multi-task learning on sperm abnormality detection

Hobbies

Engages in various extreme sports including parkour, skateboarding, skydiving, workouts, climbing, and cycling. nitter+2

Expresses creativity through AI art, with dedicated Instagram pages for his work. instagram

Has an interest in philosophy, often mentioning it in his online presence. nitter

Notes

Similar profiles

Nathan Benaich

Alexander Norman

sri vishnu

Hussain Shahbaz

Devi Parikh

Deep Gandhi

Nathan Benaich

Alexander Norman

sri vishnu

Hussain Shahbaz

Devi Parikh

Deep Gandhi