Profile banner
Erfan Miahi

Erfan Miahi

ML/RL Research Engineer, ex-founder, focused on decentralized AI and LLMs

erfan-miahi
Toronto, Canada
Joined November 2024

Network

9.2K connections
πŸŽ“
UAlberta AI Faculty
πŸ›οΈ
Guilan University Academics
πŸ’‘
AI ML Startup Founders
πŸ‡¨πŸ‡¦
Huawei Canada AI Researchers
🧠
Amii ML Scientists Educators
βš™οΈ
RL Core Tech
πŸ“Š
DeepR

Summary

Erfan Miahi is a highly active and accomplished Machine Learning Research Engineer with a deep specialization in Reinforcement Learning (RL) and its applications. He possesses over seven years of hands-on experience in building scalable AI/SaaS systems and reliable ML pipelines, collaborating with prominent researchers from institutions like Google DeepMind and the University of Alberta's RLAI Lab. github+2
He is an entrepreneurial technical founder, currently serving as Technical Founder at Qelix, where he is developing a platform for training custom AI models using RL. Previously, he was a Technical Co-founder at Intract and a Founder in Residence at Antler. His startup experience is complemented by his role as a Machine Learning Research Engineer and Consultant at DeepR Analytics, where he developed large-scale ML systems for automated trading. thenetwork
Erfan is a prolific researcher, with a Google Scholar h-index of 5 and i10-index of 5, indicating significant contributions to the fields of Reinforcement Learning, Machine Learning, and Optimization. His academic work includes a Master's thesis on feature generalization in deep RL, and notable papers on topics such as neural architecture search for medical image assessment and scalable evolutionary optimization. google+2
His current work at Covenant AI and the development of projects like 'grail' showcases a strong focus on decentralized AI and large language models (LLMs). He is actively involved in implementing advanced techniques like GRPO for verifiable post-training and scaling RL finetuning on Bittensor, and has published a paper (CCPS) on calibrating LLM confidence. nitter+2
Beyond his technical expertise, Erfan Miahi is dedicated to mentorship, offering a free program to guide students interested in AI. He also maintains a personal interest in philosophy and actively participates in extreme sports, including parkour and skateboarding, reflecting a dynamic and multifaceted personality. github+3

Work

Education

Projects

Writing

Calibrating LLM Confidence by Probing Perturbed Representation Stability (CCPS)

January 1, 2025

Introduces CCPS, a novel method for more accurately estimating Large Language Model (LLM) confidence by analyzing internal representational stability. It applies targeted adversarial perturbations to final hidden states, extracts features, and uses a lightweight classifier to predict answer correctness. The paper demonstrates significant improvements in calibration metrics and accuracy over existing methods.

Favicon imagearxiv.org

Investigating the properties of neural network representations in reinforcement learning

January 1, 2024

A research paper (pre-print on arXiv) exploring neural network representations within reinforcement learning.

Favicon imageerfanmhi.github.io

Feature Generalization in Deep Reinforcement Learning: An Investigation into Representation Properties

January 1, 2022

Master's thesis investigating the connection between properties and generalization performance of representations learned by deep reinforcement learning algorithms. It empirically examines representations good at generalization and proposes novel hypotheses regarding the impact of auxiliary tasks in end-to-end training.

Favicon imageualberta.scholaris.ca

Genetic Neural Architecture Search for automatic assessment of human sperm images

January 1, 2022

Published in Expert Systems and Applications, this paper details the use of Genetic Neural Architecture Search for the automatic assessment of human sperm images.

Favicon imageerfanmhi.github.io

Scalable Transfer Evolutionary Optimization: Coping with Big Task Instances

January 1, 2022

Published in IEEE Transactions on Cybernetics, this work addresses scalable transfer evolutionary optimization for handling large task instances.

Favicon imageerfanmhi.github.io

Effect of deep transfer and multi-task learning on sperm abnormality detection

January 1, 2021

Published in Computers in Biology and Medicine, this paper explores the impact of deep transfer and multi-task learning on detecting sperm abnormalities.

Favicon imageerfanmhi.github.io

Hobbies

Engages in various extreme sports including parkour, skateboarding, skydiving, workouts, climbing, and cycling. nitter+2

Expresses creativity through AI art, with dedicated Instagram pages for his work. instagram

Has an interest in philosophy, often mentioning it in his online presence. nitter

Notes

AFAllan Fischsaid they've seen
EMErfan Miahi
build or research AI technologies in a way that stood out to them.

Nov 19, 2025