Profile banner
Sankalp Garg

Sankalp Garg

Co-founder at Optexity, Machine Learning Engineer, and researcher

San Francisco Bay Area, California
Joined August 2025

Network

3.4K connections
🧠
AI/ML Research & Leadership
🎓
IIT Alumni in Tech & Finance
🍎
Apple AI/ML Engineers
💻
Big Tech Software Engineers (IIT)
🇮🇳
IIT Delhi Alumni & Business Leaders
🤖
Big Tech ML Engineers & Researchers
💡
Startup Founders & Investors

Summary

Sankalp Garg is an entrepreneur and co-founder of Optexity, a company focused on leveraging AI for browser automation. Optexity aims to transform UI tasks into APIs by recording workflows and generating AI-powered automation scripts, offering a solution for reliable and cost-effective automation. optexity+1
His professional experience includes significant roles as a Machine Learning Engineer at Apple and an Applied Scientist Intern at Amazon Web Services (AWS), demonstrating his expertise in applied machine learning within large tech companies. He also worked as a Quantitative Strategist at Quadeye, gaining experience in a different technical domain. rocketreach+1
Sankalp has a strong academic background in Artificial Intelligence and Machine Learning, holding a Master of Science from Carnegie Mellon University and a Bachelor of Technology in Electrical and Electronics Engineering from the Indian Institute of Technology, Delhi. His research contributions are evident through his publications on topics such as neural transfer for RDDL planning, generalized neural policies for relational MDPs, and improved finetuning of zero-shot vision models. google+2
Beyond his primary employment, he has engaged in various research internships and assistantships at institutions like Microsoft, the Indian Institute of Technology, Delhi (including the Neuromechanics Research group), and the University of Southern California, as well as a visiting researcher position at the National University of Singapore. This broad experience highlights his continuous engagement with advanced research and academic environments.

Work

Education

Writing

System and method for finuting of zero-shot vision models

January 1, 2025

A method discloses receiving a plurality of input images, receiving text prompts, generating a visual matrix utilizing the images and an image encoder, generating a text matrix utilizing a text encoder, multiplying the text matrix and the visual matrix to generate an image-text similarity matrix that assigns a numerical value indicating similarities between each of encoded visual descriptors and each of the encoded images, wherein similarities are indicated by entries of the image-text similarity matrix having numerical values that determine a loss function associated with the image-text similarity matrix, identify a gradient of the loss function with respect to parameters associated with the image encoder and parameters associated with the text encoder, utilizing the gradient, update parameters associated with the image encoder or the text encoder, and outputting final updated parameters associated with either the text encoder or image encoder of the machine learning network.

Favicon imagescholar.google.com

Finetune like you pretrain: Improved finetuning of zero-shot vision models

January 1, 2023

This work shows that a natural and simple approach of mimicking contrastive pretraining consistently outperforms alternative finetuning approaches, and establishes the proposed method of contrastive finetuning as a simple and intuitive state-of-the-art for supervised finetuned of image-text models like CLIP.

Favicon imagescholar.google.com

Symbolic Network: Generalized Neural Policies for Relational MDPs

January 1, 2020

Presents SymNet, the first neural approach for solving Relational MDPs expressed in RDDL, demonstrating that SymNet policies are significantly better than random and sometimes more effective than training state-of-the-art deep reactive policies from scratch.

Favicon imagescholar.google.com

Temporal Attribute Prediction via Joint Modeling of Multi-Relational Structure Evolution

January 1, 2020

This paper proposes a new framework, DArtNet, which learns a static embedding for every node in the graph as well as a dynamic embedding which is dependent on the dynamic attribute value (time-series) and captures the information from the neighborhood by taking a relation specific mean and encodes the history information using RNN.

Favicon imagescholar.google.com

Size Independent Neural Transfer for RDDL Planning

January 1, 2019

This work presents the first method for neural transfer of RDDL MDPs that can transfer across problems of different sizes and has superior learning curves over training from scratch.

Favicon imagescholar.google.com

An accelerometer based fall detection system using deep neural network

January 1, 2019

A Deep Neural Network (DNN) for fall detection is proposed and proved to be independent of filtering operations suggesting the approach to be useful in noisy environment.

Favicon imagescholar.google.com

Transfer of Deep Reactive Policies for MDP Planning

January 1, 2018

This paper presents the first domain-independent transfer algorithm for MDP planning domains expressed in an RDDL representation, which exploits the symbolic state configuration and transition function of the domain to learn a shared embedding space for states and state-action pairs for all problem instances of a domain.

Favicon imagescholar.google.com