One-Shot Learning

Examples of Training and testing the Siamese network for one-shot learning

Details

This project was done as part of the course CS685A: Data Mining, in the Fall ‘18 term at IIT Kanpur under Prof. Arnab Bhattacharya, Department of Computer Science and Engineering, IIT Kanpur.

Brief summary of the work done in the project

The process of learning good features for machine learning applications can be very computationally expensive and may prove difficult in cases where little data is available. A prototypical example of this is the one-shot learning setting, in which we must correctly make predictions given only a single example of each new class. In this paper, we firstly explore Siamese Networks, a unique twin-network setting which is capable of checking how similar any two inputs are. The observed shortcomings of the model inspire us to explore a more robust model, Matching Networks, a neural network which uses recent advances in attention and memory that enable rapid learning. We then extend the models to the MNIST and ImageNet datasets in addition to the standard Omniglot dataset they are trained on to check their robustness and achieve promising results.

Mrinaal Dogra
Mrinaal Dogra
Lead Engineer

My research interests include artificial intelligence, human-machine interaction and robotics