Anoop Kumar

Statistics and Machine Learning Lab, IISc Bangalore, KA - 560012 ยท

anoopkumar@iisc.ac.in

I am a Computer Science graduate student at CSA Dept, Indian Institute of Science , Bangalore. My primary area of work include Natural Language Processing and Graph Neural Networks. I am also interested in solving deep learning problems in Few-Shot scenarios. In my free time, I love to play badminton.
Link to my resume.


Education

Indian Institute of Science, Bangalore

Master of Technology
Computer Science and Engineering

GPA: 7.9*

Major Courses: Deep Learning For NLP, Machine Learning, Practical Data Science, Computational Methods of Optimization, Linear Algebra & Probability, Algorithms, Compilers

August 2019 - June 2021

GCET, Greater Noida

BACHELOR OF TECHNOLOGY

Percentage: 82.26

Major Courses: Operating Systems, Computer Organisation, DBMS, Computer Networks, Data Structures & Algorithms, Engineering Mathematics

August 2014 - June 2018

Bharatiya Vidya Bhavan's KDK Vidya Mandir, Renukoot

All India Senior School Certificate Examination
Science

Percentage: 89.0

April 2013 - March 2014

Bharatiya Vidya Bhavan's KDK Vidya Mandir, Renukoot

All India Secondary School Examination
Science

GPA: 10.0

April 2011 - March 2012

Experience

Data Science Intern

Myntra Designs, Bangalore

Developed a Machine Learning pipeline to generate Personalized User Recommendations.
- Prepared User session dataset from raw data
- Performed required Exploratory Data Analysis
- Trained embedding models to capture the contextual representation of the products & Users
- Ensemble version of MetaProd2Vec gave a 10 % improvement in mean cosine similarity.

April 2020 - May 2020

Programming Officer

Oil and Natural Gas Corporation, Mumbai

Responsible for the administration and efficient functioning of Mumbai High Data Center, development and maintenance of applications, and providing support to geologists and geophysicists.

Dec 2018 - July 2019

Recent Projects

Advance Chatbots

Lab Project with British Telecom

- Working on the System Act Prediction and Natural Language Generation (NLG) modules.
- Developed a GPT based transformer model for Response Generation in Few-shot scenario.
- The model achieved BLEU score of 63% and Slot Error Rate of 3.1%.

Ongoing

Brain Networks

Lab Project

- Modelled human brain as a network using fMRI data, to detect Alzheimer.
- Built a Hierarchical GCN based model, achieved 83% accuracy on ADNI data, beats kernel-based methods.
- Working on scalable multiplex embedding techniques to incorporate multi-task fMRI.

Ongoing

Aspect-Based Sentiment Analysis (ABSA)

DLNLP Course Project

- Developed a NLP model to predict aspects and its polarity from restaurant reviews.
- Constructed dummy sentences from the aspect to convert ABSA to a sentence-pair classification task.
- Fine-Tuned the pre-trained model from BERT to achieved 85% F1 score on Test data.

Nov 2020

Sentiment Analysis on Code-Mixed Text

DLNLP Course Project

- Developed a NLP model for sentiment Analysis on Code-Mixed Hinglish Dataset (Semeval 2020)
- Used an ensemble architecture of convolutional neural net (CNN) and self-attention based LSTM.
- Code-Mixed data is handled using XML-R, achieved 68.5% F1 score on Test data.

Oct 2020

Few-Shot Link Prediction in Knowledge Graphs

Machine Learning Course Project

- Made key modifications to the existing MetaR model for few-shot link prediction
- The new model uses a semantic scoring function and pre-trained model entity embeddings
- The model improves the MRR from 26.1% to 31.5% on NELL-One dataset in 5-shot setting

June-July 2020

Prototypical networks for few shot image classification

Machine Learning Course Project

- Built a deep metric learning based Few-shot classification model
- Primarily a CNN based model implemented using Pytorch. Dataset used: Omniglot.
- The model improves the classification accuracy from 85% to 87.6% in 1-shot setting.

June 2020

Loop Interchange Tool in MLIR

Compilers Course Project

- Developed a loop interchange tool in LLVM to speed up matrix operations
- The tool optimizes for locality (both spatial & temporal) and parallelism for multi-cores.
- Driven by an analytical cost model, implemented on Affine dialect in MLIR.

June-July 2020

Clang Support for Nested Functions in C

Compilers Course Project

- Built a tool which allows the programmer to write nested functions in C.
- The tool performs source to source transformations in Clang(C/C++ Compiler).
- Based on Clang LibTooling and AST Matchers, converts nested label statements into closures.

June 2020

Skills

Tools and Technology
  • Deep Learning - NLP, CNNs, Graph Neural Networks, Knowledge Graphs
  • Machine Learning - Gradient Boosted Trees, Clustering & Classification algorithms
  • Frameworks/ Libraries - Pytorch, Tensorflow, Gensim, Pandas, Numpy, ScikitLearn
  • BigData - Map Reduce, Hadoop(Basic), Pig
  • Compilers - LLVM, Clang, MLIR


Achievements & Certifications

  • Secured position among top 150 teams from a total of 23K teams in Flipkart GRiD 2.0, 2020.
  • Secured AIR 50 in GATE 2018 with a percentile of 99.95
  • Among top 10% of test takers on the Triplebyte assessment for Machine Learning
  • Certification in Machine Learning course from Coursera
  • Certification in Hadoop Development