Computer science graduate with a passion for software engineering, likes to take initiative to tackle hard challenges , with experience building data processing systems. Seeking opportunities where I can leverage my skills and experience and contribute to the organization in a collaborative and learning environment.
-
Experience
Graduate Research Assistant, Arizona State University August 2018 – October 2019
• MS Thesis: Developed a generalized classification system using BERT a language model to identify personal health experience mention in tweets across different health domains. Ranked 2nd out of 19 submissions in the SMM4H workshop at ACL 2019 conference. Improved the system to get state-of-the-art accuracy of 87 percent.
• Improved F1 score by 50 percent using pretraining and transfer learning to classify Adverse Drug Reactions (ADR) in Tweets. Achieved an increase of 22 percent in the accuracy by data augmentation and BERT modification to perform Named Entity Extraction (NER) of ADR spans in tweets.
Research Assistant, Arizona State University December 2018 – April 2019
• Designed a machine learning system for a research project to study the Impact of Impression Management on Academic Program Reputations on social media. This research has been accepted at a top Educational Research conference AERA 2020.
• Developed a data processing pipeline for data collection, preprocessing, and feature engineering.
• Created four different machine learning (SVM)models with accuracies of 83 to 91 percent.
Graduate Teaching Assistant, Arizona State University August 2018 – December 2018
• Delivered range of teaching and assessment activities including programming labs directed towards delivery of defined coursework for CSE 110 Principles of programming in Java.
• Managed 6 labs for around 120 students and helped them with the course related queries, assignments and projects.
-
Projects
Image recognition as a Service on AWS February 2018 – April 2018
• Developed an elastic application on AWS for image recognition using a deep learning model. Implemented a PHP application to handle the end-user interaction.
• Developed Java programs to interact with and provide cross interaction between different AWS resources like EC2, S3, SQS, and CloudWatch to manage the data flow.
• Reduced time overhead by 120 seconds by implementing an efficient load balancing algorithm to automatically scale in and out on demand.
Geospatial Hotspot Analysis February 2018 – April 2018
• Implemented a Hadoop cluster on AWS to execute geospatial queries like Range, Join, KNN using SparkSQL.
• Performed Operational tests by varying load and cluster configurations.
Project Management as a Service August 2018 – December 2018
• Designed an elastic application on the google cloud platform to manage projects.
• Implemented REST APIs to connect python HTTP server with NodeJS backend using Express.
Real-time Scalable Dashboard March 2020 – April 2020
• Developed three different subsystems – a crawler using Selenium, NLP system using SparkML and a Dashboard using Flask and Dash. Implemented python script to place the data collected from the crawler into a Redis queue for NLP processing.
• Containerized the subsystems using Docker and deployed on a Kubernetes cluster.