Shreyash Hisariya

Software Developer

Stony Brook University | Visa | NIT Allahabad

About me

I am currently pursuing Masters of Science in Computer Science from Stony Brook University in New York, United States.

As part of my thesis project, I am currently working in SBU CV Lab on building a assistive robotic arm to make self feeding task possible for people who cant physically feed themselves.

I have three years of industry experience working in Visa Inc. as a Software Developer. My interests include Distributed Systems, Machine Learning and Computer Vision.

Education

January 2020 - Present

Stony Brook University, New York, US

GPA : 3.84/4.00
Relevant Courses : Machine Learning, Introduction to Computer Vision, Probability & Statistics for Data Science
Teaching Assistant : CSE 114 : Introduction to Object Oriented Programming under Prof. Praveen Tripathi.

August 2013 - May 2017

Motilal Nehru National Institute of Technology, Allahabad, India

GPA : 8.25/10
Relevant Courses : Data Structures, Object Oriented Modeling, Databases, Software Engineering, Operating Systems & Networking
Undergraduate Research : Worked under supervision of Prof. Rupesh Dewang on Sentiment Analysis to classify product reviews collected from twitter.

Work Experience

2017 - 2021

Visa Inc. - India (Senior Software Developer)

  • Data Lake Controller| Java, Spring Boot, Kafka, Docker, Airflow

    - Designed micro-services for Data Lake Controller which provided a metadata driven implementation to manage components of Visa Data Platform automating 68% of ETL pipelines.
    - Implemented services to auto-generate data pipelines and deploy (avg. time of 2 secs) on distributed schedulers.
    - Delivered scheduler and checkpoint micro-services to deploy and run data pipelines (700+) on apache airflow. The service allows a user to author, schedule and monitor the data flow and view the dataset lineages.
    - Integrated Data Lake with ML Platform allowing data scientists to build, deploy and monitor ML models efficiently.

  • Data Quality Platform| Java, Spring Boot, MySQL, HDFS, ETL, Grafana

    - Developed a data management and analytics platform for automating data quality checks across 28 heterogeneous data environments.
    - Scaled it to perform more than 50,000 checks across completeness, consistency, and timeliness dimensions of data.
    - Spring Boot application allowing users to configure rules, monitor them and generate alerts as part of ETL pipelines.
    - Cron based scheduler to execute data quality checks based on the frequency of checks.

  • Root Cause Analysis| Python, Time series analysis, Cross-correlation

    - Formulated a root cause analysis framework to correlate event trends and detect root cause of a quality issue with 72% accuracy.
    - Built an event repository of multiple internal and external events affecting a dataset.
    - Built a correlation engine to correlate events with an anomaly using a probabilistic model.
    - Filed a US Patent for the same - PCT/US2018/066540

May 2016 - August 2016

Visa Inc. - India (Summer Intern)

  • Forecasting and Anomaly Detection| Java, Spring Boot, Holt Winters, Predictive Analysis, Outlier Detection

    - Implemented anomaly detection framework using Holt Winter's triple exponential smoothing algorithm.
    - Spring Boot application to run the detection algorithm on event of new transaction. Data was consumed from HBase and anomalies were written to mysql for root cause analysis and alerts.
    - Weekly, monthly, yearly, holidays and special events as trends for better accuracy. It helped to reduce the percentage of false alerts by 62%.
    - Generalized the framework to incorporate other forecasting models like Facebook's Prophet, Moving Average models.

Skills

Languages : Java, C++, Python, JavaScript/CSS/HTML

Technologies/Frameworks : Tensorflow, PyTorch, OpenCV, Spring Boot, Spring data and security, Apache Airflow, Apache Kafka, Solr (Apache Lucene),Hadoop, Kubernetes, ROS (Robot operating system)

Design : Micro-services architecture, Data intensive applications, Event based and metadata managed systems, MVC, REST

Interests : Distributed Systems, Machine Learning, Data Platform, Predictive Analytics, Computer Vision

Graduate Coursework

CSE 512 : Machine Learning

CSE 527 : Introduction to Computer Vision

CSE 544 : Probability and Statistics for Data Science

Online Courses

Coursera : Neural Networks and Deep Learning

Coursera : Improving Deep Neural Networks

CSE 231n : Convolutional Neural Networks for Visual Recognition (Stanford)