• MS in computer science from University of Cincinnati ( Thesis on Data mining ) advised by Professor Raj Bhatnagar.
  • Passionate about developments in data science research
  • Working on creating large scale ETL operations on Big Data using Apache Spark, Apache Hive/Apache Tez.
  • Working experience on scheduling applications using Apache Airflow.
  • Experience creating near realtime data processing flows on Apache Kafka or AWS Kinesis.
  • Experience provisioning infrastructure using Terraform.
  • Working experience on Java and related technologies like Spring MVC, Spring Social, Spring security etc. Have developed RESTful webservices from scratch.
  • Have developed front end application based on Angular Js and related technologies like Grunt, Jasmine, Protractor, karma etc.
  • Working knowledge of nodeJs server and routing frameworks like express.
  • Have experience working with Apache Camel, JBoss Fuse, Spring Framework, OSGI services, Apache Active MQ, Maven.
  • Familiar with Eclipse, IntelliJ, Webstorm,Netbeans , XML Spy , Matlab, SVN, Git and Perforce.

Experience

Nordstrom Inc

Senior Engineer
  • Working for Machine Learning Pipeline Team, where we solve ML needs of other teams across the organization.
  • Work primarily involves plumbing massive amounts of data from different sources filesystems ( HDFS, S3 ) , Streams ( Kinesis, Kafka ) , Data warehouses ( Redshift ) into a data-lake solution ( currently S3 )
  • During this process ETLs are run to preprocess the data for different ML needs. The ETL is primarily done using Apache Spark, however other tools like Apache NiFi , AWS Lambda are also used
  • Lot of focus is provided on robustness of the pipeline from error-handling to monitoring to auditing pieces.
  • Views are created using Apache Hive/ Presto on data stored in data-lake.
  • Feature vectors are extracted from datalake and a ML model is chosen based off the Usecase. Model fitting and serving is done for realtime customer requests.
  • All infrastructure is provisioned using Terraform , and Apache airflow acts as the dag scheduler that glues all the pieces together.

Nordstrom Inc

Engineer 2
  • Part of the customer profile services ( CPS ) team, where we source data and expose public facing APIs for actual end customers on Nordstrom.com.
  • Responsible for setting up a spark cluster and running spark jobs to move over large datasets ( around 0.5 B records ) to operational datastore (DynamoDB) in a lossless timely fashion introducing fault tolerance and failure retry mechanisms.
  • Responsible for setting up consumers to consume data from AWS Kinesis streams, curate and persist them in near real time fashion.
  • Responsible for creating a Rest Based Public API involving token based authentication and authorization using Amazon API Gateway, Amazon Cognito and AWS lambda.
  • Set up health monitoring for all the involved systems using AWS lambda and integrated pagerduty and slack using amazon SNS.
  • Architecting enterprise applications using serverless technologies leading to cost optimizations, better monitoring and auditing and championing separation of concerns.
  • Setup artillery scripts to performance test APIs and publish results to grafana dashboard.

CDK Global

Software Engineer 2
  • Part of audience management team where we gauge user intent and provide a seamless personalized experience. Most of my experience has been on architecting several data driven business intelligence reports that visualize KPIs and provide actionable insights and correlations to customers.
  • Full stack developer responsible for creation of several Java/Spring based micro services following practice of domain driven design. Architected NodeJs based webapps that use Angular 1.X as the JS framework for developing single page applications.
  • Developed several big data pipelines, involving both batch and streaming based processing. Depending on the volume of data we use Kafka and Rabbit MQ message brokers for our messaging needs. Have been playing around lately with Apache Spark and Akka for our stream processing needs.
  • Mentored junior developers and interns. Provided technical guidance to offshore teams and worked as scrum master.
  • Part of Docker tribe where we help teams across company run their applications on docker containers and run containers on CoreOS grid using Consul for service discovery.

CDK Global

Software Engineer 1
  • Part of social media management team, where we provide a consolidated social media management platform to more than 26,000 car dealerships. Most of my work included aggregating social media feeds from different channels using several exposed REST APIs.
  • Full stack developer responsible for creating several RESTful web services using J2EE/Spring and being one of the key contributors towards creation of a company-wide Node Angular web application stack.

UPS

Software Engineer Intern
  • Developed applications for integration middleware platform where modular applications were being developed as part of the OSGI architecture.
  • Worked on JBOSS Fuse ESB, Camel Mediation Router, Apache Active MQ, Restful Web Services and XSLT Transforms.
  • Collaborated with multiple teams to agree on contracts and created Interface agreement documents.
  • As one of the first members to join the development team, took a lot of initiatives to kick start projects and later helped other members to get on board.

University of Cincinnati

Teaching Assistant
  • Mentored freshman class for Matlab and basic engineering.
  • Guided senior students with their design projects.

Open Source Libraries Authored

Core Skillset

Area Skills
Big Data Tools Apache Spark, Apache Hive, AWS EMR, AWS Lambda, Spark Streaming, AWS Kinesis, Apache Kafka
Backend Skills J2EE, SPRING BOOT, SPRING DATA, SPRING BATCH, SPRING SOCIAL, SPRING XD, SCALA, APACHE CAMEL, REDHAT-FUSE, C, C++ APACHE SPARK , AKKA STREAMS NODEJS, EXPRESS JS, GRUNT JS
Frontend Skills ANGULARJS 1.X, JQUERY, JAVASCRIPT, UI BOOTSTRAP,BOOTSTRAP
DATABASES MY SQL, POSTGRE SQL,GREENPLUM, ORACLE,MONGO DB
AWS Services AMAZON S3,AMAZON DYNAMODB,AMAZON ELASTICSEARCH SERVICE, AMAZON EC2, AMAZON BEANSTALK,AMAZON LAMBDA, AMAZON SQS, AMAZON SNS, AMAZON ROUTE53, AMAZON API GATEWAY, AMAZON KINESIS
DATA ANALYTICS TOOLS MATLAB, ORANGE, TABLEAU
DEVOP TOOLS DOCKER, KUBERNETES, COREOS, CONSUL, NGINX, ATLASSIAN BAMBOO, JENKINS, ELK STACK, TELEGRAF, GRAFANA
VERSION CONTROL GIT, STASH/BIT BUCKET, PERFORCE, SVN
MESSAGE BROKERS RABBIT MQ, ACTIVE MQ, KAFKA, AMAZON KINESIS

Personal

  • Sports enthusiast, regularly play tennis, racquetball and ping pong. Have won numerous ping pong competitions.
  • Active member of a nonprofit organization VIBHA Seattle, that aims improve lives of poor children by spreading awareness and providing tools necessary for educating them.
  • Appreciate diverse culture, customs and languages, in fact I can speak 5 different languages.


Site designed using Jekyll and Columnal.