MyPicture

I'm a Graduate student of Information Systems at California State University Los Angeles (CSU-LA).

Data-driven continuous improvement professional with a focus on Big Data Analytics & Data Visualizations.



Prog. Languages



Hadoop





Web Languages




Data Base





Tools

Python and Java

C and C++

HDFS and MapReduce

Pig and Hive

HTML with CSS

Javascript and J2EE

MongoDB

MySQL

Eclipse and XCode

Tableau and iPython Notebook

Nov 2016 - Present

Channel Advisor

Software Developer

  • Designed and developed a system named Platform Intelligence which helped the Account Managers to enhance their sales by monitoring the platform metrics like SKU count Available, Listing and Listing Errors
  • Extracted the data that are scraped from different seller sites to identify the potential customers who sells across different categories and different geographical locations.
  • Collaborated with data warehouse developers to meet business user needs, effectively access data, optimize data storage and maintain data integrity.
  • Developed ETL pipe-lines to export/import the data from/into Salesforce.
  • Environment: Java, Apex, SQL Server, Salesforce, Jitterbit, Agile,JIRA, JSON
June 2016 - Nov 2016

Prime Health Care

Software Engineer

  • Extracted the data from SQL Server into HDFS using Sqoop with incremental load to populate Hive tables and developed Oozie workflow for scheduling and orchestrating the ETL process.
  • Wrote Java Map Reduce jobs and Hive queries to transform raw data from several data sources into forming baseline data.
  • Development of machine learning model using AzureML Studio that make predictions to identify the possibility of a patient potentially getting readmitted to the same hospital.
  • Published data visualization dashboard like In-Patient Trends, Census Portal and ED using Qlik Sense.
  • Environment: Java, MapReduce, Hive, Sqoop, Oozie, HDFS, SQL Server, HDInsights, AzureML Studio, QlikSense
June 2015 - May 2016

Harman International

Software Engineer Intern (Big Data Analytics)

  • Analyzed the highly unstructured and semi-structured data of 50TB in size (with replication factor of 3) from connected cars to “Aha Analytics” portal which created an huge impact on CES 2016
  • Developed an analytics service that enable automotive OEMs to gain valuable insight into their Vehicle Usage, Application Usage, Bluetooth Usage, Network Coverage and Driving Pattern for Head Unit Analytics module, which have shown 27 percent CAGR increase in revenue
  • Wrote Map Reduce jobs, Pig Scripts and Hive queries that resulted in performance improvement by 20% in order to process the data from the connected cars and stored it in the database
  • Delivered 10x impact from various data visualization reports involving models like Sun Burst, San Key, Heat Map, Tree Map using JavaScript API, SQL and Tableau Dashboard
  • Collaborated with data warehouse developers to meet business user needs and effectively access data, optimize Tableau dashboards and maintain data integrity
  • Environment: MapReduce, Pig, Hive, Java, Eclipse, JavaScript, Tableau, SQL, MySQL Workbench, Redshift, EMR, EC2, Jenkins, Maven, Tomcat, JIRA, Confluence, Svn
Nov 2011 – July 2013

TATA Consultancy Services

Software Engineer (Java)

  • Increased application performance by 38% for Barclays investment bank that uses Java and Oracle to manage incoming trade allocation and onboard a new client into the system
  • Enhanced code quality reducing overall bugs by 40% within the first six months, and developed and delivered 100% of required features.
  • Developed procedures and processes including: code reviews, system documentation, database design, generation of reports, and implemented software development techniques.
  • Build and maintain SQL scripts, indexes, and complex queries for data analysis and extraction
  • Designed another application that involves a series of components that process cleared trades for Japanese markets. This is done using Spring (spring core and DAO) and information is communicated between components in the form of messages using JMS. Maven is the build tool.
  • Developed a web service application that reads user inputs from a UI using REST concept and hits an Oracle coherence database to generate a series of trade summary and position reports.
  • Environment: Java, Spring, JMS, Maven Agile, Eclipse, HTML, JSP, JavaScript, SQL, MySQL Workbench, Git
Sep 2013 - Dec 2015

California State University Los Angeles

Master Of Science in Information Systems

  • Big Data Science with Hadoop
  • Data Analysis and Visualization
  • Database Systems
  • Business Intelligence
  • Information Systems
  • Data and Text Mining
  • Software Engineering
  • Cloud Computing
Sep 2007- May 2011

Anna University, Chennai, India

Bachelor of Science in Computer Science

  • Data Mining and Warehousing
  • Machine Learning
  • Data Structures
  • Algorithms
  • Operating Systems
  • Theory of Computation
  • Databases
  • Compiler Design
  • Internet Programming

 

Hotel Inspection Prediction System (Documentation: hotelinspection.info  Code: Github)
• Analyzed Trip Advisor’s dataset based on the customer’s review and ratings in order to predict the hotels that have a higher possibility of inspection using Hadoop, MapReduce, Mahout, MongoDB.

 

Implementing the Generating Sequential Patterns algorithm using multiple minimum supports (MS-GSP) (Code: Github)
• Predicted the customer’s purchasing behavior by generating patterns with an algorithm and identified the most frequently purchased item in a particular store using Java.

 

Python Data Analysis (Code: Github)
• Election Analysis 2012, Stock market analysis and Titanic Survival analysis using NumPy, pandas, Seaborn, Matplotlib.

 

Text Mining, Web Scraping and Sentiment Analysis with Twitter Data  (Code: Github)
• Cleaning the Tweets, Plotting and coloring the word cloud,  Creating a term document matrix and getting list of most frequent terms and Performing a comparative sentiment analysis on 4 pharma companies using R.

 

Simulated E-commerce Portal  (Documentation:  PDF   Code: Github)                    
• Built a system to efficiently deliver a product to a customer by identifying the nearest retailer who has the product in stock. The system facilitates interaction between the distributed objects-customer, retailer, supplier and warehouse using Apache Axis, Java, Amazon EC2, Google maps API, MongoDB.

 

Book Edu  (Documentation:   PDF      Code: Github)
• Designed a Web Book Store with different modules like updating of user profile and password, searching for books/authors, add/update/remove books from cart using Java, HTML, CSS, JavaScript, MySql.

 

Mobile Technology

Mobile Technology

Music Production

Music Production

Screen play writing

Screen play writing

Travel

Travel

Health and Fitness

Health and Fitness

Sports

Sports

May 5th, 2016

Binary Search in a Rotated Array using Merge Sort

A rotated array is a combination of two sorted array. so let’s find a pointer on where the array is rotated. Then let’s split the input array into two sorted array. Now if the search key is present in left array, we can pass the binary search function for left array or vice versa.  … read more

Apr 29th, 2016

SQL Query to generate Hierarchial Data Sets in the form of Sun Burst Chart in Tableau

SELECT “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” AS “Parent Category Label”, “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” AS “Product Category”, 1 AS “Level”, 1 AS “Path”, SUM(“vw_bluetooth_failure_by_device_by_profile”.”failure_count” ) AS “Failure Count” FROM “public”.”vw_bluetooth_failure_by_device_by_profile” “vw_bluetooth_failure_by_device_by_profile” GROUP BY “vw_bluetooth_failure_by_device_by_profile”. ”device_manufacturer” UNION   SELECT “vw_bluetooth_failure_by_device_by_profile”.”device_model” as [Parent Category Label], “vw_bluetooth_failure_by_device_by_profile”.”device_manufacturer” & ” > ” & “vw_bluetooth_failure_by_device_by_profile”.”device_model”  as [Product Category], 2 AS [Level], 1 AS [Path], SUM(“vw_bluetooth_failure_by_device_by_profile”.”failure_count” ) AS… read more

Apr 23rd, 2016

Sessionizer

/*********** UUID     TIMESTAMP uuid1      2016-01-01 21:10:05.123 uuid1      2016-01-01 21:12:05.123 uuid2      2016-01-01 21:14:12.433 uuid1      2016-01-01 21:12:25.123 uuid3      2016-01-01 21:14:12.433 uuid2      2016-01-01 21:18:12.433 uuid1       2016-01-01 22:22:25.123 Write an algorithm that generate a list of user sessions. e.g. the above input… read more

Close Bitnami banner
Bitnami