EMC GreenPlum Jobs in Delhi, NCR and Gurgaon

11+ EMC GreenPlum Jobs in Delhi, NCR and Gurgaon | EMC GreenPlum Job openings in Delhi, NCR and Gurgaon

Apply to 11+ EMC GreenPlum Jobs in Delhi, NCR and Gurgaon on CutShort.io. Explore the latest EMC GreenPlum Job opportunities across top companies like Google, Amazon & Adobe.

Data Engineer

consulting & implementation services in the area of Oil & Gas, Mining and Manufacturing Industry

Agency job

via Jobdost by Sathish Kumar

Ahmedabad, Hyderabad, Pune, Delhi

5 - 7 yrs

₹18L - ₹25L / yr

AWS Lambda

AWS Simple Notification Service (SNS)

AWS Simple Queuing Service (SQS)

Python

PySpark

+9 more

Data Engineer

Required skill set: AWS GLUE, AWS LAMBDA, AWS SNS/SQS, AWS ATHENA, SPARK, SNOWFLAKE, PYTHON

Mandatory Requirements 

Experience in AWS Glue
Experience in Apache Parquet 
Proficient in AWS S3 and data lake 
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS 
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies 
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform 
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations 
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible 
Identify and interpret trends and patterns from complex data sets 
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders. 
Key participant in regular Scrum ceremonies with the agile teams  
Proficient at developing queries, writing reports and presenting findings 
Mentor junior members and bring best industry practices

 QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales) 
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C# 
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake  
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools. 
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines. 
Good written and oral communication skills and ability to present results to non-technical audiences 
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus: 

AWS certification
Spark Streaming 
Kafka Streaming / Kafka Connect 
ELK Stack 
Cassandra / MongoDB 
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Data Engineer

Required skill set: AWS GLUE, AWS LAMBDA, AWS SNS/SQS, AWS ATHENA, SPARK, SNOWFLAKE, PYTHON

Mandatory Requirements 

Experience in AWS Glue
Experience in Apache Parquet 
Proficient in AWS S3 and data lake 
Knowledge of Snowflake
Understanding of file-based ingestion best practices.
Scripting language - Python & pyspark

CORE RESPONSIBILITIES

Create and manage cloud resources in AWS 
Data ingestion from different data sources which exposes data using different technologies, such as: RDBMS, REST HTTP API, flat files, Streams, and Time series data based on various proprietary systems. Implement data ingestion and processing with the help of Big Data technologies 
Data processing/transformation using various technologies such as Spark and Cloud Services. You will need to understand your part of business logic and implement it using the language supported by the base data platform 
Develop automated data quality check to make sure right data enters the platform and verifying the results of the calculations 
Develop an infrastructure to collect, transform, combine and publish/distribute customer data.
Define process improvement opportunities to optimize data collection, insights and displays.
Ensure data and results are accessible, scalable, efficient, accurate, complete and flexible 
Identify and interpret trends and patterns from complex data sets 
Construct a framework utilizing data visualization tools and techniques to present consolidated analytical and actionable results to relevant stakeholders. 
Key participant in regular Scrum ceremonies with the agile teams  
Proficient at developing queries, writing reports and presenting findings 
Mentor junior members and bring best industry practices

 QUALIFICATIONS

5-7+ years’ experience as data engineer in consumer finance or equivalent industry (consumer loans, collections, servicing, optional product, and insurance sales) 
Strong background in math, statistics, computer science, data science or related discipline
Advanced knowledge one of language: Java, Scala, Python, C# 
Production experience with: HDFS, YARN, Hive, Spark, Kafka, Oozie / Airflow, Amazon Web Services (AWS), Docker / Kubernetes, Snowflake  
Proficient with
Data mining/programming tools (e.g. SAS, SQL, R, Python)
Database technologies (e.g. PostgreSQL, Redshift, Snowflake. and Greenplum)
Data visualization (e.g. Tableau, Looker, MicroStrategy)
Comfortable learning about and deploying new technologies and tools. 
Organizational skills and the ability to handle multiple projects and priorities simultaneously and meet established deadlines. 
Good written and oral communication skills and ability to present results to non-technical audiences 
Knowledge of business intelligence and analytical tools, technologies and techniques.

Familiarity and experience in the following is a plus: 

AWS certification
Spark Streaming 
Kafka Streaming / Kafka Connect 
ELK Stack 
Cassandra / MongoDB 
CI/CD: Jenkins, GitLab, Jira, Confluence other related tools

Senior Data Engineering Role - Google Cloud Platform with Spark

A LEADING US BASED MNC

Agency job

via Zeal Consultants by Zeal Consultants

Bengaluru (Bangalore), Hyderabad, Delhi, Gurugram

5 - 10 yrs

₹14L - ₹15L / yr

Google Cloud Platform (GCP)

Spark

PySpark

Apache Spark

"DATA STREAMING"

Data Engineering : Senior Engineer / Manager

As Senior Engineer/ Manager in Data Engineering, you will translate client requirements into technical design, and implement components for a data engineering solutions. Utilize a deep understanding of data integration and big data design principles in creating custom solutions or implementing package solutions. You will independently drive design discussions to insure the necessary health of the overall solution.

Must Have skills :

1. GCP

2. Spark streaming : Live data streaming experience is desired.

3. Any 1 coding language: Java/Pyhton /Scala

Skills & Experience :

- Overall experience of MINIMUM 5+ years with Minimum 4 years of relevant experience in Big Data technologies

- Hands-on experience with the Hadoop stack - HDFS, sqoop, kafka, Pulsar, NiFi, Spark, Spark Streaming, Flink, Storm, hive, oozie, airflow and other components required in building end to end data pipeline. Working knowledge on real-time data pipelines is added advantage.

- Strong experience in at least of the programming language Java, Scala, Python. Java preferable

- Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc.

- Well-versed and working knowledge with data platform related services on GCP

- Bachelor's degree and year of work experience of 6 to 12 years or any combination of education, training and/or experience that demonstrates the ability to perform the duties of the position

Your Impact :

- Data Ingestion, Integration and Transformation

- Data Storage and Computation Frameworks, Performance Optimizations

- Analytics & Visualizations

- Infrastructure & Cloud Computing

- Data Management Platforms

- Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time

- Build functionality for data analytics, search and aggregation

Data Engineering : Senior Engineer / Manager

Must Have skills :

1. GCP

2. Spark streaming : Live data streaming experience is desired.

3. Any 1 coding language: Java/Pyhton /Scala

Skills & Experience :

- Overall experience of MINIMUM 5+ years with Minimum 4 years of relevant experience in Big Data technologies

- Strong experience in at least of the programming language Java, Scala, Python. Java preferable

- Hands-on working knowledge of NoSQL and MPP data platforms like Hbase, MongoDb, Cassandra, AWS Redshift, Azure SQLDW, GCP BigQuery etc.

- Well-versed and working knowledge with data platform related services on GCP

- Bachelor's degree and year of work experience of 6 to 12 years or any combination of education, training and/or experience that demonstrates the ability to perform the duties of the position

Your Impact :

- Data Ingestion, Integration and Transformation

- Data Storage and Computation Frameworks, Performance Optimizations

- Analytics & Visualizations

- Infrastructure & Cloud Computing

- Data Management Platforms

- Build functionality for data ingestion from multiple heterogeneous sources in batch & real-time

- Build functionality for data analytics, search and aggregation

Snowflake Developer

Top IT MNC

Agency job

via People First Consultants by Jayaraj E

Chennai, Bengaluru (Bangalore), Kochi (Cochin), Coimbatore, Hyderabad, Pune, Kolkata, Noida, Gurugram, Mumbai

5 - 13 yrs

₹8L - ₹20L / yr

Snow flake schema

Python

snowflake

Greetings,

We are looking out for a Snowflake developer for one of our premium clients for their PAN India loaction

Data Scientist

Client is a Machine Learning company based in New Delhi.

Agency job

via Jobdost by Sathish Kumar

NCR (Delhi | Gurgaon | Noida)

2 - 6 yrs

₹10L - ₹25L / yr

Data Science

R Programming

Python

Machine Learning (ML)

Entity Framework

+2 more

Job Responsibilities

Design machine learning systems
Research and implement appropriate ML algorithms and tools
Develop machine learning applications according to requirements
Select appropriate datasets and data representation methods
Run machine learning tests and experiments
Perform statistical analysis and fine-tuning using test results
Train and retrain systems when necessary

Requirements for the Job

Bachelor’s/Master's/PhD in Computer Science, Mathematics, Statistics or equivalent field andmust have a minimum of 2 years of overall experience in tier one colleges

Minimum 1 year of experience working as a Data Scientist in deploying ML at scale in production
Experience in machine learning techniques (e.g. NLP, Computer Vision, BERT, LSTM etc..) andframeworks (e.g. TensorFlow, PyTorch, Scikit-learn, etc.)

Working knowledge in deployment of Python systems (using Flask, Tensorflow Serving)
Previous experience in following areas will be preferred: Natural Language Processing(NLP) - Using LSTM and BERT; chatbots or dialogue systems, machine translation, comprehension of text, text summarization.
Computer Vision - Deep Neural Networks/CNNs for object detection and image classification, transfer learning pipeline and object detection/instance segmentation (Mask R-CNN, Yolo, SSD).

Job Responsibilities

Design machine learning systems
Research and implement appropriate ML algorithms and tools
Develop machine learning applications according to requirements
Select appropriate datasets and data representation methods
Run machine learning tests and experiments
Perform statistical analysis and fine-tuning using test results
Train and retrain systems when necessary

Requirements for the Job

Bachelor’s/Master's/PhD in Computer Science, Mathematics, Statistics or equivalent field andmust have a minimum of 2 years of overall experience in tier one colleges

Minimum 1 year of experience working as a Data Scientist in deploying ML at scale in production
Experience in machine learning techniques (e.g. NLP, Computer Vision, BERT, LSTM etc..) andframeworks (e.g. TensorFlow, PyTorch, Scikit-learn, etc.)

Working knowledge in deployment of Python systems (using Flask, Tensorflow Serving)
Previous experience in following areas will be preferred: Natural Language Processing(NLP) - Using LSTM and BERT; chatbots or dialogue systems, machine translation, comprehension of text, text summarization.
Computer Vision - Deep Neural Networks/CNNs for object detection and image classification, transfer learning pipeline and object detection/instance segmentation (Mask R-CNN, Yolo, SSD).

Bigdata Professional

at HCL Technologies

3 recruiters

Agency job

via Saiva System by Sunny Kumar

Delhi, Gurugram, Noida, Ghaziabad, Faridabad, Bengaluru (Bangalore), Hyderabad, Chennai, Pune, Mumbai, Kolkata

5 - 10 yrs

₹5L - ₹20L / yr

PySpark

Data engineering

Big Data

Hadoop

Spark

+2 more

Exp- 5 + years
Skill- Spark and Scala along with Azure
Location - Pan India

Looking for someone Bigdata along with Azure

MongoDB Administrator

at Getkart Pvt Ltd

1 recruiter

Posted by Pooja Jha

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

3 - 5 yrs

₹5L - ₹12L / yr

MongoDB

NOSQL Databases

Designing, building, and automating the MongoDB Architecture for open-source MongoDB.
Good understanding of DB schema design, performance, tuning and capacity planning.
The ideal candidate has worked with modern open-source MongoDB platforms cloud deployment models and test-driven development in a fast-paced agile environment.
In depth understanding of data management e g permissions recovery security and monitoring Operational experience with MongoDB.
Data Modelling Operational experience with Indexes.
Good understanding of MongoDB replica set Op log and journals.
Provide advice and support to other development resources interacting with MongoDB.
Troubleshoot any problems that may come up with the database environments.
Skilled in performance tuning and optimization using native monitoring and troubleshooting tools.
Provide guidance in the creation and modification of standards and procedures.
Experience working with cloud database services a plus.
Experience working in an Agile Scrum environment.
Experience working in Aggregation in MongoDB.
Strong communication documentation skills and technology awareness.

Designing, building, and automating the MongoDB Architecture for open-source MongoDB.
Good understanding of DB schema design, performance, tuning and capacity planning.
The ideal candidate has worked with modern open-source MongoDB platforms cloud deployment models and test-driven development in a fast-paced agile environment.
In depth understanding of data management e g permissions recovery security and monitoring Operational experience with MongoDB.
Data Modelling Operational experience with Indexes.
Good understanding of MongoDB replica set Op log and journals.
Provide advice and support to other development resources interacting with MongoDB.
Troubleshoot any problems that may come up with the database environments.
Skilled in performance tuning and optimization using native monitoring and troubleshooting tools.
Provide guidance in the creation and modification of standards and procedures.
Experience working with cloud database services a plus.
Experience working in an Agile Scrum environment.
Experience working in Aggregation in MongoDB.
Strong communication documentation skills and technology awareness.

Data Scientist

A content consumption and discovery app which provides news

Agency job

via Jobdost by Mamatha A

Noida

2 - 5 yrs

₹30L - ₹40L / yr

Data Science

Deep Learning

R Programming

Python

Data Scientist

Requirements

● B.Tech/Masters in Mathematics, Statistics, Computer Science or another
quantitative field
● 2-3+ years of work experience in ML domain ( 2-5 years experience )
● Hands-on coding experience in Python
● Experience in machine learning techniques such as Regression, Classification,
Predictive modeling, Clustering, Deep Learning stack, NLP
● Working knowledge of Tensorflow/PyTorch

Optional Add-ons-

● Experience with distributed computing frameworks: Map/Reduce, Hadoop, Spark
etc.
● Experience with databases: MongoDB

Data Scientist

Head- Data Science

Fintech Pioneer | GGN

Agency job

via Unnati by Astha Bharadwaj

NCR (Delhi | Gurgaon | Noida)

8 - 13 yrs

₹60L - ₹70L / yr

Data Science

Data Scientist

Python

SQL

Machine Learning (ML)

+4 more

Join a leading MCommerce company, set your career on a flight towards success and growth.

Our client is one of the oldest fintech companies that is taking banking and financial services to all the customers through their online platform. Having served over 50 million customers in the last 15 years, it is enabling over 7mn banking transactions each month, with a network of nearly 2 lac merchants. Using its vast network of merchant outlets, the platform is reaching the lower and mid-income groups who deal in cash, for them to be able to remit money across the country digitally. It now plans to take its unique digital financial solutions to developing markets across the globe. As pioneers of mobile-based payment services in India, they empower Retailers, Individuals and Businesses to have an online presence and earn or save a little extra through the transactions.

As a Head - Data Science, you will be part of the leadership team and will be expected to manage ambiguity & help the Founders & other leaders in building the roadmap forward for the business.

You will be expected to adopt an "iron sharpens iron" attitude where you will focus on making everyone and every data-driven process better, blend people leadership/ management skills, use predictive modelling and analytics expertise, cloud computing skills and operational know-how.

What you will do:

Working closely with business stakeholders to define, strategize and execute crucial business problem statements which lie at the core of improvising current and future data-backed product offerings.
Building and refining underwriting models for extending credit to sellers and API Partners in collaboration with the lending team
Conceiving, planning and prioritizing data projects and manage timelines
Building analytical systems and predictive models as a part of the agile ecosystem
Testing performance of data-driven products participating in sprint-wise feature releases
Managing a team of data scientists and data engineers to develop, train and test predictive models
Managing collaboration with internal and external stakeholders
Building data-centric culture from within, partnering with every team, learning deeply about business, working with highly experienced, sharp and insanely ambitious colleagues

What you need to have:

B.Tech/ M.Tech/ MS/ PhD in Data Science / Computer Science, Statistics, Mathematics & Computation with a demonstrated skill-set in leading an Analytics and Data Science team from IIT, BITS Pilani, ISI
8+ years working in the Data Science and analytics domain with 3+ years of experience in leading a data science team to understand the projects to be prioritized, how the team strategy aligns with the organization mission;
Deep understanding of credit risk landscape; should have built or maintained underwriting models for unsecured lending products
Should have handled a leadership team in a tech startup preferably a fintech/ lending/ credit risk startup.
We value entrepreneurship spirit: if you have had the experience of starting your own venture - that is an added advantage.
Strategic thinker with agility and endurance
Aware of the latest industry trends in Data Science and Analytics with respect to Fintech, Digital Transformations and Credit-lending domain
Excellent command over communication is the key to manage multiple stakeholders like the leadership team, product teams, existing & new investors.
Cloud Computing, Python, SQL, ML algorithms, Analytics and problem - solving mindset
Knowledge and demonstrated skill-sets in AWS

Join a leading MCommerce company, set your career on a flight towards success and growth.

As a Head - Data Science, you will be part of the leadership team and will be expected to manage ambiguity & help the Founders & other leaders in building the roadmap forward for the business.

What you will do:

Working closely with business stakeholders to define, strategize and execute crucial business problem statements which lie at the core of improvising current and future data-backed product offerings.
Building and refining underwriting models for extending credit to sellers and API Partners in collaboration with the lending team
Conceiving, planning and prioritizing data projects and manage timelines
Building analytical systems and predictive models as a part of the agile ecosystem
Testing performance of data-driven products participating in sprint-wise feature releases
Managing a team of data scientists and data engineers to develop, train and test predictive models
Managing collaboration with internal and external stakeholders
Building data-centric culture from within, partnering with every team, learning deeply about business, working with highly experienced, sharp and insanely ambitious colleagues

What you need to have:

B.Tech/ M.Tech/ MS/ PhD in Data Science / Computer Science, Statistics, Mathematics & Computation with a demonstrated skill-set in leading an Analytics and Data Science team from IIT, BITS Pilani, ISI
8+ years working in the Data Science and analytics domain with 3+ years of experience in leading a data science team to understand the projects to be prioritized, how the team strategy aligns with the organization mission;
Deep understanding of credit risk landscape; should have built or maintained underwriting models for unsecured lending products
Should have handled a leadership team in a tech startup preferably a fintech/ lending/ credit risk startup.
We value entrepreneurship spirit: if you have had the experience of starting your own venture - that is an added advantage.
Strategic thinker with agility and endurance
Aware of the latest industry trends in Data Science and Analytics with respect to Fintech, Digital Transformations and Credit-lending domain
Excellent command over communication is the key to manage multiple stakeholders like the leadership team, product teams, existing & new investors.
Cloud Computing, Python, SQL, ML algorithms, Analytics and problem - solving mindset
Knowledge and demonstrated skill-sets in AWS

Data Scientist

leading pharmacy provider

Agency job

via Econolytics by Jyotsna Econolytics

Noida, NCR (Delhi | Gurgaon | Noida)

4 - 10 yrs

₹18L - ₹24L / yr

Data Science

R Programming

Python

Algorithms

Predictive modelling

Job Description:

• Help build a Data Science team which will be engaged in researching, designing,
implementing, and deploying full-stack scalable data analytics vision and machine learning
solutions to challenge various business issues.
• Modelling complex algorithms, discovering insights and identifying business
opportunities through the use of algorithmic, statistical, visualization, and mining techniques
• Translates business requirements into quick prototypes and enable the
development of big data capabilities driving business outcomes
• Responsible for data governance and defining data collection and collation
guidelines.
• Must be able to advice, guide and train other junior data engineers in their job.

Must Have:

• 4+ experience in a leadership role as a Data Scientist
• Preferably from retail, Manufacturing, Healthcare industry(not mandatory)
• Willing to work from scratch and build up a team of Data Scientists
• Open for taking up the challenges with end to end ownership
• Confident with excellent communication skills along with a good decision maker

Jr. Data Scientist

at Vital

2 recruiters

Posted by Shreeya Bajaj

Delhi

0.5 - 2 yrs

₹4.2L - ₹5.4L / yr

Data Science

Data Scientist

R Programming

RStudio

Python

6+ months of proven experience as a Data Scientist or Data Analyst
Understanding of machine-learning and operations research
Extensive knowledge of R, SQL and Excel
Analytical mind and business acumen
Strong Statistical understanding
Problem-solving aptitude
BSc/BA in Computer Science, Engineering or relevant field; graduate degree in Data Science or other quantitative field is preferred

6+ months of proven experience as a Data Scientist or Data Analyst
Understanding of machine-learning and operations research
Extensive knowledge of R, SQL and Excel
Analytical mind and business acumen
Strong Statistical understanding
Problem-solving aptitude
BSc/BA in Computer Science, Engineering or relevant field; graduate degree in Data Science or other quantitative field is preferred

Computer Vision Scientist - Machine Learning

at FarmGuide

1 recruiter

Posted by Anupam Arya

NCR (Delhi | Gurgaon | Noida)

0 - 8 yrs

₹7L - ₹14L / yr

Computer Security

Image processing

OpenCV

Python

Rational ClearCase

+8 more

FarmGuide is a data driven tech startup aiming towards digitizing the periodic processes in place and bringing information symmetry in agriculture supply chain through transparent, dynamic & interactive software solutions. We, at FarmGuide (https://angel.co/farmguide) help Government in relevant and efficient policy making by ensuring seamless flow of information between stakeholders.Job Description :We are looking for individuals who want to help us design cutting edge scalable products to meet our rapidly growing business. We are building out the data science team and looking to hire across levels.- Solving complex problems in the agri-tech sector, which are long-standing open problems at the national level.- Applying computer vision techniques to satellite imagery to deduce artefacts of interest.- Applying various machine learning techniques to digitize existing physical corpus of knowledge in the sector.Key Responsibilities :- Develop computer vision algorithms for production use on satellite and aerial imagery- Implement models and data pipelines to analyse terabytes of data.- Deploy built models in production environment.- Develop tools to assess algorithm accuracy- Implement algorithms at scale in the commercial cloudSkills Required :- B.Tech/ M.Tech in CS or other related fields such as EE or MCA from IIT/NIT/BITS but not compulsory. - Demonstrable interest in Machine Learning and Computer Vision, such as coursework, open-source contribution, etc.- Experience with digital image processing techniques - Familiarity/Experience with geospatial, planetary, or astronomical datasets is valuable- Experience in writing algorithms to manipulate geospatial data- Hands-on knowledge of GDAL or open-source GIS tools is a plus- Familiarity with cloud systems (AWS/Google Cloud) and cloud infrastructure is a plus- Experience with high performance or large scale computing infrastructure might be helpful- Coding ability in R or Python. - Self-directed team player who thrives in a continually changing environmentWhat is on offer :- High impact role in a young start up with colleagues from IITs and other Tier 1 colleges- Chance to work on the cutting edge of ML (yes, we do train Neural Nets on GPUs) - Lots of freedom in terms of the work you do and how you do it - Flexible timings - Best start-up salary in industry with additional tax benefits

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort