Amazon EMR Jobs in Delhi, NCR and Gurgaon

11+ Amazon EMR Jobs in Delhi, NCR and Gurgaon | Amazon EMR Job openings in Delhi, NCR and Gurgaon

Apply to 11+ Amazon EMR Jobs in Delhi, NCR and Gurgaon on CutShort.io. Explore the latest Amazon EMR Job opportunities across top companies like Google, Amazon & Adobe.

Python developer

at codersbrain

1 recruiter

Posted by Tanuj Uppal

Delhi

4 - 8 yrs

₹2L - ₹15L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+5 more

Mandatory - Hands on experience in Python and PySpark.

Build pySpark applications using Spark Dataframes in Python using Jupyter notebook and PyCharm(IDE).

Worked on optimizing spark jobs that processes huge volumes of data.

Hands on experience in version control tools like Git.

Worked on Amazon’s Analytics services like Amazon EMR, Lambda function etc

Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services like SNS.

Experience/knowledge of bash/shell scripting will be a plus.

Experience in working with fixed width, delimited , multi record file formats etc.

Hands on experience in tools like Jenkins to build, test and deploy the applications

Awareness of Devops concepts and be able to work in an automated release pipeline environment.

Excellent debugging skills.

Mandatory - Hands on experience in Python and PySpark.

Build pySpark applications using Spark Dataframes in Python using Jupyter notebook and PyCharm(IDE).

Worked on optimizing spark jobs that processes huge volumes of data.

Hands on experience in version control tools like Git.

Worked on Amazon’s Analytics services like Amazon EMR, Lambda function etc

Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services like SNS.

Experience/knowledge of bash/shell scripting will be a plus.

Experience in working with fixed width, delimited , multi record file formats etc.

Hands on experience in tools like Jenkins to build, test and deploy the applications

Awareness of Devops concepts and be able to work in an automated release pipeline environment.

Excellent debugging skills.

Lead Computer Vision Engineer

at An AI based company

Agency job

via Qrata by Prajakta Kulkarni

Gurugram, Delhi, Noida, Ghaziabad, Faridabad

5 - 10 yrs

₹25L - ₹70L / yr

Computer Vision

OpenCV

Python

TensorFlow

PyTorch

Job Title : Lead Computer Vision Engineer
Location : Gurgaon

About the company:
The company is changing the way cataloging is done across the Globe. Our vision is to empower the smallest of sellers, situated in the farthest of corners, to create superior product images and videos, without the need for any external professional help. Imagine 30M+ merchants shooting Product Images or Videos using their Smartphones, and then choosing Filters for Amazon, Asos, Airbnb, Doordash, etc to instantly compose High-Quality "tuned-in" product visuals, instantly. The company has built the world’s leading image editing AI software, to capture and process beautiful product images for online selling. We are also fortunate and proud to be backed by the biggest names in the investment community including the likes of Accel Partners, Angellist and prominent Founders and Internet company operators, who believe that there is an intelligent and efficient way of doing Digital Production than how the world operates currently.

Job Description :
- We are looking for a seasoned Computer Vision Engineer with AI/ML/CV and Deep Learning skills to
play a senior leadership role in our Product & Technology Research Team.
- You will be leading a team of CV researchers to build models that automatically transform millions of e
commerce, automobiles, food, real-estate ram images into processed final images.
- You will be responsible for researching the latest art of the possible in the field of computer vision,
designing the solution architecture for our offerings and lead the Computer Vision teams to build the core
algorithmic models & deploy them on Cloud Infrastructure.
- Working with the Data team to ensure your data pipelines are well set up and
models are being constantly trained and updated
- Working alongside product team to ensure that AI capabilities are built as democratized tools that
provides internal as well external stakeholders to innovate on top of it and make our customers
successful
- You will work closely with the Product & Engineering teams to convert the models into beautiful products
that will be used by thousands of Businesses everyday to transform their images and videos.

Job Requirements:
- Min 3+ years of work experience in Computer Vision with 5-10 years work experience overall
- BS/MS/ Phd degree in Computer Science, Engineering or a related subject from a ivy league institute
- Exposure on Deep Learning Techniques, TensorFlow/Pytorch
- Prior expertise on building Image processing applications using GANs, CNNs, Diffusion models
- Expertise with Image Processing Python libraries like OpenCV, etc.
- Good hands-on experience on Python, Flask or Django framework
- Authored publications at peer-reviewed AI conferences (e.g. NeurIPS, CVPR, ICML, ICLR,ICCV, ACL)
- Prior experience of managing teams and building large scale AI / CV projects is a big plus
- Great interpersonal and communication skills
- Critical thinker and problem-solving skills

Senior Data Scientist

at Chegg India Private Limited

1 video

1 recruiter

Posted by Naveen Ghiya

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

4 - 9 yrs

Best in industry

Machine Learning (ML)

Data Science

Natural Language Processing (NLP)

Computer Vision

recommendation algorithm

+4 more

Senior Data Scientist

Your goal: To improve the education process and improve the student experience through data.

The organization: Data Science for Learning Services Data Science and Machine Learning are core to Chegg. As a Student Hub, we want to ensure that students discover the full breadth of learning solutions we have to offer to get full value on their learning time with us. To create the most relevant and engaging interactions, we are solving a multitude of machine learning problems so that we can better model student behavior, link various types of content, optimize workflows, and provide a personalized experience.

The Role: Senior Data Scientist

As a Senior Data Scientist, you will focus on conducting research and development in NLP and ML. You will be responsible for writing production-quality code for data product solutions at Chegg. You will lead in identification and implementation of key projects to process data and knowledge discovery.

Responsibilities:

• Translate product requirements into AIML/NLP solutions

• Be able to think out of the box and be able to design novel solutions for the problem at hand

• Write production-quality code

• Be able to design data and annotation collection strategies

• Identify key evaluation metrics and release requirements for data products

• Integrate new data and design workflows

• Innovate, share, and educate team members and community

Requirements:

• Working experience in machine learning, NLP, recommendation systems, experimentation, or related fields, with a specialization in NLP • Working experience on large language models that cater to multiple tasks such as text generation, Q&A, summarization, translation etc is highly preferred

• Knowledge on MLOPs and deployment pipelines is a must

• Expertise on supervised, unsupervised and reinforcement ML algorithms.

• Strong programming skills in Python

• Top data wrangling skills using SQL or NOSQL queries

• Experience using containers to deploy real-time prediction services

• Passion for using technology to help students

• Excellent communication skills

• Good team player and a self-starter

• Outstanding analytical and problem-solving skills

• Experience working with ML pipeline products such as AWS Sagemaker, Google ML, or Databricks a plus.

Why do we exist?

Students are working harder than ever before to stabilize their future. Our recent research study called State of the Student shows that nearly 3 out of 4 students are working to support themselves through college and 1 in 3 students feel pressure to spend more than they can afford. We founded our business on provided affordable textbook rental options to address these issues. Since then, we’ve expanded our offerings to supplement many facets of higher educational learning through Chegg Study, Chegg Math, Chegg Writing, Chegg Internships, Thinkful Online Learning, and more, to support students beyond their college experience. These offerings lower financial concerns for students by modernizing their learning experience. We exist so students everywhere have a smarter, faster, more affordable way to student.

Video Shorts

Life at Chegg: https://jobs.chegg.com/Video-Shorts-Chegg-Services

Certified Great Place to Work!: http://reviews.greatplacetowork.com/chegg

Chegg India: http://www.cheggindia.com/

Chegg Israel: http://insider.geektime.co.il/organizations/chegg

Thinkful (a Chegg Online Learning Service): https://www.thinkful.com/about/#careers

Chegg out our culture and benefits!

http://www.chegg.com/jobs/benefits

https://www.youtube.com/watch?v=YYHnkwiD7Oo

http://techblog.chegg.com/

Chegg is an equal-opportunity employer

Senior Data Scientist

Your goal: To improve the education process and improve the student experience through data.

The Role: Senior Data Scientist

Responsibilities:

• Translate product requirements into AIML/NLP solutions

• Be able to think out of the box and be able to design novel solutions for the problem at hand

• Write production-quality code

• Be able to design data and annotation collection strategies

• Identify key evaluation metrics and release requirements for data products

• Integrate new data and design workflows

• Innovate, share, and educate team members and community

Requirements:

• Knowledge on MLOPs and deployment pipelines is a must

• Expertise on supervised, unsupervised and reinforcement ML algorithms.

• Strong programming skills in Python

• Top data wrangling skills using SQL or NOSQL queries

• Experience using containers to deploy real-time prediction services

• Passion for using technology to help students

• Excellent communication skills

• Good team player and a self-starter

• Outstanding analytical and problem-solving skills

• Experience working with ML pipeline products such as AWS Sagemaker, Google ML, or Databricks a plus.

Why do we exist?

Video Shorts

Life at Chegg: https://jobs.chegg.com/Video-Shorts-Chegg-Services

Certified Great Place to Work!: http://reviews.greatplacetowork.com/chegg

Chegg India: http://www.cheggindia.com/

Chegg Israel: http://insider.geektime.co.il/organizations/chegg

Thinkful (a Chegg Online Learning Service): https://www.thinkful.com/about/#careers

Chegg out our culture and benefits!

http://www.chegg.com/jobs/benefits

https://www.youtube.com/watch?v=YYHnkwiD7Oo

http://techblog.chegg.com/

Chegg is an equal-opportunity employer

Data Scientist

at A content consumption and discovery app which provides news

Agency job

via Jobdost by Mamatha A

Noida

2 - 5 yrs

₹30L - ₹40L / yr

Data Science

Deep Learning

R Programming

Python

Data Scientist

Requirements

● B.Tech/Masters in Mathematics, Statistics, Computer Science or another
quantitative field
● 2-3+ years of work experience in ML domain ( 2-5 years experience )
● Hands-on coding experience in Python
● Experience in machine learning techniques such as Regression, Classification,
Predictive modeling, Clustering, Deep Learning stack, NLP
● Working knowledge of Tensorflow/PyTorch

Optional Add-ons-

● Experience with distributed computing frameworks: Map/Reduce, Hadoop, Spark
etc.
● Experience with databases: MongoDB

Data Scientist

Head- Data Science

at Fintech Pioneer | GGN

Agency job

via Unnati by Astha Bharadwaj

NCR (Delhi | Gurgaon | Noida)

8 - 13 yrs

₹60L - ₹70L / yr

Data Science

Data Scientist

Python

SQL

Machine Learning (ML)

+4 more

Join a leading MCommerce company, set your career on a flight towards success and growth.

Our client is one of the oldest fintech companies that is taking banking and financial services to all the customers through their online platform. Having served over 50 million customers in the last 15 years, it is enabling over 7mn banking transactions each month, with a network of nearly 2 lac merchants. Using its vast network of merchant outlets, the platform is reaching the lower and mid-income groups who deal in cash, for them to be able to remit money across the country digitally. It now plans to take its unique digital financial solutions to developing markets across the globe. As pioneers of mobile-based payment services in India, they empower Retailers, Individuals and Businesses to have an online presence and earn or save a little extra through the transactions.

As a Head - Data Science, you will be part of the leadership team and will be expected to manage ambiguity & help the Founders & other leaders in building the roadmap forward for the business.

You will be expected to adopt an "iron sharpens iron" attitude where you will focus on making everyone and every data-driven process better, blend people leadership/ management skills, use predictive modelling and analytics expertise, cloud computing skills and operational know-how.

What you will do:

Working closely with business stakeholders to define, strategize and execute crucial business problem statements which lie at the core of improvising current and future data-backed product offerings.
Building and refining underwriting models for extending credit to sellers and API Partners in collaboration with the lending team
Conceiving, planning and prioritizing data projects and manage timelines
Building analytical systems and predictive models as a part of the agile ecosystem
Testing performance of data-driven products participating in sprint-wise feature releases
Managing a team of data scientists and data engineers to develop, train and test predictive models
Managing collaboration with internal and external stakeholders
Building data-centric culture from within, partnering with every team, learning deeply about business, working with highly experienced, sharp and insanely ambitious colleagues

What you need to have:

B.Tech/ M.Tech/ MS/ PhD in Data Science / Computer Science, Statistics, Mathematics & Computation with a demonstrated skill-set in leading an Analytics and Data Science team from IIT, BITS Pilani, ISI
8+ years working in the Data Science and analytics domain with 3+ years of experience in leading a data science team to understand the projects to be prioritized, how the team strategy aligns with the organization mission;
Deep understanding of credit risk landscape; should have built or maintained underwriting models for unsecured lending products
Should have handled a leadership team in a tech startup preferably a fintech/ lending/ credit risk startup.
We value entrepreneurship spirit: if you have had the experience of starting your own venture - that is an added advantage.
Strategic thinker with agility and endurance
Aware of the latest industry trends in Data Science and Analytics with respect to Fintech, Digital Transformations and Credit-lending domain
Excellent command over communication is the key to manage multiple stakeholders like the leadership team, product teams, existing & new investors.
Cloud Computing, Python, SQL, ML algorithms, Analytics and problem - solving mindset
Knowledge and demonstrated skill-sets in AWS

Join a leading MCommerce company, set your career on a flight towards success and growth.

As a Head - Data Science, you will be part of the leadership team and will be expected to manage ambiguity & help the Founders & other leaders in building the roadmap forward for the business.

What you will do:

Working closely with business stakeholders to define, strategize and execute crucial business problem statements which lie at the core of improvising current and future data-backed product offerings.
Building and refining underwriting models for extending credit to sellers and API Partners in collaboration with the lending team
Conceiving, planning and prioritizing data projects and manage timelines
Building analytical systems and predictive models as a part of the agile ecosystem
Testing performance of data-driven products participating in sprint-wise feature releases
Managing a team of data scientists and data engineers to develop, train and test predictive models
Managing collaboration with internal and external stakeholders
Building data-centric culture from within, partnering with every team, learning deeply about business, working with highly experienced, sharp and insanely ambitious colleagues

What you need to have:

B.Tech/ M.Tech/ MS/ PhD in Data Science / Computer Science, Statistics, Mathematics & Computation with a demonstrated skill-set in leading an Analytics and Data Science team from IIT, BITS Pilani, ISI
8+ years working in the Data Science and analytics domain with 3+ years of experience in leading a data science team to understand the projects to be prioritized, how the team strategy aligns with the organization mission;
Deep understanding of credit risk landscape; should have built or maintained underwriting models for unsecured lending products
Should have handled a leadership team in a tech startup preferably a fintech/ lending/ credit risk startup.
We value entrepreneurship spirit: if you have had the experience of starting your own venture - that is an added advantage.
Strategic thinker with agility and endurance
Aware of the latest industry trends in Data Science and Analytics with respect to Fintech, Digital Transformations and Credit-lending domain
Excellent command over communication is the key to manage multiple stakeholders like the leadership team, product teams, existing & new investors.
Cloud Computing, Python, SQL, ML algorithms, Analytics and problem - solving mindset
Knowledge and demonstrated skill-sets in AWS

Jr Computer Vision Engineer

at Orboai

4 recruiters

Posted by Hardika Bhansali

Noida, Mumbai

1 - 3 yrs

₹6L - ₹15L / yr

TensorFlow

OpenCV

OCR

PyTorch

Keras

+10 more

Who Are We

A research-oriented company with expertise in computer vision and artificial intelligence, at its core, Orbo is a comprehensive platform of AI-based visual enhancement stack. This way, companies can find a suitable product as per their need where deep learning powered technology can automatically improve their Imagery.

ORBO's solutions are helping BFSI, beauty and personal care digital transformation and Ecommerce image retouching industries in multiple ways.

WHY US

Join top AI company
Grow with your best companions
Continuous pursuit of excellence, equality, respect
Competitive compensation and benefits

You'll be a part of the core team and will be working directly with the founders in building and iterating upon the core products that make cameras intelligent and images more informative.

To learn more about how we work, please check out

https://www.orbo.ai/.

Description:

We are looking for a computer vision engineer to lead our team in developing a factory floor analytics SaaS product. This would be a fast-paced role and the person will get an opportunity to develop an industrial grade solution from concept to deployment.

Responsibilities:

Research and develop computer vision solutions for industries (BFSI, Beauty and personal care, E-commerce, Defence etc.)
Lead a team of ML engineers in developing an industrial AI product from scratch
Setup end-end Deep Learning pipeline for data ingestion, preparation, model training, validation and deployment
Tune the models to achieve high accuracy rates and minimum latency
Deploying developed computer vision models on edge devices after optimization to meet customer requirements

Requirements:

Bachelor’s degree
Understanding about depth and breadth of computer vision and deep learning algorithms.
Experience in taking an AI product from scratch to commercial deployment.
Experience in Image enhancement, object detection, image segmentation, image classification algorithms
Experience in deployment with OpenVINO, ONNXruntime and TensorRT
Experience in deploying computer vision solutions on edge devices such as Intel Movidius and Nvidia Jetson
Experience with any machine/deep learning frameworks like Tensorflow, and PyTorch.
Proficient understanding of code versioning tools, such as Git

Our perfect candidate is someone that:

is proactive and an independent problem solver
is a constant learner. We are a fast growing start-up. We want you to grow with us!
is a team player and good communicator

What We Offer:

You will have fun working with a fast-paced team on a product that can impact the business model of E-commerce and BFSI industries. As the team is small, you will easily be able to see a direct impact of what you build on our customers (Trust us - it is extremely fulfilling!)
You will be in charge of what you build and be an integral part of the product development process
Technical and financial growth!

Who Are We

ORBO's solutions are helping BFSI, beauty and personal care digital transformation and Ecommerce image retouching industries in multiple ways.

WHY US

Join top AI company
Grow with your best companions
Continuous pursuit of excellence, equality, respect
Competitive compensation and benefits

You'll be a part of the core team and will be working directly with the founders in building and iterating upon the core products that make cameras intelligent and images more informative.

To learn more about how we work, please check out

https://www.orbo.ai/.

Description:

Responsibilities:

Research and develop computer vision solutions for industries (BFSI, Beauty and personal care, E-commerce, Defence etc.)
Lead a team of ML engineers in developing an industrial AI product from scratch
Setup end-end Deep Learning pipeline for data ingestion, preparation, model training, validation and deployment
Tune the models to achieve high accuracy rates and minimum latency
Deploying developed computer vision models on edge devices after optimization to meet customer requirements

Requirements:

Bachelor’s degree
Understanding about depth and breadth of computer vision and deep learning algorithms.
Experience in taking an AI product from scratch to commercial deployment.
Experience in Image enhancement, object detection, image segmentation, image classification algorithms
Experience in deployment with OpenVINO, ONNXruntime and TensorRT
Experience in deploying computer vision solutions on edge devices such as Intel Movidius and Nvidia Jetson
Experience with any machine/deep learning frameworks like Tensorflow, and PyTorch.
Proficient understanding of code versioning tools, such as Git

Our perfect candidate is someone that:

is proactive and an independent problem solver
is a constant learner. We are a fast growing start-up. We want you to grow with us!
is a team player and good communicator

What We Offer:

You will have fun working with a fast-paced team on a product that can impact the business model of E-commerce and BFSI industries. As the team is small, you will easily be able to see a direct impact of what you build on our customers (Trust us - it is extremely fulfilling!)
You will be in charge of what you build and be an integral part of the product development process
Technical and financial growth!

Data Scientist

at leading pharmacy provider

Agency job

via Econolytics by Jyotsna Econolytics

Noida, NCR (Delhi | Gurgaon | Noida)

4 - 10 yrs

₹18L - ₹24L / yr

Data Science

Python

R Programming

Algorithms

Predictive modelling

Job Description:

• Help build a Data Science team which will be engaged in researching, designing,
implementing, and deploying full-stack scalable data analytics vision and machine learning
solutions to challenge various business issues.
• Modelling complex algorithms, discovering insights and identifying business
opportunities through the use of algorithmic, statistical, visualization, and mining techniques
• Translates business requirements into quick prototypes and enable the
development of big data capabilities driving business outcomes
• Responsible for data governance and defining data collection and collation
guidelines.
• Must be able to advice, guide and train other junior data engineers in their job.

Must Have:

• 4+ experience in a leadership role as a Data Scientist
• Preferably from retail, Manufacturing, Healthcare industry(not mandatory)
• Willing to work from scratch and build up a team of Data Scientists
• Open for taking up the challenges with end to end ownership
• Confident with excellent communication skills along with a good decision maker

Data Steward

at Infogain

Agency job

via Technogen India PvtLtd by RAHUL BATTA

NCR (Delhi | Gurgaon | Noida), Bengaluru (Bangalore), Mumbai, Pune

7 - 8 yrs

₹15L - ₹16L / yr

Data steward

MDM

Tamr

Reltio

Data engineering

+7 more

Data Steward :

Data Steward will collaborate and work closely within the group software engineering and business division. Data Steward has overall accountability for the group's / Divisions overall data and reporting posture by responsibly managing data assets, data lineage, and data access, supporting sound data analysis. This role requires focus on data strategy, execution, and support for projects, programs, application enhancements, and production data fixes. Makes well-thought-out decisions on complex or ambiguous data issues and establishes the data stewardship and information management strategy and direction for the group. Effectively communicates to individuals at various levels of the technical and business communities. This individual will become part of the corporate Data Quality and Data management/entity resolution team supporting various systems across the board.

Primary Responsibilities:

Responsible for data quality and data accuracy across all group/division delivery initiatives.
Responsible for data analysis, data profiling, data modeling, and data mapping capabilities.
Responsible for reviewing and governing data queries and DML.
Accountable for the assessment, delivery, quality, accuracy, and tracking of any production data fixes.
Accountable for the performance, quality, and alignment to requirements for all data query design and development.
Responsible for defining standards and best practices for data analysis, modeling, and queries.
Responsible for understanding end-to-end data flows and identifying data dependencies in support of delivery, release, and change management.
Responsible for the development and maintenance of an enterprise data dictionary that is aligned to data assets and the business glossary for the group responsible for the definition and maintenance of the group's data landscape including overlays with the technology landscape, end-to-end data flow/transformations, and data lineage.
Responsible for rationalizing the group's reporting posture through the definition and maintenance of a reporting strategy and roadmap.
Partners with the data governance team to ensure data solutions adhere to the organization’s data principles and guidelines.
Owns group's data assets including reports, data warehouse, etc.
Understand customer business use cases and be able to translate them to technical specifications and vision on how to implement a solution.
Accountable for defining the performance tuning needs for all group data assets and managing the implementation of those requirements within the context of group initiatives as well as steady-state production.
Partners with others in test data management and masking strategies and the creation of a reusable test data repository.
Responsible for solving data-related issues and communicating resolutions with other solution domains.
Actively and consistently support all efforts to simplify and enhance the Clinical Trial Predication use cases.
Apply knowledge in analytic and statistical algorithms to help customers explore methods to improve their business.
Contribute toward analytical research projects through all stages including concept formulation, determination of appropriate statistical methodology, data manipulation, research evaluation, and final research report.
Visualize and report data findings creatively in a variety of visual formats that appropriately provide insight to the stakeholders.
Achieve defined project goals within customer deadlines; proactively communicate status and escalate issues as needed.

Additional Responsibilities:

Strong understanding of the Software Development Life Cycle (SDLC) with Agile Methodologies
Knowledge and understanding of industry-standard/best practices requirements gathering methodologies.
Knowledge and understanding of Information Technology systems and software development.
Experience with data modeling and test data management tools.
Experience in the data integration project • Good problem solving & decision-making skills.
Good communication skills within the team, site, and with the customer

Knowledge, Skills and Abilities

Technical expertise in data architecture principles and design aspects of various DBMS and reporting concepts.
Solid understanding of key DBMS platforms like SQL Server, Azure SQL
Results-oriented, diligent, and works with a sense of urgency. Assertive, responsible for his/her own work (self-directed), have a strong affinity for defining work in deliverables, and be willing to commit to deadlines.
Experience in MDM tools like MS DQ, SAS DM Studio, Tamr, Profisee, Reltio etc.
Experience in Report and Dashboard development
Statistical and Machine Learning models
Python (sklearn, numpy, pandas, genism)
Nice to Have:
1yr of ETL experience
Natural Language Processing
Neural networks and Deep learning
xperience in keras,tensorflow,spacy, nltk, LightGBM python library

Interaction : Frequently interacts with subordinate supervisors.

Education : Bachelor’s degree, preferably in Computer Science, B.E or other quantitative field related to the area of assignment. Professional certification related to the area of assignment may be required

Experience : 7 years of Pharmaceutical /Biotech/life sciences experience, 5 years of Clinical Trials experience and knowledge, Excellent Documentation, Communication, and Presentation Skills including PowerPoint

Data Steward :

Primary Responsibilities:

Responsible for data quality and data accuracy across all group/division delivery initiatives.
Responsible for data analysis, data profiling, data modeling, and data mapping capabilities.
Responsible for reviewing and governing data queries and DML.
Accountable for the assessment, delivery, quality, accuracy, and tracking of any production data fixes.
Accountable for the performance, quality, and alignment to requirements for all data query design and development.
Responsible for defining standards and best practices for data analysis, modeling, and queries.
Responsible for understanding end-to-end data flows and identifying data dependencies in support of delivery, release, and change management.
Responsible for the development and maintenance of an enterprise data dictionary that is aligned to data assets and the business glossary for the group responsible for the definition and maintenance of the group's data landscape including overlays with the technology landscape, end-to-end data flow/transformations, and data lineage.
Responsible for rationalizing the group's reporting posture through the definition and maintenance of a reporting strategy and roadmap.
Partners with the data governance team to ensure data solutions adhere to the organization’s data principles and guidelines.
Owns group's data assets including reports, data warehouse, etc.
Understand customer business use cases and be able to translate them to technical specifications and vision on how to implement a solution.
Accountable for defining the performance tuning needs for all group data assets and managing the implementation of those requirements within the context of group initiatives as well as steady-state production.
Partners with others in test data management and masking strategies and the creation of a reusable test data repository.
Responsible for solving data-related issues and communicating resolutions with other solution domains.
Actively and consistently support all efforts to simplify and enhance the Clinical Trial Predication use cases.
Apply knowledge in analytic and statistical algorithms to help customers explore methods to improve their business.
Contribute toward analytical research projects through all stages including concept formulation, determination of appropriate statistical methodology, data manipulation, research evaluation, and final research report.
Visualize and report data findings creatively in a variety of visual formats that appropriately provide insight to the stakeholders.
Achieve defined project goals within customer deadlines; proactively communicate status and escalate issues as needed.

Additional Responsibilities:

Strong understanding of the Software Development Life Cycle (SDLC) with Agile Methodologies
Knowledge and understanding of industry-standard/best practices requirements gathering methodologies.
Knowledge and understanding of Information Technology systems and software development.
Experience with data modeling and test data management tools.
Experience in the data integration project • Good problem solving & decision-making skills.
Good communication skills within the team, site, and with the customer

Knowledge, Skills and Abilities

Technical expertise in data architecture principles and design aspects of various DBMS and reporting concepts.
Solid understanding of key DBMS platforms like SQL Server, Azure SQL
Results-oriented, diligent, and works with a sense of urgency. Assertive, responsible for his/her own work (self-directed), have a strong affinity for defining work in deliverables, and be willing to commit to deadlines.
Experience in MDM tools like MS DQ, SAS DM Studio, Tamr, Profisee, Reltio etc.
Experience in Report and Dashboard development
Statistical and Machine Learning models
Python (sklearn, numpy, pandas, genism)
Nice to Have:
1yr of ETL experience
Natural Language Processing
Neural networks and Deep learning
xperience in keras,tensorflow,spacy, nltk, LightGBM python library

Interaction : Frequently interacts with subordinate supervisors.

Sr Data Engineer

at Infogain

Agency job

via Technogen India PvtLtd by RAHUL BATTA

Bengaluru (Bangalore), Pune, Noida, NCR (Delhi | Gurgaon | Noida)

7 - 10 yrs

₹20L - ₹25L / yr

Data engineering

Python

SQL

Spark

PySpark

+10 more

Sr. Data Engineer:

Core Skills – Data Engineering, Big Data, Pyspark, Spark SQL and Python

Candidate with prior Palantir Cloud Foundry OR Clinical Trial Data Model background is preferred

Major accountabilities:

Responsible for Data Engineering, Foundry Data Pipeline Creation, Foundry Analysis & Reporting, Slate Application development, re-usable code development & management and Integrating Internal or External System with Foundry for data ingestion with high quality.
Have good understanding on Foundry Platform landscape and it’s capabilities
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Defines company data assets (data models), Pyspark, spark SQL, jobs to populate data models.
Designs data integrations and data quality framework.
Design & Implement integration with Internal, External Systems, F1 AWS platform using Foundry Data Connector or Magritte Agent
Collaboration with data scientists, data analyst and technology teams to document and leverage their understanding of the Foundry integration with different data sources - Actively participate in agile work practices
Coordinating with Quality Engineer to ensure the all quality controls, naming convention & best practices have been followed

Desired Candidate Profile :

Strong data engineering background
Experience with Clinical Data Model is preferred
Experience in

SQL Server ,Postgres, Cassandra, Hadoop, and Spark for distributed data storage and parallel computing
Java and Groovy for our back-end applications and data integration tools
Python for data processing and analysis
Cloud infrastructure based on AWS EC2 and S3

7+ years IT experience, 2+ years’ experience in Palantir Foundry Platform, 4+ years’ experience in Big Data platform
5+ years of Python and Pyspark development experience
Strong troubleshooting and problem solving skills
BTech or master's degree in computer science or a related technical field
Experience designing, building, and maintaining big data pipelines systems
Hands-on experience on Palantir Foundry Platform and Foundry custom Apps development
Able to design and implement data integration between Palantir Foundry and external Apps based on Foundry data connector framework
Hands-on in programming languages primarily Python, R, Java, Unix shell scripts
Hand-on experience in AWS / Azure cloud platform and stack
Strong in API based architecture and concept, able to do quick PoC using API integration and development
Knowledge of machine learning and AI
Skill and comfort working in a rapidly changing environment with dynamic objectives and iteration with users.

Demonstrated ability to continuously learn, work independently, and make decisions with minimal supervision

Sr. Data Engineer:

Core Skills – Data Engineering, Big Data, Pyspark, Spark SQL and Python

Candidate with prior Palantir Cloud Foundry OR Clinical Trial Data Model background is preferred

Major accountabilities:

Responsible for Data Engineering, Foundry Data Pipeline Creation, Foundry Analysis & Reporting, Slate Application development, re-usable code development & management and Integrating Internal or External System with Foundry for data ingestion with high quality.
Have good understanding on Foundry Platform landscape and it’s capabilities
Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
Defines company data assets (data models), Pyspark, spark SQL, jobs to populate data models.
Designs data integrations and data quality framework.
Design & Implement integration with Internal, External Systems, F1 AWS platform using Foundry Data Connector or Magritte Agent
Collaboration with data scientists, data analyst and technology teams to document and leverage their understanding of the Foundry integration with different data sources - Actively participate in agile work practices
Coordinating with Quality Engineer to ensure the all quality controls, naming convention & best practices have been followed

Desired Candidate Profile :

Strong data engineering background
Experience with Clinical Data Model is preferred
Experience in

SQL Server ,Postgres, Cassandra, Hadoop, and Spark for distributed data storage and parallel computing
Java and Groovy for our back-end applications and data integration tools
Python for data processing and analysis
Cloud infrastructure based on AWS EC2 and S3

7+ years IT experience, 2+ years’ experience in Palantir Foundry Platform, 4+ years’ experience in Big Data platform
5+ years of Python and Pyspark development experience
Strong troubleshooting and problem solving skills
BTech or master's degree in computer science or a related technical field
Experience designing, building, and maintaining big data pipelines systems
Hands-on experience on Palantir Foundry Platform and Foundry custom Apps development
Able to design and implement data integration between Palantir Foundry and external Apps based on Foundry data connector framework
Hands-on in programming languages primarily Python, R, Java, Unix shell scripts
Hand-on experience in AWS / Azure cloud platform and stack
Strong in API based architecture and concept, able to do quick PoC using API integration and development
Knowledge of machine learning and AI
Skill and comfort working in a rapidly changing environment with dynamic objectives and iteration with users.

Demonstrated ability to continuously learn, work independently, and make decisions with minimal supervision

Data Analyst

at The Smart Cube

1 recruiter

Posted by Jasmine Batra

Remote, Noida, NCR (Delhi | Gurgaon | Noida)

2 - 5 yrs

₹2L - ₹5L / yr

R Programming

Advanced analytics

Python

Marketing analytics

• Act as a lead analyst on various data analytics projects aiding strategic decision making for Fortune 500 / FTSE 100 companies, Blue Chip Consulting Firms and Global Financial Services companies • Understand the client objectives, and work with the PL to design the analytical solution/framework. Be able to translate the client objectives / analytical plan into clear deliverables with associated priorities and constraints • Collect/Organize/Prepare/Manage data for the analysis and conduct quality checks • Use and implement basic and advanced statistical techniques like frequencies, cross-tabs, correlation, Regression, Decision Trees, Cluster Analysis, etc. to identify key actionable insights from the data • Develop complete sections of final client report in Power Point. Identify trends and evaluate insights in terms of logic and reasoning, and be able to succinctly present them in terms of an executive summary/taglines • Conduct sanity checks of the analysis output based on reasoning and common sense, and be able to do a rigorous self QC, as well as of the work assigned to analysts to ensure an error free output • Aid in decision making related to client management, and also be able to take client calls relatively independently • Support the project leads in managing small teams of 2-3 analysts, independently set targets and communicate to team members • Discuss queries/certain sections of deliverable report over client calls or video conferences Technical Skills: • Hands on experience of one or more statistical tools such as SAS, R and Python • Working knowledge or experience in using SQL Server (or other RDBMS tools) would be an advantage Work Experience: • 2-4 years of relevant experience in Marketing Analytics / MR. • Experience in managing, cleaning and analysis of large datasets using statistical packages like SAS, R, Python, etc. • Experience in data management using SQL queries on tools like Access/ SQL Server

Senior Software Engineer

at LimeTray

5 recruiters

Posted by tanika monga

NCR (Delhi | Gurgaon | Noida)

4 - 6 yrs

₹15L - ₹18L / yr

Machine Learning (ML)

Python

Cassandra

MySQL

Apache Kafka

+2 more

Requirements: Minimum 4-years work experience in building, managing and maintaining Analytics applications B.Tech/BE in CS/IT from Tier 1/2 Institutes Strong Fundamentals of Data Structures and Algorithms Good analytical & problem-solving skills Strong hands-on experience in Python In depth Knowledge of queueing systems (Kafka/ActiveMQ/RabbitMQ) Experience in building Data pipelines & Real time Analytics Systems Experience in SQL (MYSQL) & NoSQL (Mongo/Cassandra) databases is a plus Understanding of Service Oriented Architecture Delivered high-quality work with a significant contribution Expert in git, unit tests, technical documentation and other development best practices Experience in Handling small teams

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort