Apache Flume Jobs in Delhi, NCR and Gurgaon

11+ Apache Flume Jobs in Delhi, NCR and Gurgaon | Apache Flume Job openings in Delhi, NCR and Gurgaon

Apply to 11+ Apache Flume Jobs in Delhi, NCR and Gurgaon on CutShort.io. Explore the latest Apache Flume Job opportunities across top companies like Google, Amazon & Adobe.

Big Data Evangelist

at UpX Academy

2 recruiters

Posted by Suchit Majumdar

Noida, Hyderabad, NCR (Delhi | Gurgaon | Noida)

2 - 6 yrs

₹4L - ₹12L / yr

Spark

Hadoop

MongoDB

Python

Scala

+3 more

Looking for a technically sound and excellent trainer on big data technologies. Get an opportunity to become popular in the industry and get visibility. Host regular sessions on Big data related technologies and get paid to learn.

Lead Data Analytics

at Semi Stealth Mode startup in Delhi

Agency job

via Qrata by Blessy Fernandes

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

3 - 6 yrs

₹35L - ₹40L / yr

Data Analytics

Python

Data Visualization

SQL

A Delhi NCR based Applied AI & Consumer Tech company tackling one of the largest unsolved consumer internet problems of our time. We are a motley crew of smart, passionate and nice people who believe you can build a high performing company with a culture of respect aka a sports team with a heart aka a caring meritocracy.

Our illustrious angels include unicorn founders, serial entrepreneurs with exits, tech & consumer industry stalwarts and investment professionals/bankers.

We are hiring for our founding team (in Delhi NCR only, no remote) that will take the product from prototype to a landing! Opportunity for disproportionate non-linear impact, learning and wealth creation in a classic 0-1 with a Silicon Valley caliber founding team.

Key Responsibilities:

1. Data Strategy and Vision:

· Develop and drive the company's data analytics strategy, aligning it with overall business goals.

· Define the vision for data analytics, outlining clear objectives and key results (OKRs) to measure success.

2. Data Analysis and Interpretation:

· Oversee the analysis of complex datasets to extract valuable insights, trends, and patterns.

· Utilize statistical methods and data visualization techniques to present findings in a clear and compelling manner to both technical and non-technical stakeholders.

3. Data Infrastructure and Tools:

· Evaluate, select, and implement advanced analytics tools and platforms to enhance data processing and analysis capabilities.

· Collaborate with IT teams to ensure a robust and scalable data infrastructure, including data storage, retrieval, and security protocols.

4. Collaboration and Stakeholder Management:

· Collaborate cross-functionally with teams such as marketing, sales, and product development to identify opportunities for data-driven optimizations.

· Act as a liaison between technical and non-technical teams, ensuring effective communication of data insights and recommendations.

5. Performance Measurement:

· Establish key performance indicators (KPIs) and metrics to measure the impact of data analytics initiatives on business outcomes.

· Continuously assess and improve the accuracy and relevance of analytical models and methodologies.

Qualifications:

Bachelor's or Master's degree in Data Science, Statistics, Computer Science, or related field.
Proven experience (5+ years) in data analytics, with a focus on leading analytics teams and driving strategic initiatives.
Proficiency in data analysis tools such as Python, R, SQL, and advanced knowledge of data visualization tools.
Strong understanding of statistical methods, machine learning algorithms, and predictive modelling techniques.
Excellent communication skills, both written and verbal, to effectively convey complex findings to diverse audie

Our illustrious angels include unicorn founders, serial entrepreneurs with exits, tech & consumer industry stalwarts and investment professionals/bankers.

Key Responsibilities:

1. Data Strategy and Vision:

· Develop and drive the company's data analytics strategy, aligning it with overall business goals.

· Define the vision for data analytics, outlining clear objectives and key results (OKRs) to measure success.

2. Data Analysis and Interpretation:

· Oversee the analysis of complex datasets to extract valuable insights, trends, and patterns.

· Utilize statistical methods and data visualization techniques to present findings in a clear and compelling manner to both technical and non-technical stakeholders.

3. Data Infrastructure and Tools:

· Evaluate, select, and implement advanced analytics tools and platforms to enhance data processing and analysis capabilities.

· Collaborate with IT teams to ensure a robust and scalable data infrastructure, including data storage, retrieval, and security protocols.

4. Collaboration and Stakeholder Management:

· Collaborate cross-functionally with teams such as marketing, sales, and product development to identify opportunities for data-driven optimizations.

· Act as a liaison between technical and non-technical teams, ensuring effective communication of data insights and recommendations.

5. Performance Measurement:

· Establish key performance indicators (KPIs) and metrics to measure the impact of data analytics initiatives on business outcomes.

· Continuously assess and improve the accuracy and relevance of analytical models and methodologies.

Qualifications:

Bachelor's or Master's degree in Data Science, Statistics, Computer Science, or related field.
Proven experience (5+ years) in data analytics, with a focus on leading analytics teams and driving strategic initiatives.
Proficiency in data analysis tools such as Python, R, SQL, and advanced knowledge of data visualization tools.
Strong understanding of statistical methods, machine learning algorithms, and predictive modelling techniques.
Excellent communication skills, both written and verbal, to effectively convey complex findings to diverse audie

Data Scientist

at Series B funded product startup

Agency job

via Qrata by Blessy Fernandes

Delhi

2 - 5 yrs

₹8L - ₹14L / yr

Data Science

Machine Learning (ML)

Python

Java

Job Title -Data Scientist

Job Duties

Data Scientist responsibilities includes planning projects and building analytics models.
You should have a strong problem-solving ability and a knack for statistical analysis.
If you're also able to align our data products with our business goals, we'd like to meet you. Your ultimate goal will be to help improve our products and business decisions by making the most out of our data.

Responsibilities

Own end-to-end business problems and metrics, build and implement ML solutions using cutting-edge technology.

Create scalable solutions to business problems using statistical techniques, machine learning, and NLP.

Design, experiment and evaluate highly innovative models for predictive learning

Work closely with software engineering teams to drive real-time model experiments, implementations, and new feature creations

Establish scalable, efficient, and automated processes for large-scale data analysis, model development, deployment, experimentation, and evaluation.

Research and implement novel machine learning and statistical approaches.

Requirements

2-5 years of experience in data science.

In-depth understanding of modern machine learning techniques and their mathematical underpinnings.

Demonstrated ability to build PoCs for complex, ambiguous problems and scale them up.

Strong programming skills (Python, Java)

High proficiency in at least one of the following broad areas: machine learning, statistical modelling/inference, information retrieval, data mining, NLP

Experience with SQL and NoSQL databases

Strong organizational and leadership skills

Excellent communication skills

Job Title -Data Scientist

Job Duties

Data Scientist responsibilities includes planning projects and building analytics models.
You should have a strong problem-solving ability and a knack for statistical analysis.
If you're also able to align our data products with our business goals, we'd like to meet you. Your ultimate goal will be to help improve our products and business decisions by making the most out of our data.

Responsibilities

Own end-to-end business problems and metrics, build and implement ML solutions using cutting-edge technology.

Create scalable solutions to business problems using statistical techniques, machine learning, and NLP.

Design, experiment and evaluate highly innovative models for predictive learning

Work closely with software engineering teams to drive real-time model experiments, implementations, and new feature creations

Establish scalable, efficient, and automated processes for large-scale data analysis, model development, deployment, experimentation, and evaluation.

Research and implement novel machine learning and statistical approaches.

Requirements

2-5 years of experience in data science.

In-depth understanding of modern machine learning techniques and their mathematical underpinnings.

Demonstrated ability to build PoCs for complex, ambiguous problems and scale them up.

Strong programming skills (Python, Java)

High proficiency in at least one of the following broad areas: machine learning, statistical modelling/inference, information retrieval, data mining, NLP

Experience with SQL and NoSQL databases

Strong organizational and leadership skills

Excellent communication skills

Data Scientist

at Fintech lead,

Agency job

via The Hub by Sridevi Viswanathan

Gurugram, Noida

3 - 8 yrs

₹5L - ₹15L / yr

Natural Language Processing (NLP)

BERT

Machine Learning (ML)

Data Science

Python

+1 more

Who we are looking for

· A Natural Language Processing (NLP) expert with strong computer science fundamentals and experience in working with deep learning frameworks. You will be working at the cutting edge of NLP and Machine Learning.

Roles and Responsibilities

· Work as part of a distributed team to research, build and deploy Machine Learning models for NLP.

· Mentor and coach other team members

· Evaluate the performance of NLP models and ideate on how they can be improved

· Support internal and external NLP-facing APIs

· Keep up to date on current research around NLP, Machine Learning and Deep Learning

Mandatory Requirements

· Any graduation with at least 2 years of demonstrated experience as a Data Scientist.

Behavioural Skills

· Strong analytical and problem-solving capabilities.

· Proven ability to multi-task and deliver results within tight time frames

· Must have strong verbal and written communication skills

· Strong listening skills and eagerness to learn

· Strong attention to detail and the ability to work efficiently in a team as well as individually

Technical Skills

Hands-on experience with

· NLP

· Deep Learning

· Machine Learning

· Python

· Bert

Preferred Requirements

· Experience in Computer Vision is preferred

Role: Data Scientist

Industry Type: Banking

Department: Data Science & Analytics

Employment Type: Full Time, Permanent

Role Category: Data Science & Machine Learning

Who we are looking for

Roles and Responsibilities

· Work as part of a distributed team to research, build and deploy Machine Learning models for NLP.

· Mentor and coach other team members

· Evaluate the performance of NLP models and ideate on how they can be improved

· Support internal and external NLP-facing APIs

· Keep up to date on current research around NLP, Machine Learning and Deep Learning

Mandatory Requirements

· Any graduation with at least 2 years of demonstrated experience as a Data Scientist.

Behavioural Skills

· Strong analytical and problem-solving capabilities.

· Proven ability to multi-task and deliver results within tight time frames

· Must have strong verbal and written communication skills

· Strong listening skills and eagerness to learn

· Strong attention to detail and the ability to work efficiently in a team as well as individually

Technical Skills

Hands-on experience with

· NLP

· Deep Learning

· Machine Learning

· Python

· Bert

Preferred Requirements

· Experience in Computer Vision is preferred

Role: Data Scientist

Industry Type: Banking

Department: Data Science & Analytics

Employment Type: Full Time, Permanent

Role Category: Data Science & Machine Learning

Python developer

at codersbrain

1 recruiter

Posted by Tanuj Uppal

Delhi

4 - 8 yrs

₹2L - ₹15L / yr

Spark

Hadoop

Big Data

Data engineering

PySpark

+5 more

Mandatory - Hands on experience in Python and PySpark.

Build pySpark applications using Spark Dataframes in Python using Jupyter notebook and PyCharm(IDE).

Worked on optimizing spark jobs that processes huge volumes of data.

Hands on experience in version control tools like Git.

Worked on Amazon’s Analytics services like Amazon EMR, Lambda function etc

Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services like SNS.

Experience/knowledge of bash/shell scripting will be a plus.

Experience in working with fixed width, delimited , multi record file formats etc.

Hands on experience in tools like Jenkins to build, test and deploy the applications

Awareness of Devops concepts and be able to work in an automated release pipeline environment.

Excellent debugging skills.

Mandatory - Hands on experience in Python and PySpark.

Build pySpark applications using Spark Dataframes in Python using Jupyter notebook and PyCharm(IDE).

Worked on optimizing spark jobs that processes huge volumes of data.

Hands on experience in version control tools like Git.

Worked on Amazon’s Analytics services like Amazon EMR, Lambda function etc

Worked on Amazon’s Compute services like Amazon Lambda, Amazon EC2 and Amazon’s Storage service like S3 and few other services like SNS.

Experience/knowledge of bash/shell scripting will be a plus.

Experience in working with fixed width, delimited , multi record file formats etc.

Hands on experience in tools like Jenkins to build, test and deploy the applications

Awareness of Devops concepts and be able to work in an automated release pipeline environment.

Excellent debugging skills.

Data Scientist

at Information Solution Provider Company

Agency job

via Jobdost by Sathish Kumar

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

3 - 7 yrs

₹10L - ₹15L / yr

SQL

Hadoop

Spark

Machine Learning (ML)

Data Science

+3 more

Job Description:

The data science team is responsible for solving business problems with complex data. Data complexity could be characterized in terms of volume, dimensionality and multiple touchpoints/sources. We understand the data, ask fundamental-first-principle questions, apply our analytical and machine learning skills to solve the problem in the best way possible.

Our ideal candidate

The role would be a client facing one, hence good communication skills are a must.

The candidate should have the ability to communicate complex models and analysis in a clear and precise manner.

The candidate would be responsible for:

Comprehending business problems properly - what to predict, how to build DV, what value addition he/she is bringing to the client, etc.
Understanding and analyzing large, complex, multi-dimensional datasets and build features relevant for business
Understanding the math behind algorithms and choosing one over another
Understanding approaches like stacking, ensemble and applying them correctly to increase accuracy

Desired technical requirements

Proficiency with Python and the ability to write production-ready codes.
Experience in pyspark, machine learning and deep learning
Big data experience, e.g. familiarity with Spark, Hadoop, is highly preferred
Familiarity with SQL or other databases.

Job Description:

Our ideal candidate

The role would be a client facing one, hence good communication skills are a must.

The candidate should have the ability to communicate complex models and analysis in a clear and precise manner.

The candidate would be responsible for:

Comprehending business problems properly - what to predict, how to build DV, what value addition he/she is bringing to the client, etc.
Understanding and analyzing large, complex, multi-dimensional datasets and build features relevant for business
Understanding the math behind algorithms and choosing one over another
Understanding approaches like stacking, ensemble and applying them correctly to increase accuracy

Desired technical requirements

Proficiency with Python and the ability to write production-ready codes.
Experience in pyspark, machine learning and deep learning
Big data experience, e.g. familiarity with Spark, Hadoop, is highly preferred
Familiarity with SQL or other databases.

MongoDB Administrator

at Getkart Pvt Ltd

1 recruiter

Posted by Pooja Jha

Delhi, Gurugram, Noida, Ghaziabad, Faridabad

3 - 5 yrs

₹5L - ₹12L / yr

MongoDB

NOSQL Databases

Designing, building, and automating the MongoDB Architecture for open-source MongoDB.
Good understanding of DB schema design, performance, tuning and capacity planning.
The ideal candidate has worked with modern open-source MongoDB platforms cloud deployment models and test-driven development in a fast-paced agile environment.
In depth understanding of data management e g permissions recovery security and monitoring Operational experience with MongoDB.
Data Modelling Operational experience with Indexes.
Good understanding of MongoDB replica set Op log and journals.
Provide advice and support to other development resources interacting with MongoDB.
Troubleshoot any problems that may come up with the database environments.
Skilled in performance tuning and optimization using native monitoring and troubleshooting tools.
Provide guidance in the creation and modification of standards and procedures.
Experience working with cloud database services a plus.
Experience working in an Agile Scrum environment.
Experience working in Aggregation in MongoDB.
Strong communication documentation skills and technology awareness.

Designing, building, and automating the MongoDB Architecture for open-source MongoDB.
Good understanding of DB schema design, performance, tuning and capacity planning.
The ideal candidate has worked with modern open-source MongoDB platforms cloud deployment models and test-driven development in a fast-paced agile environment.
In depth understanding of data management e g permissions recovery security and monitoring Operational experience with MongoDB.
Data Modelling Operational experience with Indexes.
Good understanding of MongoDB replica set Op log and journals.
Provide advice and support to other development resources interacting with MongoDB.
Troubleshoot any problems that may come up with the database environments.
Skilled in performance tuning and optimization using native monitoring and troubleshooting tools.
Provide guidance in the creation and modification of standards and procedures.
Experience working with cloud database services a plus.
Experience working in an Agile Scrum environment.
Experience working in Aggregation in MongoDB.
Strong communication documentation skills and technology awareness.

Python developer

at Gauge Data Solutions Pvt Ltd

2 recruiters

Posted by Deeksha Dewal

Noida

0 - 4 yrs

₹3L - ₹8L / yr

Data Science

Machine Learning (ML)

Natural Language Processing (NLP)

Computer Vision

Artificial Intelligence (AI)

+4 more

Essential Skills :

- Develop, enhance and maintain Python related projects, data services, platforms and processes.

- Apply and maintain data quality checks to ensure data integrity and completeness.

- Able to integrate multiple data sources and databases.

- Collaborate with cross-functional teams across, Decision Sciences, Search, Database Management. To design innovative solutions, capture requirements and drive a common future vision.

Technical Skills/Capabilities :

- Hands on experience in Python programming language.

- Understanding and proven application of Computer Science fundamentals in object oriented design, data structures, algorithm design, Regular expressions, data storage procedures, problem solving, and complexity analysis.

- Understanding of natural language processing and basic ML algorithms will be a plus.

- Good troubleshooting and debugging skills.

- Strong individual contributor, self-motivated, and a proven team player.

- Eager to learn and develop new experience and skills.

- Good communication and interpersonal skills.

About Company Profile :

Gauge Data Solutions Pvt Ltd :

- We are a leading company into Data Science, Machine learning and Artificial Intelligence.

- Within Gauge data we have a competitive environment for the Developers and Engineers.

- We at Gauge create potential solutions for the real world problems. One such example of our engineering is Casemine.

- Casemine is a legal research platform powered by Artificial Intelligence. It helps lawyers, judges and law researchers in their day to day life.

- Casemine provides exhaustive case results to its users with the use of cutting edge technologies.

- It is developed with the efforts of great engineers at Gauge Data.

- One such opportunity is now open for you. We at Gauge Data invites application for competitive, self motivated Python Developer.

Purpose of the Role :

- This position will play a central role in developing new features and enhancements for the products and services at Gauge Data.

- To know more about what we do and how we do it, feel free to read these articles:

- https://bit.ly/2YfVAsv

- https://bit.ly/2rQArJc

- You can also visit us at https://www.casemine.com/.

- For more information visit us at: - www.gaugeanalytics.com

- Join us on LinkedIn, Twitter & Facebook

Data Governance Engineer

at European Bank headquartered at Copenhagen, Denmark.

Agency job

via Apical Mind by Rajeev T

NCR (Delhi | Gurgaon | Noida)

2 - 12 yrs

₹25L - ₹40L / yr

Data governance

DevOps

Data integration

Data engineering

Python

+14 more

Data Platforms (Data Integration) is responsible for envisioning, building and operating the Bank’s data integration platforms. The successful candidate will work out of Gurgaon as a part of a high performing team who is distributed across our two development centers – Copenhagen and Gurugram. The individual must be driven, passionate about technology and display a level of customer service that is second to none.

Roles & Responsibilities

Designing and delivering a best-in-class, highly scalable data governance platform
Improving processes and applying best practices
Contribute in all scrum ceremonies; assuming the role of ‘scum master’ on a rotational basis
Development, management and operation of our infrastructure to ensure it is easy to deploy, scalable, secure and fault-tolerant
Flexible on working hours as per business needs

Roles & Responsibilities

Designing and delivering a best-in-class, highly scalable data governance platform
Improving processes and applying best practices
Contribute in all scrum ceremonies; assuming the role of ‘scum master’ on a rotational basis
Development, management and operation of our infrastructure to ensure it is easy to deploy, scalable, secure and fault-tolerant
Flexible on working hours as per business needs

Data Engineer

at Paisabazaar.com

3 recruiters

Posted by Amit Gupta

NCR (Delhi | Gurgaon | Noida)

1 - 5 yrs

₹6L - ₹18L / yr

Spark

MapReduce

Hadoop

ETL

We are looking at a Big Data Engineer with at least 3-5 years of experience as a Big Data Developer/EngineerExperience with Big Data technologies and tools like Hadoop, Hive, MapR, Kafka, Spark, etc.,Experience in Architecting data ingestion, storage, consumption model.Experience with NoSQL Databases like MongoDB, HBase, Cassandra, etc.,Knowledge of various ETL tools & techniques

Data Scientist

at YCH Logistics

1 recruiter

Posted by Sanatan Upmanyu

NCR (Delhi | Gurgaon | Noida)

0 - 5 yrs

₹2L - ₹5L / yr

Python

Deep Learning

MySQL

Job Description: Data Science Analyst/ Data Science Senior Analyst Job description KSTYCH is seeking a Data Science Analyst to join our Data Science team. Individuals in this role are expected to be comfortable working as a software engineer and a quantitative researcher, should have a significant theoretical foundation in mathematical statistics. The ideal candidate will have a keen interest in the study of Pharma sector, network biology, text mining, machine learning, and a passion for identifying and answering questions that help us build the best consulting resource and continuous support to other teams. Responsibilities Work closely with a product scientific, medical, business development and commercial to identify and answer important healthcare/pharma/biology questions. Answer questions by using appropriate statistical techniques and tools on available data. Communicate findings to project managers and team managers. Drive the collection of new data and the refinement of existing data sources Analyze and interpret the results of an experiments Develop best practices for instrumentation and experimentation and communicate those to other teams Requirements B. Tech, M.Tech, M.S. or Ph.D. in a relevant technical field, or 1+ years experience in a relevant role Extensive experience solving analytical problems using quantitative approaches Comfort manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources A strong passion for empirical research and for answering hard questions with data A flexible analytic approach that allows for results at varying levels of precision Ability to communicate complex quantitative analysis in a clear, precise, and actionable manner Fluency with at least one scripting language such as Python or PHP Familiarity with relational databases and SQL Experience working with large data sets, experience working with distributed computing tools a plus (KNIME, Map/Reduce, Hadoop, Hive, etc)

Get to hear about interesting companies hiring right now

Follow Cutshort

Why apply via Cutshort?

Connect with actual hiring teams and get their fast response. No spam.

Find more jobs

Get to hear about interesting companies hiring right now

Follow Cutshort