LOG IN
SIGN UP
Tech Job Finder - Find Software, Technology Sales and Product Manager Jobs.
Sign In
OR continue with e-mail and password
E-mail address
Password
Don't have an account?
Reset password
Join Tech Job Finder
OR continue with e-mail and password
E-mail address
First name
Last name
Username
Password
Confirm Password
How did you hear about us?
By signing up, you agree to our Terms & Conditions and Privacy Policy.

Data Engineer - Pyspark

at Citi

Back to all Data Engineering jobs
Citi logo
Bulge Bracket Investment Banks

Data Engineer - Pyspark

at Citi

Mid LevelNo visa sponsorshipData Engineering

Posted a month ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
India

Data Engineer role focused on designing and optimizing big-data systems using Spark (Python/Scala), Hive, Hadoop and cloud-based data management technologies. The position requires hands-on Python/Scala development, Unix scripting, strong SQL skills, ETL/data ingestion experience and working with large datasets and data warehouses. Experience with performance tuning, code re-engineering, exposure to ML techniques and familiarity with tools like Talend, Cloudera and container/automation tooling is a plus; excellent communication and stakeholder management skills are required.

Data Engineer - Pyspark

Job Req Id:
26931109
Location(s):
Haryana, India
Job Type:
Hybrid
Posted:
Jan. 28, 2026

Discover your future at Citi

Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.

Job Overview

Responsibilities:

  • Engineering Degree with 4+ years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies
  • Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL.
  • Comfortable working with completed unstructured, undocumented code and turning it around into best in class code redesigning costly compute and data processes and aligning to best development standards
  • Experienced in working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Well versed with necessary data preprocessing and application engineering skills
  • At least 3 years of experience designing software systems with intense computational needs across real time and batch process .
  • Experience and understanding of Supervised, unsupervised machine learning techniques
  • Exposure to data ingestion, ETL tools such as Talend, modeling tools, Performance Management tooling such as Pepper data, Cloudera stack will be a plus
  • Knowledge of data management, data governance, data security and regulatory practices
  • Ability to identify, clearly articulate and solve complex business problems and present them to the management in a structured and simpler form
  • Should have experience of working in onsite, offsite delivery model
  • Experience working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Experience in Credit Cards and Retail Banking
  • Should have excellent communication and inter-personal skills
  • Strong process/project management skills
  • Multiple stake holder management
  • Control orientated and Risk awareness


Qualifications:

  • Fast Learner with a desire to excel and attitude to partner and solve problems in complex environments placing business objectives at center or all activity.
  • Experience in Performance Tuning, Code Re-engineering is preferred.
  • Experience in broad IT architecture and design preferred across data and channels
  • Experience in query tuning, automation technologies (Autosys, Jenkins, Service Now) preferred
  • Exposure to container technology, Machine learning will be a plus


Education:

  • Bachelors/University degree or equivalent experience

------------------------------------------------------

Job Family Group:

Decision Management

------------------------------------------------------

Job Family:

Data/Information Management

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.

Data Engineer - Pyspark

at Citi

Back to all Data Engineering jobs
Citi logo
Bulge Bracket Investment Banks

Data Engineer - Pyspark

at Citi

Mid LevelNo visa sponsorshipData Engineering

Posted a month ago

No clicks

Compensation
Not specified

Currency: Not specified

City
Not specified
Country
India

Data Engineer role focused on designing and optimizing big-data systems using Spark (Python/Scala), Hive, Hadoop and cloud-based data management technologies. The position requires hands-on Python/Scala development, Unix scripting, strong SQL skills, ETL/data ingestion experience and working with large datasets and data warehouses. Experience with performance tuning, code re-engineering, exposure to ML techniques and familiarity with tools like Talend, Cloudera and container/automation tooling is a plus; excellent communication and stakeholder management skills are required.

Data Engineer - Pyspark

Job Req Id:
26931109
Location(s):
Haryana, India
Job Type:
Hybrid
Posted:
Jan. 28, 2026

Discover your future at Citi

Working at Citi is far more than just a job. A career with us means joining a team of more than 230,000 dedicated people from around the globe. At Citi, you’ll have the opportunity to grow your career, give back to your community and make a real impact.

Job Overview

Responsibilities:

  • Engineering Degree with 4+ years of experience in BigData systems, Hive, Hadoop, Spark (Python/ scala) and cloud based data management technologies
  • Hands-on experience in Unix Scripting, Python and Scala programing along with strong experience in SQL.
  • Comfortable working with completed unstructured, undocumented code and turning it around into best in class code redesigning costly compute and data processes and aligning to best development standards
  • Experienced in working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Well versed with necessary data preprocessing and application engineering skills
  • At least 3 years of experience designing software systems with intense computational needs across real time and batch process .
  • Experience and understanding of Supervised, unsupervised machine learning techniques
  • Exposure to data ingestion, ETL tools such as Talend, modeling tools, Performance Management tooling such as Pepper data, Cloudera stack will be a plus
  • Knowledge of data management, data governance, data security and regulatory practices
  • Ability to identify, clearly articulate and solve complex business problems and present them to the management in a structured and simpler form
  • Should have experience of working in onsite, offsite delivery model
  • Experience working with large and multiple datasets, data warehouses and ability to pull data using relevant programs and coding.
  • Experience in Credit Cards and Retail Banking
  • Should have excellent communication and inter-personal skills
  • Strong process/project management skills
  • Multiple stake holder management
  • Control orientated and Risk awareness


Qualifications:

  • Fast Learner with a desire to excel and attitude to partner and solve problems in complex environments placing business objectives at center or all activity.
  • Experience in Performance Tuning, Code Re-engineering is preferred.
  • Experience in broad IT architecture and design preferred across data and channels
  • Experience in query tuning, automation technologies (Autosys, Jenkins, Service Now) preferred
  • Exposure to container technology, Machine learning will be a plus


Education:

  • Bachelors/University degree or equivalent experience

------------------------------------------------------

Job Family Group:

Decision Management

------------------------------------------------------

Job Family:

Data/Information Management

------------------------------------------------------

Time Type:

Full time

------------------------------------------------------

Most Relevant Skills

Please see the requirements listed above.

------------------------------------------------------

Other Relevant Skills

For complementary skills, please see above and/or contact the recruiter.

------------------------------------------------------

Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.

If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review Accessibility at Citi.

View Citi’s EEO Policy Statement and the Know Your Rights poster.