We are looking Python ETL Developer having more than 3 years of experience in the design and implementation of low-latency, high-availability, and performant applications
Responsibilities:
- Python, PySpark, ETL Tools experience like (DSS or Informatica OR AWS Glue OR Jitterbit)
- Python language for data processing (panda, PySpark)
- Data analysis using PySpark (and SQL)• Knowledge (and if possible experiment) in a Hadoop / HDFS file systems big data environment
- ETL Concepts (preferably exposure to Dataiku DSS (Data Science Studio) or any other ETL tool)
- Writing reusable, testable, and efficient code
- Design and implementation of low-latency, high-availability, and performant applications
- Integration of user-facing elements developed by front-end developers with server-side logic
- Implementation of security and data protection
- Integration of data storage solutions {may include databases, key-value stores, blob stores, etc.}
Required:
- Minimum 4 years of experience is required.
- A computer degree is essential.
- Python Scripting-L3, (Mandatory) and Cloud-IaaS-Compute-Amazon Web Services-AWS-L3, (Optional), Python.
- Application Development (Mandatory)
- As a Lead, you are responsible for managing a small team of analysts, developers, testers, or engineers and drive the delivery of a small module within a project (Delivery/Maintenance/Testing)
- You may serve as an entry-level specialist with expertise in a particular technology/industry domain/a process/application/product.
- You are responsible for the functional/technical track of a project.