Are you a Big Data Engineer or Developer who can deliver consulting services including planning, designing and implementing new solutions using the latest Big Data technologies? Do you want to work remotely to implement and help develop a cutting edge Big Data solutions, create data pipelines which will migrate data from customers on-prem systems and load it into a cloud hosted Enterprise Data Platforms? Do you want to work on large scale custom Big Data consulting projects? If you have a passion for big data and solving complex problems then this could be the job for you!
Role & Responsibilities
Working with the Data Architects to implement data pipelines
Working with our Big Data Principal Architects in the development both proof of concepts and complete implementations.
Working on complex and varied Big Data projects including tasks such as collecting, parsing, managing, analyzing, and visualizing very large datasets.
Translating complex functional and technical requirements into detailed designs.
Writing high-performance, reliable and maintainable code.
Performing data processing requirements analysis.
Performance tuning for batch and real-time data processing.
Securing components of clients' Big Data platforms.
Diagnostics and troubleshooting of operational issues.
Health-checks and configuration reviews.
Data pipelines development - ingestion, transformation, cleansing.
Data flow integration with external systems.
Integration with data access tools and products.
Assisting application developers and advising on efficient data access and manipulations.
Defining and implementing efficient operational processes
Skills & Qualifications
While we realize you might not have everything on the list to be the successful candidate for the Big Data Developer job you will likely have at least 3 years experience in similar roles. The position requires specialized knowledge and experience in performing the following:
Experience building data pipelines in any public cloud (AWS Glue, GCP Dataflow, Azure DataFactory) or any equivalent
Experience writing ETL (Any popular tools)
Experience in data modeling, data design and persistence (e.g. warehousing, data marts, data lakes).
Strong Knowledge of Big Data architectures and distributed data processing frameworks: Hadoop, Spark, Kafka, Hive
Experience and working knowledge of various development platforms, frameworks and languages such as Java, Python, Scala and SQL
Experience with Apache Airflow, Oozie and Nifi would be great
General knowledge of modern data-center and cloud infrastructure including server hardware, networking and storage.
Strong written and verbal English communication skills
Bonus Point Skills & Qualifications
Experience with BI platforms, reporting tools, data visualization products, ETL engines.
Experience with data streaming frameworks.
DevOps experience with a good understanding of continuous delivery and deployment patterns and tools (Jenkins, Artifactory, Maven, etc)
Experience with Hbase.
Experience in data management best practices, real-time and batch data integration, and data rationalization
DATA SCIENCE TECHNOLOGIES LLC is an equal opportunity employer inclusive of female, minority, disability and veterans, (M/F/D/V). Hiring, promotion, transfer, compensation, benefits, discipline, termination and all other employment decisions are made without regard to race, color, religion, sex, sexual orientation, gender identity, age, disability, national origin, citizenship/immigration status, veteran status or any other protected status. DATA SCIENCE TECHNOLOGIES LLC will not make any posting or employment decision that does not comply with applicable laws relating to labor and employment, equal opportunity, employment eligibility requirements or related matters. Nor will DATA SCIENCE TECHNOLOGIES LLC require in a posting or otherwise U.S. citizenship or lawful permanent residency in the U.S. as a condition of employment except as necessary to comply with law, regulation, executive order, or federal, state, or local government contract