The right talent can transform your business—and we make that happen. At Collabera, we go beyond staffing to deliver strategic workforce solutions that drive growth, innovation, and agility. With deep industry expertise, a global talent network, and a people-first approach, we connect you with professionals who don’t just fit the role but elevate your business. Partner with us and build a workforce that powers success.
Lead Data Engineer
Direct Hire: Houston, Texas, US span>
Salary Range: 140000.00 - 155000.00 | Per Annum
Job Code: 368561
End Date: 2026-05-08
Days Left: 15 days, 5 hours left
Job Title: Lead Data Engineer
Location: Houston, TX 77002
Schedule: onsite with half day's on Friday's
Salary: $140-155k range -
Day to day:
- Design and implement reliable data pipelines to integrate disparate data sources into a single Data Lakehouse.
- Design and implement data quality pipelines to ensure data correctness and building trusted datasets.
- Design and implement a Data Lakehouse solution which accurately reflects business operations.
- Assist with data platform performance tuning and physical data model support including partitioning and compaction.
- Provide guidance in data visualizations and reporting efforts to ensure solutions are aligned to business objectives.
- Automate and optimize the data lifecycle, find insights from raw data, and applying DevOps principle to data pipelines.
- Work with business leaders to deliver custom software solutions meeting data needs.
- Build and support a data platform for data engineering teams to build, deploy and manage applications
Education:
- Bachelor's degree in Computer Science, Information Technology, or a related field.
Must Haves:
- 5+ years of experience as a Data Engineer designing and maintaining data pipeline architectures.
- 5+ years of in-depth programming experience in Python and SQL.
- 5+ years in software development lifecycle experience with software engineering, development, testing, version control, refactoring, and deployment.
- Experience with common Python Data Engineering packages including pandas, Numpy, Pyarrow, Pytest, Scikit-Learn, and Boto3.
- Experience in implementing a Data Lakehouse using Apache Iceberg or Delta Lake.
- Experience with data platform architecture responsible for high-level design, strategy and implementation of data infrastructure, including data modelling, designing scalable architectures and ensuring data governance, security and compliance.
- Knowledgeable of modern data platform technologies including Apache Airflow, Kubernetes, and S3 Object Storage.
- Experience with AWS, Snowflake, dbt and Airbyte is preferred.
- Experience with infrastructure as code, building consistent and repeatable cloud infrastructure. Preferred Skills
- Experience in the midstream industry, supply chain logistics, or chemical engineering.
- Familiarity with commercial systems like OpenLink or Quorum.
Soft Skills:
- Strong communication skills and a proactive, go-getter attitude. Interview Process.
This is a direct hire opportunity. The selected candidate will be employed directly by our client. All compensation and benefits, including but not limited to medical insurance, retirement plans, paid time off, and other perks, will be provided by the client in accordance with their internal policies and subject to applicable laws and eligibility requirements.
Job Requirement
- data
- SQL
- Python
Reach Out to a Recruiter
- Recruiter
- Phone
- Jahnavi Jena
- jahnavi.jena@collabera.com