Data Engineer (AWS)
Vwgds
Lisbon/PortoHybridUnknownSalary not listed
Job details
We're looking for Data Engineers to join our team in Lisbon, hiring multiple experience positions, from graduate to senior.
You'll design and operate production-grade data pipelines on AWS, working with vehicle data at massive scale (datasets in the order of multiple terabytes) that power analytics and decision-making across the business.
This role involves working with vehicle signals and telemetry data into structured, analytics-ready datasets — a unique technical challenge that combines distributed processing, domain knowledge and engineering rigor.
You'll own architecture decisions, collaborate with international stakeholders (including German teams), and contribute to a mature data platform
You’ll be focused in
• Designing, building and maintaining data pipeline architecture on AWS for ingestion, transformation and curation
• Processing and decoding large-scale vehicle datasets (multi-terabyte) — raw signals and telemetry transformed into ready-to-use analytics data
• Assembling large, complex datasets meeting business and analytics requirements
• Implementing ETL/ELT infrastructure using Python, PySpark, SQL and AWS big data services
• Building analytics tooling on top of these pipelines to deliver actionable business insights
• Working at scale - getting data into a ready-to-use state in close alignment with business stakeholders
In order to succeed, you’ll need
• +5 years of hands-on experience as a Data Engineer or Software Engineer working on big-data environments
• Strong Python — production-grade, not scripting
• PySpark / Spark — real experience with distributed processing (academic exposure acceptable for junior profiles; production experience expected for senior)
• AWS core stack: S3, Glue, Athena, Lambda, IAM — hands-on experience
• SQL — comfortable with complex queries, optimization and data modeling
• ETL pipelines — proven experience building and operating them
• Knowledge of Data Lake, Data Warehouse and RDS architectures
• Fluent English, written and spoken (you'll work daily with international teams)
Nice to have
• Experience in the automotive sector or working with vehicle/telemetry data
• IaC: AWS CDK, Terraform, or equivalent
• CI/CD pipelines (GitHub Actions, GitLab CI)
• SQS, SNS, EventBridge, Step Functions
• Experience with Apache Iceberg or Hive — or strong willingness to learn Iceberg
• Experience working with large-scale datasets (TB-scale