We are looking for a Data Engineer passionate about building scalable, reliable data infrastructure that powers AI. You will help make data accessible, trustworthy, and ready for machine learning forming the foundation of Aily’s decision intelligence platform.
About the team
Our Data team owns the full data lifecycle—from ingestion and transformation to quality and delivery. We build scalable pipelines and APIs that power insights across domains like Finance, R&D, and Go-to-Market.
What you’ll work on
- Designing and maintaining scalable, multi-tenant data pipelines
- Ensuring data quality, reliability, and governance
- Collaborating with Product, Engineering, and Data Science teams
- Languages & Frameworks: Python is our core for pipelines, APIs, and CLI tooling. We use FastAPI and Pydantic to serve typed, high-performance REST APIs to our platform.
- Data Transformation & Modeling: We use dbt for our SQL-based transformation layer and SQLModel with Alembic for ORM-based schema management and versioned migrations.
- Storage & Analytics: We utilize DuckDB and DuckLake as our embedded analytics engine, all hosted on a robust AWS infrastructure (S3, IAM).
- Orchestration & Workflow: Apache Airflow handles our complex event-driven and scheduled job flows.
- Quality & Engineering Excellence: We prioritize reliability through pytest for QA validations, GitHub for version control, and a rigorous PR-based code review process.
