Software Development Engineer (Data Engineer) - SDE II
Indexed description
Location: Mumbai, MH
- Education: Bachelor’s in CS / IT or related field
Our mission is to build “Unbreakable Systems”—platforms that don’t just survive failures but absorb them, self-heal, and scale with the business calmly and predictably. We are looking for engineers who want to stop “fixing” and start “architecting” at the highest level.
THE PURPLLE WAY: AI-NATIVE ENGINEERING
We Believe That As a Business Scales, Technology Shouldn’t Become More Fragile. We Have Redesigned Our Entire Engineering Lifecycle To Be AI-native, Moving Away From Manual Toil Toward a World Of Intelligent, Self-governing Systems
- Velocity without Fear (70% AI-Assisted): We ship fast because we’ve built the safety nets to do so. You won’t just be a “coder”; you will be a system architect. We empower our engineers to use AI for the heavy lifting of development—aiming for 70% of Pull Requests to be AI-assisted—so you can focus on complex logic and high-level architecture.
- Built-in Guardrails & Self-Healing Systems: Security and quality aren’t “extra steps”—they are baked into our “Golden Paths.” We build systems that use AI for automated rollbacks and deterministic fallback logic, ensuring the platform absorbs mistakes automatically and remains resilient under pressure.
- Intelligence by Default (Zero-Toil Operations): We embed AI across the entire lifecycle—Build, Test, Detect, and Operate. By utilizing AI agents to eliminate “Ops Toil,” we’ve created an environment where anomaly detection and incident diagnosis happen in minutes, not hours.
- Predictable Economics & Growth: We treat cloud costs and system performance as core engineering disciplines. Every engineer at Purplle understands the “unit economics” of their code, ensuring our infrastructure scales as efficiently as our business.
We Aren’t Looking For “feature Factories”; We Are Looking For Owners Of a Resilient Ecosystem. At Purplle, Your Success Is Defined By Your Mindset And The Tangible Impact You Have On Our Platform
The Systems Thinker (Quality & Resilience): You hate fixing the same bug twice. You’d rather build a “self-healing” mechanism than perform a manual fix. Your goal is to maintain a Change Failure Rate of <1.5% and help us reach a state of fewer than 2 Sev-1 incidents per quarter.The AI-Fluent Architect (High Velocity): You are already using AI tools to 10x your own productivity. You leverage these tools to drive a cycle time of The Data-Informed Engineer (Economic Discipline): You care about the “why” behind the code. You understand that latency and cloud spend are engineering disciplines, and you optimize your infrastructure so that our cost-per-order remains predictable as we scale.
In This Role, You Will Be Directly Responsible For
- AI-Augmented Pipeline Architecture: Design and build self-healing, schema-validated data pipelines on GCP (BigQuery, Dataflow, Pub/Sub) that power real-time analytics. Pipelines must be idempotent, observable, and auto-recover from failures without human intervention.
- Streaming & Batch Processing at Scale: Implement and optimise large-scale processing workflows using Apache Spark, Flink, or Beam. Own both real-time Kafka consumer pipelines and scheduled batch jobs, ensuring sub-hour SLAs for data freshness across all critical business domains.
- Schema Contracts & Data Integrity: Implement and enforce Schema Registry and data-contract standards across all producers and consumers: target 100% data quality scores—no silent failures, no downstream surprises.
- ML Pipeline Integration: Build and maintain feature stores and data pipelines that feed production ML models (recommender systems, demand forecasting, pricing). Collaborate with Data Science to ensure training-serving skew is eliminated.
- Data Governance & Cost Engineering: Champion data cataloguing, lineage tracking, and access-control policies. Monitor BigQuery slot usage and pipeline costs, and implement optimizations that reduce cloud spend by measurable percentages each quarter.
- Languages: Python, Scala, Java, SQL
- Processing: Apache Spark, Apache Beam / Dataflow, Apache Flink, Kafka, Databricks
- Cloud & Storage: GCP (BigQuery, Dataflow, Pub/Sub, DataProc, GCS), AWS (S3, EMR)
- Orchestration: Apache Airflow (Cloud Composer)
- Data Quality: Great Expectations, dbt
- Infrastructure: Kubernetes, Docker, Terraform, Databricks
- AI Productivity: Cursor, Claude
- DE2 — 3–5 years; end-to-end ownership of production pipelines at scale. Ability to ship data products independently with clear quality and latency guarantees.
You will be joining a team that is actively moving away from the “chaos” of traditional scaling. At Purplle, we are building a tech culture that is “Operational, not Aspirational.”
We provide the tools, the AI-first environment, and the “Golden Paths” to let you do the best work of your career. If you want to build intelligent, unbreakable systems that scale with confidence, your journey starts here.
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search