Senior Software Engineer - Data Infrastructure
Indexed description
Making data driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively.
Engineers on Data Infrastructure are domain experts in Data Warehouse, Data Lakehouse, Spark, Workflow Orchestration, and Streaming technologies. We scale our existing data pipelines in a performant and cost efficient way while creating the necessary abstractions to make developing on top of this platform extremely simple for other engineers at Plaid.
Responsibilities
- Contribute towards the long-term technical roadmap for data-driven and machine learning iteration at Plaid
- Leading key data infrastructure projects such as improving ML development golden paths, implementing offline streaming solutions for data freshness, building net new ETL pipeline infrastructure, and evolving data warehouse or data lakehouse capabilities.
- Working with stakeholders in other teams and functions to define technical roadmaps for key backend systems and abstractions across Plaid.
- Debugging, troubleshooting, and reducing operational burden for our Data Platform.
- Growing the team via mentorship and leadership, reviewing technical documents and code changes.
- 5+ years of software engineering experience
- Extensive hands-on software engineering experience, with a strong track record of delivering successful projects within the Data Infrastructure or Platform domain at similar or larger companies.
- Deep understanding of one of: ML Infrastructure systems, including Feature Stores, Training Infrastructure, Serving Infrastructure, and Model Monitoring OR Data Infrastructure systems, including Data Warehouses, Data Lakehouses, Apache Spark, Streaming Infrastructure, Workflow Orchestration.
- Strong cross-functional collaboration, communication, and project management skills, with proven ability to coordinate effectively.
- Proficiency in coding, testing, and system design, ensuring reliable and scalable solutions.
- Demonstrated leadership abilities, including experience mentoring and guiding junior engineers.
- [Nice to have] Experience with Databricks, Airflow, AWS EMR
Plaid is proud to be an equal opportunity employer and values diversity at our company. We do not discriminate based on race, color, national origin, ethnicity, religion or religious belief, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, military or veteran status, disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local laws. Plaid is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance with your application or interviews due to a disability, please let us know at [email protected].
Please review our Candidate Privacy Notice here.
Additional compensation in the form(s) of equity and/or commission are dependent on the position offered. Plaid provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay is based on factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience and skillset, and location. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.
Compensation Range: $190.8K - $262.8K
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search