Data Software Engineer
Indexed description
As a Software Data Engineer, you will collaborate closely with staff and senior engineers to deliver robust data infrastructure in a regulated, compliance-driven healthcare environment. You’ll gain deep exposure to modern tools and cloud technologies while contributing meaningfully to transforming healthcare delivery through data and AI.
Responsibilities
Data Engineering & Pipeline Development
- Implement and maintain reliable batch and real-time data pipelines using modern frameworks such as Apache Spark, Databricks, and Snowflake.
- Support ingestion, transformation, and delivery of structured and semi-structured data across cloud-native platforms.
- Contribute to performance tuning and optimization efforts to ensure scalable and cost-effective workflows.
- Utilize orchestration tools like Airflow, Temporal and dbt to enable automation, scheduling, monitoring, and version control of pipeline operations.
- Enforce data quality through the implementation of validation checks, data lineage tracking, and schema enforcement.
- Assist in meeting healthcare compliance standards (e.g., HIPAA, GDPR) by implementing best practices for data handling, encryption, and access controls.
- Work with metadata and data cataloging systems to ensure accuracy, transparency, and discoverability of enterprise datasets.
- Participate in agile development practices, including daily stand-ups, sprint planning, retrospectives, and backlog grooming.
- Support code reviews and debugging efforts to uphold code reliability, reusability, and maintainability.
- Collaborate with cross-functional teams, including Data Science, Product, Quality, and Security, to deliver coordination and alignment on technical implementations.
- Stay current on evolving technologies in cloud data engineering, including tools such as Delta Lake, Iceberg, and healthcare interoperability standards.
- Actively engage in knowledge sharing and continuous learning through team discussions, peer programming, and internal demos.
- Show initiative in exploring process improvements, automation opportunities, and platform enhancements.
- Bachelor’s degree in Computer Science, Engineering, or related field.
- 2–5 years of hands-on experience in data engineering, platform development, or software engineering focused on data systems.
- Software development experience at a healthcare company (provider space preferred), with exposure to EHR systems or healthcare data domains is strongly preferred.
- Proficiency in Python and SQL for building and maintaining data workflows.
- Experience working with distributed data frameworks like Apache Spark or Databricks.
- Familiarity with data orchestration tools such as Airflow, Temporal and data modeling tools like dbt.
- Experience with data platforms such as Azure data factory, databricks, etc.
- Exposure to cloud providers - Azure preferred.
- Exposure to healthcare data types (e.g., FHIR, HL7, CCDA) and terminologies (e.g., ICD, SNOMED) is a strong plus.
- Strong analytical and problem-solving skills with attention to performance and reliability.
- Effective communicator and team player who thrives in collaborative, cross-functional settings.
- Eagerness to learn new tools and technologies and contribute to platform evolution in a fast-paced environment.
GW RhythmX is proud to be an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age or veteran status.
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search