Persistent Systems Linkedin · Posted 1mo ago

Google Cloud Data Architect – IAM Data Modernization

Dallas, Texas, United States

Continue to application Add your email once, then Caio opens the original posting.

Indexed description

About Persistent

We are a trusted Digital Engineering and Enterprise Modernization partner, combining deep technical expertise and industry experience to help our clients anticipate what’s next. Our offerings and proven solutions create unique competitive advantage for our clients by giving them the power to see beyond and rise above.

We are experiencing tremendous growth, with $566 million in revenue in FY21, representing 12.9% year-over-year growth. Along with that growth, we onboarded over 3,000 new employees in the past year, bringing our total employee count to over 15,000 people located in 18 countries across the globe.

At Persistent, our values are more than a list of ideals to improve our corporate image. We’re dedicated to building an inclusive culture that reflects what’s important to our employees and is based on what they value. As a result, 95% of our employees approve of the CEO and 83% recommend working at Persistent to a friend.

About Position:

Identity & Access Management (IAM) Data Modernization – migration of an on‑premises SQL data warehouse to a target‑state Data Lake on Google Cloud (GCP), enabling metrics & reporting, advanced analytics, and GenAI use cases (natural language querying, accelerated summarization, cross‑domain trend analysis) leveraging PySpark‑based processing, cloud‑native DevOps CI/CD pipelines, and containerized deployments on OpenShift (OCP) to deliver scalable, secure, and high‑performance data solutions.

About Position:

Role: Google Cloud Data Architect – IAM Data Modernization

Location: Dallas, TX / Charlotte, NC/ Iselin, NJ, / Chandler, AZ / Ohio, Delaware (Hybrid)

What You'll Do:

DevOps / CI‑CD

Experience implementing CI/CD pipelines for data and analytics workloads
Familiarity with Git‑based source control, build automation, and deployment strategies

Containers & Platform

Experience with OpenShift Container Platform (OCP) for deploying data workloads and services
Understanding of containerized architecture, scaling, and environment management
Proven ability to build CI/CD pipelines for data and infrastructure workloads
Experience managing secrets securely using GCP Secret Manager
Ownership of observability, SLOs, dashboards, alerts, and runbooks
Proficiency in logging, monitoring, and alerting for data pipelines and platform reliability

Big Data & Processing

Hands‑on experience with PySpark for ETL/ELT, data transformation, and performance optimization
Solid understanding of distributed data processing concepts

Data & Cloud Architecture

Strong experience designing data platforms on Google Cloud Platform (GCP)
Experience with Data Lakes, data warehousing, and large‑scale migration programs

Data Lake Architecture & Storage

Proven experience designing and implementing data lake architectures (e.g., Bronze/Silver/Gold or layered models).
Strong knowledge of Cloud Storage (GCS) design, including bucket layout, naming conventions, lifecycle policies, and access controls

· Experience with Hadoop/HDFS architecture, distributed file systems, and data locality principles

Hands-on experience with columnar data formats (Parquet, Avro, ORC) and compression techniques
Expertise in partitioning strategies, backfills, and large-scale data organization
Ability to design data models optimized for analytics and BI consumption

Data Ingestion & Orchestration

· Experience building batch and streaming ingestion pipelines using GCP-native services

· Knowledge of Pub/Sub-based streaming architectures, event schema design, and versioning

· Strong understanding of incremental ingestion and CDC patterns, including idempotency and deduplication

· Hands-on experience with workflow orchestration tools (Cloud Composer / Airflow)

· Ability to design robust error handling, replay, and backfill mechanisms

Data Processing & Transformation

· Experience developing scalable batch and streaming pipelines using Dataflow (Apache Beam) and/or Spark (Dataproc)

· Strong proficiency in BigQuery SQL, including query optimization, partitioning, clustering, and cost control.

· Hands-on experience with Hadoop MapReduce and ecosystem tools (Hive, Pig, Sqoop)

· Advanced Python programming skills for data engineering, including testing and maintainable code design

· Experience managing schema evolution while minimizing downstream impact

Analytics & Data Serving

· Expertise in BigQuery performance optimization and data serving patterns

· Experience building semantic layers and governed metrics for consistent analytics

· Familiarity with BI integration, access controls, and dashboard standards

· Understanding of data exposure patterns via views, APIs, or curated datasets

Data Governance, Quality & Metadata

· Experience implementing data catalogs, metadata management, and ownership models

· Understanding of data lineage for auditability and troubleshooting

· Strong focus on data quality frameworks, including validation, freshness checks, and alerting

· Experience defining and enforcing data contracts, schemas, and SLAs

Good to have

Security, Privacy & Compliance

· Hands-on experience implementing fine-grained access controls for BigQuery and GCS

· Experience with Sprint planning and helping team technically.

· Strong stakeholder communication and solution‑architecture skills

Expertise You'll Bring:

Experience: [10–14]+ years in DevOps and Data Architecture, 5+ years designing on Pyspark/GCP/OCP at scale; prior on‑prem → cloud migration a must.
Education: Bachelor’s/Master’s in Computer Science, Information Systems, or equivalent experience.
Certifications:Google Cloud Professional Cloud Architect/DevOps/OCP (required or within 3 months). Plus: Professional Data Engineer, Security Engineer

Benefits:

Competitive salary and benefits package
Culture focused on talent development with quarterly promotion cycles and company-sponsored higher education and certifications
Opportunity to work with cutting-edge technologies

How to apply

Send your resume and cover letter to [email protected]

Please format the subject line as follows: VP of Marketing – Last Name, First Name.

Employee engagement initiatives such as project parties, flexible work hours, and ‘Long Service’ awards
Annual health check-ups as well as insurance:
Group term life insurance
Personal accident insurance
Mediclaim hospitalization insurance for self, spouse, two children, and parents

Why Persistent is an employer of choice

Technology Innovation: culture of innovation using cutting-edge technology to bring value to clients.
Growth and Career Progression: learning opportunities for growth, including quarterly promotion cycles.
One Persistent Culture: global outlook with diversity and inclusion at its core.

Mental and Physical Wellness: employee health and mindfulness programs

Free. 20 seconds. No password. See every match in this search.

Create a free Caio profile to unlock more results and save your role and location preferences.

Unlock free search

Want help applying to roles like this? Search Caio for free. If CV tailoring and application tracking get heavy, Full Caio Agent adds a human specialist.

View Full Agent

Persistent Systems Company profile preview

Source: Linkedin
Location: Dallas, Texas, United States
Compensation: Not listed
Open on Caio: 64 roles

Salary insight

Compensation not indexed

Caio highlights salary ranges whenever the original posting exposes them. Compare similar roles as the index fills in.

Similar role details

Full-time roles Location flexible matches Linkedin postings

Company stats

Current index details for Persistent Systems, based on roles Caio has indexed from public sources.

64open roles 1sources 6markets Posted 3d agolatest role