Big Data Engineer
Indexed description
🚀 Hiring: Big Data Engineer (AWS | Spark | SQL | AI Tools)
📍 Location: Rockville, MD or Tysons Corner, VA (3 days office Hybrid)
We are looking for a highly skilled Big Data Engineer to design, build, and optimize large-scale data processing systems. This role involves working on modern data platforms supporting interactive analytics and reporting solutions in a fast-paced environment.
You will collaborate with cross-functional teams to develop scalable data pipelines, improve system performance, and leverage cutting-edge technologies including AI-assisted development tools.
🔧 Key Responsibilities:
- Design and develop scalable data pipelines using Spark, Hadoop, and cloud technologies
- Build and optimize ETL/data processing workflows handling large datasets (TB–PB scale)
- Implement efficient data ingestion, transformation, and storage solutions
- Optimize performance of distributed systems and troubleshoot production issues
- Collaborate with data scientists, analysts, and engineering teams
- Ensure data quality through testing, monitoring, and automation
- Leverage AI tools (Copilot, ChatGPT, etc.) to improve development workflows
✅ Must Have Skills:
- 5+ years of experience in Big Data Engineering
- Strong hands-on experience with Apache Spark (core + performance tuning)
- Expertise in SQL (window functions, joins, optimization)
- Strong programming skills in Python / Scala / Java
- Experience working with large-scale data (terabytes or more)
- Hands-on experience with AWS (S3, EMR, Glue, Lambda, Athena, etc.)
- Experience with Hadoop ecosystem (Hive, HDFS, etc.)
- Strong understanding of distributed systems & data processing architectures
- Experience with CI/CD pipelines
- Hands-on exposure to AI tools (GitHub Copilot, ChatGPT, etc.)
⭐ Nice to Have:
- Experience with Trino / Presto
- Knowledge of Spark internals & optimization techniques
- Experience with data pipeline monitoring & troubleshooting
- Exposure to Agile/Scrum environments
💡 What We’re Looking For:
- Strong problem-solving mindset with real-world debugging experience
- Ability to work in fast-paced, high-scale environments
- Someone who can own data pipelines end-to-end
📩 If you're interested or know someone who fits, feel free to reach out!
Create a free Caio profile to unlock the full index and keep your job-search signal for future recommendations.
Unlock free search