Saransh Inc
Linkedin · Posted 22d ago
AI Tester - Remote (US) - Only W2
Continue to application
Add your email once, then Caio opens the original posting.
Indexed description
Role: AI TesterRemote (US)Job Type: W2 ContractDuration: 5-6 monthsAbout The Project
- We re developing a new customer-facing chatbot integrated into a modern Drupal-based website.
- The chatbot will use GenAI and RAG (Retrieval-Augmented Generation) to provide automated FAQ-style support, with future potential for agentic automation and integration into enterprise systems.
- The solution will be built on AWS, primarily in Python, using frameworks like LangGraph or AWS Strands Agents.
- We re seeking a hands-on QA Engineer to design and execute testing strategies for a GenAI-based, multi-agent, cloud-native chatbot platform.
- The focus will be on validating LLM-driven functionality, ensuring data and prompt security, and verifying integrations, performance, and reliability in AWS.
- Develop and execute test plans for chatbot logic, GenAI responses, and agent interactions.
- Validate RAG functionality: data retrieval accuracy, response relevance, and hallucination detection.
- Test API integrations between chatbot, AWS services, and enterprise systems.
- Implement automation for regression, performance, and cost-efficiency testing.
- Ensure LLM prompt security (e.g., guardrails, injection testing).
- Support analytics validation: user engagement, response accuracy, and model performance metrics.
Create a free Caio profile to unlock more results and save your role and location preferences.
Unlock free search
Want help applying to roles like this?
Search Caio for free. If the repetitive CV tweaking gets heavy, Daniel can help set up Caio Agent.
Ask about Agent