"Possible 3 Month CTH | No Fees | Do Not Re-Post| Confidential
TMR ID: YWTWG2
Role: V&V Engineer AI-Driven Testing & Validation
Work location: Plano, TX
Background and Meet and Greet: MANDATORY
Job Description:
"Key Responsibilities
AI/ML & LLM Development/Validation
Lead end-to-end quality engineering for enterprise AI applications, including LLM-powered products, RAG pipelines, and agentic workflows.
Design and execute prompt validation strategies, evaluating LLM responses for accuracy, semantic relevance, hallucination risk, and safety compliance.
Build automated evaluation pipelines for AI model outputs using metrics such as BLEU, ROUGE, embedding-based similarity, precision, recall, and F1-score.
Validate agentic systems (tool use, multi-step reasoning, planner-executor workflows) for correctness, determinism, and failure mode handling.
Test Automation & Frameworks
Architect and maintain Python...