VivoCity to Sentosa Express - $4 Entry | 4-min Journey - Learn More

*GOV* Gen AI Tester

SCIENTEC CONSULTING PTE. LTD.

Contract, Full Time Islandwide $7000 - $12000

Posted: January 21, 2026

Job Description

AI Quality Engineer (LLM/NLP)

  • Working Hours: Mon-Fri
  • Location: Central
  • Remuneration: Up to $12,000 + AWS

Job Summary

We are seeking an AI Quality Engineer to evaluate and ensure the accuracy, reliability, and performance of Large Language Models (LLMs) used in GenAI applications such as chatbots, classification tools, and RAG systems. The role focuses on identifying hallucinations, validating model behaviour, and supporting improvements through structured testing and collaboration.

Key Responsibilities

  • Design and execute test cases to assess LLM accuracy, relevance, and contextual correctness.
  • Detect and analyse hallucinations or fabricated outputs, and document them clearly.
  • Develop automated test scripts (Python, PyTest or similar) to streamline LLM regression testing.
  • Conduct functional and non-functional testing, including performance and stress tests for LLM-based systems.
  • Evaluate model output quality using NLP metrics and business-specific correctness rules.
  • Collaborate with AI engineers, data scientists, and product teams to improve model behaviour based on test findings.
  • Perform regression testing after fine-tuning, retraining, or system updates to ensure no degradation in accuracy.
  • Maintain structured documentation: test plans, test cases, test logs, and issue reports.
  • Use issue tracking tools (e.g., Jira) to report and track LLM-related bugs and inconsistencies.
  • Apply knowledge of LLMs, NLP concepts, and cloud-based AI environments (AWS/GCP/Azure preferred) to support comprehensive QA coverage.

Requirements

  • Experience testing LLMs (e.g., GPT, BERT) for chatbots and conversational AI.
  • Proficiency in test automation (PyTest, custom AI frameworks) to detect inaccuracies and hallucinations.
  • Familiarity with accuracy evaluation methods for high-stakes NLP applications.
  • Understanding of AI/NLP testing methodologies, including hallucination and relevance testing.
  • Strong Python skills for writing test scripts and analysing model outputs.
  • Ability to document and track issues using tools like Jira.
  • Strong problem-solving skills to propose improvements and reduce hallucinations.

By submitting your resume, you consent to the collection, use, and disclosure of your personal information per ScienTec’s Privacy Policy (scientecconsulting.com/privacy-policy).

This authorizes us to:

Contact you about potential opportunities.

Delete personal data as it is not required at this application stage.

All applications will be processed with strict confidence. Only shortlisted candidates will be contacted.

Aloysius Tan Sheng Rong - R22110441
ScienTec Consulting Pte Ltd - 11C5781

How to Apply

Please click the "Apply Now" button below to submit your application on the employer's website.

Apply Now

Similar Jobs

Plate Collector收碗盘员 (Harbourfront)

Full Time Islandwide

🌟 Join Our Team Today! 🌟 📞 Call 97219095 to Start IMMEDIATELY! All age are welcome!! 💰 Get Paid...

View Details

Driver

Full Time Islandwide

Responsibilities: Responsible for ferrying of staff and dispatch works as assigned Ensure timel...

View Details

BIM MODELLER

Permanent, Full Time Islandwide

Responsibilities Overall responsibility is to create BIM content and production documents for the...

View Details