About | Tan Wee Joe

I'm Wee Joe, an AI research engineer interested in mechanistic interpretability, reinforcement learning, and emerging paradigms in agentic intelligence, particularly world models, multi-agent coordination, and the challenge of keeping post-AGI systems human-aligned.

I am currently pursuing a BSc in Computer Science at University College London . Earlier, I completed a Diploma in Computer Engineering at Singapore Polytechnic (Apr 2020 to May 2023), where I graduated as valedictorian. I have also taken part in the Stanford ASES Entrepreneurship Bootcamp .

I believe these research directions are among the most consequential of our time. Mechanistic interpretability can make AI systems auditable and trustworthy, a prerequisite for deploying them in medicine, education, and governance. World models and multi-agent coordination unlock AI that genuinely reasons and collaborates, compressing decades of scientific progress into years. And getting alignment right is what determines whether the transition to post-AGI systems expands human agency or erodes it. The stakes make it the most meaningful problem I can work on.

Want to collaborate? Reach me on LinkedIn or via email .

Research

Avenir-UX: Automated UX Evaluation via Simulated Human Web Interaction with GUI Grounding

Wee Joe Tan, Zi Rui Lucas Lim, Shashank Durgad, Karim Obegi, Aiden Yiliu Li

2026

An agent system that evaluates website usability through simulated human interaction with GUI grounding, integrating System Usability Scale, Single Ease Questions, and Think Aloud protocols to generate comprehensive UX reports without traditional user studies.

AI AgentsHCIUX EvaluationMultimodal

Projects

OpenSRE

github.com

Open-source framework for AI SRE agents that automatically investigates production incidents by correlating logs, metrics, and traces, generates root-cause analysis reports, and executes remediations. Integrates with Grafana, Datadog, PagerDuty, and Slack.

PythonAI AgentsObservabilitySRE

OpenFlo

github.com

Autonomous web agent framework that measures website usability at scale through simulated human interactions and GUI grounding, scoring ease, efficiency, clarity, and confidence to generate System Usability Scale reports. Built at UCL Nexus Labs.

PythonAI AgentsUXPlaywright

Sentrix

github.com

Agentic AI security platform with real-time intent classification, anomaly detection, and automated escalation for safe LLM workflows. UCL AI Engine NVIDIA Hackathon and Stanford ASES Bootcamp winner.

PythonLLMSecurityObservability

Onflow

github.com

AI/ML research and startup: LLM embedding pipelines and synthetic behavioural personas from interaction data, retrieval and semantic similarity for large-scale simulation. Google hackathon winner; white paper; incubated in two London accelerators.

TypeScriptLLMEmbeddingsResearch

Hackoholics BH25

github.com

DSTA BrainHack TIL-AI 2025: multi-domain AI across reinforcement learning, computer vision, ASR, and optical challenges.

PythonHackathonAI

treeminspls

github.com

Platform for builders to stress-test products using agentic personas and iterate on their sites from simulated user feedback.

TypeScriptAgentsProduct

anubis

github.com

TypeScript project exploring tooling and automation in the development workflow.

Gendash

github.com

Dynamically generates interactive dashboards from any API by analysing data and building intelligent visualisations.

TypeScriptData visualisationAPIs