Skip to main content

Salesforce UI Automation: POM vs UTAM vs TestZeus Hercules

Reading Time: 7 minutes

 

 

Having joined my first Salesforce project just over six months ago, we’re now ready to consider automating the UI.  We previously had one automated UI flow for our old system, which was completely different, and this was created as more of a proof of concept (PoC) for automated system integration tests across different products.  The main thing I learned while maintaining this old automation is that Salesforce randomly changes things, such that UI automation breaks for no good reason.  I’m talking different locators, altered HTML structure, and removing auto-load behaviours.  It’s pretty annoying, to say the least.

 

So this time round, we’re taking all the learnings up until now and creating new automation infrastructure from scratch.  I want to share a little bit about which approach we chose, and why.  I struggled to find resources myself, so I hope this will be helpful to others.  I’m also currently going through the Foundation Certificate in Test Automation from Ministry of Testing, which has a section on tool selection.  The timing is a happy coincidence, and I thought it would be nice to share a real example of tool evaluation and selection.

 

 

Additional Context

 

As additional context for this evaluation:

  • I’m the first and only Quality Engineer on the project
  • Although I’ve written some automation in the past, it’s a skill I want to develop further
  • Like many teams, we have many requirements and limited resources; it’s very important that the benefits we get are worth the investment we make
  • We agreed that automating some key flows could be useful, but we do not plan to have hundreds of UI automation scripts
  • Our existing system integration tests use Playwright and TypeScript
  • We have a company Tech Radar, and would like to use tools already on it, where possible

 

 

Criteria

 

Naturally, there are some criteria which our chosen solution should meet:

  • Easy to set up
  • As little maintenance effort as possible
  • Simple to learn
  • Good resources / documentation / community
  • Likely to last / unlikely to be discontinued
  • Free to use
  • “Approved” for company use

 

 

Potential Approaches

 

There are three main approaches that I considered for our Salesforce UI automation:

  1. The classic page object model (POM) pattern
    1. A popular approach, which makes logical sense to my brain and reduces repetition of locator definitions
  2. Salesforce’s own UI test automation model (UTAM) framework
    1. An option which specifically claims to reduce the pain of maintaining locators
  3. TestZeus’ open source testing agent, Hercules
    1. A technology with self-healing AI features, which theoretically means total removal of locator maintenance

 

 

Comparison

 

POM

UTAM

Hercules

Easy to set up

Check Mark No additional tools or frameworks required – our existing tech stack (Playwright + TypeScript) can be used

Check Mark More control over locators

Cross Mark Initial set-up of page objects can be time consuming

Check Mark Built specifically for Salesforce and handling shadow DOMs

Check Mark Standard components are included

Cross Mark Requires third-party web-driver

Cross Mark UTAM files are written in JSON, then compiled to page objects

Cross Mark Custom components need to be manually set up

Cross Mark Less control over locators

Check Mark Supports automation of Salesforce

Check Mark Video guides for set up

Cross Mark Requires knowledge of Python and Gherkin

Cross Mark Free version requires own AI agent, which may not be as effective

As little maintenance effort as possible

Check Mark Locators only need to be updated in one place

Cross Mark Locators need to be updated manually

Cross Mark Generally, no good strategy for Salesforce locators exists

Check Mark Locators for standard components are updated with each new UTAM version

Cross Mark UTAM is sometimes updated after Salesforce, so automation still breaks

Cross Mark Custom components still need to be manually maintained

Check Mark Self-healing locators require no maintenance

Simple for the team to learn

Check Mark Simple, logical structure

Cross Mark Requires knowledge of Playwright and TypeScript

Check Mark Requires knowledge of JSON, not TypeScript

Cross Mark Requires knowledge of UTAM grammar

Check Mark No code required for writing automation

Cross Mark Requires knowledge of Gherkin

Good resources / documentation / community

Check Mark Widely used and documented

Cross Mark Tutorials aren’t based on Salesforce apps

Cross Mark Not many resources or much engagement

Check Mark Video guides

Cross Mark Written documentation is mostly for initial set up

Cross Mark Still fairly new

Likely to last / unlikely to be discontinued

Check Mark Established pattern without third-party dependencies

Cross Mark Hasn’t been updated in a while

Cross Mark Not a focus for Salesforce

Check Mark Creators are passionate about having an open-source option

Cross Mark Still fairly new

Free to use

Check Mark Free

Check Mark Free

Check Mark Hercules is free

Cross Mark AI agent may incur additional costs

“Approved” for company use

Check Mark Technologies on Tech Radar

Cross Mark Not yet on Tech Radar

Cross Mark Not yet on Tech Radar

 

Pros and Cons

 

POM + Playwright + TypeScript

 

Pros

Cons

Adopted company technologies

Flaky locators

Widely used pattern

No good locator strategy

Built-in driver

Separation of concerns

Full control over locators

 

UTAM + JSON + WebDriver

 

Pros

Cons

Built to handle shadow DOMs

New to company

JSON for standard components updated automatically

No built-in driver

UTAM files need to be manually regenerated when JSON changes

JSON for custom components still requires manual maintenance

More logic in automation scripts

Less flexibility for locators

 

Hercules + Playwright + Python + Gherkin

 

Pros

Cons

Al agent

New to company

No code required for writing automation

Free tier doesn’t include specialised AI agents

Self-healing locators

AI agent may incur additional costs

Open-source version still supports Salesforce

Little opportunity for developing programming skills

 

 

Recommendation

Based on my research and evaluation, my recommendation for our project is to use POM.

 

Although POM does not provide us with Salesforce-specific capabilities, it does allow us full control over locators, so we have flexibility in our problem solving.

 

While UTAM is specifically built for Salesforce, locators still need to be maintained, and I don’t have confidence in it’s longevity. As soon as it stops being maintained, we’ll lose the benefits and cannot reuse anything.

 

Hercules has the most appeal, from a maintenance standpoint, but uncertainties around the effectiveness and cost of the AI agent, coupled with the other cons, make it less attractive for our context.

 

 

I hope you found this helpful.  If you’re in a similar situation, I’d love to hear from you.  What approach(es) have you tried, and what challenges have you faced?  Let me know in the comments.

 


 

Disclosure: As part of my research, I spoke with Robin Gupta, the founder of TestZeus, to get his opinion on UTAM.  However, he was very careful not to make the conversation about TestZeus, and we only spoke about it when I specifically asked him for information on it.


Discover more from Cassandra HL

Subscribe to get the latest posts sent to your email.

Share Your Thoughts