Having joined my first Salesforce project just over six months ago, we’re now ready to consider automating the UI. We previously had one automated UI flow for our old system, which was completely different, and this was created as more of a proof of concept (PoC) for automated system integration tests across different products. The main thing I learned while maintaining this old automation is that Salesforce randomly changes things, such that UI automation breaks for no good reason. I’m talking different locators, altered HTML structure, and removing auto-load behaviours. It’s pretty annoying, to say the least.
So this time round, we’re taking all the learnings up until now and creating new automation infrastructure from scratch. I want to share a little bit about which approach we chose, and why. I struggled to find resources myself, so I hope this will be helpful to others. I’m also currently going through the Foundation Certificate in Test Automation from Ministry of Testing, which has a section on tool selection. The timing is a happy coincidence, and I thought it would be nice to share a real example of tool evaluation and selection.
Additional Context
As additional context for this evaluation:
- I’m the first and only Quality Engineer on the project
- Although I’ve written some automation in the past, it’s a skill I want to develop further
- Like many teams, we have many requirements and limited resources; it’s very important that the benefits we get are worth the investment we make
- We agreed that automating some key flows could be useful, but we do not plan to have hundreds of UI automation scripts
- Our existing system integration tests use Playwright and TypeScript
- We have a company Tech Radar, and would like to use tools already on it, where possible
Criteria
Naturally, there are some criteria which our chosen solution should meet:
- Easy to set up
- As little maintenance effort as possible
- Simple to learn
- Good resources / documentation / community
- Likely to last / unlikely to be discontinued
- Free to use
- “Approved” for company use
Potential Approaches
There are three main approaches that I considered for our Salesforce UI automation:
- The classic page object model (POM) pattern
- A popular approach, which makes logical sense to my brain and reduces repetition of locator definitions
- Salesforce’s own UI test automation model (UTAM) framework
- An option which specifically claims to reduce the pain of maintaining locators
- TestZeus’ open source testing agent, Hercules
- A technology with self-healing AI features, which theoretically means total removal of locator maintenance
Comparison
|
POM |
UTAM |
Hercules |
---|---|---|---|
Easy to set up |
|
|
|
As little maintenance effort as possible |
|
|
|
Simple for the team to learn |
|
|
|
Good resources / documentation / community |
|
|
|
Likely to last / unlikely to be discontinued |
|
|
|
Free to use |
|
|
|
“Approved” for company use |
|
|
|
Pros and Cons
POM + Playwright + TypeScript
Pros |
Cons |
---|---|
Adopted company technologies |
Flaky locators |
Widely used pattern |
No good locator strategy |
Built-in driver |
|
Separation of concerns |
|
Full control over locators |
|
UTAM + JSON + WebDriver
Pros |
Cons |
---|---|
Built to handle shadow DOMs |
New to company |
JSON for standard components updated automatically |
No built-in driver |
|
UTAM files need to be manually regenerated when JSON changes |
|
JSON for custom components still requires manual maintenance |
|
More logic in automation scripts |
|
Less flexibility for locators |
Hercules + Playwright + Python + Gherkin
Pros |
Cons |
---|---|
Al agent |
New to company |
No code required for writing automation |
Free tier doesn’t include specialised AI agents |
Self-healing locators |
AI agent may incur additional costs |
Open-source version still supports Salesforce |
Little opportunity for developing programming skills |
Recommendation
Based on my research and evaluation, my recommendation for our project is to use POM.
Although POM does not provide us with Salesforce-specific capabilities, it does allow us full control over locators, so we have flexibility in our problem solving.
While UTAM is specifically built for Salesforce, locators still need to be maintained, and I don’t have confidence in it’s longevity. As soon as it stops being maintained, we’ll lose the benefits and cannot reuse anything.
Hercules has the most appeal, from a maintenance standpoint, but uncertainties around the effectiveness and cost of the AI agent, coupled with the other cons, make it less attractive for our context.
I hope you found this helpful. If you’re in a similar situation, I’d love to hear from you. What approach(es) have you tried, and what challenges have you faced? Let me know in the comments.
Disclosure: As part of my research, I spoke with Robin Gupta, the founder of TestZeus, to get his opinion on UTAM. However, he was very careful not to make the conversation about TestZeus, and we only spoke about it when I specifically asked him for information on it.
Discover more from Cassandra HL
Subscribe to get the latest posts sent to your email.