Skip to content
← All work

Case study · Syndio

Expertise on Demand

Expertise on Demand is Syndio's AI assistant for its pay-equity and pay-reporting products. This case study is about how to render LLM UI experiences, their patterns / architecture, and how to embed this technology into our compliance products.

Role
Frontend Engineer · UX
Timeframe
2025 — Present
Team
AI Engineering, Product, Frontend
Stack
React, TypeScript, Vertex AI, RAG
Expertise on Demand wordmark on a deep-purple band — the case study's hero illustration.

The problem

Syndio's special sauce has always been the expertise underneath the product. With Expertise on Demand we set out to create a RAG (Retrieval-Augmented Generation) LLM chatbot with this special sauce. Our key priority was:



The value proposition was trivial. Putting that expertise behind a chat input, available the moment a question arises, and rendering it intelligently was not. Furthermore, compounding the difficulty was that LLM technology and how to render it was still very much nascent at the start of this project. What followed was a half year of tight back-and-forth between Syndio's LLM engineering team and the frontend team.

RAG, in short

A search step in front of the LLM. The model only answers from documents pulled from a library you control, not from what it learned in training.

Retrieval-Augmented Generation flowA user question enters a retrieval layer that performs vector search over Syndio's expert-curated corpus and returns the most relevant chunks. The chunks plus the original question are sent to the LLM, which synthesizes a streamed answer with citations back to the source documents.RETRIEVAL-AUGMENTED GENERATIONSTEP 1QuestionUser asks a questionin plain language.STEP 2RetrieveVector search findsthe relevant chunks.STEP 3SynthesizeLLM answers usingquestion + chunks.STEP 4RenderStream answerand citations to UI.TOP-K CHUNKSSTREAMCORPUSSyndio's expert docsVectorized. Curated. Cited.

Expertise on Demand draws from Syndio's expert-curated docs: support center articles, regulatory references, methodology notes. Every answer is grounded in those docs, and every answer ships with citations back to them. The LLM never generates numbers. Anything quantitative is computed by PayEQ or GPR upstream and handed in as structured context.

The streaming contract

Markdown expects a finished document. The LLM streams a half-written one.

The contract had two sides:

Neither side could carry it alone. The contract works because both sides know what the other guarantees. We worked in tandem to fine-tune both sides of the contract to deliver a seamless user experience for our customers.

The Expertise on Demand chat surface in its initial shimmer state, before the first token arrives.
The Expertise on Demand chat surface mid-stream, rendering partial markdown as tokens arrive.
A closer look at the streaming chat surface, showing how partial constructs render without reflowing on completion.

Trust is a frontend problem

LLM limitations come from the technology; leveraging it in a user interface comes with user-trust-building requirements.

LLM citations are paramount

For RAG products, the citation UI is the trust UI. The model doesn't write source URLs into its answer. The LLM team returns citations as data on the side, with markers showing where each one applies. The frontend renders those as small numbered chips in the text, plus a deduplicated list of sources below. Hover highlights the source. Click opens it in a side panel; the user stays in the chat.

Citations rendered as superscript chips inline with the answer, with a deduplicated source box below.

LLM limitations aren't just fine print

LLMs hallucinate. In a compliance product, that's a liability, not a footnote. The model team can shrink the rate; they can't drive it to zero. So the surface had to carry the rest.

Persistent under the input, every session: "This is for informational purposes only and should not be considered legal advice. Always seek a lawyer for legal advice." Alongside it, links to the privacy policy and to a page explaining what Expertise on Demand actually does. No one-time modal. No tooltip. The user sees it every time they type.

The model can't promise it's right. The frontend can promise it's honest about that.

The Expertise on Demand chat surface at rest, with a persistent legal disclaimer and links to privacy and product info anchored below the input.

Outcome

30%
Adoption rate
Of customers in PayEQ and GPR who opened Expertise on Demand at least once.
100%
Audited answers
Every response is anchored to a source in our docs. No source, no answer.
30%
Engagement rate
Each engaged session freed CSM bandwidth for higher-touch work.

Reflection