Does One Large Language Model Fit All?

Yesterday, we hosted a webinar designed to dig further into the ways that public and private large language models (LLM) differ from one another – and complement one another. Moderated by Cresta CMO Scott Kolman, the webinar welcomed John Donovan, technologist and former CEO of AT&T Communications and Tim Shi, Cresta’s co-founder and CTO, to share their expertise and unique perspectives on generative AI and the enterprise. 

Tim kicked off with background on LLMs and the impact they have had on applications of AI. As he said, the core of what makes LLMs remarkable is the scale involved; with proper programming, you can scale a model to billions – or even trillions – of parameters. With this, the model starts to have emergent capabilities, meaning that it develops better language understanding and better reasoning. 

So how to consider whether to deploy a public LLM – like ChatGPT, Bard, or some others you may be familiar with – or a private LLM? Here, we got into the pros and cons of deploying a public LLM:

Public LLM – Pros

  • Quick time to value – a very low barrier to entry 
  • Ability to enable self-serve through prompt design
  • Vast public knowledge 

Public LLM – Cons

  • Very large models can be costly and slow – as the model continues to grow, downstream costs in time and resources may be involved
  • Can’t continuously learn from feedback
  • Accuracy ceiling on critical tasks – public LLMs are trained on public data, not truth 

Next, our panelists turned their attention to Domain Models and Fine-tune Models. Domain Models find themselves in the middle of these generic, publicly available LLM and the Fine-tune Models; they are trained on domain-specific data, and thus will show value immediately. An example of this would be Cresta’s Ocean-1 model, the world’s first contact center foundation model, designed specifically to draw on contact center expertise. 

What about Fine-tune Models and their pros and cons?

Fine-tune LLM – Pros

  • Pick the right size for the right task
  • Continuously learn from feedback data, making the model ever more bespoke and tuned to your particular business needs 
  • Reach high accuracy for critical tasks

Fine-tune LLM – Cons

  • Require effort to train
  • Difficult to self-serve
  • Per use case deployment 

Figure 1

During the webinar, Tim also addressed security and privacy considerations when it comes to these different types of LLMs and how exactly those in the contact center space should be thinking about deploying generative AI applications in the most effective ways. 

Next week, we’ll dig into some of the C-level insights that John shared with us – including his take on how boards and the C-suite are evaluating generative AI solutions, three critical drivers that companies and vendors should be addressing with their deployments, and much more. 

In the meantime, download the on-demand webinar replay now and share with your team! 

And to understand what Cresta’s custom-built generative AI products can do for your contact center, get in touch and request a demo now

John Donovan on the Potential of Large Language Models in the Enterprise

READ MORE

The Emerging Stack of Generative AI

READ MORE

Introducing Ocean-1: First Contact Center Foundation Model

READ MORE