Visit our Content Hub!
Access free downloadable content curated by our editors.

How to incorporate AI Small Language Models (SLMs) into packaging & processing equipment

This step-by-step approach breaks down what it really means to add generative AI into your packaging and processing equipment using Small Language Models (SLMs).

Robot+ape Construction Site

For OEM's recent series of articles on incorporating AI into packaging and processing equipment, I've covered the why, and the what, as well as some off-the-shelf tools. (Briefly, the "why" centers around adding ChatGPT-style functionality to your equipment to make it much easier for operators and technicians to keep machines running efficiently on at your CPG customers.)

But so far we haven't touched on the "how" for those who want to roll up their sleeves and try this. Warning, this is not for the faint of heart, and as I point at the end, waiting for off-the-shelf tools and functionality could be easier, albeit with less ability by your engineers to shape the outcome.

1. Define the use case and objectives

Reminder, the application of generative AI that we're discussing is using a Small Language Model to create what is technically referred to as a Retrieval-Augmented Generation (RAG) chatbot, that is to say, fed specifically by all your manuals and documentation vs relying on training from the open web which is what many language models were originally trained on. So the use case is to identify what specific tasks the model needs to perform.

Taking an iterative or crawl/walk/run approach, your crawl phase would mainly consist of troubleshooting and operational guidance. A walk or run phase might consist of a predictive maintenance application. As part of this you'd want to establish metrics for success, which is mainly accuracy in responding to queries, but also speed of response. You may want to rule in or rule out specific scenarios that your chatbot would or would not be able to handle. For example, you may decide to limit the chatbot to just documentation you supply, or you may choose to give customers a chance to include their own standard operating procedures (SOPs).

2. Gather relevant training data

This includes all of the manuals and documentation for a specific machine, SOPs, and so forth. Models are rapidly expanding beyond text and able to identify the contents of a photograph, technical drawings or even video, so you could decide whether to experiment with including such content. A walk or run phase could consider incorporating aggregated operational data collected by other users of the machine across multiple customers. The idea of a customer learning from all other customers (in the aggregate) for a given machine is quite compelling and might overcome privacy and competitive concerns customer may have.