Hallucinations Could Blunt ChatGPTâ€™s Success

IEEE SpectrumFOR THE TECHNOLOGY INSIDER
TopicsAerospaceArtificial IntelligenceBiomedicalClimate TechComputingConsumer ElectronicsEnergyHistory of TechnologyRoboticsSemiconductorsTelecommunicationsTransportation
SectionsFeaturesNewsOpinionCareersDIYEngineering Resources
MoreNewslettersPodcastsSpecial ReportsCollectionsExplainersTop Programming LanguagesRobots Guide â†—IEEE Job Site â†—
For IEEE MembersCurrent IssueMagazine ArchiveThe InstituteThe Institute Archive
For IEEE MembersCurrent IssueMagazine ArchiveThe InstituteThe Institute Archive
IEEE SpectrumAbout UsContact UsReprints & Permissions â†—Advertising â†—
Follow IEEE Spectrum
Support IEEE SpectrumIEEE Spectrum is the flagship publication of the IEEE â€” the worldâ€™s largest professional organization devoted to engineering and applied sciences. Our articles, podcasts, and infographics inform our readers about developments in technology, engineering, and science.
Join IEEE
Subscribe
About IEEEContact & SupportAccessibilityNondiscrimination PolicyTermsIEEE Privacy PolicyCookie PreferencesAd Privacy Options
Â© Copyright 2024 IEEE â€” All rights reserved. A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

ChatGPT has wowed the world with the depth of its knowledge and the fluency of its responses, but one problem has hobbled its usefulness: It keeps hallucinating.

Yes, large language models (LLMs) hallucinate, a concept popularized by Google AI researchers in 2018. Hallucination in this context refers to mistakes in the generated text that are semantically or syntactically plausible but are in fact incorrect or nonsensical. In short, you canâ€™t trust what the machine is telling you.

Thatâ€™s why, while OpenAIâ€™s Codex or Githubâ€™s Copilot can write code, an experienced programmer still needs to review the outputâ€”approving, correcting, or rejecting it before allowing it to slip into a code base where it might wreak havoc.

High school teachers are learning the same. A ChatGPT-written book report or historical essay may be a breeze to read but could easily contain erroneous â€œfactsâ€ that the student was too lazy to root out.

Hallucinations are a serious problem. Bill Gates has mused that ChatGPT or similar large language models could some day provide medical advice to people without access to doctors. But you canâ€™t trust advice from a machine prone to hallucinations.

OpenAI Is Working to Fix ChatGPTâ€™s Hallucinations

Ilya Sutskever, OpenAIâ€™s chief scientist and one of the creators of ChatGPT, says heâ€™s confident that the problem will disappear with time as large language models learn to anchor their responses in reality. OpenAI has pioneered a technique to shape its modelsâ€™ behaviors using something called reinforcement learning with human feedback (RLHF).

RLHF was developed by OpenAI and Googleâ€™s DeepMind team in 2017 as a way to improve reinforcement learning when a task involves complex or poorly defined goals, making it difficult to design a suitable reward function. Having a human periodically check on the reinforcement learning systemâ€™s output and give feedback allows reinforcement-learning systems to learn even when the reward function is hidden.

For ChatGPT, data collected during its interactions are used to train a neural network that acts as a â€œreward predictor,â€ which reviews ChatGPTâ€™s outputs and predicts a numerical score that represents how well those actions align with the systemâ€™s desired behaviorâ€”in this case, factual or accurate responses.

Periodically, a human evaluator checks ChatGPT responses and chooses those that best reflect the desired behavior. That feedback is used to adjust the reward-predictor neural network, and the updated reward-predictor neural network is used to adjust the behavior of the AI model. This process is repeated in an iterative loop, resulting in improved behavior. Sutskever believes this process will eventually teach ChatGPT to improve its overall performance.

â€œIâ€™m quite hopeful that by simply improving this subsequent reinforcement learning from the human feedback step, we can teach it to not hallucinate,â€ said Sutskever, suggesting that the ChatGPT limitations we see today will dwindle as the model improves.

Hallucinations May Be Inherent to Large Language Models

But Yann LeCun, a pioneer in deep learning and the self-supervised learning used in large language models, believes there is a more fundamental flaw that leads to hallucinations.

â€œLarge language models have no idea of the underlying reality that language describes,â€ he said, adding that most human knowledge is nonlinguistic. â€œThose systems generate text that sounds fine, grammatically, semantically, but they donâ€™t really have some sort of objective other than just satisfying statistical consistency with the prompt.â€

Humans operate on a lot of knowledge that is never written down, such as customs, beliefs, or practices within a community that are acquired through observation or experience. And a skilled craftsperson may have tacit knowledge of their craft that is never written down.

â€œLanguage is built on top of a massive amount of background knowledge that we all have in common, that we call common sense,â€ LeCun said. He believes that computers need to learn by observation to acquire this kind of nonlinguistic knowledge.

â€œThere is a limit to how smart they can be and how accurate they can be because they have no experience of the real world, which is really the underlying reality of language,â€ said LeCun. â€œMost of what we learn has nothing to do with language.â€

â€œWe learn how to throw a basketball so it goes through the hoop,â€ said Geoff Hinton, another pioneer of deep learning. â€œWe donâ€™t learn that using language at all. We learn it from trial and error.â€

But Sutskever believes that text already expresses the world. â€œOur pretrained models already know everything they need to know about the underlying reality,â€ he said, adding that they also have deep knowledge about the processes that produce language.

While learning may be faster through direct observation by vision, he argued, even abstract ideas can be learned through text, given the volumeâ€”billions of wordsâ€”used to train LLMs like ChatGPT.

Neural networks represent words, sentences, and concepts through a machine-readable format called an embedding. An embedding maps high-dimensional vectorsâ€”long strings of numbers that capture their semantic meaningâ€”to a lower-dimensional space, a shorter string of numbers that is easier to analyze or process.

By looking at those strings of numbers, researchers can see how the model relates one concept to another, Sutskever explained. The model, he said, knows that an abstract concept like purple is more similar to blue than to red, and it knows that orange is more similar to red than purple. â€œIt knows all those things just from text,â€ he said. While the concept of color is much easier to learn from vision, it can still be learned from text alone, just more slowly.

Whether or not inaccurate outputs can be eliminated through reinforcement learning with human feedback remains to be seen. For now, the usefulness of large language models in generating precise outputs remains limited.

â€œMost of what we learn has nothing to do with language.â€

Mathew Lodge, the CEO of Diffblue, a company that uses reinforcement learning to automatically generate unit tests for Java code, said that â€œreinforcement systems alone are a fraction of the cost to run and can be vastly more accurate than LLMs, to the point that some can work with minimal human review.â€

Codex and Copilot, both based on GPT-3, generate possible unit tests that an experienced programmer must review and run before determining which is useful. But Diffblueâ€™s product writes executable unit tests without human intervention.

â€œIf your goal is to automate complex, error-prone tasks at scale with AIâ€”such as writing 10,000 unit tests for a program that no single person understandsâ€”then accuracy matters a great deal,â€ said Lodge. He agrees that LLMs can be great for freewheeling creative interaction, but he cautions that the last decade has taught us that large deep-learning models are highly unpredictable, and making the models larger and more complicated doesnâ€™t fix that. â€œLLMs are best used when the errors and hallucinations are not high impact,â€ he said.

Nonetheless, Sutskever said that as generative models improve, â€œthey will have a shocking degree of understanding of the world and many of its subtleties, as seen through the lens of text.â€

From Your Site Articles

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Hallucinations Could Blunt ChatGPTâ€™s Success

OpenAI Is Working to Fix ChatGPTâ€™s Hallucinations

Hallucinations May Be Inherent to Large Language Models

IEEE Presidentâ€™s Note: Embracing the Future

Hacking A Car-Radio Chip Into The Ultimate SDR Receiver

5 Questions for Robotics Legend Ruzena Bajcsy

Related Stories

How Good Is ChatGPT at Coding, Really?

What to Do When the Ghost in the Machine Is You

ChatGPTâ€™s New Upgrade Teases AIâ€™s Multimodal Future

Topics

Sections

More

For IEEE Members

For IEEE Members

IEEE Spectrum

Follow IEEE Spectrum

Support IEEE Spectrum

Enjoy more free content and benefits by creating an account

Saving articles to read later requires an IEEE Spectrum account

The Institute content is only available for members

Downloading full PDF issues is exclusive for IEEE Members

Downloading this e-book is exclusive for IEEE Members

Access to Spectrum 's Digital Edition is exclusive for IEEE Members

Following topics is a feature exclusive for IEEE Members

Adding your response to an article requires an IEEE Spectrum account

Create an account to access more content and features on IEEE Spectrum , including the ability to save articles to read later, download Spectrum Collections, and participate in conversations with readers and editors. For more exclusive content and features, consider Joining IEEE .

Join the worldâ€™s largest professional organization devoted to engineering and applied sciences and get access to all of Spectrumâ€™s articles, archives, PDF downloads, and other benefits. Learn more about IEEE â†’

Join the worldâ€™s largest professional organization devoted to engineering and applied sciences and get access to this e-book plus all of IEEE Spectrumâ€™s articles, archives, PDF downloads, and other benefits. Learn more about IEEE â†’

Access Thousands of Articles â€” Completely Free

Create an account and get exclusive content and features: Save articles, download collections, and talk to tech insiders â€” all free! For full access and benefits, join IEEE as a paying member.

Hallucinations Could Blunt ChatGPTâ€™s Success

OpenAI Is Working to Fix ChatGPTâ€™s Hallucinations

Hallucinations May Be Inherent to Large Language Models

IEEE Presidentâ€™s Note: Embracing the Future

Hacking A Car-Radio Chip Into The Ultimate SDR Receiver

5 Questions for Robotics Legend Ruzena Bajcsy

Related Stories

How Good Is ChatGPT at Coding, Really?

What to Do When the Ghost in the Machine Is You

ChatGPTâ€™s New Upgrade Teases AIâ€™s Multimodal Future