fix(core): Improve model sub-nodes error handling #11418

burivuhster · 2024-10-25T13:54:47Z

Summary

Problem

The current error handling in LLM sub-nodes has two main issues:

Model sub-nodes sometimes appear as "running" even after failing
n8n incorrectly indicate that errors originated in the root node, when they actually occurred in Model sub-nodes

Context

The error handling relies on custom exception classes (e.g., NodeOperationError). However, there's no straightforward mechanism to wrap model exceptions before they propagate to the root node.

Solution Approaches Explored

Using N8nLlmTracing (callback handler passed to the LangChain model's contructor). Using handleLLMError callback handler allows to add proper error info to the sub-node and fix the infinite running issue. But it can not be used for wrapping the exception for the root node.
onFailedAttempt callback can be provided during each model's LangChain class instantiation. It allows to intercept the retry logic built into LangChain, and re-throw the wrapped exceptions.

Implementation

To use the second approach, the new helper function makeN8nLlmFailedAttemptHandler was implemented. It catches LLM errors, wrap them with our custom error classes, provides some default error handling logic (interrupt running if it does not make sense to retry for the specific error), and also allows to customize error handling for the specific cases (for example to use a custom error message for the OpenAI's insufficient quota error).

Known issues

Couple of LangChain model classes do not use the AsyncCaller for their calls. It means, that the common retry logic built into LangChain does not work in such cases. The onFailedAttempt callback is not being called as well.
Thus, the described error wrapping logic does not work yet for the following models:

LMChatOllama
LmChatAwsBedrock
LmChatGoogleVertex

Example 1. Insufficient quota error (OpenAI)
Before:

After:

Example 2. Empty prompt (Anthropic)
Before:

After:

Example 3. Nested sub-node error
Before:

After:

Example 4. Nested sub-node error
Before:

After:

Related Linear tickets, Github issues, and Community forum posts

Review / Merge checklist

PR title and summary are descriptive. (conventions)
Docs updated or follow-up ticket created.
Tests included.
PR Labeled with release/backport (if the PR is an urgent fix that needs to be backported)

jeanpaul · 2024-11-01T08:50:55Z

One thing I noticed, that's a difference now is that I don't really get a sense for what's wrong with my credential, whereas on master I do:

Now it just says something is wrong with my credentials:

Why did that change? I think getting the things to check directly in the toast is a better experience.

jeanpaul · 2024-11-01T10:18:00Z

One thing I noticed, that's a difference now is that I don't really get a sense for what's wrong with my credential, whereas on master I do:
Now it just says something is wrong with my credentials: Why did that change? I think getting the things to check directly in the toast is a better experience.

I don't know why, but now I cannot reproduce this anymore.

jeanpaul · 2024-11-01T10:25:21Z

packages/editor-ui/src/composables/usePushConnection.ts

On line 326 is another occurrence of context.functionality -- can you check if that needs to change too?

And same for packages/editor-ui/src/utils/expressions.ts line 63.

Should be fine as this is for another case, when nodes use "pairedItem" feature (e.g. referencing previous node's item value when the pairedItem connection was interrupted)

but you don't set context.functionality anymore now? does that functionality break?

It's being set in n8nLlmFailedAttemptHandler.ts when we wrap the error into NodeApiError. For the paired item exceptions – nothing changes, I guess it's being set in WorkflowDataProxy.ts and should not be affected by this PR.

Ahh, I see your point now. I found that setting context.functionality to configuration-node on NodeApiError does nothing, as it checks for the functionality on the exception object itself. Now I see that the checks are different for the configuration-node functionality and pairedItem. Good catch, thank you!

packages/@n8n/nodes-langchain/nodes/llms/N8nLlmTracing.ts

…in-the-sub-node-even-if # Conflicts: # packages/@n8n/nodes-langchain/nodes/llms/N8nLlmTracing.ts

codecov · 2024-11-04T17:21:35Z

Codecov Report

Attention: Patch coverage is 39.68254% with 38 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
...ain/nodes/vendors/OpenAi/helpers/error-handling.ts	0.00%	9 Missing ⚠️
...s/@n8n/nodes-langchain/nodes/llms/N8nLlmTracing.ts	0.00%	7 Missing ⚠️
...ges/editor-ui/src/composables/usePushConnection.ts	20.00%	4 Missing ⚠️
...chain/nodes/llms/LMChatOpenAi/LmChatOpenAi.node.ts	0.00%	3 Missing ⚠️
...llms/LmChatGoogleVertex/LmChatGoogleVertex.node.ts	0.00%	2 Missing ⚠️
...nodes/llms/LMChatAnthropic/LmChatAnthropic.node.ts	0.00%	1 Missing ⚠️
...chain/nodes/llms/LMChatOllama/LmChatOllama.node.ts	0.00%	1 Missing ⚠️
...des-langchain/nodes/llms/LMCohere/LmCohere.node.ts	0.00%	1 Missing ⚠️
...des-langchain/nodes/llms/LMOllama/LmOllama.node.ts	0.00%	1 Missing ⚠️
...des-langchain/nodes/llms/LMOpenAi/LmOpenAi.node.ts	0.00%	1 Missing ⚠️
... and 8 more

📢 Thoughts on this report? Let us know!

jeanpaul

Looks good! thanks for addressing the issues with context.functionality, and for adding the extra tests! 👍

cypress · 2024-11-08T08:34:01Z

n8n Run #7788

Run Properties: Passed #7788 • 188f76131f: 🌳 🖥️ browsers:node18.12.0-chrome107 🤖 burivuhster 🗃️ e2e/*

Project	`n8n`
Branch Review	`ai-298-errors-happen-in-the-root-node-not-in-the-sub-node-even-if`
Run status	`Passed #7788`
Run duration	`04m 22s`
Commit	`188f76131f: 🌳 🖥️ browsers:node18.12.0-chrome107 🤖 burivuhster 🗃️ e2e/*`
Committer	`Eugene Molodkin`
View all properties for this run ↗︎

Test results
Failures	`0`
Flaky	`1`
Pending	`0`
Skipped	`0`
Passing	`461`
View all changes introduced in this branch ↗︎

github-actions · 2024-11-08T08:34:05Z

✅ All Cypress E2E specs passed

janober · 2024-11-13T17:11:25Z

Got released with [email protected]

burivuhster added 2 commits October 24, 2024 17:46

wip: Show model sub-node error state

79eed4b

wip: Improve LLM node's error handling

89b1dbd

burivuhster requested a review from OlegIvaniv October 25, 2024 13:54

wip: Fix infinite "running" indication on vector store tool sub-node

fff8149

n8n-assistant bot added core Enhancement outside /nodes-base and /editor-ui n8n team Authored by the n8n team ui Enhancement in /editor-ui or /design-system labels Oct 25, 2024

jeanpaul requested changes Nov 1, 2024

View reviewed changes

Merge branch 'master' into ai-298-errors-happen-in-the-root-node-not-…

be3259c

…in-the-sub-node-even-if # Conflicts: # packages/@n8n/nodes-langchain/nodes/llms/N8nLlmTracing.ts

wip: Harmonize functionality prop of exceptions

70feb25

burivuhster mentioned this pull request Nov 6, 2024

Add tests for LLM sub-node errors n8n-io/test-workflows#280

Merged

burivuhster added 2 commits November 6, 2024 17:58

wip: add tests for the default LLM failedAttemptHandler

1591e8e

wip: add tests for the LLM failedAttemptHandler wrapper

188f761

burivuhster marked this pull request as ready for review November 7, 2024 08:26

burivuhster requested a review from jeanpaul November 7, 2024 08:28

jeanpaul approved these changes Nov 8, 2024

View reviewed changes

burivuhster merged commit 57467d0 into master Nov 8, 2024
35 checks passed

burivuhster deleted the ai-298-errors-happen-in-the-root-node-not-in-the-sub-node-even-if branch November 8, 2024 09:17

OlegIvaniv pushed a commit that referenced this pull request Nov 12, 2024

fix(core): Improve model sub-nodes error handling (#11418)

8b9478f

github-actions bot mentioned this pull request Nov 13, 2024

🚀 Release 1.68.0 #11725

Merged

janober added the Released label Nov 13, 2024

github-actions bot mentioned this pull request Nov 20, 2024

🚀 Release 1.69.0 #11812

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(core): Improve model sub-nodes error handling #11418

fix(core): Improve model sub-nodes error handling #11418

burivuhster commented Oct 25, 2024 •

edited

Loading

jeanpaul commented Nov 1, 2024 •

edited

Loading

jeanpaul commented Nov 1, 2024

jeanpaul Nov 1, 2024

jeanpaul Nov 1, 2024

burivuhster Nov 4, 2024

jeanpaul Nov 5, 2024

burivuhster Nov 5, 2024

burivuhster Nov 5, 2024

codecov bot commented Nov 4, 2024 •

edited

Loading

jeanpaul left a comment

cypress bot commented Nov 8, 2024

github-actions bot commented Nov 8, 2024

janober commented Nov 13, 2024

fix(core): Improve model sub-nodes error handling #11418

fix(core): Improve model sub-nodes error handling #11418

Conversation

burivuhster commented Oct 25, 2024 • edited Loading

Summary

Problem

Context

Solution Approaches Explored

Implementation

Known issues

Related Linear tickets, Github issues, and Community forum posts

Review / Merge checklist

jeanpaul commented Nov 1, 2024 • edited Loading

jeanpaul commented Nov 1, 2024

jeanpaul Nov 1, 2024

Choose a reason for hiding this comment

jeanpaul Nov 1, 2024

Choose a reason for hiding this comment

burivuhster Nov 4, 2024

Choose a reason for hiding this comment

jeanpaul Nov 5, 2024

Choose a reason for hiding this comment

burivuhster Nov 5, 2024

Choose a reason for hiding this comment

burivuhster Nov 5, 2024

Choose a reason for hiding this comment

codecov bot commented Nov 4, 2024 • edited Loading

Codecov Report

jeanpaul left a comment

Choose a reason for hiding this comment

cypress bot commented Nov 8, 2024

n8n Run #7788

github-actions bot commented Nov 8, 2024

janober commented Nov 13, 2024

burivuhster commented Oct 25, 2024 •

edited

Loading

jeanpaul commented Nov 1, 2024 •

edited

Loading

codecov bot commented Nov 4, 2024 •

edited

Loading