tag:google.com,2016:duai-release-notes Document AI - Release notes Google Cloud Platform 2025-11-12T00:00:00-08:00 November 12, 2025 tag:google.com,2016:duai-release-notes#November_12_2025 2025-11-12T00:00:00-08:00 <![CDATA[v1beta3

Feature

Automated schema extraction for custom extractor processors is in Preview.

This feature allows you to automatically extract a document schema from a test document you supply. Then, you can approve or decline the schema and edit it manually. This saves time and effort when defining the document schema for your custom processor and allows you to focus on refining the schema.

When creating a custom extractor processor, find the Generate from document option in the Get started tab of the Google Cloud console.

]]>
November 07, 2025 tag:google.com,2016:duai-release-notes#November_07_2025 2025-11-07T00:00:00-08:00 <![CDATA[v1beta3 & v1

Feature

Gemini layout parser is in Preview. The Gemini layout parser gives better layout quality on table recognition, reading order and text recognition on PDF files. You can enable the feature by default by selecting layout parser processor version pretrained-layout-parser-v1.4-2024-08-25, pretrained-layout-parser-v1.5-2025-08-25 or pretrained-layout-parser-v1.5-pro-2025-08-25 for your processor.

]]>
November 04, 2025 tag:google.com,2016:duai-release-notes#November_04_2025 2025-11-04T00:00:00-08:00 <![CDATA[v1beta3 & v1

Feature

Layout parser support for DOCX, PPTX, XLSX, and XLSM file types in Document AI is in General Availability (GA). It makes content like paragraphs, tables, lists, and structural elements like headings, page headers, and footers easily accessible. It also creates context-aware chunks that facilitate information retrieval in a range of generative AI and discovery applications.

For more information, see Process documents with Layout Parser.

]]>
October 31, 2025 tag:google.com,2016:duai-release-notes#October_31_2025 2025-10-31T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom splitter model pretrained-splitter-v1.5-2025-07-14 with zero-shot splitting, classification and confidence scores is available as Release Candidate (Preview).

]]>
October 17, 2025 tag:google.com,2016:duai-release-notes#October_17_2025 2025-10-17T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Layout parser lets you parse images and tables as annotations in Preview.

Layout parser can identify if there are images or tables in parsed documents. When found, images and tables are annotated as a descriptive block of text with the information depicted in the image and table.

]]>
October 06, 2025 tag:google.com,2016:duai-release-notes#October_06_2025 2025-10-06T00:00:00-07:00 <![CDATA[v1beta3 & v1

Announcement

Capacity reservation is available for Document AI in Preview. This lets you grant capacity to selected processors and maintain a steady real-time, high-volume processing flow for document processing requests.

For the necessary steps, read make a capacity reservation request section of "Quotas".

Feature

Custom extractor model pretrained-foundation-model-v1.5.1-2025-08-07 with improved adaptive few-shot learning is available as Release Candidate (Preview).

Feature

Support for confidence scores in Custom classifier models pretrained-foundation-model-v1.4-2025-05-16 and pretrained-classifier-v1.5-2025-08-05 is in Preview.

For best performance, use them with fine-tuned models.

]]>
September 23, 2025 tag:google.com,2016:duai-release-notes#September_23_2025 2025-09-23T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom classifier model pretrained-classifier-v1.5-2025-08-05 powered by Gemini 2.5 Flash is in Preview. It has ML processing available for US and EU regions, a maximum page limit of 30 pages, and processing requests of 120 pages per minute.

Unlike the prior custom classifier, which used classical machine learning, this version features a new platform. It accommodates:

  • High accuracy immediately, based on the document classes you define.
  • Few-shot learning to further improve accuracy.
  • Use of descriptions when labeling for more context and insight for document classes.
  • More accurate results with the same training dataset on the fine-tuned generative AI model, compared to the trained version.
  • Autolabeling documents for fine-tuning and evaluation.
  • Generative AI to fine-tune and heighten accuracy.

For more information on processor versions, see Managing processor versions.

]]>
September 10, 2025 tag:google.com,2016:duai-release-notes#September_10_2025 2025-09-10T00:00:00-07:00 <![CDATA[v1beta3 & v1

Deprecated

Custom Extractor version pretrained-foundation-model-v1.4-2025-02-05 will no longer be accessible on February 5, 2026.

To avoid service disruptions, migrate to a later version such as pretrained-foundation-model-v1.5-2025-05-05 or pretrained-foundation-model-v1.5-pro-2025-06-20. To learn more about the migration process, refer to Manage processor versions.

]]>
September 09, 2025 tag:google.com,2016:duai-release-notes#September_09_2025 2025-09-09T00:00:00-07:00 <![CDATA[v1beta3 & v1

Announcement

Document AI supports two service tiers and associated quotas: provisioned and best effort tiers.

The base is the provisioned tier quota, which provides 120 pages per minute for Gemini 2.0 and 2.5 Flash LLM and 30 pages per minute for Gemini 2.5 Pro LLM.

If you require more volume, best effort tier quota provides 120 pages per minute for Gemini 2.0 2.5 Flash and 60 pages per minute for Gemini 2.5 Pro. It's only used when the provisioned quota has been exhausted. This applies to the BestEffortOnlineProcessDocumentPagesPerMinutePerProjectUS and EU quotas and, in the console, best_effort_online_process_document_pages_us and eu.

Best effort can get up to 240 pages per minute for custom data extractor models v1.4 and v1.5 with a quota increase request (QIR). You can make a QIR by contacting your sales team representative.

There is no service level agreement (SLA) for best effort tier.

]]>
September 03, 2025 tag:google.com,2016:duai-release-notes#September_03_2025 2025-09-03T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom extractor model pretrained-foundation-model-v1.5-pro-2025-06-20 is available as General Availability (GA).

For more information about available models, see the custom extractor page.

]]>
August 29, 2025 tag:google.com,2016:duai-release-notes#August_29_2025 2025-08-29T00:00:00-07:00 <![CDATA[v1

Feature

Derived entity and signature detection are now supported in custom extractor models pretrained-foundation-model-v1.4-2025-02-05 as General Availability (GA) and in pretrained-foundation-model-v1.5-2025-05-05, as well as pretrained-foundation-model-v1.5-pro-2025-06-20 as Preview.

Signature detection lets you identify handwritten signatures by using visual cues in the document. Derived entity detection lets you deduce entities by inference without requiring the value to be explicitly present in the text. You can use this feature to deduce the country in an address, counting items in a table, or detecting if an ID is fake.

These can be enabled in the console when creating labels or by using the DocumentSchema.EntityType resource in the API.

For more information, read Custom extractor with derived fields and choose label attributes.

]]>
July 22, 2025 tag:google.com,2016:duai-release-notes#July_22_2025 2025-07-22T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom extractor model pretrained-foundation-model-v1.5-pro-2025-06-20 powered by Gemini 2.5 Pro is in Preview. It has ML processing available for US and EU regions, a maximum page limit of 30 pages, and processing requests of 30 pages per minute.

For more information, see Managing processor versions.

]]>
July 04, 2025 tag:google.com,2016:duai-release-notes#July_04_2025 2025-07-04T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Document AI now supports Identity and Access Management (IAM) deny policies. These policies allow you to define deny rules that prevent certain principals from using certain permissions to access Google Cloud resources, regardless of the roles they're granted.

For more information, read Deny policy overview and Document AI security and compliance.

Feature

Document AI VPC service controls (VPC-SC) integration now supports identity groups.

For more information on setting up VPC-SC identity groups, read Configure identity groups and third-party identities in ingress and egress rules.

]]>
July 03, 2025 tag:google.com,2016:duai-release-notes#July_03_2025 2025-07-03T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

The Document AI CDE processor now supports merging the child entities of nested entities that extend across several pages. This is supported in custom extractor model pretrained-foundation-model-v1.5-2025-05-05.

This change is automatic in all processors.

For customers with existing v1.5 processors, to make use of this feature, you must relabel the nested entities in different pages.

To learn more about the labeling process, refer to Label documents.

]]>
June 30, 2025 tag:google.com,2016:duai-release-notes#June_30_2025 2025-06-30T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom Extractor model pretrained-foundation-model-v1.5-2025-05-05 is in General Availability (GA) and has fine-tuning available for the US and EU.

From version v1.4 and later, we will use a new quota for online processing called Number of online process document pages per minute per processor type and model version. This quota will be enforced at a per-page and per-foundation model level. There will be no change to the batch processing quota.

These can be enabled in the console when creating labels and by using the DocumentSchema.EntityType.

For more information, read Managing processor versions.

]]>
June 19, 2025 tag:google.com,2016:duai-release-notes#June_19_2025 2025-06-19T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

We've increased the maximum file size for online processing requests from 20 MB to 40 MB. This applies to all types of processors.

For more information, see the Document AI limits page.

]]>
May 19, 2025 tag:google.com,2016:duai-release-notes#May_19_2025 2025-05-19T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Cross-regions importing of fine-tuned models is now supported for processor versions based on Gemini 1.5 and later, such as custom extractors pretrained-foundation-model-v1.2-2024-05-10 and later.

For more information, see Managing processor versions.

]]>
May 05, 2025 tag:google.com,2016:duai-release-notes#May_05_2025 2025-05-05T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

Custom extractor model pretrained-foundation-model-v1.5-2025-04-25 powered by Gemini 2.5 Flash LLM is available as Public Preview in US regions. The custom extractor model supports a quota of up to 15 pages per minute for online process requests.

For more information about available models, see Custom extractor model versions.

]]>
April 08, 2025 tag:google.com,2016:duai-release-notes#April_08_2025 2025-04-08T00:00:00-07:00 <![CDATA[v1

Announcement

Previous Custom Extractor versions pretrained-foundation-model-v1.0-2023-08-22 and pretrained-foundation-model-v1.1-2024-03-12 will be deprecated on April 9, 2025. To ensure uninterrupted service, prediction traffic to these versions, including any fine-tuned variants, will be automatically redirected to the latest version, pretrained-foundation-model-v1.4-2025-02-05.

For guidance on how to fine-tune a new version, refer to the fine tuning documentation.

]]>
April 02, 2025 tag:google.com,2016:duai-release-notes#April_02_2025 2025-04-02T00:00:00-07:00 <![CDATA[v1beta3 & v1

Feature

All processors can now extend the Maximum page limit for online and synchronous requests up to 30 pages.

To do so, enable imageless_mode in ProcessRequest.

For Custom Extractor, you will need to first request to be allowlisted for this feature by filling out the form Allowlist Request for 30 Page limit in CDE.

]]>
March 24, 2025 tag:google.com,2016:duai-release-notes#March_24_2025 2025-03-24T00:00:00-07:00 <![CDATA[v1

Deprecated

As we launch Custom Extractor version pretrained-foundation-model-v1.4-2025-02-05 in GA with fine tuning (in Preview), these versions will no longer be accessible effective September 24, 2025:

  • pretrained-foundation-model-v1.2-2024-05-10
  • pretrained-foundation-model-v1.3-2024-08-31

To avoid service disruptions, migrate to a later version, such as pretrained-foundation-model-v1.4-2025-02-05. To learn more about the migration process, refer to our Manage processor versions documentation.

Customers and projects can access pretrained-foundation-model-v1.2-2024-05-10 and pretrained-foundation-model-v1.3-2024-08-31 until September 24, 2025. This includes the ability to create tuning jobs and access fine-tuned processor versions.

Starting March 24, 2025:

  • Newly created processor versions using pretrained-foundation-model-v1.2-2024-05-10 can only be used for batch processing.
  • Newly created processor versions using pretrained-foundation-model-v1.2-2024-05-10 and pretrained-foundation-model-v1.3-2024-08-31 will have a quota limit of 120 pages per minute.

This update requires planning, but if you have questions or need assistance, contact Google Cloud support.

]]>
March 19, 2025 tag:google.com,2016:duai-release-notes#March_19_2025 2025-03-19T00:00:00-07:00 <![CDATA[v1

Feature

Custom Extractor model pretrained-foundation-model-v1.4-2025-02-05 is in General Availability (GA), and has fine-tuning available in Preview for the US and EU.

From version v1.4 and later, we will use a new quota for online processing called Number of online process document pages per minute per processor_type_and_model_version. This quota will be enforced at a per-page and per-foundation model level. There will be no change to the batch processing quota.

]]>
February 14, 2025 tag:google.com,2016:duai-release-notes#February_14_2025 2025-02-14T00:00:00-08:00 <![CDATA[v1

Feature

Custom extractor model pretrained-foundation-model-v1.4-2025-02-05 powered by Gemini 2.0 Flash LLM is available as Public Preview in US and EU regions with improved accuracy. The Custom Extractor Model supports a quota of up to 120 pages per minute for online process requests.

For more information about available models, see Custom extractor model versions.

]]>
February 03, 2025 tag:google.com,2016:duai-release-notes#February_03_2025 2025-02-03T00:00:00-08:00 <![CDATA[v1

Feature

Model pretrained-ocr-v2.1.1-2025-01-31 is available as a Release Candidate in the regions asia-south1, australia-southeast1, europe-west2, europe-west3 and northamerica-northeast1.

For more information about available models, see Enterprise Document OCR.

Feature

Model pretrained-ocr-v2.1-2024-08-07 has General Availability (GA) in the US and EU.

For more information about available models, see Enterprise Document OCR and Regional and multi-regional support availability.

]]>
January 27, 2025 tag:google.com,2016:duai-release-notes#January_27_2025 2025-01-27T00:00:00-08:00 <![CDATA[v1.3

Feature

For processor versions pretrained-foundation-model-v1.2-2024-05-10 and pretrained-foundation-model-v1.3-2024-08-31 custom extractors, customer-managed encryption keys (CMEK) is now supported when importing fine-tuned processor versions.

For more information, see Import processor versions.

]]>
January 23, 2025 tag:google.com,2016:duai-release-notes#January_23_2025 2025-01-23T00:00:00-08:00 <![CDATA[v1

Breaking

Effective January 27, 2025, new and existing processors require explicit storage.objects.get permissions to access Google Cloud Storage buckets for training dataset imports and offline/batch processing.

You will need to review your use of training dataset imports and offline/batch processing to verify that the users of these APIs have appropriate permissions to access Google Cloud Storage buckets.

Ensure that users of these APIs have been granted one of the predefined or legacy Cloud Storage roles that includes the storage.objects.get permission (such as Storage Object Viewer). You can assign these roles in the Permissions tab of the relevant Cloud Storage bucket.

We understand that this update requires planning, but we're here to support you during this process. If you have questions or need assistance, contact Google Cloud support.

]]>
December 19, 2024 tag:google.com,2016:duai-release-notes#December_19_2024 2024-12-19T00:00:00-08:00 <![CDATA[v1

Feature

Property description is now Generally Available (GA) as part of the custom extractor in both the Document AI section of the Google Cloud console and the API, with additional support for parent entities in hierarchies.

Property description allows you to provide additional context, insights, and prior knowledge for each entity to improve extraction accuracy.

]]>
December 12, 2024 tag:google.com,2016:duai-release-notes#December_12_2024 2024-12-12T00:00:00-08:00 <![CDATA[v1

Feature

You can copy processor versions of pretrained-foundation-model-v1.2-2024-05-10 and pretrained-foundation-model-v1.3-2024-08-31 between projects by following the steps in Import a processor version.

]]>
October 22, 2024 tag:google.com,2016:duai-release-notes#October_22_2024 2024-10-22T00:00:00-07:00 <![CDATA[v1.3

Feature

The Document AI section of the Google Cloud console now allows you to configure property descriptions as part of the Custom extractor processor-creation process.

Property description allows you to provide additional context, insights, and prior knowledge for each entity to improve extraction accuracy.

Property descriptions can be edited after schema creation. After you update the property descriptions, you will need to either call the pretrained models or create or fine-tune a new processor version for the changes to take effect.

]]>
October 01, 2024 tag:google.com,2016:duai-release-notes#October_01_2024 2024-10-01T00:00:00-07:00 <![CDATA[v1

Feature

Custom Extractor pretrained-foundation-model-v1.2-2024-05-10 and pretrained-foundation-model-v1.3-2024-08-31 are now Stable versions.

v1.2 and v1.3 now have the following features:

  • Fine-tuning is now available in Public preview.
  • They were internally upgraded to a higher quality model.
  • The labeling system has been upgraded to use the latest version of the OCR model.

v1.2 is recommended for the best quality. v1.3 is recommended for the lowest latency.

We recommend creating a new processor and relabeling the training and evaluation documents to benefit from both the improved quality with the new processor versions of Custom Extractor (v1.2 and v1.3) and the enhanced labeling system.

]]>