Cloud Bloghttps://cloud.google.com/blog/Cloud BlogenFri, 22 Nov 2024 11:00:01 +0000https://cloud.google.com/blog/static/blog/images/google.a51985becaa6.pngCloud Bloghttps://cloud.google.com/blog/How Commerzbank is transforming financial advisory workflows with gen AIhttps://cloud.google.com/blog/products/ai-machine-learning/how-commerzbank-is-transforming-financial-advisory-workflows-with-gen-ai/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">In today's fast-paced financial landscape, staying competitive requires embracing innovation and efficiency. Commerzbank, a leading German bank, recognized the potential to streamline its internal workflows, particularly within its financial advisory division for corporate clients. </span></p>
<p><span style="vertical-align: baseline;">Given regulatory requirements, sales advisors need to carefully document investment suggestions in detailed protocols. It’s a highly manual and time consuming task.This has led to significant productivity bottlenecks and reduces the time available for advising customers.</span></p>
<p><span style="vertical-align: baseline;">"Our Sales advisor team spends a considerable amount of time in documentation of advisory calls," </span><span style="vertical-align: baseline;">Ulrich Heitbaum, COO, Corporate Clients segment at Commerzbank</span><span style="vertical-align: baseline;">. "By partnering with Google to build a sophisticated GenAI system to automate this process, we considerably boost productivity and efficiency. One thought leads us: Only lean, smoothly functioning processes and reliable technology lead to an outstanding - excellent - service delivery to the customer."</span></p>
<p><span style="vertical-align: baseline;">Recognizing the potential for improvement, Commerzbank partnered with Google Cloud to develop an advanced gen AI-powered solution that automates this labor-intensive process. By leveraging </span><a href="https://cloud.google.com/ai/generative-ai"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud's AI and machine learning</span></a><span style="vertical-align: baseline;"> capabilities, Commerzbank was able to automate this process and achieve a 300% gain in sales advisor productivity.</span></p></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">The Challenge: Time-Consuming Manual Processes</strong></h3>
<p><span style="vertical-align: baseline;">Financial advisors play a crucial role in providing personalized financial advice to clients. However, the process of reviewing client interactions and extracting and summarizing relevant domain and client-specific information was highly manual and inefficient. Sales advisors had to dedicate significant time to listening to call recordings, identifying key details, and manually entering data into various systems. This process not only consumed valuable time but also increased the risk of errors and inconsistencies.</span></p>
<h3><strong style="vertical-align: baseline;">The Technical Solution: A Deep Dive into Commerzbank's gen-AI system</strong></h3>
<p><span style="vertical-align: baseline;">Commerzbank's solution for automating financial advisory workflows leverages a sophisticated multi-step gen-AI architecture built using </span><a href="https://cloud.google.com/vertex-ai"><span style="text-decoration: underline; vertical-align: baseline;">Vertex AI</span></a><span style="vertical-align: baseline;"> and designed for quality, scalability and extensibility. </span><span style="vertical-align: baseline;">Gemini 1.5 Pro</span><span style="vertical-align: baseline;">'s ability to </span><a href="https://cloud.google.com/transform/historic-year-for-ai-momentous-multimodal-moment-the-prompt"><span style="text-decoration: underline; vertical-align: baseline;">understand multiple modalities</span></a><span style="vertical-align: baseline;"> and process long context information played a key role in building this system that would not have been possible with any other model</span><span style="vertical-align: baseline;">. Here's a breakdown of the key steps involved:</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_v6TdxD7.max-1000x1000.png"
alt="image1">
</a>
<figcaption class="article-image__caption "><p data-block-key="fperr">An end-to-end architecture of the AI Advisor system</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><strong style="vertical-align: baseline;">1. User interaction and data import (1, 2, 3):<br/></strong><span style="vertical-align: baseline;">The process begins with the sales advisor using a user-friendly frontend interface (1) to select the client calls they need to process. This interface communicates with a Java Spring backend (2) that manages the workflow. The backend then initiates the import of the selected audio recordings from Commerzbank's recording system (3) into Google Cloud Platform (GCP) storage buckets. This ensures the data is readily available for the AI processing pipeline.</span></p>
<p><strong style="vertical-align: baseline;">2. Audio chunking and storage (4.1, 4.2):<br/></strong><span style="vertical-align: baseline;">To handle potentially lengthy client calls, the imported audio recordings are divided into smaller, manageable segments (4.1). This "chunking" process allows the system to process even multi-hour calls efficiently. These audio chunks are then stored securely within GCP storage (4.2), ensuring data durability and accessibility for subsequent steps.</span></p>
<p><strong style="vertical-align: baseline;">3. Advanced diarization and transcription (4.3):<br/></strong><span style="vertical-align: baseline;">This step is crucial for generating a high-quality, structured transcript that captures the nuances of the conversation. Gemini 1.5 Pro is employed to create a diarized transcript, meaning each speaker is identified and their contributions are accurately attributed. This process occurs sequentially, with each audio chunk processed in order. To maximize accuracy, the model receives the transcript generated up to that point, along with carefully engineered prompts and a few-shot example of audio-to-text transcription. This ensures the final transcript is not only accurate in terms of content but also includes correct speaker identification and especially numerical information, which is crucial in a financial context. Once the final transcript is generated, the individual audio chunks from step 4.2 are deleted to optimize storage.</span></p>
<p><strong style="vertical-align: baseline;">4. Fact extraction (4.4):<br/></strong><span style="vertical-align: baseline;">With a comprehensive and long transcript in hand, Gemini 1.5 Pro long context is then used to analyze and extract relevant facts (4.4). This involves identifying key information related to the specific financial advisory document that needs to be completed. The model is prompted to recognize and extract crucial details such as client names, investment preferences, risk tolerance, and financial goals.</span></p>
<p><strong style="vertical-align: baseline;">5. Summary generation (4.5):<br/></strong><span style="vertical-align: baseline;">This step focuses on generating concise and accurate summaries for each field within the financial advisory document. Leveraging the extracted facts from the previous step and employing Chain-of-Thought (CoT) prompting, Gemini 1.5 Pro creates multiple German-language summaries tailored to the specific domain and the requirements of each form field. This ensures the generated summaries are not only informative but also comply with Commerzbank's internal guidelines and regulatory requirements.</span></p>
<p><strong style="vertical-align: baseline;">6. Summary optimization and explanation (4.6):<br/></strong><span style="vertical-align: baseline;">To ensure the highest quality output, the multiple summaries generated for each form field are </span><a href="https://cloud.google.com/blog/products/ai-machine-learning/enhancing-llm-quality-and-interpretability-with-the-vertex-gen-ai-evaluation-service?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">evaluated and the best summary for each field is selected</span></a><span style="vertical-align: baseline;"> using the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview"><span style="text-decoration: underline; vertical-align: baseline;">Vertex AI Gen AI Evaluation Service</span></a><span style="vertical-align: baseline;"> (4.6). Importantly, the service also provides a human-readable explanation for its selection, enabling sales advisors to understand the reasoning behind the AI's choices and maintain trust in the automated process.</span></p>
<p><span style="vertical-align: baseline;">This multi-stage architecture, combining the power of Gemini 1.5 Pro with Vertex AI's evaluation capabilities, enables Commerzbank to automate a complex and time-consuming process with high accuracy and efficiency. By streamlining these workflows, Commerzbank empowers its sales advisors to focus on higher-value tasks, ultimately improving client service and driving business growth.</span></p>
<h2><strong style="vertical-align: baseline;">The Benefits: Increased Efficiency and Productivity</strong></h2>
<p><span style="vertical-align: baseline;">The impact of this AI-powered automation has been significant. By automating the manual tasks associated with financial advisory documentation, Commerzbank has achieved substantial productivity gains. Sales advisors now have more time to focus on higher-value activities, such as building client relationships and providing personalized financial advice.</span></p>
<p><strong style="vertical-align: baseline;">Key benefits of the solution include:</strong></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Reduced processing time:</strong><span style="vertical-align: baseline;"> The automated solution significantly reduces the time required to process client interactions by achieving what takes a client 60-plus minutes in just a few minutes with manual human overview. This greatly accelerates time to business. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Increased productivity:</strong><span style="vertical-align: baseline;"> By automating manual tasks, the solution empowers sales advisors to focus on more strategic activities, leading to increased productivity by 3x and improved client service.</span></li>
</ul>
<h2><strong style="vertical-align: baseline;">Looking into the Future</strong></h2>
<p><span style="vertical-align: baseline;">Commerzbank's collaboration with Google Cloud exemplifies the transformative power of AI in the financial services industry. By embracing innovative technologies, Commerzbank is streamlining its operations, empowering its employees, and enhancing the client experience. “Therefore, we set up a Strategic Initiative Corporate Clients AI powered sales force - to make our sales focus on high value activities” Sebastian Kauck, CIO Corporate Clients at Commerzbank. </span></p>
<p><span style="vertical-align: baseline;">They plan to scale this solution to other use cases and enhance its functionality, providing new and additional value to their sales team. This AI-powered solution is just one example of how Commerzbank is leveraging technology to stay ahead of the curve and deliver exceptional financial services, in addition to many other cloud and GenAI use cases.</span></p>
<p><span style="vertical-align: baseline;">This partnership has not only delivered significant productivity gains but has also laid the foundation for future innovation. Commerzbank plans to expand the use of AI and automation across other areas of its business, further optimizing its operations and enhancing its offerings to clients.</span></p>
<hr/>
<p><sup><em><span style="vertical-align: baseline;">This project was a joint collaboration between Anant Nawalgaria, Patrick Nestler, Florian Baumert and Markus Staab from Google and Tolga Bastürk, </span><span style="vertical-align: baseline;">Otto Franke, Mirko Franke,</span><span style="vertical-align: baseline;"> Gregor Wilde, Janine Unger, Enis Muhaxhiri, Andre Stubig, Ayse-Maria Köken and Andreas Racke from Commerzbank.</span></em></sup></p></div>Fri, 22 Nov 2024 11:00:00 +0000https://cloud.google.com/blog/products/ai-machine-learning/how-commerzbank-is-transforming-financial-advisory-workflows-with-gen-ai/Financial ServicesCustomersDevelopers & PractitionersGoogle Cloud in EuropeAI & Machine LearningarticleHow Commerzbank is transforming financial advisory workflows with gen AIhttps://storage.googleapis.com/gweb-cloudblog-publish/original_images/Commerzbank-v3.gifGooglehttps://cloud.google.com/blog/products/ai-machine-learning/how-commerzbank-is-transforming-financial-advisory-workflows-with-gen-ai/Tolga BastürkProduct Owner, CommerzbankAnant NawalgariaSr. Staff ML Engineer, GoogleSeeing Through a GLASSBRIDGE: Understanding the Digital Marketing Ecosystem Spreading Pro-PRC Influence Operationshttps://cloud.google.com/blog/topics/threat-intelligence/glassbridge-pro-prc-influence-operations/<div class="block-paragraph_advanced"><p>Written by: Vanessa Molter</p>
<p><span style="font-style: italic; vertical-align: baseline;">Special thanks to Mandiant's Ryan Serabian for his contributions to this analysis.</span></p>
<hr/></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">This blog post details GLASSBRIDGE—an umbrella group of four different companies that operate networks of inauthentic news sites and newswire services tracked by the Google Threat Intelligence Group (consisting of Google’s Threat Analysis Group (TAG) and Mandiant). Collectively these firms bulk-create and operate hundreds of domains that pose as independent news websites from dozens of countries, but are in fact publishing thematically similar, inauthentic content that emphasizes narratives aligned to the political interests of the People’s Republic of China (PRC). Since 2022, Google has blocked more than a thousand GLASSBRIDGE-operated websites from eligibility to appear in Google News features and Google Discover because these sites violated our </span><a href="https://support.google.com/news/publisher-center/answer/6204050?hl=en" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">policies</span></a><span style="vertical-align: baseline;"> that prohibit deceptive behavior and require editorial transparency. </span></p>
<p><span style="vertical-align: baseline;">We cannot attribute who hired these services to create the sites and publish content, but assess the firms may be taking directions from a shared customer who has outsourced the distribution of pro-PRC content via imitation news websites.</span></p>
<p><span style="vertical-align: baseline;">These campaigns are </span><a href="https://blog.google/threat-analysis-group/prigozhin-interests-and-russian-information-operations/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">another example</span></a><span style="vertical-align: baseline;"> of private public relations (PR) firms conducting coordinated influence campaigns—in this case, spreading content aligned with the PRC’s views and political agenda to audiences dispersed across the globe. By using private PR firms, the actors behind the information operations (IO) gain plausible deniability, obscuring their role in the dissemination of coordinated inauthentic content.</span></p>
<h2><span style="vertical-align: baseline;">The Basics</span></h2>
<p><span style="vertical-align: baseline;">These inauthentic news sites are operated by a small number of stand-alone digital PR firms that offer newswire, syndication and marketing services. They pose as independent outlets that republish articles from PRC state media, press releases, and other content likely commissioned by other PR agency clients. In some cases, they publish localized news content copied from legitimate news outlets. We have also observed content from </span><a href="https://blog.google/threat-analysis-group/google-disrupted-dragonbridge-activity-q1-2024/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">DRAGONBRIDGE</span></a><span style="vertical-align: baseline;">, the most prolific IO actor TAG tracks, disseminated in these campaigns. </span></p>
<p><span style="vertical-align: baseline;">Although the four PR firms discussed in this post are separate from one another, they operate in a similar fashion, bulk-creating dozens of domains at a time and sharing thematically similar inauthentic content. Based on the set of inauthentic news domain names, the firms target audiences outside the PRC, including Australia, Austria, Czechia, Egypt, France, Germany, Hungary, Kenya, India, Indonesia, Japan, Luxemburg, Macao, Malaysia, New Zealand, Nigeria, Poland, Portugal, Qatar, Russia, Saudi Arabia, Singapore, South Korea, Spain, Switzerland, Taiwan, Thailand, Turkey, the United States, Vietnam, and the Chinese-speaking diaspora.</span></p>
<p><span style="vertical-align: baseline;">The use of newswire services is a shared tactic across all campaigns, and two of the PR firms directly control and operate the newswire services.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/glassbridge-fig1a.gif"
alt="GLASSBRIDGE is an ecosystem of companies and newswire services that publish inauthentic news content">
</a>
<figcaption class="article-image__caption "><p data-block-key="lc3u0">Figure 1: GLASSBRIDGE is an ecosystem of companies and newswire services that publish inauthentic news content</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h2><span style="vertical-align: baseline;">The Most Prolific: Shanghai Haixun Technology</span></h2>
<p><span style="vertical-align: baseline;">Of the PR and marketing firms we have observed supporting pro-China IO campaigns, the most prolific is Shanghai Haixun Technology Co., Ltd or “Haixun”. Since TAG first began </span><a href="https://blog.google/threat-analysis-group/tag-bulletin-q3-2022/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">tracking</span></a><span style="vertical-align: baseline;"> Haixun, Google has removed more than 600 policy-violating domains linked to the firm from the ability to appear in Google News features. The sites target English- and Chinese-speaking audiences, as well as audiences in a number of countries such as Brazil, India, Japan, Kenya, Korea, Malaysia, Saudi Arabia, Singapore, Spain, Russia, Thailand, Qatar, and Vietnam. Google has also terminated a limited number of policy-violating YouTube channels tied to the group. </span></p>
<p><span style="vertical-align: baseline;">In July 2023, </span><a href="https://cloud.google.com/blog/topics/threat-intelligence/pro-prc-haienergy-us-news/"><span style="text-decoration: underline; vertical-align: baseline;">Mandiant identified Haixun using both Times Newswire and World Newswire</span></a><span style="vertical-align: baseline;"> to place pro-Beijing content on the subdomains of legitimate news outlets. Mandiant also identified Haixun’s use of freelance services such as Fiverr to recruit </span><span style="vertical-align: baseline;">for-hire social media accounts to promote pro-Beijing content.</span></p>
<p><span style="vertical-align: baseline;">Haixun’s inauthentic news sites are generally low quality, and much of the content on the domains is spammy and repetitive. Mixed in with “filler” articles on topics such as the metaverse, the sites publish news content that is politically aligned to the views of the PRC government. This includes articles from the Global Times, a PRC state-controlled media outlet, and narratives aligned to common PRC talking points on Beijing’s territorial claims in the South China Sea, Taiwan, ASEAN, Falun Gong, Xinjiang, and the COVID-19 pandemic.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig2.max-1000x1000.png"
alt="Haixun inauthentic news featuring a mix of content, including PRC government talking points, Global Times articles, and content on the metaverse">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 2: Haixun inauthentic news featuring a mix of content, including PRC government talking points, Global Times articles, and content on the metaverse</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h2><span style="vertical-align: baseline;">Times Newswire and Shenzhen Haimai Yunxiang Media </span></h2>
<p><span style="vertical-align: baseline;">In February 2024, we removed policy-violating domains from appearing on Google News surfaces associated with a pro-PRC coordinated influence campaign reported by Citizen Lab as </span><a href="https://citizenlab.ca/2024/02/paperwall-chinese-websites-posing-as-local-news-outlets-with-pro-beijing-content/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">PAPERWALL</span></a><span style="vertical-align: baseline;"> that operated a network of over 100 websites in more than 30 countries masquerading as local news outlets. The imitation news sites published localized news content copied from legitimate local news outlets alongside articles republished from PRC state-controlled media, as well as press releases, conspiracy theories, and ad hominem attacks targeting specific individuals. </span></p>
<p><span style="vertical-align: baseline;">Based on technical indicators, TAG determined the inauthentic news websites were operated and controlled directly by Times Newswire, one of the news wire services that has distributed content on behalf of Haixun. TAG believes Times Newswire is, in turn, operated by another Chinese media company, Shenzhen Haimai Yunxiang Media Co., Ltd., or “Haimai”, which bills itself as a service provider specialized in global media communication and overseas network promotion. </span></p>
<p><span style="vertical-align: baseline;">The views expressed in the conspiracy and smear content were similar to past pro-PRC IO campaigns—for example, character attacks against the Chinese virologist Yan Limeng and claims that the US is conducting biological experiments on humans. Much of the smear content targeting specific individuals was ephemeral—it was posted on imitation news sites for a short period of time and then removed. </span></p>
<h2><span style="vertical-align: baseline;">DURINBRIDGE</span></h2>
<p><span style="vertical-align: baseline;">Another example of a commercial firm distributing content linked to pro-China IO campaigns is DURINBRIDGE, an alias we use to track a technology and marketing company that has multiple subsidiaries that provide news and PR services. DURINBRIDGE operates a network of over 200 websites designed to look like independent media outlets that publish news content on various topics. These domains violated our policies and have been blocked from appearing on Google News surfaces and Discover.</span></p>
<p><span style="vertical-align: baseline;">Importantly, DURINBRIDGE itself is not an IO actor and likely published the IO content on behalf of a customer or partner. Most of the content on the sites is news and press releases from various sources and has no apparent links to coordinated influence campaigns. However, a small portion of the content includes pro-PRC narratives and content directly linked to IO campaigns from Haixun and DRAGONBRIDGE. DURINBRIDGE sites also used articles and images from Times Newswire, which is operated by the aforementioned Chinese PR firm Haimai. </span></p>
<p><span style="vertical-align: baseline;">We identified multiple DRAGONBRIDGE articles published to DURINBRIDGE’s news sites. The content included narratives focused on exiled businessman Guo Wengui, a perennial topic for DRAGONBRIDGE, and multiple narratives amplified by DRAGONBRIDGE in the lead up to the Taiwanese presidential election. </span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig3.max-1000x1000.png"
alt="DRAGONBRIDGE content published to inauthentic news sites operated by DURINBRIDGE">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 3: DRAGONBRIDGE content published to inauthentic news sites operated by DURINBRIDGE</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig4.max-1000x1000.png"
alt="“Secret History of Tsai Ing-Wen,” on DURINBRIDGE-operated inauthentic news site">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 4: “Secret History of Tsai Ing-Wen,” on DURINBRIDGE-operated inauthentic news site</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig5.max-1000x1000.png"
alt="Narratives about then-candidate Lai Ching-te promoted by DRAGONBRIDGE prior to the Taiwanese presidential election">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 5: Narratives about then-candidate Lai Ching-te promoted by DRAGONBRIDGE prior to the Taiwanese presidential election</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h2><span style="vertical-align: baseline;">Shenzhen Bowen Media</span></h2>
<p><span style="vertical-align: baseline;">In early 2024, TAG and Mandiant identified a fourth marketing firm that operates a network of over 100 domains that pose as independent news sites focused on countries and cities across Europe, the Americas, Asia, and Australia. These domains violated our policies and have been blocked from appearing on Google News surfaces and Discover. The operator of the sites, Shenzhen Bowen Media Information Technology Co., Ltd., is a PRC-based marketing firm that also operates World Newswire, the same press release service used by Haixun to place content on the subdomains of legitimate news outlets. </span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig6.max-1000x1000.png"
alt="Sites linked to Shenzhen Bowen with localized content for Brazil and Germany">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 6: Sites linked to Shenzhen Bowen with localized content for Brazil and Germany</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Shenzhen Bowen’s sites present themselves as local outlets focused on a particular country or city, with articles in the local language about business, sports, and politics. The content is in multiple languages, aligned to each target audience, including Chinese, English, French, German, Japanese, and Thai. The sites do not disclose their connection to the marketing firm. </span></p>
<p><span style="vertical-align: baseline;">Side-by-side with local content, the sites include narratives promoting the Chinese government’s interests, much of it sourced from World Newswire. In more than one case, TAG and Mandiant have identified content linked to DRAGONBRIDGE published on Shenzhen Bowen-operated sites. </span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/glassbridge-fig7.max-1000x1000.png"
alt="DRAGONBRIDGE content on “Boston Journal” website linked to Shenzhen Bowen Media">
</a>
<figcaption class="article-image__caption "><p data-block-key="ol9gk">Figure 7: DRAGONBRIDGE content on “Boston Journal” website linked to Shenzhen Bowen Media</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h2><span style="vertical-align: baseline;">Conclusion</span></h2>
<p><span style="vertical-align: baseline;">The inauthentic news sites operated by GLASSBRIDGE illustrate how information operations actors have embraced methods beyond social media in an attempt to spread their narratives. We have observed similar behavior from </span><a href="https://blog.google/threat-analysis-group/prigozhin-interests-and-russian-information-operations/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Russian</span></a><span style="vertical-align: baseline;"> and </span><a href="https://blog.google/threat-analysis-group/tag-bulletin-q3-2024/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Iranian</span></a><span style="vertical-align: baseline;"> IO actors. By posing as independent, and often local news outlets, IO actors are able to tailor their content to specific regional audiences and present their narratives as seemingly legitimate news and editorial content. In fact, the content has been crafted or amplified by PR and newswire firms who conceal their role, or actively misrepresent their content as local and independent news coverage. In the case of GLASSBRIDGE, the consistency in content, behavioral similarities, connections across firms, and pro-PRC messaging suggests the private firms take direction from a shared customer who outsourced the creation of influence campaigns. Google is committed to information transparency, and we will continue tracking GLASSBRIDGE and blocking their inauthentic content on Google’s platforms. We regularly disclose our latest enforcement actions in the </span><a href="https://blog.google/threat-analysis-group/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">TAG Bulletin</span></a><span style="vertical-align: baseline;">. </span></p></div>Fri, 22 Nov 2024 10:00:00 +0000https://cloud.google.com/blog/topics/threat-intelligence/glassbridge-pro-prc-influence-operations/Threat IntelligencearticleSeeing Through a GLASSBRIDGE: Understanding the Digital Marketing Ecosystem Spreading Pro-PRC Influence OperationsGooglehttps://cloud.google.com/blog/topics/threat-intelligence/glassbridge-pro-prc-influence-operations/Google Threat Intelligence Group Don't let resource exhaustion leave your users hanging: A guide to handling 429 errorshttps://cloud.google.com/blog/products/ai-machine-learning/learn-how-to-handle-429-resource-exhaustion-errors-in-your-llms/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Large language models (LLMs) give developers immense power and scalability, but managing resource consumption is key to delivering a smooth user experience. LLMs demand significant computational resources, which means it's essential to anticipate and handle potential resource exhaustion. If not, you might encounter </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429"><span style="text-decoration: underline; vertical-align: baseline;">429 “resource exhaustion” errors</span></a><span style="vertical-align: baseline;">, which can disrupt how users interact with your AI application. </span></p>
<p><span style="vertical-align: baseline;">Today, we'll delve into why 429 errors occur with LLMs and equip you with three practical strategies to address them effectively. By understanding the root causes and implementing the right solutions, you can help ensure a smooth and uninterrupted experience, even during times of peak demand.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud AI and ML'), ('body', <wagtail.rich_text.RichText object at 0x3e712ce599a0>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Backoff!</strong></h3>
<p><span style="vertical-align: baseline;">Exponential backoff and retry logic have been around for a number of years. These basic techniques for handling resource exhaustion or API unavailability also apply to LLMs. When a generative AI application’s calls floods a model’s API, or when an excessive amount of queries overloads a system, backoff and retry logic in the code can help. The waiting time increases exponentially with each retry until the overloaded system recovers.</span></p>
<p><span style="vertical-align: baseline;">In Python, there are decorators available to implement backoff logic in your application code. For example, </span><a href="https://github.com/jd/tenacity" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">tenacity</span></a><span style="vertical-align: baseline;"> is a useful general-purpose retrying library written in Python to simplify the task of adding retry behavior to your code. 429 errors are more likely to occur with asynchronous code and multimodal models such as Gemini with large context windows. Below is a sample of an asynchronous code for retry using tenacity. To view the entire notebook, please visit this </span><a href="https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/community/generative_ai/backoff_and_retry_for_LLMs.ipynb" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">link</span></a><span style="vertical-align: baseline;">. </span></p></div>
<div class="block-code"><dl>
<dt>code_block</dt>
<dd><ListValue: [StructValue([('code', 'from tenacity import retry, wait_random_exponential\r\n\r\nasync def async_ask_gemini(contents, model_name=DEFAUL_MODEL_NAME):\r\n # This basic function calls Gemini asynchronously without a retry logic\r\n multimodal_model = GenerativeModel(model_name)\r\n response = await multimodal_model.generate_content_async(\r\n contents=contents, generation_config=config)\r\n return response.text\r\n\r\n\r\n@retry(wait=wait_random_exponential(multiplier=1, max=60))\r\nasync def retry_async_ask_gemini(contents, model_name=DEFAUL_MODEL_NAME):\r\n """This is the same code as the async_ask_gemini function but implements a\r\n retry logic using tenacity decorator.\r\n wait_random_exponential(multiplier=1, max=60) means that it will\r\n Retry “Randomly wait up to 2^x * 1 seconds between each retry until the \r\n range reaches 60 seconds, then randomly up to 60 seconds afterwards.\r\n """\r\n multimodal_model = GenerativeModel(model_name)\r\n response = await multimodal_model.generate_content_async(contents=contents, generation_config=config)\r\n return response.text'), ('language', ''), ('caption', <wagtail.rich_text.RichText object at 0x3e712ce59e20>)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">We tested passing a large amount of input to Gemini 1.5 Pro to demonstrate how backoff and retry is crucial to the success of your gen AI application. We’re using images and videos stored in Google Cloud Storage to heavily tax the Gemini system.</span></p>
<p><span style="vertical-align: baseline;">Below are the results </span><span style="font-style: italic; vertical-align: baseline;">without</span><span style="vertical-align: baseline;"> backoff and retry configured; where four out of five attempts failed.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_1_-_Test_Results_WITHOUT_BACKOFF_AND.max-1000x1000.png"
alt="Image 1 - Test Results WITHOUT BACKOFF AND RETRY">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Below are the results </span><span style="font-style: italic; vertical-align: baseline;">with</span><span style="vertical-align: baseline;"> backoff and retry configured. Implementing backoff and retry allowed all five attempts to succeed. Even with a successful API call and a response from the model, there is a trade-off. The backoff and retry adds increasing latency to a response. There may be other changes to make to the code, changes to the model, or a different cloud region that could improve performance. Nonetheless, backoff and retry is an overall improvement during moments of peak traffic and congestion. </span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_2_-_Test_Results_WITH_BACKOFF_AND_RE.max-1000x1000.png"
alt="Image 2 - Test Results WITH BACKOFF AND RETRY">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Additionally, when working with LLMs, you may often encounter issues from the underlying APIs such as rate-limiting or downtime. As you move your LLM applications into production it becomes more and more important to safeguard against these. That's why LangChain introduced the concept of a </span><a href="https://python.langchain.com/api_reference/core/runnables/langchain_core.runnables.fallbacks.RunnableWithFallbacks.html" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">fallback</span></a><span style="vertical-align: baseline;">, an alternative plan that may be used in an emergency. A fallback can be to a different model or even to another LLM provider altogether. Fallbacks can be implemented in code along with backoff and retry methods for greater resilience of your LLM applications. </span></p>
<p><span style="vertical-align: baseline;">Another robust option for LLM resiliency is </span><a href="https://github.com/GoogleCloudPlatform/apigee-samples/tree/main/llm-circuit-breaking" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">circuit breaking with Apigee</span></a><span style="vertical-align: baseline;">. By placing Apigee between a retrieval-augmented generation (RAG) application and LLM endpoints, you can manage traffic distribution and graceful failure handling. Of course, each model will provide a different answer so fallbacks and circuit breaking architecture should be thoroughly tested to ensure it meets your users needs.</span></p>
<h3><strong style="vertical-align: baseline;">Dynamic shared quota</strong></h3>
<p><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/dsq"><span style="text-decoration: underline; vertical-align: baseline;">Dynamic shared quota</span></a><span style="vertical-align: baseline;"> is one way that Google Cloud manages resource allocation for certain models, aiming to provide a more flexible and efficient user experience. Here's how it works: </span></p>
<p><strong style="vertical-align: baseline;">Traditional quota vs. dynamic shared quota</strong></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Traditional quota:</strong><span style="vertical-align: baseline;"> In a traditional quota system, you're assigned a fixed limit for a specific resource (e.g., API requests per day, per minute, per region). If you need more capacity, you usually have to submit a quota increase request and wait for approval. This can be slow and inconvenient. Of course, simply having quota allocated does not guarantee capacity, as it is still on-demand and not dedicated capacity. We will talk more about dedicated capacity when we discuss Provisioned Throughput later in this blog.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Dynamic shared quota:</strong><span style="vertical-align: baseline;"> With dynamic shared quota, Google Cloud has a pool of available capacity for a service. This capacity is dynamically distributed among all users who are making requests. Instead of having a fixed individual limit, you draw from this shared pool based on your needs at any given moment. </span></li>
</ul>
<p><strong style="vertical-align: baseline;">Benefits of dynamic shared quota</strong></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Eliminates quota increase requests:</strong><span style="vertical-align: baseline;"> You no longer need to submit quota increase requests for services that use dynamic shared quota. The system automatically adjusts to your usage patterns. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Improved efficiency:</strong><span style="vertical-align: baseline;"> Resources are used more efficiently because the system can allocate capacity where it's needed most at any given time.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Reduced latency:</strong><span style="vertical-align: baseline;"> By dynamically allocating resources, Google Cloud can minimize latency and provide faster responses to your requests.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Simplified management:</strong><span style="vertical-align: baseline;"> It simplifies capacity planning because you don't have to worry about hitting fixed limits.</span></li>
</ul>
<p><strong style="vertical-align: baseline;">Dynamic shared quota in action</strong></p>
<p><span style="vertical-align: baseline;">429 resource exhaustion errors are more likely to occur with asynchronous calls to Gemini with large multimodal input such as large video files. Below is a comparison of model performance of Gemini-1.5-pro-001 with traditional quota versus Gemini-1.5-pro-002 with dynamic shared quota. We can see even without retry (not recommended) the second-generation Gemini Pro model outperforms the previous-generation model because of dynamic shared quota.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_3_-_Test_Results_WITHOUT_RETRY_AND_T.max-1000x1000.png"
alt="Image 3 - Test Results WITHOUT RETRY AND TRADITIONAL QUOTA">
</a>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_4_-_Test_Results_WITHOUT_RETRY_WITH_.max-1000x1000.png"
alt="Image 4 - Test Results WITHOUT RETRY WITH DYNAMIC SHARED QUOTA">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Backoff and retry mechanisms should be combined with dynamic shared quota, especially as request volume and token size increase. During our testing of the -002 model with larger video input, we encountered 429 errors in all our initial attempts. However, the test results below demonstrate that incorporating backoff and retry logic allowed all five subsequent attempts to succeed. This highlights the necessity of this strategy for the newer -002 Gemini model to be consistently successful.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_5_-_Test_Results_WITHOUT_RETRY_WITH_.max-1000x1000.png"
alt="Image 5 - Test Results WITHOUT RETRY WITH DYNAMIC SHARED QUOTA">
</a>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_6_-_Test_Results_LARGE_VIDEO_WITH_RE.max-1000x1000.png"
alt="Image 6 - Test Results LARGE VIDEO WITH RETRY AND DYNAMIC SHARED QUOTA">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p style="text-align: justify;"><span style="vertical-align: baseline;">Dynamic shared quota represents a shift towards a more flexible and efficient way of managing resources in Google Cloud. By dynamically allocating capacity, it aims to provide a tightly integrated experience for users while optimizing resource utilization. Dynamic shared quota is not user-configurable. Google enables it only on specific models’ versions like Gemini-1.5-pro-002 and Gemini-1.5-flash-002. Check </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/dsq#google-models"><span style="text-decoration: underline; vertical-align: baseline;">supported Google Model versions</span></a><span style="vertical-align: baseline;"> for more details.</span></p>
<p><span style="vertical-align: baseline;">Alternatively, there are times where you want a hard-stop threshold to prevent excessive API calls to Gemini. Abuse, budget limits and controls, or security reasons all play a factor in purposely setting a customer-defined quota in Vertex AI. This is where the feature of </span><a href="https://cloud.google.com/service-usage/docs/manage-quota#create_consumer_quota_override"><span style="text-decoration: underline; vertical-align: baseline;">consumer quota override</span></a><span style="vertical-align: baseline;"> comes in. This can be a useful tool to protect your AI applications and systems. You can manage consumer quota with Terraform using the </span><a href="https://registry.terraform.io/providers/hashicorp/google/latest/docs/resources/service_usage_consumer_quota_override" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">google_service_usage_consumer_quota_override</span></a><span style="vertical-align: baseline;"> schema. </span></p>
<h3><strong style="vertical-align: baseline;">Provisioned Throughput</strong></h3>
<p><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/provisioned-throughput"><span style="text-decoration: underline; vertical-align: baseline;">Provisioned Throughput</span></a><span style="vertical-align: baseline;"> from Google Cloud is a service that allows you to reserve dedicated capacity for generative AI models on the Vertex AI platform. This means you can have predictable and reliable performance for your AI workloads, even during peak demand. </span></p>
<p><span style="vertical-align: baseline;">Here's a breakdown of what it offers and why it's useful:</span></p>
<p><strong style="vertical-align: baseline;">Benefits:</strong></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Predictable performance:</strong><span style="vertical-align: baseline;"> You get consistent response times and avoid performance variability, helping your AI applications run smoothly. </span></li>
<li><strong style="vertical-align: baseline;">Reserved capacity:</strong><span style="vertical-align: baseline;"> No more worrying about resource contention or queuing. You have your own dedicated capacity for your AI models. By default, when Provisioned Throughput capacity is surpassed, the excess traffic is billed at the pay-as-you-go rate.</span></li>
</ul></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Image_7_-_Provisioned_Throughput_reserved_.max-1000x1000.png"
alt="Image 7 - Provisioned Throughput reserved capacity">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><ul>
<li role="presentation"><strong style="vertical-align: baseline;">Cost-effective:</strong><span style="vertical-align: baseline;"> It can be more cost-effective than pay-as-you-go pricing if you have consistent, high-volume AI workloads. To </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/provisioned-throughput#place-an-order"><span style="text-decoration: underline; vertical-align: baseline;">estimate</span></a><span style="vertical-align: baseline;"> if you can save money using Provisioned Throughput, follow steps one through ten in the order process.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Scalable:</strong><span style="vertical-align: baseline;"> You can easily scale your reserved capacity up or down as your needs change.</span></li>
</ul>
<p><span style="vertical-align: baseline;">If you have an application with a large user base and need to provide fast response times then this is definitely going to help. This is designed especially for applications that require immediate AI processing, such as chatbots or interactive content generation. Provisioned Throughput can also help computationally-intensive AI tasks, such as processing massive datasets or generating complex outputs.</span></p>
<h3><strong style="vertical-align: baseline;">Leave 429s behind</strong></h3>
<p><span style="vertical-align: baseline;">When using generative AI in production, reliable performance is crucial. To achieve this, consider implementing these three strategies. They are designed to work together, and incorporating backoff and retry mechanisms into all your gen AI applications is a best practice. To get started building with generative AI, you can use these </span><a href="https://github.com/GoogleCloudPlatform/vertex-ai-samples/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Vertex AI samples</span></a><span style="vertical-align: baseline;"> on GitHub or take advantage of the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/overview"><span style="text-decoration: underline; vertical-align: baseline;">beginner's guide</span></a><span style="vertical-align: baseline;">, </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/start/quickstarts/quickstart-multimodal"><span style="text-decoration: underline; vertical-align: baseline;">quickstarts</span></a><span style="vertical-align: baseline;">, or </span><a href="https://github.com/GoogleCloudPlatform/generative-ai/tree/main/gemini/sample-apps/e2e-gen-ai-app-starter-pack" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">starter pack</span></a><span style="vertical-align: baseline;"> Google Cloud offers.</span></p></div>Thu, 21 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/ai-machine-learning/learn-how-to-handle-429-resource-exhaustion-errors-in-your-llms/Application DevelopmentAI & Machine LearningarticleDon't let resource exhaustion leave your users hanging: A guide to handling 429 errorsGooglehttps://cloud.google.com/blog/products/ai-machine-learning/learn-how-to-handle-429-resource-exhaustion-errors-in-your-llms/Pedro MelendezCustomer EngineerMatt DalidaTechnical Account ManagerMake IAM for GKE easier to use with Workload Identity Federationhttps://cloud.google.com/blog/products/identity-security/make-iam-for-gke-easier-to-use-with-workload-identity-federation/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">At Google Cloud, we work to continually improve our platform’s security capabilities to deliver the most trusted cloud. As part of this goal, we’re helping our users move away from less secure authentication methods such as long-lived, unauditable, service account keys towards </span><a href="https://cloud.google.com/docs/authentication#auth-decision-tree"><span style="text-decoration: underline; vertical-align: baseline;">more secure alternatives</span></a><span style="vertical-align: baseline;"> when authenticating to Google Cloud APIs and services. </span></p>
<p><span style="vertical-align: baseline;">In the context of Kubernetes workloads, there have been three ways users can do this:</span></p>
<ol>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Export credentials and mount as a secret in the Pod at runtime. This is done using service account keys but could be a security risk if the keys are not managed correctly.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Use the worker node identity or credential, such as the node service account. The security issue is that the credential is shared by all workloads deployed on that node. This can result in over-provisioning of permissions, which violates the principle of least privilege and is not recommended for multi-tenant clusters nor microservices in general. </span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Use GKE Workload Identity, which allows you to grant access to Cloud APIs using OpenID Connect without needing manual configuration or less secure methods like the aforementioned options. </span></p>
</li>
</ol>
<p><span style="vertical-align: baseline;">The preferred option has been to use GKE Workload Identity. Earlier this year, we renamed it </span><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/workload-identity"><span style="text-decoration: underline; vertical-align: baseline;">Workload Identity Federation for GKE</span></a><span style="vertical-align: baseline;">, and rolled out a significant update that made it even easier to use. The update also enabled deeper integration into Google Cloud’s IAM platform. Here’s what you need to know about the changes.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud security products'), ('body', <wagtail.rich_text.RichText object at 0x3e712c76eb50>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Why you should avoid service account impersonation (for most cases)</strong></h3>
<p><span style="vertical-align: baseline;">Previously, a workload on GKE needed to impersonate a Google Cloud service account with its Kubernetes service account (KSA) in order to enable Pod access to Google Cloud services. Although this improved security, we heard from many users that it was difficult to set up. Having to maintain both a Kubernetes service account and a Google Cloud service account for every Kubernetes workload also contributed to a sprawl of identities that needed to be managed and audited.</span></p>
<p><span style="vertical-align: baseline;">While the previous configuration is still possible and will continue to be supported, there is now a </span><strong style="vertical-align: baseline;">simpler way to configure this access</strong><span style="vertical-align: baseline;">: Google Cloud IAM policies can directly reference GKE workloads and Kubernetes service accounts. This </span><span style="vertical-align: baseline;">significantly simplifies the setup (six configuration steps to three), removing the need to manage another set of Google Cloud service accounts, and perform annotations on Kubernetes service accounts referencing the Google Cloud service account.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image2_3LeKyZU.max-1000x1000.png"
alt="image2">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">More integrated experience inside Cloud IAM</strong></h3>
<p><span style="vertical-align: baseline;">Moving towards this new configuration also gives your Kubernetes identities first class principal and principalSet representations inside Google Cloud IAM. When you use IAM tools such as </span><a href="https://cloud.google.com/policy-intelligence/docs/role-recommendations-overview"><span style="text-decoration: underline; vertical-align: baseline;">IAM recommender</span></a><span style="vertical-align: baseline;"> and IAM policies, you no longer need to apply changes to the impersonated service account and then mentally map the service account to the Kubernetes workload. </span></p>
<p><span style="vertical-align: baseline;">For example, inside the IAM recommender you can now directly see least privileged recommendations for your Kubernetes workloads and apply these recommendations directly to the Kubernetes principal.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/image1_RoD03F0.max-1000x1000.png"
alt="image1">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Referencing multiple GKE workloads inside IAM using principalSet notation</strong></h3>
<p><span style="vertical-align: baseline;">Previously, you could only reference a single principal (such as a single Kubernetes service account). The new configuration supports the principalSet notation which enables an attribute-based selection of multiple identities just like any other Workload Identity Federation principal. As a result, you can now </span><a href="https://cloud.google.com/iam/docs/principal-identifiers#v2"><span style="text-decoration: underline; vertical-align: baseline;">refer to multiple GKE workloads in a single IAM policy</span></a><span style="vertical-align: baseline;">, for example:</span></p>
<p><span style="vertical-align: baseline;">1) All workloads or Pods that belong to a Kubernetes namespace.</span></p></div>
<div class="block-code"><dl>
<dt>code_block</dt>
<dd><ListValue: [StructValue([('code', 'principalSet://iam.googleapis.com/projects/PROJECT_NUMBER/locations/global/workloadIdentityPools/PROJECT_ID.svc.id.goog/namespace/NAMESPACE'), ('language', ''), ('caption', <wagtail.rich_text.RichText object at 0x3e712c750070>)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">2) All workloads or Pods that belong to a Kubernetes cluster.</span></p></div>
<div class="block-code"><dl>
<dt>code_block</dt>
<dd><ListValue: [StructValue([('code', 'principalSet://iam.googleapis.com/projects/PROJECT_NUMBER/locations/global/workloadIdentityPools/PROJECT_ID.svc.id.goog/kubernetes.cluster/https://container.googleapis.com/v1/projects/PROJECT_ID/locations/LOCATION/clusters/CLUSTER_NAME'), ('language', ''), ('caption', <wagtail.rich_text.RichText object at 0x3e712c7501f0>)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Limitations and next steps</strong></h3>
<p><span style="vertical-align: baseline;">There are still a few limitations when using this capability that you should be aware of. If any of these apply, you will need to continue to use the </span><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/workload-identity#kubernetes-sa-to-iam"><span style="text-decoration: underline; vertical-align: baseline;">previous service account impersonation method</span></a><span style="vertical-align: baseline;"> to perform authentication:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">A small number of </span><a href="https://cloud.google.com/iam/docs/federated-identity-supported-services"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud services</span></a><span style="vertical-align: baseline;"> don’t yet support Workload and Workforce Identity Federation principals.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/security/vpc-service-controls"><span style="text-decoration: underline; vertical-align: baseline;">VPC Service Controls</span></a><span style="vertical-align: baseline;"> ingress and egress rules </span><a href="https://cloud.google.com/iam/docs/federated-identity-supported-services#vpc-service-controls"><span style="text-decoration: underline; vertical-align: baseline;">do not support</span></a><span style="vertical-align: baseline;"> Workload Identity Federation principal and principalSets.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">The specific permission to invoke a </span><a href="https://cloud.google.com/run"><span style="text-decoration: underline; vertical-align: baseline;">Cloud Run</span></a><span style="vertical-align: baseline;"> instance </span><a href="https://cloud.google.com/iam/docs/federated-identity-supported-services#cloud-run"><span style="text-decoration: underline; vertical-align: baseline;">does not support</span></a><span style="vertical-align: baseline;"> Workload Identity Federation principal and principalSets.</span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">We encourage you to update your existing configurations and use this new simplified IAM policy syntax where you can, moving forward. </span></p>
<p><span style="vertical-align: baseline;">To get started, consult the following guidance:</span></p>
<ul>
<li role="presentation"><span style="vertical-align: baseline;">View our updated documentation (</span><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/workload-identity"><span style="text-decoration: underline; vertical-align: baseline;">Concept</span></a><span style="vertical-align: baseline;">,</span><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/workload-identity"><span style="vertical-align: baseline;"> </span><span style="text-decoration: underline; vertical-align: baseline;">How-to</span></a><span style="vertical-align: baseline;">) pages and update your configurations.</span></li>
<li role="presentation"><span style="vertical-align: baseline;">Check out our </span><a href="https://www.youtube.com/watch?v=XmjTKVI93NA" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Container Native Security Con lighting talk</span></a><span style="vertical-align: baseline;"> on removing the need for impersonation.</span></li>
<li><span style="vertical-align: baseline;">Check out our </span><a href="https://www.youtube.com/watch?v=Gx_MsdeocAM" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Container Bytes video</span></a><span style="vertical-align: baseline;"> on Workload Identity Federation for GKE.</span></li>
</ul></div>Thu, 21 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/identity-security/make-iam-for-gke-easier-to-use-with-workload-identity-federation/Containers & KubernetesSecurity & IdentityarticleMake IAM for GKE easier to use with Workload Identity FederationGooglehttps://cloud.google.com/blog/products/identity-security/make-iam-for-gke-easier-to-use-with-workload-identity-federation/Cynthia ThomasProduct ManagerShaun LiuProduct ManagerAnnouncing Mistral AI’s Large-Instruct-2411 on Vertex AIhttps://cloud.google.com/blog/products/ai-machine-learning/announcing-new-mistral-large-model-on-vertex-ai/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">In July, we </span><a href="https://cloud.google.com/blog/products/ai-machine-learning/codestral-and-mistral-large-v2-on-vertex-ai?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">announced</span></a><span style="vertical-align: baseline;"> the availability of Mistral AI’s models on Vertex AI: Codestral for code generation tasks, Mistral Large 2 for high-complexity tasks, and the lightweight Mistral Nemo for reasoning tasks like creative writing. Today, we’re announcing the availability of Mistral AI’s newest model on </span><a href="https://cloud.google.com/model-garden"><span style="text-decoration: underline; vertical-align: baseline;">Vertex AI Model Garden</span></a><span style="vertical-align: baseline;">: </span><strong style="vertical-align: baseline;">Mistral-Large-Instruct-2411</strong><span style="vertical-align: baseline;"> is now generally available</span></p>
<p><a href="https://console.cloud.google.com/vertex-ai/publishers/mistralai/model-garden/mistral-large-2411"><strong style="text-decoration: underline; vertical-align: baseline;">Large-Instruct-2411</strong></a><strong style="vertical-align: baseline;"> </strong><span style="vertical-align: baseline;"> is an advanced dense large language model (LLM) of 123B parameters with strong reasoning, knowledge and coding capabilities extending its predecessor with better long context, function calling and system prompt. The model is ideal for use cases that include complex agentic workflows with precise instruction following and JSON outputs, or large context applications requiring strong adherence for retrieval-augmented generation (RAG), and code generation. </span></p>
<p><span style="vertical-align: baseline;">You can access and deploy the new Mistral AI Large-Instruct-2411 model on Vertex AI through our Model-as-a-Service (MaaS) or self-service offering today.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud AI and ML'), ('body', <wagtail.rich_text.RichText object at 0x3e712c770970>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">What can you do with the new Mistral AI models on Vertex AI?</strong></h3>
<p><span style="vertical-align: baseline;">By building with Mistral’s models on Vertex AI, you can:</span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Select the best model for your use case: </strong><span style="vertical-align: baseline;"> Choose from a range of Mistral AI models, including efficient options for low-latency needs and powerful models for complex tasks like agentic workflows. Vertex AI makes it easy to evaluate and select the optimal model.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Experiment with confidence: </strong><span style="vertical-align: baseline;">Mistral AI models are available as fully managed Model-as-a-Service on Vertex AI. You can explore Mistral AI models through simple API calls and comprehensive side-by-side evaluations within our intuitive environment.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Manage models without overhead: </strong><span style="vertical-align: baseline;">Simplify how you deploy the new Mistral AI models at scale with fully managed infrastructure designed for AI workloads and the flexibility of pay-as-you-go pricing.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Tune the models to your needs:</strong><span style="vertical-align: baseline;"> In the coming weeks, you will be able to fine-tune Mistral AI's models to create bespoke solutions, with your unique data and domain knowledge. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Craft intelligent agents:</strong><span style="vertical-align: baseline;"> Create and orchestrate agents powered by Mistral AI models, using Vertex AI's comprehensive set of tools, including LangChain on Vertex AI. Integrate Mistral AI models into your production-ready AI experiences with Genkit’s Vertex AI plugin.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Build with enterprise-grade security and compliance: </strong><span style="vertical-align: baseline;">Leverage Google Cloud's built-in security, privacy, and compliance measures. Enterprise controls, such as Vertex AI Model Garden’s </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/control-model-access"><span style="text-decoration: underline; vertical-align: baseline;">new organization policy</span></a><span style="vertical-align: baseline;">, provide the right access controls to make sure only approved models can be accessed.</span></li>
</ul>
<h3><strong style="vertical-align: baseline;">Get started with Mistral AI models on Google Cloud</strong></h3>
<p><span style="vertical-align: baseline;">These additions continue Google Cloud’s commitment to open and flexible AI ecosystems that help you build solutions best-suited to your needs. Our collaboration with Mistral AI is a testament to our open approach, within a unified and an enterprise ready environment. Vertex AI provides a curated collection of first-party, open-source, and third-party models, many of which — including the new Mistral AI models — can be delivered as a fully-managed Model-as-a-service (MaaS) offering — providing you with the simplicity of a single bill and enterprise-grade security on our fully-managed infrastructure. </span></p>
<p><span style="vertical-align: baseline;">To start building with Mistral’s newest models, visit </span><a href="https://cloud.google.com/model-garden"><span style="text-decoration: underline; vertical-align: baseline;">Model Garden</span></a><span style="vertical-align: baseline;"> and select the Mistral Large model tile. The models are also available on Google Cloud Marketplace here: </span><a href="https://console.cloud.google.com/marketplace/product/mistralai/mistral-large-2411"><span style="text-decoration: underline; vertical-align: baseline;">Mistral Large</span></a><span style="vertical-align: baseline;">. </span></p>
<p><span style="vertical-align: baseline;">You can check out our </span><a href="https://github.com/GoogleCloudPlatform/vertex-ai-samples/blob/main/notebooks/official/generative_ai/mistralai_intro.ipynb" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">sample code</span></a><span style="vertical-align: baseline;"> and </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/partner-models/mistral"><span style="text-decoration: underline; vertical-align: baseline;">documentation</span></a><span style="vertical-align: baseline;"> to help you get started.</span></p></div>Thu, 21 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/ai-machine-learning/announcing-new-mistral-large-model-on-vertex-ai/AI & Machine LearningarticleAnnouncing Mistral AI’s Large-Instruct-2411 on Vertex AIhttps://storage.googleapis.com/gweb-cloudblog-publish/images/071024c_GC_Banner_Mistral_v1_1.max-600x600.jpgGooglehttps://cloud.google.com/blog/products/ai-machine-learning/announcing-new-mistral-large-model-on-vertex-ai/Nenshad BardoliwallaDirector, Product Management, Vertex AIAnnouncing new updates to Cloud Translation AI, now covering 189 languageshttps://cloud.google.com/blog/products/ai-machine-learning/latest-updates-to-google-clouds-translation-ai/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Your next big customer doesn't speak your language. In fact, </span><a href="https://csa-research.com/Blogs-Events/CSA-in-the-Media/Press-Releases/Consumers-Prefer-their-Own-Language" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">40% of global consumers</span></a><span style="vertical-align: baseline;"> won't even consider buying from websites not in their native tongue. With </span><a href="https://www.internetworldstats.com/stats7.htm" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">51.6% of internet users</span></a><span style="vertical-align: baseline;"> speaking languages other than English, you're potentially missing half your market. </span></p>
<p><span style="vertical-align: baseline;">Until now, enterprises faced an impossible choice in addressing translation use cases. They had to choose one of the following:</span></p>
<ul>
<li role="presentation"><span style="vertical-align: baseline;">Human translators: High quality but slow and expensive</span></li>
<li role="presentation"><span style="vertical-align: baseline;">Basic machine translation: Fast but lacks nuance</span></li>
<li role="presentation"><span style="vertical-align: baseline;">DIY solutions: Inconsistent and risky</span></li>
</ul>
<p><span style="vertical-align: baseline;">But the challenge with translation is, you need all three – and traditional translation methods can't keep up. This isn't just about converting words - it's about connecting with people using the right context and tone. </span></p>
<p><span style="vertical-align: baseline;">That’s why at Google Cloud, we built </span><a href="https://cloud.google.com/translate?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Translation AI</span></a><span style="vertical-align: baseline;"> in Vertex AI. We’re excited to share the latest updates, and how you can apply it to your business.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud AI and ML'), ('body', <wagtail.rich_text.RichText object at 0x3e712cbc8250>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/vertex-ai/'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Translation AI: Unmatched translation quality, but your way</strong></h3>
<p><span style="vertical-align: baseline;">Google Cloud’s Translation AI includes two offerings for you to choose from:</span></p>
<ul>
<li role="presentation"><a href="https://cloud.google.com/translate/docs/basic/translate-text-basic"><span style="text-decoration: underline; vertical-align: baseline;">Translation API Basic</span></a><span style="vertical-align: baseline;">: An essential toolkit for translation capabilities. You can instantly detect languages and translate text using our advanced Neural Machine Translation (NMT) model. Translation AI Basic is perfect for chat conversations, short-form content, and scenarios where speed and consistency are crucial.</span></li>
<li><a href="https://cloud.google.com/translate/docs/advanced/translate-text-advance"><span style="text-decoration: underline; vertical-align: baseline;">Translation API Advanced</span></a><span style="vertical-align: baseline;">: Process entire documents, run batch translations, and maintain terminology consistency with custom glossaries. You can leverage our Gemini-powered Translation model for long-form content, or use Adaptive Translation to capture your brand's unique voice and tone. You can even customize translations by applying glossary, </span><a href="https://cloud.google.com/translate/docs/advanced/custom-translations#automl-translation"><span style="text-decoration: underline; vertical-align: baseline;">fine tuning</span></a><span style="vertical-align: baseline;"> our industry leading translation models, or </span><a href="https://cloud.google.com/translate/docs/advanced/custom-translations#adaptive_translation"><span style="text-decoration: underline; vertical-align: baseline;">adapting translation</span></a><span style="vertical-align: baseline;"> predictions in real time.</span></li>
</ul></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/image3_jLArRkU.gif"
alt="image3">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">What’s new in Translation AI</strong></h3>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Expanded reach and accuracy:</strong><span style="vertical-align: baseline;"> You can now reach global audiences with our expanded language support, now covering 189 languages — including Cantonese, Fijian, and Balinese — while maintaining lightning-fast performance, perfect for user content and contact centers.<br/></span><span style="font-style: italic; font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;">"Per our evaluations, Google NMT is among the best-performing real-time NMT models for 97% of the language-domain combinations we've tested (87 out of 90) -- which is 15% more than the closest competitor." -- Konstantin Savenkov, CEO & Co-Founder, Intento, Inc. </span></p>
</li>
</ul>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Smarter adaptive translation:</strong><span style="vertical-align: baseline;"> You can customize your translations' tone and style with as few as five examples, or use up to 30,000 for ultimate precision. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Model selection based on your use case: </strong><span style="vertical-align: baseline;">Using Cloud Translation Advanced</span><strong style="vertical-align: baseline;">, </strong><span style="vertical-align: baseline;">you have the option to choose from </span><a href="https://cloud.google.com/translate/docs/advanced/translating-text-v3"><span style="text-decoration: underline; vertical-align: baseline;">multiple approaches</span></a><span style="vertical-align: baseline;"> based on the complexity of your translation use case. For example, you can use our NMT model for translating general text or choose Adaptive Translation for customization in real-time.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Quality without compromise:</strong><span style="vertical-align: baseline;"> </span><span style="vertical-align: baseline;">While leaderboards and reports offer insights into overall model performance, they don't reveal how a model handles your specific needs. The </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview"><span style="text-decoration: underline; vertical-align: baseline;">gen AI evaluation service</span></a><span style="vertical-align: baseline;"> helps you pick your own evaluation criteria, giving you a clear understanding of how well AI models and applications align with your use case.</span><span style="vertical-align: baseline;"> For example, </span><a href="https://slator.com/with-metricx-24-google-presents-latest-machine-translation-evaluation-metric/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Google’s MetricX</span></a><span style="vertical-align: baseline;"> and the widely used </span><a href="https://github.com/Unbabel/COMET" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">COMET</span></a><span style="vertical-align: baseline;"> correlate strongly with human evaluation, are widely used for evaluating translation quality, and available now on </span><span style="vertical-align: baseline;">the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview"><span style="text-decoration: underline; vertical-align: baseline;">Vertex gen AI evaluation service</span></a><span style="vertical-align: baseline;">. </span><span style="vertical-align: baseline;">Compare models, prototype solutions, and select the best translation approach for your needs.</span></li>
</ul>
<p><span style="vertical-align: baseline;">We built Translation AI with a dual focus – transform how you translate and transform how you work with translation. While most vendors offer either powerful translation </span><span style="font-style: italic; vertical-align: baseline;">or</span><span style="vertical-align: baseline;"> easy implementation, we deliver on both in four critical ways. </span></p>
<ul>
<li role="presentation"><a href="https://console.cloud.google.com/vertex-ai/studio/translation"><strong style="text-decoration: underline; vertical-align: baseline;">Vertex AI</strong></a><strong style="vertical-align: baseline;"> for rapid prototyping: </strong><span style="vertical-align: baseline;">Instantly test translations across 189 languages. </span><a href="http://google.com/translation/adaptive-translation" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Compare NMT or our latest translation fine-tuned Gemini</span><span style="text-decoration: underline; vertical-align: baseline;">-</span><span style="text-decoration: underline; vertical-align: baseline;">powered model to find your perfect fit</span></a><span style="vertical-align: baseline;">. See how your custom adaptations perform without writing a single line of code, and get immediate quality metrics to validate your choices.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Production-ready APIs for your existing workflows:</strong><span style="vertical-align: baseline;"> Plug our Translation API (NMT) directly into your applications for real-time, high-volume translations. Switch your model selection to our Adaptive Translation Gemini-powered model via the same Translation API when tone and context matter most. Both models integrate into your existing workflows and automatically scale with your needs.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Customization without coding:</strong><span style="vertical-align: baseline;"> Train custom translation models on your specific industry terms and phrases. Simply upload your domain-specific data and let Translation AI build a custom model that speaks your language. It’s perfect for specialized content in legal, medical, or technical fields—no ML expertise required.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Full control with Vertex AI:</strong><span style="vertical-align: baseline;"> Own your complete translation pipeline using Translation AI through our comprehensive platform – Vertex AI. With Vertex AI, you can select your preferred models, customize their behavior, and monitor real-world performance. Integrate seamlessly with your existing CI/CD processes for true enterprise-grade translation at scale.</span></li>
</ul>
<h3><strong style="vertical-align: baseline;">Real impact: The Uber story</strong></h3>
<p><a href="https://www.uber.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Uber</span></a><span style="vertical-align: baseline;"> is leveraging Google Cloud Translation AI product suite to achieve their mission to </span><strong style="vertical-align: baseline;">help people go anywhere and get anything and earn their way</strong><span style="vertical-align: baseline;">. </span></p>
<p style="padding-left: 40px;"><span style="font-style: italic; vertical-align: baseline;">"Operating in tens of thousands of cities worldwide, Uber prioritizes seamless communication and support for riders, drivers, couriers, and eaters across diverse languages. Misinterpretations can result in delays, frustration, and even safety concerns. For years, Google has been our trusted translation platform. With recent advancements in Translation models, automatic quality metrics, and language expansion, we’re excited to partner with Google to deliver innovative multilingual experiences to our users." — </span><span style="vertical-align: baseline;">Megha Yethadka, Senior Director, Uber.</span></p>
<h3><strong style="vertical-align: baseline;">Get started with Translation AI</strong></h3>
<p><span style="vertical-align: baseline;">Here are a few resources to help you get started with the latest features in Translation AI. Gemini-powered Translation model and Adaptive Translation are publicly available to use today. You can try them out in Vertex AI Studio. </span></p>
<ul>
<li role="presentation"><a href="https://cloud.google.com/translate/docs/advanced/translating-text-v3"><span style="text-decoration: underline; vertical-align: baseline;">Translation API - Translate Text</span></a></li>
<li role="presentation"><a href="https://cloud.google.com/translate/docs/advanced/adaptive-translation"><span style="text-decoration: underline; vertical-align: baseline;">Adaptive Translation overview</span></a></li>
<li role="presentation"><a href="https://cloud.google.com/translate"><span style="text-decoration: underline; vertical-align: baseline;">Translation product page</span></a></li>
<li role="presentation"><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/models/evaluation-overview"><span style="text-decoration: underline; vertical-align: baseline;">GenAI Evaluation service overview </span></a></li>
<li role="presentation"><a href="https://cloud.google.com/translate/docs/languages"><span style="text-decoration: underline; vertical-align: baseline;">Supported Languages</span></a></li>
</ul></div>Wed, 20 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/ai-machine-learning/latest-updates-to-google-clouds-translation-ai/AI & Machine LearningarticleAnnouncing new updates to Cloud Translation AI, now covering 189 languagesGooglehttps://cloud.google.com/blog/products/ai-machine-learning/latest-updates-to-google-clouds-translation-ai/Sarah WeldonProduct Manager, Google CloudBuild, deploy, and promote AI agents through Google Cloud’s AI agent ecosystemhttps://cloud.google.com/blog/topics/partners/build-deploy-and-promote-ai-agents-through-the-google-cloud-ai-agent-ecosystem-program/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">We’ve seen a sharp rise in demand from enterprises that want to use AI agents to automate complex tasks, personalize customer experiences, and increase operational efficiency. Today, we’re announcing a Google Cloud AI agent ecosystem program to help partners build and co-innovate AI agents with technical and go-to-market resources from Google Cloud. We’re also launching AI Agent Space, a new category in our </span><a href="https://console.cloud.google.com/marketplace/browse?filter=category:ai-agent"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud Marketplace</span></a><span style="vertical-align: baseline;"> for customers to easily find and deploy partner-built AI agents. </span></p>
<p><span style="vertical-align: baseline;">Through this program, we’ll provide product support, marketing amplification, and co-selling opportunities to help our services and ISV partners bring these solutions to market faster, reach more customers, and grow their AI agent businesses. Our goal is to provide customers with a rich ecosystem of solutions that sit on top of our world-class infrastructure and offer the choice and optionality needed to tailor AI for their businesses and maximize value from AI investments. </span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', 'Try Google Cloud for free'), ('body', <wagtail.rich_text.RichText object at 0x3e712cbe8f40>), ('btn_text', 'Get started for free'), ('href', 'https://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">New resources for partners building AI Agents</strong></h3>
<p><span style="vertical-align: baseline;">To increase the development and adoption of AI agents, we’re focusing on supporting partners in three key areas:</span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Accelerated agent development:</strong><span style="vertical-align: baseline;"> We'll provide partners with direct access to Google Cloud's product and engineering teams for guidance and optimization of their AI agents. Partners will also receive early access to our latest AI technologies, technical enablement and best practices, and dedicated support for bringing their solutions to market quickly via Google Cloud Marketplace.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Go-to-market success:</strong><span style="vertical-align: baseline;"> New go-to-market programs and co-selling opportunities specifically designed for AI agent solutions will help partners more effectively promote their offerings and drive adoption across a wider range of customers.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Increased customer visibility:</strong><span style="vertical-align: baseline;"> We will highlight the innovative work of our partners through targeted marketing resources, blogs, and dedicated events, which will increase visibility of partner-built AI agents and help them stand out in our growing AI ecosystem.</span></li>
</ul>
<h3><strong style="vertical-align: baseline;">Offerings from services partners</strong></h3>
<p><span style="vertical-align: baseline;">We’ve seen significant momentum from services partners who have used Google Cloud’s technology to help customers successfully build and deploy AI agents. Through this program, our services partners will make their AI agents available to even more customers, including on AI Agent Space in the future. Here are some of their innovative agent solutions: </span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Accenture</strong><span style="vertical-align: baseline;"> is transforming customer support at a major retailer by offering convenient self-service options through virtual assistants, enhancing the overall customer experience.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Bain</strong><span style="vertical-align: baseline;"> supports SEB’s wealth management division with an AI agent that enhances end-customer conversations with suggested responses and generates call summaries that help increase efficiency by 15%. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">BCG</strong><span style="vertical-align: baseline;"> provides a sales optimization tool to improve the effectiveness and impact of insurance advisors. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Capgemini</strong><span style="vertical-align: baseline;"> optimizes the ecommerce experience by helping retailers accept customer orders through new revenue channels and to accelerate the order-to-cash process for digital stores.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Cognizant</strong><span style="vertical-align: baseline;"> helps legal teams draft contracts, assigning risk scores and recommendations for how to optimize operational impact. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Deloitte</strong><span style="vertical-align: baseline;"> offers a “Care Finder” agent as part of its Agent Fleet, helping care seekers find in-network providers often in less than a minute — significantly faster than the average call time of 5-8 minutes.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">HCLTech</strong><span style="vertical-align: baseline;"> </span><span style="vertical-align: baseline;">helps predict and eliminate different types of defects on manufacturing products with its manufacturing quality agent, Insight.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Infosys</strong><span style="vertical-align: baseline;"> optimizes digital marketplaces for a </span><span style="vertical-align: baseline;">leading consumer brand manufacturer</span><span style="vertical-align: baseline;">, providing actionable insights on inventory planning, promotions, and product descriptions. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">PwC</strong><span style="vertical-align: baseline;"> uses AI agent technology to help oncology clinics streamline administrative work so that doctors can optimize their time with patients.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">TCS</strong><span style="vertical-align: baseline;"> helps build persona-based AI agents contextualized with enterprise knowledge to accelerate software development.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Wipro</strong><span style="vertical-align: baseline;"> </span><span style="vertical-align: baseline;">supports a national healthcare provider in using agent technology to develop and adjust contracts, streamlining a complex and time-consuming task while improving accuracy. </span></li>
</ul>
<p><span style="vertical-align: baseline;">Partners have already given us positive feedback about the support we’ve provided to more effectively scale their agent solutions, including Datatonic, Kyndryl, Quantiphi, and Slalom who plan to bring new agents to market soon. Here’s what partners had to say:</span></p>
<ul>
<li role="presentation"><span style="vertical-align: baseline;">“Leaders who prioritize and invest in agentic architecture will be at the forefront of their industries, driving future growth with generative AI. For example, Accenture's marketing team is using autonomous agents to streamline campaign creation and execution, reducing manual steps by 25-35%, saving 6% in costs, and speeding up time-to-market by 25-55%.” - </span><strong style="vertical-align: baseline;">Scott Alfieri, Global Lead, Google Business Group, Accenture</strong></li>
<li role="presentation"><span style="vertical-align: baseline;">“BCG continues to see strong business value partnering with Google Cloud to deliver gen AI transformations for our joint clients across industries. Google Cloud's support for a robust ecosystem of AI agents demonstrates its commitment to innovation and democratization of AI.” - </span><strong style="vertical-align: baseline;">Val Elbert, Managing Director and Senior Partner, BCG</strong></li>
<li role="presentation"><span style="vertical-align: baseline;">“By partnering with Google Cloud, we are building AI agents that transform customer experiences and bring efficiency to business processes. Google Cloud's Agent Marketplace empowers Capgemini to continue developing and deploying innovative AI agents, leveraging our deep understanding of our customers.” – </span><strong style="vertical-align: baseline;">Fernando Alvarez, Chief Strategy and Development Officer and Group Executive Board Member, Capgemini</strong></li>
<li role="presentation"><span style="vertical-align: baseline;">“Deloitte has helped some of its largest clients improve how they operate with AI agents built with Google Cloud’s technology. As agentic AI takes off, this initiative can enhance our agent-building and distribution capabilities, thus enabling us to accelerate our clients’ time to business value with AI solutions.” – </span><strong style="vertical-align: baseline;">Gopal Srinivasan, Alphabet Google Alliance Generative AI Leader, Deloitte Consulting LLP</strong></li>
</ul>
<h3><strong style="vertical-align: baseline;">Offerings from ISV partners</strong></h3>
<p><span style="vertical-align: baseline;">Our ISV partners are leveraging the power of Google Cloud's AI technology, including Vertex AI and Gemini models, to develop cutting-edge AI agent solutions. Many have already made their offerings available on Google Cloud Marketplace, and we're thrilled that they will be expanding their reach through AI Agent Space to make it even easier for customers to deploy and benefit from these innovative AI agents. </span></p>
<p><span style="vertical-align: baseline;">Here are some examples of their agent capabilities: </span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Bud Financial</strong><span style="vertical-align: baseline;"> uses its "Financial LLM" to provide personalized answers to customer queries and supports automation of banking tasks such as moving money between accounts to avoid overdrafts.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Dun & Bradstreet </strong><span style="vertical-align: baseline;">uses its Hoovers SmartSearch AI to help customers quickly build targeted lists of companies and contacts matching specific criteria such as location, industry, and company size, making it easier to identify and action targeted opportunities.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Elastic </strong><span style="vertical-align: baseline;">helps SREs and SecOps interpret log messages and errors, optimize code, write reports, and even identify and execute a runbook. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Exabeam</strong><span style="vertical-align: baseline;"> enhances cybersecurity with natural language search, visualization, and investigation acceleration, automating threat explanations and next steps for multi-terabyte datasets.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">FullStory </strong><span style="vertical-align: baseline;">integrates its real-time data capture with Google Cloud's AI to create context-aware conversational agents, enabling faster data discovery and analysis of web and mobile interactions and more intelligent AI responses.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">GrowthLoop </strong><span style="vertical-align: baseline;">gives marketers tools that automate audience building, suggest optimal targeting, and create custom attributes, optimizing the power of BigQuery data.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">OpenText </strong><span style="vertical-align: baseline;">enables users to quickly find fast, accurate answers to inquiries that span a broad set of business domains, such as DevOps, customer service, and content management. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Quantum Metric</strong><span style="vertical-align: baseline;"> uses its Felix AI agent to help customer service associates quickly summarize and identify important takeaways from consumer engagements, with reporting metrics that help businesses enhance inquiry resolutions. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Sprinklr </strong><span style="vertical-align: baseline;">offers multiple AI agents that can help businesses improve decision-making, resolve service queries, and handle complex tasks end-to-end. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Teradata</strong><span style="vertical-align: baseline;"> helps analyze, categorize, and summarize customer inquiries or complaints by using multimodal capabilities that process text and voice data, identifying key trends and actionable insights to enhance customer loyalty.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">ThoughtSpot </strong><span style="vertical-align: baseline;">uses its Spotter agent to empower customers with autonomous analytics capabilities and a natural-language chat interface that brings deep data analysis and contextual reasoning to any user. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Typeface </strong><span style="vertical-align: baseline;">enables users to automate marketing workloads and across teams with its Arc Agent, which supports marketers with campaign performance, creative content creation updates, and audience optimization. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">UKG </strong><span style="vertical-align: baseline;">enhances the workplace experience with Bryte AI, a conversational agent that enables HR administrators and people managers to request information about company policies, business insights, and more. </span></li>
</ul>
<p><span style="vertical-align: baseline;">ISV partners are successfully using our AI to enhance their agent solutions, which they expect to grow through our ecosystem. Here’s what they had to say: </span></p>
<ul>
<li role="presentation"><span style="vertical-align: baseline;">“Dun & Bradstreet built Hoovers SmartSearch AI with Google's AI to revolutionize sales prospecting by instantly generating targeted lists of companies and contacts. Through this innovative initiative, customer adoption of our AI agent will be accelerated to help users effortlessly identify ideal customers and accelerate revenue growth.” - </span><strong style="vertical-align: baseline;">Michael Manos, Chief Technology Officer, Dun & Bradstreet </strong></li>
<li role="presentation"><span style="vertical-align: baseline;">“Elastic AI Assistant uses Vertex AI and Gemini models to empower SREs and SecOps teams to build intelligent agents that interpret log messages, optimize code, automate reports, and even generate runbooks. This is the future of agentic architecture, and it's available now in partnership with Google Cloud.” - </span><strong style="vertical-align: baseline;">Ken Exner, CPO, Elastic</strong><span style="vertical-align: baseline;"> </span></li>
<li role="presentation"><span style="vertical-align: baseline;">“By leveraging Google's advanced AI capabilities, ThoughtSpot Spotter delivers an autonomous analytics agent that empowers users to extract valuable insights from their data through natural language interactions. We're excited to scale our AI agent to even more customers in partnership with Google Cloud." </span><strong style="vertical-align: baseline;">- </strong><strong style="vertical-align: baseline;">Sumeet Arora, Chief Development Officer, ThoughtSpot</strong></li>
<li role="presentation"><span style="vertical-align: baseline;">“UKG leverages Vertex AI to power UKG Bryte AI, a gen AI sidekick for UKG’s Pro and Ready HCM solutions. Bryte AI is built on UKG’s proprietary people, culture, and work data to enhance insights and decision-making, and to enable more conversational AI experiences” - </span><strong style="vertical-align: baseline;">Venkat Ramamurthy, Head of Product, AI, and Data, UKG</strong></li>
</ul>
<p><span style="vertical-align: baseline;">We’re pleased by how quickly partners have built AI agents to help customers improve their businesses. Additional partners with powerful AI agent capabilities available through Google Cloud include AUI.io, Automation Anywhere, Big SUR AI, BigCommerce, DataStax, Decagon.ai, Dialpad, Elastic, ema.co, Livex.ai, Lyzr.ai, Mojix, Moveo.ai, Regnology, Tamr, Tektonic AI, Vijil, VMware, Wisdom AI, and Zeotap. </span></p>
<h3><strong style="vertical-align: baseline;">Joining AI Agent Space</strong></h3>
<p><a href="https://console.cloud.google.com/marketplace/browse?filter=category:ai-agent"><span style="text-decoration: underline; vertical-align: baseline;">AI Agent Space</span></a><span style="vertical-align: baseline;"> is available today with solutions from select partners, and we plan to add hundreds of additional AI agents over the coming months. Partners interested in learning more can visit </span><a href="https://cloud.google.com/marketplace/sell"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud Marketplace</span></a><span style="vertical-align: baseline;"> to start listing AI agent solutions, and they can apply to the program </span><a href="https://docs.google.com/forms/d/10U-lTYO4J1fgSTjfKWl-EiqjFDnLLqaYvxtLVHSpUf0/viewform?edit_requested=true" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">here</span></a><span style="vertical-align: baseline;"> or reach out to their partner representative to explore additional collaboration opportunities. </span></p>
<p><span style="vertical-align: baseline;">We’re dedicated to empowering our partners with the tools, resources, and support they need to build and deploy successful AI agents. We're excited to see the transformative solutions they create and the positive impact they'll have on customers in the coming year.</span></p></div>Wed, 20 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/partners/build-deploy-and-promote-ai-agents-through-the-google-cloud-ai-agent-ecosystem-program/PartnersarticleBuild, deploy, and promote AI agents through Google Cloud’s AI agent ecosystemGooglehttps://cloud.google.com/blog/topics/partners/build-deploy-and-promote-ai-agents-through-the-google-cloud-ai-agent-ecosystem-program/Kevin IchhpuraniPresident, Global Partner Organization, Google CloudGoogle Cloud NetApp Volumes now available for OpenShift on Google Cloudhttps://cloud.google.com/blog/topics/partners/netapp-volumes-now-available-for-openshift-on-google-cloud/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">As a result of </span><a href="https://developers.redhat.com/articles/2024/09/23/red-hat-openshift-dedicated-google-cloud-integration?sc_cid=701f2000000txokAAA&utm_source=bambu&utm_medium=organic_social" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">new joint efforts across NetApp, Red Hat and Google Cloud</span></a><span style="vertical-align: baseline;">, we are announcing support for </span><a href="https://cloud.google.com/netapp-volumes"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud NetApp Volumes</span></a><span style="vertical-align: baseline;"> in OpenShift on Google Cloud through </span><a href="https://www.netapp.com/blog/trident-24-10-best-storage-kubernetes/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">NetApp Trident Version 24.10</span></a><span style="vertical-align: baseline;">. This enables joint customers to take advantage of Google Cloud infrastructure that’s optimized for OpenShift, reduce operational toil, and streamline migration of complex workloads.</span></p>
<h3><strong style="vertical-align: baseline;">The power of OpenShift-optimized infrastructure</strong></h3>
<p><span style="vertical-align: baseline;">Red Hat and Google Cloud have a long history of collaborating on and contributing to Kubernetes as well as other Cloud Native Compute Foundation (CNCF) projects including Istio, Knative and Tekton. Together, these projects make up the basis for </span><a href="https://www.redhat.com/en/technologies/cloud-computing/openshift" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">OpenShift</span></a><span style="vertical-align: baseline;">, Red Hat’s platform that helps developers build, deploy, and manage applications. In fact, Google and Red Hat have been collaborating since before Kubernetes was even conceived, including co-developing Cgroups, a precursor to Linux containers. When Google open-sourced Kubernetes, Red Hat was one of the first to jump on board, betting the Red Hat OpenShift platform on the new open-source standard. Today, Google and Red Hat hold prominent leadership roles in Kubernetes governance, and are </span><a href="https://k8s.devstats.cncf.io/d/9/companies-table" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">#1 and #2 largest contributors to Kubernetes</span></a><span style="vertical-align: baseline;">, respectively</span><strong style="vertical-align: baseline;">.</strong></p>
<p><span style="vertical-align: baseline;">Google Cloud infrastructure is highly optimized for OpenShift. </span><a href="https://cloud.google.com/compute/docs/instances/creating-instance-with-custom-machine-type"><span style="text-decoration: underline; vertical-align: baseline;">Custom machine shapes</span></a><span style="vertical-align: baseline;"> let you optimize OpenShift Pods:Nodes bin-packing, reducing how much compute capacity you need to provision for a typical OpenShift workload. </span><a href="https://cloud.google.com/compute/docs/disks/storage-pools"><span style="text-decoration: underline; vertical-align: baseline;">Hyperdisk Storage Pools</span></a><span style="vertical-align: baseline;"> enables thin-provisioning for OpenShift PersistentVolumes, reducing the amount of storage that needs to be provisioned. Additionally, support for </span><a href="https://cloud.google.com/compute/docs/instances/live-migration-process"><span style="text-decoration: underline; vertical-align: baseline;">live migration</span></a><span style="vertical-align: baseline;"> in a wide array of Compute Engine families lets you provide higher uptime guarantees for stateful OpenShift workloads, which are common in enterprise application portfolios.</span></p>
<p><span style="vertical-align: baseline;">And when you deploy OpenShift workloads on Google Cloud, you can count on access to a deep bench of L3/ L4 engineers who are experts in the OpenShift runtime core components (in Kubernetes) given Google’s staff strong participation as core maintainers and technical leads in Kubernetes, providing you with enterprise-grade support and coverage for mission-critical workloads.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', 'Try Google Cloud for free'), ('body', <wagtail.rich_text.RichText object at 0x3e712c735100>), ('btn_text', 'Get started for free'), ('href', 'https://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">NetApp Volumes storage comes to OpenShift on Google Cloud</strong></h3>
<p><span style="vertical-align: baseline;">When you deploy OpenShift workloads on Google Cloud, there’s a wide array of options for modernizing your operations, with OpenShift-native integrations into managed infrastructure services across compute, networking, storage, monitoring/logging, secrets/encryption, serverless, CI/CD, etc.</span></p>
<p><span style="vertical-align: baseline;">These managed infrastructure services give you the ability to “carry much fewer pagers” than you typically would with an on-prem OpenShift deployment. However, sometimes you are migrating applications that have requirements or dependencies on specific solutions for infrastructure pillars (such as storage). The typical approach is to rely on self-managing the architecture — and going back to carrying pagers…</span></p>
<p><span style="vertical-align: baseline;">With support for Google Cloud NetApp Volumes in OpenShift, you benefit from the best of both worlds for your file storage needs: the modernization, toil-reduction, and efficiency benefits of a managed service, with enterprise-ready features, compatibility, and familiarity of NetApp on-premises storage.</span></p>
<p><span style="vertical-align: baseline;">You can maximize data performance and reliability for your Red Hat OpenShift workloads on Google Cloud by leveraging high-performance file storage on Google Cloud infrastructure while using NetApp Volumes features like automated snapshots, and Red Hat OpenShift-native persistent storage integration helps ensure high availability and fault tolerance across your workloads. </span></p>
<h3><strong style="vertical-align: baseline;">Streamlined deployment for a variety of workloads</strong></h3>
<p><span style="vertical-align: baseline;">Collaboration between Google Cloud, NetApp and Red Hat makes it easier to quickly configure and deploy Red Hat OpenShift clusters and workloads in Google Cloud with NetApp Volumes for file storage, with streamlined access to Google Cloud IAM, service account management, and the Certificate Authority Service, among others. NetApp Volumes provides as small as 1 GiB volumes, read-write many (RWX) PVs, low-latency performance and up to 12.5 GiB/sec throughput with large volumes, all while protecting your applications and data with customer managed encryption keys (CMEK). </span></p>
<p style="padding-left: 40px;"><span style="font-style: italic; vertical-align: baseline;">“Google Cloud is heavily invested in our partner community with the common goal of providing a world-class experience for our customers. Building on our long-standing technical collaborations with industry-leading partners like Red Hat and NetApp, we are deeply aligned on the core principles of both openness and reliability to help enterprise customers get what they need done. Our customers are increasingly turning to us to help them transform their business and together, and through a joint partnership with NetApp and Red Hat, we can help customers with a new way to cloud, while leveraging familiarity and consistency that brings together innovations across their business.” </span><span style="vertical-align: baseline;">- Stephen Orban, Google Cloud VP of Migrations, ISVs, and Marketplace </span></p>
<h3><strong style="vertical-align: baseline;">D</strong><strong style="vertical-align: baseline;">e-risk cloud adoption while accelerating time to value</strong></h3>
<p><span style="vertical-align: baseline;">We are partnering with one of our trusted resellers,</span> <a href="https://convergetp.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Converge Technology Solutions</span></a><span style="vertical-align: baseline;">, to bridge on-premises, multi-cloud, and Google Cloud, and provide unified management and operational experience for your Red Hat OpenShift and NetApp workloads. Converge’s expertise in Red Hat OpenShift and NetApp technologies helps ensure solutions are architected for peak performance, scalability, and reliability. You can take advantage of their deep understanding of hybrid cloud and Kubernetes/Red Hat OpenShift to provide a smooth transition to Google Cloud, minimizing disruption and maximizing uptime. At the same time, Converge’s best-practice-aligned methodologies help streamline Google Cloud deployments while integrating Red Hat OpenShift and NetApp Volumes, so you can run persistent containerized workloads on an enterprise-class hybrid cloud environment.</span></p>
<p style="padding-left: 40px;"><span style="font-style: italic; vertical-align: baseline;">"Converge is thrilled to partner with Google Cloud, Red Hat, and NetApp to deliver this powerful new solution for OpenShift on Google Cloud. Our deep expertise in hybrid cloud and Kubernetes, coupled with our proven methodologies, ensures a seamless transition and rapid time-to-value for clients adopting this innovative offering. This collaboration empowers enterprises to modernize their operations, optimize their infrastructure, and unlock the full potential of containerized workloads in a secure and reliable hybrid cloud environment." </span><span style="vertical-align: baseline;">- David Luftig, Executive Vice President Strategy and Solutions, Converge Technology Solutions</span></p>
<h3><strong style="vertical-align: baseline;">Connect with us to learn more</strong></h3>
<p><span style="vertical-align: baseline;">Are you ready to get more value from the cloud? Learn more about </span><a href="https://console.cloud.google.com/marketplace/browse?filter=partner:Red%20Hat"><span style="text-decoration: underline; vertical-align: baseline;">Red Hat solutions on Google Cloud Marketplace</span></a><span style="vertical-align: baseline;">, and </span><a href="https://cloud.google.com/netapp-volumes?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud NetApp Volumes</span></a><span style="vertical-align: baseline;">. You can also contact your Red Hat, NetApp, or </span><a href="https://cloud.google.com/contact/?utm_source=google&utm_medium=cpc&utm_campaign=na-none-all-en-dr-sitelink-all-all-trial-b-gcp-1707554&utm_content=text-ad-none-any-DEV_c-CRE_665735422256-ADGP_Hybrid%20%7C%20BKWS%20-%20MIX%20%7C%20Txt-Management%20Tools-Cloud%20Console-KWID_43700078974892313-kwd-296393718382-userloc_9032024&utm_term=KW_google%20cloud%20console-ST_google%20cloud%20console-NET_g-&gad_source=1&gclid=CjwKCAjw48-vBhBbEiwAzqrZVBb-QVEQkCMsFolm9kdz54UTJFnPKTW2Q9Z7ruunhFcf6p7sU9z1-BoChyYQAvD_BwE&gclsrc=aw.ds"><span style="text-decoration: underline; vertical-align: baseline;">Google sales</span></a><span style="vertical-align: baseline;"> representatives, or reach out to </span><a href="https://convergetp.com/contact-us/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Converge Technology specialists</span></a><span style="vertical-align: baseline;"> to discuss your specific use cases. </span></p>
<p><strong style="vertical-align: baseline;">About NetApp<br/></strong><a href="https://www.netapp.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">NetApp</span></a><span style="vertical-align: baseline;"> is the intelligent data infrastructure company, combining unified data storage, integrated data services, and CloudOps solutions to turn a world of disruption into opportunity for every customer. NetApp creates silo-free infrastructure, harnessing observability and AI to enable the industry’s best data management. As the only enterprise-grade storage service natively embedded in the world’s biggest clouds, our data storage delivers seamless flexibility. In addition, our data services create a data advantage through superior cyber resilience, governance, and application agility. Our CloudOps solutions provide continuous optimization of performance and efficiency through observability and AI. No matter the data type, workload, or environment, with NetApp you can transform your data infrastructure to realize your business possibilities.</span></p>
<p><strong style="vertical-align: baseline;">About Red Hat, Inc. <br/></strong><span><a href="https://www.redhat.com/en" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Red Hat</span></a><span style="vertical-align: baseline;"> is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver reliable and high-performing Linux, hybrid cloud, container, and Kubernetes technologies. Red Hat helps customers integrate new and existing IT applications, develop cloud-native applications, standardize on our industry-leading operating system, and automate, secure, and manage complex environments. </span><a href="https://access.redhat.com/recognition" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Award-winning</span></a><span style="text-decoration: underline; vertical-align: baseline;"> </span><span style="vertical-align: baseline;">support, training, and consulting services make Red Hat a </span><a href="https://www.redhat.com/en/about/company" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">trusted adviser to the Fortune 500</span></a><span style="vertical-align: baseline;">. As a strategic partner to cloud providers, system integrators, application vendors, customers, and open source communities, Red Hat can help organizations prepare for the digital future. </span></span></p>
<p><strong style="vertical-align: baseline;">About Converge <br/></strong><a href="https://convergetp.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Converge Technology Solutions</span></a><span style="vertical-align: baseline;"> Corp. is a services-led, software-enabled, IT & Cloud Solutions provider focused on delivering industry-leading solutions. Converge’s global approach delivers advanced analytics, artificial intelligence (AI), application modernization, cloud platforms, cybersecurity, digital infrastructure, and digital workplace offerings to clients across various industries. The Company supports these solutions with advisory, implementation, and managed services expertise across all major IT vendors in the marketplace. This multi-faceted approach enables Converge to address the unique business and technology requirements for all clients in the public and private sectors. For more information, visit convergetp.com.</span></p></div>Tue, 19 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/partners/netapp-volumes-now-available-for-openshift-on-google-cloud/Containers & KubernetesPartnersarticleGoogle Cloud NetApp Volumes now available for OpenShift on Google CloudGooglehttps://cloud.google.com/blog/topics/partners/netapp-volumes-now-available-for-openshift-on-google-cloud/Pablo OsinagaProduct ManagerSean DerringtonGroup Product Manager, StorageRealizing AI's Full Potential where Workforce, Security, & Collaboration Matterhttps://cloud.google.com/blog/topics/public-sector/realizing-ais-full-potential-where-workforce-security-collaboration-matter/<div class="block-paragraph"><p data-block-key="pmwmd">AI is rapidly reshaping the public sector, ushering in a new era of intelligent and AI-powered service delivery and mission impact. Chief AI Officers (CAIOs) and other agency leaders play a critical role as AI becomes more pervasive. At Google, we've long believed that AI is a foundational and transformational technology, with the potential to benefit people and society. Realizing its full potential to improve government services, enhance decision-making, and ultimately create a more efficient and effective public sector requires leadership and a clear commitment.</p><p data-block-key="6o8j0">Google recently commissioned IDC to conduct a study that surveyed 161 federal CAIOs, government AI leaders and other decision makers to understand how agency leaders are leading in this new AI era – and the value they are already bringing when it comes to AI governance, collaboration, and building public trust and citizen engagement¹. I recently sat down with Ruthbea Yesner, Vice-President of IDC Government Insights to explore the key findings of this research and what it means for the public sector - see excerpts of our discussion and key insights below.</p><h3 data-block-key="brqaa"><b>Key Finding #1: 62% of those surveyed say strengthening cybersecurity is a top motivator for AI investments</b></h3><p data-block-key="bo2io">Agencies are embracing AI to enhance cybersecurity and protect critical infrastructure - with 60% of respondents indicating that internal cybersecurity protection is their top AI/ML use case. Over 40% of federal agencies surveyed state that protecting critical infrastructure is a key driver for their AI investments going forward. Additionally, respondents believe that applying AI to strengthen cybersecurity and protect critical infrastructure will deliver positive outcomes in just 9 months; the second fastest time to value of any expected outcome of AI.</p><p data-block-key="1ogg8">CAIOs and other agency leaders play a crucial role in driving AI adoption and ensuring that agencies are able to leverage this powerful technology. While 50% of federal agencies have already appointed a CAIO, the rest are expected to follow soon. As adoption accelerates and AI maturity grows, CAIOs need to prioritize robust cybersecurity measures and risk mitigation strategies in all AI initiatives, ensuring the protection of sensitive data and systems.</p><h3 data-block-key="55h7h"><b>Key Finding #2: Higher AI maturity increases likelihood to explore other Gen AI use cases by 4x</b></h3><p data-block-key="adeg9">IDC created a 5-phase approach to assessing AI maturity and the findings are remarkable - 50% of agencies surveyed reported high levels of AI maturity which corresponds to mature behaviors like piloting and implementing generative AI use cases to drive innovation and mission impact. Mature AI agencies are embracing an innovation culture and are focused on AI use cases and projects with high potential for impact.</p><p data-block-key="3dntd">We’re seeing some agencies solving for one specific problem or use case and creating quick wins and the appetite to do more, and in other cases, they are tackling big, complex challenges head-on. By adopting an AI-first mindset, incorporating AI into their workflows and scaling their use of AI, they are creating the groundswell to do more. This has a compounding effect as AI becomes more pervasive across the agency, and individuals increasingly feel part of its positive cultural change and impact.</p></div>
<div class="block-pull_quote"><div class="uni-pull-quote h-c-page">
<section class="h-c-grid">
<div class="uni-pull-quote__wrapper h-c-grid__col h-c-grid__col--8 h-c-grid__col-m--6 h-c-grid__col-l--6
h-c-grid__col--offset-2 h-c-grid__col-m--offset-3 h-c-grid__col-l--offset-3">
<div class="uni-pull-quote__inner-wrapper h-c-copy h-c-copy">
<q class="uni-pull-quote__text">This has a catalyst effect, it just takes one person doing something amazing with AI to motivate others to learn and apply AI</q>
<cite class="uni-pull-quote__author">
<span class="uni-pull-quote__author-meta">
<strong class="h-u-font-weight-medium">Ruthbea Yesner</strong><br />
Vice-President of IDC Government Insights
</span>
</cite>
</div>
</div>
</section>
</div>
</div>
<div class="block-paragraph"><p data-block-key="pmwmd">Generative AI is the future - attracting 42% of AI investments. Agencies are eager to explore its potential - and innovation will be a key motivator for continued AI investment going forward. As organizations prioritize AI, the CAIO role becomes even more multifaceted, demanding not just technical expertise but also visionary leadership to drive organizational culture change and develop a truly AI-enabled workforce.</p><p data-block-key="2nhld">We believe that a robust AI maturity model, as outlined in <a href="https://inthecloud.withgoogle.com/caio-playbook/dl-cd.html" target="_blank">IDC’s The Chief Artificial Intelligence Officer (CAIO) Playbook: A Practical Guide for Advancing AI Innovation in Government</a>, is key to guiding agencies in their adoption of AI and fostering organizational readiness. By providing a clear framework for progress, agencies can strategically navigate the complexities of AI implementation and unlock its full potential.</p><h3 data-block-key="501k5"><b>Key Finding #3: An AI-ready workforce is the key to unlocking AI’s potential</b></h3><p data-block-key="c3jed">The rapid pace of AI adoption has highlighted a significant challenge: a shortage of AI expertise. 39% percent of survey respondents report that their biggest challenge is a lack of in-house AI skills and expertise, and 68% are focused on training and retaining their workforce.</p><p data-block-key="6gtdf">Google is tackling this skills challenge head-on. We recently announced our <a href="https://cloud.google.com/blog/topics/training-certifications/google-cloud-launchpad-veterans-to-cloud-careers?e=48754805">Google Cloud Launchpad for Veterans</a> - a no-cost training and certification journey to equip veterans in all roles and at all levels with the cloud knowledge and skills needed to drive innovation, and contribute to their employer’s digital transformation strategy. And we also announced a new AI training initiative through Google.org’s AI Opportunity Fund – with <a href="https://blog.google/outreach-initiatives/google-org/google-org-public-sector-ai-funding/" target="_blank"><i>$15 million</i> for AI skills training for US government workers</a> for the Partnership for Public Service and InnovateUS. This also includes a grant to the Partnership for Public Service to establish the new Center for Federal AI to provide AI skills and literacy to federal leaders and workers, including 2,000 senior government officials.</p><p data-block-key="a0954">One thing is clear - AI requires leadership, and the CAIO is an important new C-suite role signaling the government’s commitment to harness AI and reach its full potential. CAIOs and other agency leaders are critical to charting this new AI era and providing the expertise and leadership necessary to leverage AI for the public good.</p><p data-block-key="abqvu">To learn more about how CAIOs are leading in this new AI era, download <a href="https://inthecloud.withgoogle.com/caio-playbook/dl-cd.html" target="_blank">The Chief Artificial Intelligence Officer (CAIO) Playbook: A Practical Guide for Advancing AI Innovation in Governmen</a>t. By embracing its recommendations, agencies can create their own roadmap to drive AI adoption to accelerate mission outcomes and impact. To hear the full interview with Ruthbea Yesner, Vice-President of IDC Government Insights, please <a href="https://cloudonair.withgoogle.com/events/public-sector-2024-vod?utm_source=Website&utm_medium=Digital&utm_campaign=Google%2BPublic%2BSector%2BSummit%2B2024" target="_blank"><b>register</b></a> to join the Google Public Sector Summit On-Demand on December 3rd.<br/></p><p data-block-key="8p095"><i><sub>¹ IDC Signature White Paper, The Chief Artificial Intelligence Officer (CAIO) Playbook: A Practical Guide for Advancing AI Innovation in Government, sponsored by Google Public Sector, Doc# US52616824, October 2024.</sub></i></p></div>Tue, 19 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/public-sector/realizing-ais-full-potential-where-workforce-security-collaboration-matter/Public SectorarticleRealizing AI's Full Potential where Workforce, Security, & Collaboration Matterhttps://storage.googleapis.com/gweb-cloudblog-publish/images/Screenshot_2024-11-18_10.53.22_AM.max-600x600.pngGooglehttps://cloud.google.com/blog/topics/public-sector/realizing-ais-full-potential-where-workforce-security-collaboration-matter/Katharyn WhiteDirector of Marketing, Public SectorGoogle CloudThe overwhelmed person’s guide to Google Cloud: week of November 14https://cloud.google.com/blog/products/gcp/the-overwhelmed-persons-guide-to-google-cloud/<div class="block-paragraph"><p data-block-key="2u57j"><i>The content in this blog post was originally published last week as a members-only email to the Google Cloud Innovators community. To get this content directly in your inbox (not to mention</i> <a href="https://cloud.google.com/innovators/?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY23-Q4-Global-OPGGCWeekly-CGCBlog-EN&utm_content=opggc&utm_term=-"><i>lots of other benefits</i></a><i>),</i> <a href="https://cloud.google.com/innovators/?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY23-Q4-Global-OPGGCWeekly-CGCBlog-EN&utm_content=opggc&utm_term=-"><i>sign up to be an Innovator today</i></a><i>.</i></p><hr/><p data-block-key="di9mo"></p></div>
<div class="block-paragraph"><h3 data-block-key="a3czl"><b>New and shiny</b></h3><p data-block-key="6s4ls"><i>Three new things to know this week</i></p><ul><li data-block-key="cdnqa"><b>Ground Gemini’s answers with Google Search in Vertex AI and Google AI Studio.</b> There’s brand new support in <a href="https://aistudio.google.com/welcome" target="_blank">Google AI Studio</a> for connecting the Gemini model’s output to verifiable sources of data through Google Search. This functionality is already part of Vertex AI, but both platforms now support <i>dynamic retrieval</i>. This means that grounding only happens if we predict that the query needs it. See <a href="https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/ground-gemini">how it works in Vertex AI</a>, and <a href="https://ai.google.dev/gemini-api/docs/grounding" target="_blank">learn how to get started in Google AI Studio</a>.</li><li data-block-key="evql6"><b>Use Google’s own Arm-based CPUs.</b> The Axion CPU is now ready for you! Get some excellent price performance and better energy efficiency by <a href="https://cloud.google.com/blog/products/compute/try-c4a-the-first-google-axion-processor">deploying C4A VMs</a> powered by our first Arm-based processors.</li><li data-block-key="88s5"><b>Business process automation service gets AI upgrade, increased sophistication.</b> It’s likely flying below your radar, but take a look at Google Cloud Application Integration. You can model out workflows for connecting systems in all sorts of ways. There’s <a href="https://cloud.google.com/application-integration/docs/build-integrations-gemini">now Gemini Code Assist functionality</a> to help you build integrations, model out <a href="https://cloud.google.com/application-integration/docs/configure-data-transformer-script-task">data transformations</a>, create <a href="https://cloud.google.com/application-integration/docs/test-cases">test cases</a>, and even apply <a href="https://cloud.google.com/application-integration/docs/error-handling-strategy">complex retry strategies</a>.</li><li data-block-key="e6rpa"><b>Build and Deploy Gen AI Applications on Google Cloud with Genkit and Go.</b> <a href="https://cloudonair.withgoogle.com/events/build-and-deploy-gen-ai-apps-on-google-cloud-with-genkit-and-go?utm_source=cloud_sfdc&utm_medium=email&utm_campaign=FY24-Q4-global-GOO27246-onlineevent-er-Build-Gen-AI-With-Genkit-and-Go-GLOBAL&utm_content=innovators_newsletter" target="_blank">Join us</a> on November 19th for a hands-on workshop to build and deploy a generative AI app on Google Cloud! Use Genkit, Vertex AI, and Go to create and automate a reusable app deployment pipeline—perfect for beginners and pros alike.</li></ul><hr/><h3 data-block-key="742le"><b>Watch this</b></h3><p data-block-key="f7e37"><b>Learn advanced RAG techniques</b>. Watch this excellent video series to get up to speed on LLM fundamentals. This edition looks at retrieval augmented generation and enhancing quality of responses.</p></div>
<div class="block-video">
<div class="article-module article-video ">
<figure>
<a class="h-c-video h-c-video--marquee"
href="https://youtube.com/watch?v=sGvXO7CVwc0"
data-glue-modal-trigger="uni-modal-sGvXO7CVwc0-"
data-glue-modal-disabled-on-mobile="true">
<div class="article-video__aspect-image"
style="background-image: url(https://storage.googleapis.com/gweb-cloudblog-publish/original_images/Watch_This_Vst77H0.png);">
<span class="h-u-visually-hidden">A captured graphic of a Google Cloud YouTube video where Aja and Jason explain advanced retrieval augmented generation techniques</span>
</div>
<svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white">
<use xlink:href="#mi-youtube-icon"></use>
</svg>
</a>
</figure>
</div>
<div class="h-c-modal--video"
data-glue-modal="uni-modal-sGvXO7CVwc0-"
data-glue-modal-close-label="Close Dialog">
<a class="glue-yt-video"
data-glue-yt-video-autoplay="true"
data-glue-yt-video-height="99%"
data-glue-yt-video-vid="sGvXO7CVwc0"
data-glue-yt-video-width="100%"
href="https://youtube.com/watch?v=sGvXO7CVwc0"
ng-cloak>
</a>
</div>
</div>
<div class="block-paragraph_advanced"><hr/>
<h3 data-draftjs-conductor-fragment='{"blocks":[{"key":"21jp","text":"Community cuts","type":"header-three","depth":0,"inlineStyleRanges":[{"offset":0,"length":14,"style":"BOLD"}],"entityRanges":[],"data":{}}],"entityMap":{}}'>Community cuts</h3></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Updated_CommunityCuts-header_v5.max-1000x1000.png"
alt="Community Cuts">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="h9w35"><i>Every week I round up some of my favorite links from builders around the Google Cloud-iverse. Want to see your blog or video in the next issue?</i> <a href="mailto:[email protected]"><i>Drop Richard a line!</i></a></p><ul><li data-block-key="d40qc"><b>“Perfect” is a strong word, but yeah, we’re pretty good.</b> Simon at SADA <a href="https://sada.com/blog/why-google-cloud-is-perfect-for-genai-developers/" target="_blank">makes the case</a> that you should be looking at Google Cloud for your AI work because of our models, security posture, tools, and expertise. Who am I to disagree?</li><li data-block-key="7sdng"><b>Build an ETL pipeline locally and then transition to the cloud.</b> <a href="https://medium.com/@thomas.sigmund/data-pipelines-transitioning-from-local-environments-to-the-cloud-with-gcp-8ca5d9293b25" target="_blank">Thomas looks at</a> the exercise of taking a working data pipeline and using our data and compute services to get it running successfully in Google Cloud.</li><li data-block-key="dou96"><b>What exactly is Firebase?</b> It’s been part of Google for a while and is used by mobile devs around the world. Now it’s appealing to new audiences, and this post from <a href="https://medium.com/@hemantkumarceo001/understanding-google-firebase-everything-you-need-to-know-3a443650aaa3" target="_blank">Hermant explains</a> what it offers to modern developers.</li></ul><hr/><p data-block-key="eh7ss"></p></div>
<div class="block-paragraph_advanced"><div data-draftjs-conductor-fragment='{"blocks":[{"key":"12an8","text":"Learn and grow","type":"header-three","depth":0,"inlineStyleRanges":[{"offset":0,"length":14,"style":"BOLD"}],"entityRanges":[],"data":{}},{"key":"2ferk","text":"Three ways to build your cloud muscles this week","type":"unstyled","depth":0,"inlineStyleRanges":[{"offset":0,"length":48,"style":"ITALIC"}],"entityRanges":[],"data":{}},{"key":"fs3o9","text":"Test-driven development with Ruby and Gemini Code Assist. This codelab offers a useful look at how to do iterative test-driven development, and then deploy an app to Cloud Functions.","type":"unordered-list-item","depth":0,"inlineStyleRanges":[{"offset":0,"length":56,"style":"BOLD"}],"entityRanges":[{"offset":58,"length":12,"key":0}],"data":{}},{"key":"9mns8","text":"What can you use to build generative AI apps. This one is from me. I wanted to learn more about the options for building generative AI apps using native APIs, meta frameworks, and low-code tools. ","type":"unordered-list-item","depth":0,"inlineStyleRanges":[{"offset":0,"length":44,"style":"BOLD"}],"entityRanges":[{"offset":67,"length":22,"key":1}],"data":{}},{"key":"ejv3j","text":"Let’s optimize our continuous integration. Darren wrote up a great post that explores a handful of ways to tune your CI pipelines. Even if you don’t use Cloud Build, you’ll take something useful away from this. ","type":"unordered-list-item","depth":0,"inlineStyleRanges":[{"offset":0,"length":41,"style":"BOLD"}],"entityRanges":[{"offset":43,"length":28,"key":2}],"data":{}},{"key":"soj8","text":"New Skills Boost training for learning advanced GKE topics. We just published four new courses for learners ready to take the next step with GKE:","type":"unordered-list-item","depth":0,"inlineStyleRanges":[{"offset":0,"length":60,"style":"BOLD"}],"entityRanges":[],"data":{}},{"key":"bc8a4","text":"Manage Multi-cluster Workloads at Scale with GKE Fleets and Teams","type":"unordered-list-item","depth":0,"inlineStyleRanges":[],"entityRanges":[{"offset":0,"length":65,"key":3}],"data":{}},{"key":"40s8b","text":"Automate GKE Configurations with Config Sync and Policy Controller","type":"unordered-list-item","depth":0,"inlineStyleRanges":[],"entityRanges":[{"offset":0,"length":66,"key":4}],"data":{}},{"key":"8n4uo","text":"Manage and Secure Distributed Services with GKE Managed Service Mesh ","type":"unordered-list-item","depth":0,"inlineStyleRanges":[],"entityRanges":[{"offset":0,"length":68,"key":5}],"data":{}},{"key":"bhurc","text":"Deploying a Multi-Cluster Gateway Across GKE Clusters","type":"unordered-list-item","depth":0,"inlineStyleRanges":[],"entityRanges":[{"offset":0,"length":53,"key":6}],"data":{}},{"key":"9448n","text":" ","type":"atomic","depth":0,"inlineStyleRanges":[],"entityRanges":[{"offset":0,"length":1,"key":7}],"data":{}},{"key":"ff8qh","text":"One more thing","type":"header-three","depth":0,"inlineStyleRanges":[{"offset":0,"length":14,"style":"BOLD"}],"entityRanges":[],"data":{}}],"entityMap":{"0":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://codelabs.developers.google.com/tdd-ruby-app-duetai?hl=en#0"}},"1":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://seroter.com/2024/04/30/heres-what-id-use-to-build-a-generative-ai-application-in-2024/"}},"2":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://medium.com/google-cloud/optimizing-ci-in-google-cloud-build-1ae2562ccaa1"}},"3":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://www.cloudskillsboost.google/focuses/89749?parent=catalog"}},"4":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://www.cloudskillsboost.google/focuses/89750?parent=catalog"}},"5":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://www.cloudskillsboost.google/focuses/89751?parent=catalog"}},"6":{"type":"LINK","mutability":"MUTABLE","data":{"url":"https://www.cloudskillsboost.google/focuses/89752?parent=catalog"}},"7":{"type":"HORIZONTAL_RULE","mutability":"IMMUTABLE","data":{}}}}'>
<h3 class="public-DraftStyleDefault-block public-DraftStyleDefault-ltr" data-offset-key="atitm-0-0"><span data-offset-key="atitm-0-0">Learn and grow</span></h3>
<div class="Draftail-block--unstyled" data-block="true" data-editor="d7ej8" data-offset-key="93ovk-0-0">
<div class="public-DraftStyleDefault-block public-DraftStyleDefault-ltr" data-offset-key="93ovk-0-0"><span data-offset-key="93ovk-0-0">Three ways to build your cloud muscles this week</span></div>
<div class="public-DraftStyleDefault-block public-DraftStyleDefault-ltr" data-offset-key="93ovk-0-0">
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">In-person Workshop! AI In Action - AlloyDB and Vertex AI Agent Builder. </strong><span style="vertical-align: baseline;">Level up your AI skills with a hands-on journey in building knowledge-driven chat applications! Dive into AlloyDB and Vertex AI Agent Builder to create intelligent, interactive customer solutions. Sign up now! </span><a href="https://rsvp.withgoogle.com/events/ai-in-action-new_york" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">New York - 11/20</span></a><span style="vertical-align: baseline;">, </span><a href="https://rsvp.withgoogle.com/events/ai-in-action-toronto" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Toronto - 11/22</span></a><span style="vertical-align: baseline;">, </span><a href="https://rsvp.withgoogle.com/events/ai-in-action-bay_area" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Bay Area - 12/3</span></a><span style="vertical-align: baseline;">, </span><a href="https://rsvp.withgoogle.com/events/ai-in-action-seattle" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Seattle - 12/5</span></a><span style="vertical-align: baseline;">. </span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Let’s modernize our old apps. </strong><span style="vertical-align: baseline;">Here’s a </span><a href="https://codelabs.developers.google.com/codelabs/app-mod-workshop?hl=en#0" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">great code lab</span></a><span style="vertical-align: baseline;"> that walks you through, step-by-step, the process to modernize an old PHP app. Learn what it takes to containerize the app, automate the path to production, add generative AI features, and introduce modern operations.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Running Apache Airflow? You have choices. </strong><span style="vertical-align: baseline;">If you’re orchestrating data, there’s a good chance you’ve come across Apache Airflow. </span><a href="https://cloud.google.com/blog/products/data-analytics/different-ways-to-run-apache-airflow-on-google-cloud"><span style="text-decoration: underline; vertical-align: baseline;">This post points out</span></a><span style="vertical-align: baseline;"> that you can run it yourself on VMs, use a more managed GKE environment, or embrace a fully managed service with Cloud Composer.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Cloud Workstations is ready for devs in government. </strong><span style="vertical-align: baseline;">You deserve nice things, wherever you may work. Those in restricted environments sometimes have to settle for less. But now, </span><a href="https://cloud.google.com/blog/topics/public-sector/fedramp-high-development-in-the-cloud-code-with-cloud-workstations"><span style="text-decoration: underline; vertical-align: baseline;">Cloud Workstations is FedRAMP High Authorized</span></a><span style="vertical-align: baseline;">. Curious about Cloud Workstations? Stanal has a good new </span><a href="https://medium.com/google-cloud/what-is-gcp-cloud-workstation-df144ff86584" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">overview post</span></a><span style="vertical-align: baseline;">.</span></li>
<li><strong style="vertical-align: baseline;">Standard storage format, and all the BigQuery goodness. </strong><span style="vertical-align: baseline;">We just shipped a </span><a href="https://cloud.google.com/blog/products/data-analytics/announcing-bigquery-tables-for-apache-iceberg"><span style="text-decoration: underline; vertical-align: baseline;">preview of BigQuery tables for Apache Iceberg</span></a><span style="vertical-align: baseline;">. Use this open format to store data, but get all the lakehouse goodness that BigQuery offers.</span></li>
</ul>
</div>
</div>
<figure class="Draftail-block--atomic" contenteditable="false" data-block="true" data-editor="d7ej8" data-offset-key="qahn2-0-0"></figure>
<hr/>
<h3 class="public-DraftStyleDefault-block public-DraftStyleDefault-ltr" data-offset-key="ati7h-0-0"><span data-offset-key="ati7h-0-0">One more thing</span></h3>
</div></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--medium
h-c-grid__col
h-c-grid__col--4 h-c-grid__col--offset-4
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/One_More_Thing.max-1000x1000.png"
alt="222 One More Thing..">
</a>
<figcaption class="article-image__caption "><p data-block-key="s5t1j"><a href="https://x.com/JeffDean/status/1851471466543620513">https://x.com/JeffDean/status/1851471466543620513</a></p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="w6t8h"><b>Gemini is used across Google to create helpful AI experiences.</b> Jeff highlights Sundar’s message about the billions of Google users with access to Gemini.</p><hr/><p data-block-key="6lron"><a href="https://cloud.google.com/innovators/?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY23-Q4-Global-OPGGCWeekly-CGCBlog-EN&utm_content=opggc&utm_term=-"><i>Become an Innovator</i></a><i> to stay up-to-date on the latest news, product updates, events, and learning opportunities with Google Cloud.</i></p></div>Tue, 19 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/gcp/the-overwhelmed-persons-guide-to-google-cloud/Application ModernizationApplication DevelopmentGoogle CloudarticleThe overwhelmed person’s guide to Google Cloud: week of November 14https://storage.googleapis.com/gweb-cloudblog-publish/images/overwhelmed_persons_guide.max-600x600.pngGooglehttps://cloud.google.com/blog/products/gcp/the-overwhelmed-persons-guide-to-google-cloud/Richard SeroterChief Evangelist, Google CloudNew ways to protect your sensitive data with Chrome Enterprisehttps://cloud.google.com/blog/products/chrome-enterprise/new-ways-to-protect-your-sensitive-data-with-chrome-enterprise/<div class="block-paragraph"><p data-block-key="jy5m0">Protecting sensitive company data is no longer just a best practice—it’s business critical. In today's world, data breaches can have serious consequences, from financial losses and reputational damage to legal repercussions and operational disruptions. That’s why <a href="https://chromeenterprise.google/products/chrome-enterprise-premium/" target="_blank">Chrome Enterprise Premium</a>, our advanced secure enterprise browser offering, includes a growing suite of <b>Data Loss Prevention</b> (DLP) capabilities to help organizations safeguard their sensitive information and maintain compliance.</p><p data-block-key="b4j91">We <a href="https://cloud.google.com/blog/products/chrome-enterprise/end-user-computing-thats-made-for-today-and-ready-for-whats-next?e=48754805">recently launched</a> a number of enhancements to our DLP capabilities, giving you even more granular control over your company's data. This blog post will explore how these new capabilities support your organization’s comprehensive DLP journey—from <b>discovering</b> potential risks and user behavior, to <b>controlling</b> data flow with robust security measures, to <b>investigating</b> potential incidents with detailed reporting and analysis, and finally, to <b>expanding</b> protection beyond desktops.</p><p data-block-key="9olhe"><b>Discover and understand user behavior</b></p><p data-block-key="8q9ih">Understanding how your users interact with data is the first step in preventing data leaks. Chrome Enterprise provides powerful tools to gain visibility into user activity and to identify potential risks:</p><p data-block-key="1pj8c"><b>1. Chrome Security Insights</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/Chrome_Security_Insights.max-1000x1000.png"
alt="Chrome Security Insights">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">Chrome Security Insights empowers <a href="https://chromeenterprise.google/products/chrome-enterprise-premium/" target="_blank">Chrome Enterprise</a> customers to proactively identify potential threats with simplified security monitoring. This feature monitors key security configurations, security event logging, and 50 common DLP detectors with just a few clicks. Administrators gain valuable insights into high-risk activities through detailed reports on users, domains, and sensitive data transfers, enabling swift identification and remediation of security concerns. Start your 30-day Chrome Enterprise Premium trial and enable Chrome Security Insights <a href="https://support.google.com/chrome/a/answer/14804659?hl=en" target="_blank">here</a>.</p><p data-block-key="fpbpu"><b>2. URL Filtering Audit Mode</b> [Currently in Public Preview (beta), general availability coming soon]</p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/2_3.png"
alt="4 (1)">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">Chrome Enterprise Premium’s URL Filtering Audit Mode offers a valuable tool for organizations seeking to refine their web access policies. It allows administrators to selectively activate monitoring of employee browsing activity without enforcing restrictions, providing insights into users behavior and potential security risks. By analyzing this data, IT and security teams can make informed decisions regarding URL filtering rules, striking an effective balance between security and user productivity. See <a href="https://support.google.com/a/answer/10104358?hl=en#zippy=%2Cexample-audit-chrome-navigations-to-urls-that-match-a-regular-expression-beta" target="_blank">here</a> to learn how to configure URL Filtering Audit Mode.</p><p data-block-key="1nq5i"><b>Enforce DLP controls</b></p><p data-block-key="5hrne">Once you understand your users' behavior, it's time to put the right controls in place to prevent data leaks. Chrome Enterprise offers a robust set of in-browser protections.</p><p data-block-key="13tgn"><b>1. Copy and paste protections</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/3_1_OXyOiNZ.max-1000x1000.png"
alt="3 (1)">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">Controlling how users interact with sensitive data is crucial. Chrome Enterprise Premium's <a href="https://support.google.com/chrome/a/answer/15450083#zippy=%2Crestricted-sources-for-pasting%2Crestricted-destinations-for-pasting%2Cscreenshot-prevention" target="_blank">copy and paste protections</a> allow you to restrict or block users from copying sensitive information from web pages or pasting it into unauthorized applications or websites. This granular control helps prevent data exfiltration and ensures that sensitive information stays within designated boundaries, reducing the risk of data breaches and helping with compliance with data protection regulations. The copy and paste protections include:</p><ul><li data-block-key="2mjp0"><b>Preventing data leakage to Incognito mode:</b> Concerned about sensitive data being copied into incognito mode, where it can potentially bypass security measures? Chrome Enterprise Premium now allows you to block or warn users when they attempt to copy data between regular browsing sessions and incognito windows.</li><li data-block-key="39gde"><b>Controlling data sharing between applications:</b> For organizations looking to prevent data leakage to external applications, Chrome Enterprise Premium now allows you to block or warn users when they attempt to copy data from your web applications into external programs like Notepad, Microsoft Word, or other apps.</li><li data-block-key="7drjo"><b>Isolating data between Chrome profiles:</b> Shared devices or users with multiple Chrome profiles can introduce risks of data cross-contamination. Chrome Enterprise Premium’s new copy-paste controls now allow you to block or warn users when they attempt to move data between different profiles.</li><li data-block-key="1a1h6"><b>Securing sensitive emails</b>: Emails often contain highly confidential information requiring stringent protection. With Chrome Enterprise Premium, you can implement specific rules, such as blocking any copying from Gmail unless it’s being pasted back into Gmail.</li></ul><p data-block-key="21c30">See more details about setting up copy and paste protections <a href="https://support.google.com/chrome/a/answer/15450083#zippy=%2Crestricted-sources-for-pasting%2Crestricted-destinations-for-pasting%2Cscreenshot-prevention" target="_blank">here</a>.</p><p data-block-key="2b3tg"><b>2. Watermarking</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_2.max-1000x1000.png"
alt="4 (2)">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">Watermarking acts as a deterrent to unauthorized data sharing. Chrome Enterprise Premium allows you to apply visible watermarks to sensitive company documents viewed in the browser, displaying information like the user's email address, date, or a custom message. This helps discourage data exfiltration and makes it easier to trace the source of any leaked information. See <a href="https://support.google.com/chrome/a/answer/14792155?visit_id=638618595108193616-1174257236&p=watermark&rd=1" target="_blank">here</a> on how to set up watermarking with Chrome Enterprise Premium</p><p data-block-key="8rjon"><b>3. Screenshot protections</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/5_g61QgbO.gif"
alt="5">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">Screenshots can be a convenient way to capture information, but they also pose a data leak risk. Chrome Enterprise Premium's screenshot protection allows you to prevent users from taking screenshots of sensitive content within the browser. This adds another layer of protection to your DLP strategy, limiting the potential for unauthorized data capture. Learn how to set up screenshot protection rules <a href="https://support.google.com/a/answer/10104358?visit_id=638670424982161017-1017088533&p=dlp-block-screenshot&rd=1#dlp-block-screenshot&zippy=%2Cexample-block-screenshots-and-screen-sharing-from-a-specific-website" target="_blank">here</a>.</p><p data-block-key="ed2u6">These controls work together to create a comprehensive security strategy, limiting the ways in which data can be exfiltrated from your organization.</p><p data-block-key="b5j38"><b>Investigate potential data leaks</b></p><p data-block-key="ev1hh">Even with the best preventative measures in place, it's crucial to be prepared to investigate potential security incidents. Chrome Enterprise provides tools to help you quickly identify and address threats:</p><p data-block-key="35et7"><b>1. Evidence Locker</b> [Currently in Private Preview, general availability coming soon]</p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/6_1.max-1000x1000.png"
alt="6 (1)">
</a>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/7_U4sqpyY.max-1000x1000.png"
alt="7">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="hguig">The evidence locker provides a secure repository for storing files and data that require further investigation by security teams. For instance, if an employee attempts to download a non-public financial report, Chrome Enterprise Premium can block the action and retain a copy of the file in the evidence locker. This triggers a detailed report for IT and security teams, enabling them to take appropriate investigation and remediation steps. Stay tuned for more information on the upcoming release of Evidence Locker.</p><p data-block-key="ecul1"><b>2. Chrome Extension Telemetry in Google Security Operations</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/8_kT4Rak9.max-1000x1000.png"
alt="8">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="7njvf">Chrome Enterprise Core integrates with <a href="https://cloud.google.com/security/products/security-operations?e=48754805">Google Security Operations</a>, our cloud-native security analytics platform, to provide deeper visibility into browser activity. Previously, detection and response teams were limited to analyzing static extension attributes. Now, you can set dynamic rules that continuously monitor extension behavior in your production environment, enabling proactive identification and remediation of risks before they escalate into threats. For example, you can monitor if extensions are unexpectedly contacting remote hosts or accessing cookies. This enhanced visibility empowers your security team to detect and mitigate data theft and infrastructure attacks in near real-time, significantly reducing your organization's vulnerability to malicious extensions. See how to set this up <a href="https://support.google.com/a/answer/9393909?hl=en" target="_blank">here</a>.</p><p data-block-key="9d9np"><b>Expand protection to other platforms</b></p><p data-block-key="8lpp2">Chrome Enterprise is committed to extending its threat protection capabilities beyond the desktop.</p><p data-block-key="epa35"><b>1. Mobile threat protections</b></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/9_3.max-1000x1000.png"
alt="9 (3)">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph"><p data-block-key="7njvf">With the growing use of mobile devices for work, securing the browser on these devices is essential. <a href="https://chromeenterprise.google/products/cloud-management/" target="_blank">Chrome Enterprise Core</a> is extending its threat protection capabilities to Android devices with <a href="https://chromeenterprise.google/policies/#DownloadRestrictions" target="_blank">download blocking</a>. This feature will allow organizations to set policies to prevent users from downloading malicious files flagged by Google Safe Browsing from the web onto their mobile devices, bringing threat protections beyond desktops. Organizations can also choose to block all downloads on Android on managed Chrome. <a href="https://chromeenterprise.google/products/cloud-management/" target="_blank">Get started</a> with Chrome Enterprise Core today at no additional costs.</p><p data-block-key="6cs5v"><b>Chrome Enterprise Premium: Your partner in DLP</b></p><p data-block-key="9uq7j">These features are just a glimpse into the comprehensive DLP capabilities offered by Chrome Enterprise. We are consistently enhancing our security capabilities to help organizations like yours take a proactive approach to data loss prevention, safeguarding sensitive information at the critical browser layer and ensuring compliance in today's increasingly complex digital landscape.</p><p data-block-key="fftig"><a href="https://chromeenterprise.google/products/cloud-management/" target="_blank">Start using Chrome Enterprise Core</a> today at no additional cost to gain foundational security capabilities. Or, experience Chrome Enterprise Premium’s advanced security and DLP features with a free 60-day trial and enable Chrome Security Insights <a href="https://support.google.com/chrome/a/answer/14804659?hl=en" target="_blank">here</a>.</p></div>Tue, 19 Nov 2024 15:30:00 +0000https://cloud.google.com/blog/products/chrome-enterprise/new-ways-to-protect-your-sensitive-data-with-chrome-enterprise/Chrome EnterprisearticleNew ways to protect your sensitive data with Chrome Enterprisehttps://storage.googleapis.com/gweb-cloudblog-publish/images/23261_DLP_Deep_Dive_announcement__Blog_Heade.max-600x600.pngGooglehttps://cloud.google.com/blog/products/chrome-enterprise/new-ways-to-protect-your-sensitive-data-with-chrome-enterprise/Niamh CunninghamSenior Product ManagerTanisha RaiProduct ManagerEmpowering Gemini for Malware Analysis with Code Interpreter and Google Threat Intelligencehttps://cloud.google.com/blog/topics/threat-intelligence/gemini-malware-analysis-code-interpreter-threat-intelligence/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;"><span style="vertical-align: baseline;">One of Google Cloud's major missions is to arm security professionals with modern tools to help them defend against the latest threats. Part of that mission involves moving closer to a more autonomous, adaptive approach in threat intelligence automation.</span></span></p>
<p><span style="vertical-align: baseline;">In our latest advancements in malware analysis, we’re equipping Gemini with new capabilities to address obfuscation techniques and obtain real-time insights on indicators of compromise (IOCs). By integrating the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/extensions/code-interpreter"><span style="text-decoration: underline; vertical-align: baseline;">Code Interpreter extension</span></a><span style="vertical-align: baseline;">, Gemini can now dynamically create and execute code to help deobfuscate specific strings or code sections, while </span><a href="https://cloud.google.com/security/products/threat-intelligence"><span style="text-decoration: underline; vertical-align: baseline;">Google Threat Intelligence</span></a><span style="vertical-align: baseline;"> (GTI) </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/multimodal/function-calling"><span style="text-decoration: underline; vertical-align: baseline;">function calling</span></a><span style="vertical-align: baseline;"> enables it to query GTI for additional context on URLs, IPs, and domains found within malware samples. These tools are a step toward transforming Gemini into a more adaptive agent for malware analysis, enhancing its ability to interpret obfuscated elements and gather contextual information based on the unique characteristics of each sample.</span></p>
<p><span style="vertical-align: baseline;">Building on this foundation, we previously explored critical preparatory steps with Gemini 1.5 Pro, leveraging its expansive 2-million-token input window to process substantial sections of </span><a href="https://cloud.google.com/blog/topics/threat-intelligence/gemini-for-malware-analysis"><span style="text-decoration: underline; vertical-align: baseline;">decompiled code in a single pass</span></a><span style="vertical-align: baseline;">. To further enhance scalability, we introduced Gemini 1.5 Flash, incorporating </span><a href="https://cloud.google.com/blog/topics/threat-intelligence/scaling-up-malware-analysis-with-gemini"><span style="text-decoration: underline; vertical-align: baseline;">automated binary unpacking</span></a><span style="vertical-align: baseline;"> through Mandiant Backscatter before the decompilation phase to tackle certain obfuscation techniques. Yet, as any seasoned malware analyst knows, the true challenge often begins once the code is exposed. Malware developers frequently employ obfuscation tactics to conceal critical IOCs and underlying logic. Malware may also download additional malicious code, making it challenging to fully understand the behavior of a given sample.</span></p>
<p><span style="vertical-align: baseline;">For large language models (LLMs), obfuscation techniques and additional payloads create unique challenges. When dealing with obfuscated strings such as URLs, IPs, domains, or file names, LLMs often “hallucinate” without explicit decoding methods. Additionally, LLMs cannot access, for example, URLs that host additional payloads, often resulting in speculative interpretations about the sample’s behavior.</span></p>
<p><span style="vertical-align: baseline;">To help with these challenges, Code Interpreter and GTI function calling tools provide targeted solutions. Code Interpreter enables Gemini to autonomously create and execute custom scripts, as needed, using its own judgment to decode obfuscated elements within a sample, such as strings encoded with XOR-based algorithms. This capability minimizes interpretation errors and enhances Gemini's ability to reveal hidden logic without requiring manual intervention. </span></p>
<p><span style="vertical-align: baseline;">Meanwhile, GTI function calling expands Gemini’s reach by retrieving contextualized information from Google Threat Intelligence on suspicious external resources such as URLs, IPs, or domains, providing verified insights without speculative guesses. Together, these tools equip Gemini to better handle obfuscated or externally hosted data, bringing it closer to the goal of functioning as an autonomous agent for malware analysis.</span></p>
<p><span style="vertical-align: baseline;">To illustrate how these enhancements boost Gemini's capabilities, let's look at a practical example. In this case, we’re analyzing a PowerShell script that contains an obfuscated URL that hosts a second-stage payload. This particular sample was previously analyzed with some of the most advanced publicly available LLM models, which incorporate code generation and execution as part of their reasoning process. Despite these capabilities, each model “hallucinated,” generating completely fabricated URLs instead of accurately revealing the correct one.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/gemini-for-malware3-fig1.max-1000x1000.png"
alt="Obfuscated PowerShell code sample to be analyzed by Gemini">
</a>
<figcaption class="article-image__caption "><p data-block-key="rb3he">Obfuscated PowerShell code sample to be analyzed by Gemini</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Utilizing Code Interpreter and GTI function calling as part of its reasoning process, Gemini autonomously generated the following report without any human intervention. When deemed necessary, it applies these tools to process and extract additional information from the sample.</span></p></div>
<div class="block-paragraph_advanced"><div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"><table border="1" style="border-collapse: collapse; width: 99.9641%;">
<tbody>
<tr>
<td style="width: 98.1839%;">
<h3><span style="vertical-align: baseline;">Final Report: PowerShell Malware with URL Deobfuscation</span></h3>
<p><span style="vertical-align: baseline;">This report analyzes a PowerShell script designed to download and execute a file from an obfuscated URL. The script employs a custom obfuscation routine resembling the RC4 stream cipher to conceal the malicious URL.</span></p>
<p><strong style="vertical-align: baseline;">Functionality:</strong></p>
<ol>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Obfuscation Routine:</strong><span style="vertical-align: baseline;"> The script defines a function </span><code style="vertical-align: baseline;">F</code><span style="vertical-align: baseline;"> that performs byte-level XOR encryption using a key derived from the string "tox2". This function is used to deobfuscate the malicious URL.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">URL Decryption:</strong><span style="vertical-align: baseline;"> The obfuscated URL, stored as a byte array, is decrypted using the </span><code style="vertical-align: baseline;">F</code><span style="vertical-align: baseline;"> function with "tox2" as the key.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">File Download:</strong><span style="vertical-align: baseline;"> The script downloads a file from the decrypted URL (</span><code style="vertical-align: baseline;">hxxps://filedn[.]eu/lODWTgN8sswHA6Pn8HXWe1J/tox2/Scan_docs%2398097960[.]msi</code><span style="vertical-align: baseline;">) and saves it to the system's temporary directory under a randomly generated filename.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Execution:</strong><span style="vertical-align: baseline;"> Finally, the script executes the downloaded file.</span></p>
</li>
</ol>
<p><strong style="vertical-align: baseline;">Indicators of Compromise (IOCs):</strong></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">URL:</strong><span style="vertical-align: baseline;"> </span><code style="vertical-align: baseline;">hxxps://filedn[.]eu/lODWTgN8sswHA6Pn8HXWe1J/tox2/Scan_docs%2398097960[.]msi</code></p>
</li>
</ul>
<p><strong style="vertical-align: baseline;">Deobfuscation Code:</strong></p>
<pre class="language-plain"><code>def deobfuscate(x, y):
s = list(range(256))
j = 0
k = [ord(c) for c in y]
for i in range(256):
j = (j + s[i] + k[i % len(k)]) % 256
s[i], s[j] = s[j], s[i]
i = 0
j = 0
output = []
for m in range(len(x)):
i = (i + 1) % 256
j = (j + s[i]) % 256
s[i], s[j] = s[j], s[i]
output.append(x[m] ^ s[(s[i] + s[j]) % 256])
return output
a = [0x25, 0x7B, 0x5A, 0xD8, 0x9C, 0xFF, 0xC3, 0xA4, 0x95, 0xB7,
0x9D, 0x6E, 0x5F, 0x4D, 0x49, 0xFB, 0x78, 0x32, 0xB8, 0x17, 0x72,
0x01, 0x31, 0xFB, 0xC0, 0x58, 0x45, 0xCB, 0x5C, 0x3C, 0xDD, 0x00,
0x60, 0x87, 0x25, 0xB2, 0x44, 0xA6, 0xC5, 0xEA, 0x71, 0xAA, 0x9D,
0xA1, 0x05, 0x5F, 0xBD, 0x41, 0x28, 0x70, 0x30, 0x6B, 0x96, 0x24,
0x92, 0xA0, 0x9D, 0x92, 0x23, 0x79, 0x69, 0x2C, 0x1B, 0x1E, 0xEB,
0x69, 0x8F, 0x04, 0x10, 0x74, 0xBC]
b = "tox2"
c = deobfuscate(a, b)
decoded_url = "".join(chr(i) for i in c)
print(decoded_url)</code></pre>
<p><strong style="vertical-align: baseline;">Google Threat Intelligence (GTI) Lookup:</strong></p>
<p><span style="vertical-align: baseline;">The GTI lookup provides valuable context regarding the identified IOC:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Threat Actor:</strong><span style="vertical-align: baseline;"> UNC5687, known for using MESHAGENT in phishing campaigns impersonating the Security Service of Ukraine.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Campaign:</strong><span style="vertical-align: baseline;"> This specific URL is linked to a campaign spreading MESHAGENT via phishing emails with malicious links. The campaign involved a captcha page leading to the download of either a malicious MSI or EXE file.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Malware:</strong><span style="vertical-align: baseline;"> The campaign delivers MESHAGENT, an open-source remote access framework, configured to communicate with C2 domains linked to a service called "AnonVNC."</span></p>
</li>
</ul>
<p><strong style="vertical-align: baseline;">Conclusion:</strong></p>
<p><span style="vertical-align: baseline;">The analyzed PowerShell script is confirmed to be malicious, associated with a phishing campaign by UNC5687 distributing the MESHAGENT remote access tool.</span></p>
</td>
</tr>
</tbody>
</table></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Gemini identified that the script employs an XOR-based obfuscation algorithm that resembles RC4 to conceal the download URL. Recognizing this pattern, Gemini autonomously generates and executes a Python deobfuscation script within the Code Interpreter sandbox, successfully revealing the external resource.</span></p>
<p><span style="vertical-align: baseline;">With the URL in hand, Gemini then utilizes GTI function calling to query Google Threat Intelligence for further context. This analysis links the URL to UNC5687, a threat cluster known for using a remote access tool in phishing campaigns impersonating the Security Service of Ukraine.</span></p>
<p><span style="vertical-align: baseline;">As we’ve seen, the integration of these tools has strengthened Gemini’s ability to function as a malware analyst capable of adapting its approach to address obfuscation and gathering vital context on IOCs. By incorporating the Code Interpreter and GTI function calling, Gemini is better equipped to navigate complex samples by autonomously interpreting hidden elements and contextualizing external references.</span></p>
<p><span style="vertical-align: baseline;">While these are significant advancements, many challenges remain, especially given the vast diversity of malware and scenarios that exist in the threat landscape. We’re committed to making steady progress, and future updates will continue to enhance Gemini's capabilities, moving us closer to a more autonomous, adaptive approach in threat intelligence automation.</span></p></div>Tue, 19 Nov 2024 14:00:00 +0000https://cloud.google.com/blog/topics/threat-intelligence/gemini-malware-analysis-code-interpreter-threat-intelligence/Threat IntelligencearticleEmpowering Gemini for Malware Analysis with Code Interpreter and Google Threat IntelligenceGooglehttps://cloud.google.com/blog/topics/threat-intelligence/gemini-malware-analysis-code-interpreter-threat-intelligence/Bernardo QuinteroAndrés RamírezNew Cassandra to Spanner adapter simplifies Yahoo's migration journeyhttps://cloud.google.com/blog/products/databases/new-proxy-adapter-eases-cassandra-to-spanner-migration/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Cassandra, a key-value NoSQL database, is prized for its speed and scalability, and used broadly for applications that require rapid data retrieval and storage such as caching, session management, and real-time analytics. Its simple key-value pair structure helps ensure high performance and easy management, especially for large datasets. </span></p>
<p><span style="vertical-align: baseline;">But this simplicity also leads to limitations like poor support for complex queries, potential data redundancy, and difficulty in modeling intricate relationships. Spanner, Google Cloud’s always-on, globally consistent, and virtually unlimited-scale database, combines the scalability and availability of NoSQL with the strong consistency and relational model of traditional databases, positioning it for traditional Cassandra workloads. And today, it’s easier than ever to switch from Cassandra to Spanner, with the introduction of the </span><a href="https://github.com/cloudspannerecosystem/cassandra-to-spanner-proxy" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Cassandra to Spanner Proxy Adapter,</span></a><span style="vertical-align: baseline;"> an open-source tool for plug-and-play migrations of Cassandra workloads to Spanner, without any changes to the application logic.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud databases'), ('body', <wagtail.rich_text.RichText object at 0x3e71416b4550>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/products?#databases'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Spanner for NoSQL workloads</strong></h3>
<p><span style="vertical-align: baseline;">Spanner provides strong consistency, high availability, virtually unlimited scalability, and a familiar relational data model with support for SQL and ACID transactions for data integrity. As a fully managed service, it helps simplify operations, allowing teams to focus on application development rather than database administration. Furthermore, Spanner's high availability, even at a massive global scale, supports business continuity by minimizing database downtime. </span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_G0mCmB0.max-1000x1000.png"
alt="1">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">We’re constantly evolving Spanner to meet the needs of modern businesses. Some of the latest Spanner capabilities include enhanced multi-model capabilities such as </span><a href="https://cloud.google.com/blog/products/databases/announcing-spanner-graph"><span style="text-decoration: underline; vertical-align: baseline;">graph</span></a><span style="vertical-align: baseline;">, </span><a href="https://cloud.google.com/spanner/docs/full-text-search"><span style="text-decoration: underline; vertical-align: baseline;">full-text search</span></a><span style="vertical-align: baseline;">, </span><a href="https://cloud.google.com/spanner/docs/find-approximate-nearest-neighbors"><span style="text-decoration: underline; vertical-align: baseline;">vector search</span></a><span style="vertical-align: baseline;">, improved performance for analytical queries with </span><a href="https://cloud.google.com/blog/products/databases/understanding-cloud-spanner-data-boost"><span style="text-decoration: underline; vertical-align: baseline;">Spanner Data Boost</span></a><span style="vertical-align: baseline;">, and unique enterprise features such as </span><a href="https://cloud.google.com/blog/products/databases/spanner-gets-geo-partitioning"><span style="text-decoration: underline; vertical-align: baseline;">geo-partitioning</span></a><span style="vertical-align: baseline;"> and </span><a href="https://cloud.google.com/blog/products/databases/spanner-dual-region-configurations-for-data-residency"><span style="text-decoration: underline; vertical-align: baseline;">dual-region configurations</span></a><span style="vertical-align: baseline;">. For Cassandra users, these powerful features, along with Spanner’s compelling </span><a href="https://cloud.google.com/blog/products/databases/announcing-cloud-spanner-price-performance-updates?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">price-performance</span></a><span style="vertical-align: baseline;">, unlock a world of new, exciting possibilities.</span></p>
<h3><strong style="vertical-align: baseline;">The Cassandra to Spanner adapter — battle-tested by Yahoo!</strong></h3>
<p><span style="vertical-align: baseline;">If you’re wondering, “Spanner sounds like a leap forward from Cassandra. How do I get started?” the proxy adapter provides a plug-n-play way to forward your client applications' Cassandra Query Language (CQL) traffic to Spanner. Under the hood, the adapter functions as a Cassandra client for the application but operates internally by interacting with Spanner for all data manipulation tasks. With the Cassandra to Spanner proxy adapter there is no migration for your application code needed — it just works! </span></p>
<p><span style="vertical-align: baseline;">Yahoo successfully migrated from Cassandra to Spanner, reaping the benefits of improved performance, scalability, consistency, and operational efficiency. And the proxy adapter made it easy to migrate. </span></p>
<p style="padding-left: 40px;"><span style="font-style: italic; vertical-align: baseline;">“The Cassandra Adapter has provided a foundation for migrating the Yahoo Contacts workload from Cassandra to Spanner without changing any of our CQL queries. Our migration strategy has more flexibility, and we can focus on other engineering activities while utilizing the scale, redundancy, and support of Spanner without updating the codebase. Spanner is cost-effective for our specific needs, delivering the performance required for a business of our scale. This transition enables us to maintain operational continuity while optimizing cost and performance.”</span><span style="vertical-align: baseline;"> - Patrick JD Newnan, Principal Product Manager, Core Mail and Analytics, Yahoo </span></p>
<p><span style="vertical-align: baseline;">Another Google Cloud customer that successfully migrated from Cassandra to Spanner recently is </span><a href="https://cloud.google.com/blog/products/spanner/reltio-migrates-from-cassandra-to-spanner"><strong style="text-decoration: underline; vertical-align: baseline;">Reltio</strong></a><span style="vertical-align: baseline;">. Reltio benefited from an effortless migration process to minimize downtime and disruption to their services while reaping the benefits of a fully managed, globally distributed, and strongly consistent database.</span></p>
<p><span style="vertical-align: baseline;">These success stories demonstrate that migrating from Cassandra to Spanner can be a transformative step for businesses seeking to modernize their data infrastructure, unlock new capabilities, and accelerate innovation.</span></p>
<p><span style="vertical-align: baseline;">How does the new proxy adapter simplify your migration? </span><span style="vertical-align: baseline;">A typical </span><a href="https://cloud.google.com/spanner/docs/migration-overview"><span style="text-decoration: underline; vertical-align: baseline;">database migration</span></a><span style="vertical-align: baseline;"> involves the following steps:</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--medium
h-c-grid__col
h-c-grid__col--4 h-c-grid__col--offset-4
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_t87AI4p.max-1000x1000.png"
alt="2">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Some of these steps — migrate your application (step 4) and migrate the data (step 6) — are more complex than others. The proxy adapter vastly simplifies migrating a Cassandra-backed application to point to Spanner. Here's a high-level overview of the steps involved when using the new proxy adapter:</span></p>
<p role="presentation"><strong style="vertical-align: baseline;">1. Assessment:</strong><span style="vertical-align: baseline;"> Evaluate your Cassandra schema, data model, and query patterns which ones you can simplify after moving to Spanner. </span></p>
<p role="presentation"><strong style="vertical-align: baseline;">2. Schema design:</strong><span style="vertical-align: baseline;"> Spanner’s table declaration syntax and data types are similar to Cassandra’s; the </span><a href="https://cloud.google.com/spanner/docs/non-relational/spanner-for-cassandra-users#data_modeling"><span style="text-decoration: underline; vertical-align: baseline;">documentation</span></a><span style="vertical-align: baseline;"> covers these similarities and differences in depth. With Spanner, you can also take advantage of relational capabilities and features like </span><a href="https://cloud.google.com/spanner/docs/schema-and-data-model#parent-child"><span style="text-decoration: underline; vertical-align: baseline;">interleaved tables</span></a><span style="vertical-align: baseline;"> for optimal performance.</span></p>
<p role="presentation"><strong style="vertical-align: baseline;">3. Data migration:</strong><span style="vertical-align: baseline;"> There are several steps to migrate your data:</span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Bulk load:</strong><span style="vertical-align: baseline;"> Export data from Cassandra and import it into Spanner using tools like the </span><a href="https://cloud.google.com/spanner/docs/dataflow-connector"><span style="text-decoration: underline; vertical-align: baseline;">Spanner Dataflow connector</span></a><span style="vertical-align: baseline;"> or </span><a href="https://cloud.google.com/bigquery/docs/export-to-spanner"><span style="text-decoration: underline; vertical-align: baseline;">BigQuery reverse ETL</span></a><span style="vertical-align: baseline;">.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Replicate incoming data: </strong><span style="vertical-align: baseline;">Replicate incoming updates to your Cassandra cluster to Spanner in real-time using Cassandra’s Change Data Capture (CDC). <br/><br/></span><span style="font-family: -apple-system, BlinkMacSystemFont, 'Segoe UI', Roboto, Oxygen, Ubuntu, Cantarell, 'Open Sans', 'Helvetica Neue', sans-serif;">Another possibility is to update your application logic to perform dual-writes to Cassandra and Spanner. We don’t recommend this approach if you’re trying to minimize changes to your application code.</span></li>
</ul>
<p role="presentation"><strong style="vertical-align: baseline;">4. Set up the proxy adapter and update your Cassandra configuration:</strong><span style="vertical-align: baseline;"> Download and run the </span><a href="https://github.com/cloudspannerecosystem/cassandra-to-spanner-proxy" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Cassandra to Spanner Proxy Adapter</span></a><span style="vertical-align: baseline;">, which runs as a sidecar next to your application. By default, the proxy adapter runs on port 9042. In case you decide to use a different port, don’t forget to update your application code to point to the proxy adapter.</span></p>
<p role="presentation"><strong style="vertical-align: baseline;">5. Testing:</strong><span style="vertical-align: baseline;"> Thoroughly test your migrated application and data in a non-production environment to ensure everything works as expected.</span></p>
<p role="presentation"><strong style="vertical-align: baseline;">6. Cutover:</strong><span style="vertical-align: baseline;"> Once you're confident in the migration, switch your application traffic to Spanner. Monitor closely for any issues and fine-tune performance as needed.</span></p>
<h3><strong style="vertical-align: baseline;">What’s under the hood of the new proxy adapter?</strong></h3>
<p><span style="vertical-align: baseline;">The new proxy adapter presents itself as a Cassandra client to the application. From the application's perspective, the only noticeable change is the IP address or hostname of the Cassandra endpoint, which now points to the proxy adapter. This streamlines the Spanner migration, without requiring extensive modifications to application code.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/3a.png"
alt="3">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">We designed the proxy adapter to establish a one-to-one mapping between each Cassandra cluster and a corresponding Spanner database. The proxy instance employs a multi-listener architecture, with each listener bound to a distinct port. This facilitates concurrent handling of multiple client connections, where each listener manages a distinct connection with the specified Spanner database. </span></p>
<p><span style="vertical-align: baseline;">The proxy’s translation layer handles the intricacies of the Cassandra protocol. This layer performs message decoding and encoding, manages buffers and caches, and crucially, parses incoming CQL queries and translates them into Spanner-compatible equivalents.</span></p>
<p><span style="vertical-align: baseline;">The proxy adapter supports </span><a href="https://opentelemetry.io/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">OpenTelemetry</span></a><span style="vertical-align: baseline;"> to collect and export traces to </span><a href="https://cloud.google.com/trace/docs"><span style="text-decoration: underline; vertical-align: baseline;">Cloud Trace</span></a><span style="vertical-align: baseline;">. </span></p>
<p><span style="vertical-align: baseline;">For more details about different ways of setting up the adapter, limitations, mapping of CQL data types to Spanner, and more, refer to the </span><a href="https://github.com/cloudspannerecosystem/cassandra-to-spanner-proxy?tab=readme-ov-file" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">proxy adapter documentation</span></a><span style="vertical-align: baseline;">.</span></p>
<h3><strong style="vertical-align: baseline;">Addressing common concerns and challenges</strong></h3>
<p><span style="vertical-align: baseline;">Let's address a few concerns you may have with your migrations:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Cost:</strong><span style="vertical-align: baseline;"> Have a look at </span><a href="https://www.accenture.com/content/dam/accenture/final/accenture-com/document-2/Spanner-Benchmark-With-YCSB-Final.pdf" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Accenture’s benchmark result</span></a><span style="vertical-align: baseline;"> that demonstrates that Spanner ensures not only consistent latency and throughput but also cost efficiency. Furthermore, Spanner now offers a new tiered pricing model (</span><a href="https://cloud.google.com/blog/products/databases/announcing-spanner-editions?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Spanner editions</span></a><span style="vertical-align: baseline;">) that delivers better cost transparency and cost savings opportunities to help you take advantage of all of Spanner’s capabilities.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Latency increases: </strong><span style="vertical-align: baseline;">To minimize an increase in query latencies, we recommend running the proxy adapter on the same host as the client application (as a side-car proxy) or running on the same Docker network when running the proxy adapter in a Docker container. We also recommend keeping the CPU utilization of the proxy adapter host to under 80%.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Schema flexibility:</strong><span style="vertical-align: baseline;"> While Cassandra offers schema flexibility, Spanner's stricter relational schema provides advantages in terms of data integrity, query power, and consistency.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Learning curve:</strong><span style="vertical-align: baseline;"> Spanner’s data types have some differences with Cassandra’s. Have a look at this </span><a href="https://cloud.google.com/spanner/docs/non-relational/spanner-for-cassandra-users"><span style="text-decoration: underline; vertical-align: baseline;">comprehensive documentation</span></a><span style="vertical-align: baseline;"> that can ease the transition.</span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">Get started today </strong></h3>
<p><span style="vertical-align: baseline;">The benefits of strong consistency, simplified operations, enhanced data integrity, and global scalability make Spanner a compelling option for businesses looking to leverage the cloud's full potential for NoSQL workloads. With the new Cassandra to Spanner proxy adapter, we are making it easier to plan and execute on your migration strategy, so you can unlock a new era of data-driven innovation for your organization.</span></p>
<p><span style="vertical-align: baseline;">Download the new </span><a href="https://github.com/cloudspannerecosystem/cassandra-to-spanner-proxy" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Cassandra to Spanner proxy adapter</span></a><span style="vertical-align: baseline;">, and try it out on a </span><a href="https://cloud.google.com/spanner/docs/free-trial-instance"><span style="text-decoration: underline; vertical-align: baseline;">Spanner Free Trial instance</span></a><span style="vertical-align: baseline;"> at no cost today.</span></p></div>Mon, 18 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/databases/new-proxy-adapter-eases-cassandra-to-spanner-migration/SpannerCustomersDatabasesarticleNew Cassandra to Spanner adapter simplifies Yahoo's migration journeyGooglehttps://cloud.google.com/blog/products/databases/new-proxy-adapter-eases-cassandra-to-spanner-migration/Nitin SagarSr. Product Manager, Google CloudEike FalkenbergEngineering Manager, GoogleWhat’s new with Google Cloudhttps://cloud.google.com/blog/topics/inside-google-cloud/whats-new-google-cloud/<div class="block-paragraph"><p data-block-key="kgod7">Want to know the latest from Google Cloud? Find it here in one handy location. Check back regularly for our newest updates, announcements, resources, events, learning opportunities, and more. </p><hr/><p data-block-key="ru1z9"><b>Tip</b>: Not sure where to find what you’re looking for on the Google Cloud blog? Start here: <a href="https://cloud.google.com/blog/topics/inside-google-cloud/complete-list-google-cloud-blog-links-2021">Google Cloud blog 101: Full list of topics, links, and resources</a>.</p><hr/><p data-block-key="b0lnw"></p></div>
<div class="block-paragraph"><h3 data-block-key="3rhcn">Week of Nov 11 - Nov 15</h3><ul><li data-block-key="6lmq3">Subsea cable connectivity is coming to Tuvalu for the first time with the addition of the Tuvalu Vaka cable. Building on the <a href="https://cloud.google.com/blog/products/infrastructure/introducing-bulikula-and-halaihai-subsea-cables-to-connect-the-central-pacific">Bulikula subsea cable system</a> announced last year, this new network infrastructure is a collaboration among several partners including Australia, Japan, New Zealand, Taiwan, Tuvalu, Tuvalu Telecommunications Corporation and the United States, and will help reduce the digital divide in the Pacific.</li></ul><h3 data-block-key="ejca9">Week of Nov 4-8</h3><ul><li data-block-key="bfd8a">We are excited to announce the <a href="https://github.com/GoogleCloudPlatform/reCAPTCHA-PLD" target="_blank">reCAPTCHA Password Leak Detection Container App</a>, a new tool that makes it easier than ever to protect your users from account takeovers. This container app simplifies the integration of <a href="https://cloud.google.com/blog/products/identity-security/announcing-recaptcha-enterprise-password-leak-detection-in-ga">reCAPTCHA's powerful password leak detection</a>, allowing you to instantly detect compromised credentials and proactively prompt users to change their password before their account is compromised. With pre-built libraries and a streamlined process, you can significantly reduce integration time and enhance your website's security with ease.</li></ul><h3 data-block-key="blhn">Week of Oct 21-25</h3><ul><li data-block-key="f8lua">We're excited to announce GA support for scanning: Rocky Linux, Alma, SUSE (SLES), Red Hat (UBI), <a href="https://www.chainguard.dev/chainguard-images" target="_blank">Chainguard</a>, Wolfi & <a href="https://github.com/GoogleContainerTools/distroless/blob/main/README.md" target="_blank">Google Distroless</a>. These operating systems are now supported in both <a href="https://cloud.google.com/artifact-analysis/docs/artifact-analysis#automatic_scanning">Artifact Registry scanning</a>, as well as <a href="https://cloud.google.com/artifact-analysis/docs/artifact-analysis#on-demand_scanning">On Demand Scanning</a>.<br/> When the Container Scanning API is enabled, any container with these new operating systems or distroless images will automatically be scanned for vulnerabilities when pushed to Artifact Registry. We've also upgraded our On Demand scan to include; NPM, Python, Ruby, Rust, .Net & PHP language packages. See all <a href="https://cloud.google.com/artifact-analysis/docs/container-scanning-overview?skip_cache=true#feature-support">supported package types</a>.</li><li data-block-key="1eq2j"><b>Term Extension Now Available for Compute Engine Committed Use Discounts:</b> You can now extend the term length of your Compute Engine resource-based <a href="https://cloud.google.com/compute/docs/instances/signing-up-committed-use-discounts">Committed Use Discounts</a> (“CUDs”) beyond the preset 1-year and 3-year options. CUDs offer significant cost savings for predictable workloads. You can now choose a CUDs term length beyond the original commitment end date that perfectly aligns with your workload needs, from one year and one day up to 6 years. <a href="https://cloud.google.com/compute/docs/committed-use-discounts/extend-commitment-term">Learn more</a></li></ul><h3 data-block-key="fer4l">Week of Oct 14-18</h3><ul><li data-block-key="bfpvh">Announcing <a href="https://cloud.google.com/blog/topics/partners/enhancing-google-cloud-marketplace-private-offers">Google Cloud Marketplace private offer enhancements</a> that enable additional payment flexibility for enterprises, including when transacting generative AI models.</li></ul><h3 data-block-key="n8m9">Week of Oct 7-11</h3><ul><li data-block-key="2i00">We are excited to announce the launch of new <a href="https://cloud.google.com/solutions/cortex?hl=en"><b>Google Cloud Cortex Framework</b></a> data integration and analytics solution content for BigQuery and Looker with Oracle EBS data. To learn more read our announcement <a href="https://cloud.google.com/blog/products/data-analytics/google-cloud-cortex-framework-integrated-with-oracle-ebs">blog</a>.</li><li data-block-key="ekm0r">Google Cloud is partnering with leading AI and cybersecurity startups to accelerate their growth and innovation, through the<a href="https://cloud.google.com/programs/startups/isv-startup-springboard"> <b>ISV Startup Springboard program</b></a>, announced this week at the <a href="https://cloudonair.withgoogle.com/events/summit-startup-24" target="_blank">Google Cloud Startup Summit.</a> <a href="https://cloud.google.com/programs/startups/isv-startup-springboard">Learn more</a><a href="https://cloudonair.withgoogle.com/events/summit-startup-24" target="_blank"> and register interest.</a></li><li data-block-key="9g2oq">Privileged Access Manager (PAM) is now Generally Available. The GA release offers new capabilities in addition to recently released public preview and includes features such as <a href="https://cloud.google.com/iam/docs/pam-overview#pubsub-notifications">Pub/Sub integration</a> for custom alerting and monitoring, <a href="https://cloud.google.com/iam/docs/pam-view-grants#modified-through-iam">alerts on IAM grant modifications</a> outside of PAM, and <a href="https://cloud.google.com/iam/docs/secure-iam-vpc-sc#pam">integration with VPC Service Controls</a> to tackle data exfiltration. <a href="https://cloud.google.com/iam/docs/pam-overview">Learn more</a>.</li></ul><h3 data-block-key="fm45r">Week of Sept 23-27</h3><ul><li data-block-key="55ro8">We are excited to announce that registration is open for the<a href="https://cloudonair.withgoogle.com/events/summit-app-dev-infrastructure-24?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY24-Q4-global-EXP135-onlineevent-er-app-dev-infra-summit-2024-mc&utm_content=whats-new-blog&utm_term=-" target="_blank"> <b>App Dev & Infrastructure Summit</b></a> <b>on October 30 (AMER) and October 31 (EMEA).</b> Google Technology Fellows - our luminary technical leaders - and industry experts will share strategies and learnings on how to improve efficiency, reduce costs, and speed up AI innovation for your cloud and application infrastructure at this global digital event. Register<a href="https://cloudonair.withgoogle.com/events/summit-app-dev-infrastructure-24?utm_source=cgc-blog&utm_medium=blog&utm_campaign=FY24-Q4-global-EXP135-onlineevent-er-app-dev-infra-summit-2024-mc&utm_content=whats-new-blog&utm_term=-" target="_blank"> here</a>.</li></ul><h3 data-block-key="54k2r">Week of Sept 16-20</h3><ul><li data-block-key="5oc4u">Starting this week, Google Cloud customers with eligible support plans can access assistance for the<a href="https://cloud.google.com/cluster-toolkit/docs/overview"> Cluster Toolkit</a> through the Cloud Console. Cluster Toolkit, formerly known as Cloud HPC Toolkit, is open-source software offered by Google Cloud which simplifies the process for you to deploy HPC, AI and ML workloads on Google Cloud. The Cloud Support team will handle filed cases, ensuring that users receive timely and effective support for their Cluster Toolkit implementations. Select 'Cluster Toolkit' as the sub-category under 'Compute Engine' when creating a support ticket in your Cloud Console to get in touch about any Cluster Toolkit issues.</li><li data-block-key="1hsaa">Backup and DR service is excited to announce the public preview of backup vaults and simplified VM backup offering. Backup vaults provide secure backups for cyber resilience through immutable and indelible backups for VMs and databases, delivering security against accidental or malicious data deletion. Simplified Compute Engine VM backup with a fully-managed experience, directly integrated into the cloud console makes backing of VMs as easy as 1-2-3. The solution also enables backup admins to empower application developers to self-protect their VMs while retaining centralized governance and oversight. Read the <a href="https://cloud.google.com/blog/products/storage-data-transfer/backup-and-dr-service-adds-immutable-indelible-backups">full blog</a> to learn more and try out the new features.</li></ul><h3 data-block-key="8inrl">Week of Sept 2-6</h3><ul><li data-block-key="dq7kb">We’re excited to share that Topaz will be extended to Taiwan. <a href="https://cloud.google.com/blog/products/infrastructure/topaz-subsea-cable-connects-canada-and-asia?e=0">Announced</a> in 2022, the transpacific subsea cable system was the first to connect Canada and Japan. Now, with the extension of Topaz to Taiwan, we’ll provide the region with increased reliability and resilience for network operators, for Google, and for users.</li></ul><h3 data-block-key="e21uc">Week of Aug 26-30</h3><ul><li data-block-key="5oqo6">We are excited to announce the general availability of Instant snapshots for Google Compute Engine Persistent Disks, which provide near-instantaneous, high-frequency, point-in-time checkpoints of a disk that can be rapidly restored as needed. Read the full <a href="https://cloud.google.com/blog/products/compute/introducing-compute-engine-instant-snapshots">blog</a> to try it out.</li><li data-block-key="9n7q">In response to customer and partner requests for pollen data in Japan, we are excited to announce that data for Japanese Cedar and Cypress trees-the 2 main sources of pollen allergens in Japan-have been added to our <a href="https://mapsplatform.google.com/maps-products/pollen/" target="_blank">Pollen API from Google Maps Platform</a>.</li></ul><h3 data-block-key="4k6tl">Week of Aug 19-23</h3><ul><li data-block-key="90p3t">We are excited to announce we’re adding support for NVIDIA L4 GPUs to<a href="https://cloud.google.com/run"> Cloud Run</a>, in preview. Developers love Cloud Run for its simplicity, fast autoscaling, scale-to-zero capabilities, and pay-per-use pricing. Those same benefits come into play for real-time inference apps serving open gen AI models. Check out this launch <a href="https://cloud.google.com/blog/products/application-development/run-your-ai-inference-applications-on-cloud-run-with-nvidia-gpus">blog</a>. Also watch demos from this launch event webinar <a href="https://cloudonair.withgoogle.com/events/run-ai-with-cloud-run" target="_blank">Run AI on Cloud run</a>.</li><li data-block-key="2kttd">We are excited to announce that Google Cloud Functions is now Cloud Run functions — event-driven programming in one unified serverless platform. This goes beyond a simple name change. We’ve unified the Cloud Functions infrastructure with Cloud Run, and developers of Cloud Functions (2nd gen) get immediate access to all new Cloud Run features, including NVIDIA GPUs. Read the launch <a href="https://cloud.google.com/blog/products/serverless/google-cloud-functions-is-now-cloud-run-functions">blog</a> and watch demos from this launch event webinar <a href="https://cloudonair.withgoogle.com/events/run-ai-with-cloud-run" target="_blank">Run AI on Cloud run</a>.</li></ul><h3 data-block-key="cjpk0">Week of Aug 5-9</h3><ul><li data-block-key="f0041">Google’s Workforce Identity federation now enables Microsoft Entra ID users to access Google BigQuery from Microsoft Power BI with Single-Sign-On. No users or groups need to be provisioned in Google Cloud as Workforce identity Federation leverages a syncless federation capability using attribute based access control to authorize access to Google BigQuery using Microsoft Entra user attributes such as user group membership. You can refer to our <a href="https://cloud.google.com/iam/docs/workforce-sign-in-power-bi">documentation</a> to learn more.</li><li data-block-key="fej89">We are excited to announce the preview of <a href="https://cloud.google.com/bigtable/docs/introduction-sql">SQL support in Bigtable</a> to bring Google’s pioneering NoSQL database to a broader developer audience. Bigtable leverages GoogleSQL ─the same SQL dialect used by BigQuery─ making it easier to use Bigtable as low-latency analytics serving layer in combination with BigQuery’s newly announced <a href="https://cloud.google.com/bigquery/docs/continuous-queries-introduction">continuous queries</a> but does so with extensions to support its signature data model so you can use SQL without giving up on all the flexibility that comes with a NoSQL database. It also simplifies migrations from open source databases such as Apache Cassandra. With over 100 new functions from JSON processing capabilities, kNN for GenAI and HLL for real-time analytics, SQL opens the door to many new possibilities with Bigtable. Learn more in our detailed <a href="https://cloud.google.com/blog/products/databases/announcing-sql-support-for-bigtable">blog post</a>.</li><li data-block-key="fcjh7">We are excited to announce the public preview of <a href="https://cloud.google.com/bigquery/docs/continuous-queries-introduction">BigQuery continuous queries</a>, a groundbreaking new feature that empowers users to run continuously processing SQL statements that can process, analyze, and transform data as new events arrive in BigQuery, ensuring insights are always up to date. Native integration with the Google Cloud ecosystem unlocks the ability of Vertex AI and Gemini to perform machine learning inference on incoming data in real time. As well as streaming replication of continuous query results to Pub/Sub topics, Bigtable instances, or other BigQuery tables. Read the <a href="https://cloud.google.com/blog/products/data-analytics/bigquery-continuous-queries-makes-data-analysis-real-time">full blog</a> and try it out!</li><li data-block-key="cjcln">AlloyDB’s AutoPilot capabilities- Automatic memory management, Adaptive AutoVacuum<b>,</b> Automatic storage tiering ,Automatic data columnarization and query rewrite- makes management super efficient and easy. AlloyDB eliminates the drudgery of maintaining a PostgreSQL database by using, behind the scenes , advanced self-tuning machine learning algorithm. In this <a href="https://medium.com/google-cloud/alloydb-adaptive-autovacuum-and-how-alloydb-cluster-storage-space-is-released-41be54b8b8c8" target="_blank">blog</a> we will look into a real world example of AlloyDB Adaptive AutoVacuum in work and how AlloyDB Cluster Storage Space is Released</li><li data-block-key="dvta7">Google Cloud Identity Platform, our consumer identity solution, now supports Passkeys. With Passkeys, developers can authenticate their app's end users securely, protecting them from account takeover attacks like phishing and leaked credentials. To join the private preview, contact your Google account team.</li></ul><h3 data-block-key="5jj2o">Week of July 15-19</h3><ul><li data-block-key="lvjd">Google Cloud is excited to launch the Modern SecOps Masterclass, now available on Coursera. This course equips security professionals with cutting-edge skills to modernize their Security Operations Centers (SOCs) using our Autonomic Security Operations framework and Continuous Detection, Continuous Response (CD/CR) methodology. Read the <a href="https://cloud.google.com/blog/products/identity-security/modern-secops-masterclass-now-available-on-coursera">full blog</a> and <a href="https://coursera.org/learn/modern-security-operations" target="_blank">enroll now</a>.</li><li data-block-key="6ase5">Learn how to potentially achieve a strong consistency in Cloud Bigtable for your next big data solution. Bigtable offers high throughput at low latency. It is ideal for storing large amounts of data in a key-value store while supporting high read and write throughput at low latency for fast access. Bigtable provides eventual consistency as well as strong consistency. This blog talks about achieving strong data consistency in a multi-cluster Bigtable instance. <a href="https://medium.com/google-cloud/data-consistency-across-bigtable-clusters-cec93ed8adda" target="_blank">Read the full blog</a>.</li></ul><h3 data-block-key="amrr6">Week of June 24-28</h3><ul><li data-block-key="7m6v4">Introducing <b>Google Cloud Marketplace Channel Private Offers</b>, enabling customers, ISV partners, and channel partners to efficiently transact private offers via reseller-initiated sales of third-party solutions listed on the Google Cloud Marketplace. This differentiated program also empowers channel partners to manage the customer relationship from billing, collections to revenue recognition. <a href="https://cloud.google.com/blog/topics/partners/announcing-google-cloud-marketplace-private-offers">Read the full blog</a>.</li><li data-block-key="c9ine">A blog on benchmark study (collaborated with Yahoo) by comparing the cost and performance of Apache Flink and Google Cloud Dataflow for two specific streaming data processing use cases. The goal of the study was to determine the most cost-effective platform for these use cases by establishing a fair comparison methodology and controlling variables such as throughput and workload. The results indicate that, with some optimization on Dataflow can perform on-par with Apache Flink. <a href="https://medium.com/google-cloud/yahoo-benchmarks-dataflow-vs-b189c809ff49" target="_blank">Read the full blog</a>.</li><li data-block-key="1sllq">A Blog on Secure Gateways: Mutual TLS for Ingress Gateway Secure Gateways: Mutual TLS for Ingress Gateway," discusses the implementation of mutual TLS (mTLS) for enhanced security in ingress gateways. It explains how mTLS ensures both client and server authentication through certificates, going beyond the traditional server-only verification. The article explores the setup process and the benefits of using mTLS, emphasizing its role in establishing secure communication channels in modern cloud architectures. <a href="https://medium.com/google-cloud/secure-gateways-mutual-tls-for-ingress-gateway-05f8daa8f41f" target="_blank">Read the full blog</a>.</li><li data-block-key="9hq5n">A Blog on Wildcard certificates with Ingress Gateway "Wildcard certificates with Ingress Gateway" provides a guide on how to use wildcard certificates to secure multiple services behind a single Istio Ingress Gateway. This simplifies certificate management and improves the user experience by allowing seamless connections across different services within the same domain. The article demonstrates the configuration process step-by-step and explains how wildcard certificates are matched to incoming requests. <a href="https://medium.com/google-cloud/wildcard-certificates-with-ingress-gateway-904ce822a50e" target="_blank">Read the full blog</a></li></ul><h3 data-block-key="8b8l9">Week of June 17-21</h3><ul><li data-block-key="5qe61">Learn how to leverage BigQuery vector search to analyze your logs and asset metadata stored in BigQuery. Using vector search, you can find semantically similar logs which can be helpful in several use cases such as outlier detection, triage and investigation. This how-to blog walks you through the setup from processing logs, generating vector embeddings, to analyzing vector search results. It includes sample SQL queries which can be adapted for your own logs and use case. <a href="https://cloud.google.com/blog/products/data-analytics/bigquery-vector-search-for-log-analysis">Read the full blog</a>.</li><li data-block-key="cnhc0"><a href="https://cloud.google.com/blog/products/infrastructure/introducing-the-nuvem-subsea-cable">Nuvem</a>, first announced last year, is a transatlantic subsea cable system that will connect Portugal, Bermuda, and the United States. We are now working with the Regional Government of Azores to enable extending the system to the Azores as well. Named after the Portuguese word for “cloud,” Nuvem will improve network resiliency across the Atlantic, helping meet growing demand for digital services and further establishing its landing locations as digital hubs.</li></ul><h3 data-block-key="f5umb">Week of June 10-14</h3><ul><li data-block-key="a0cl0">General Availability of A3 Mega, a new instance type in the A3 VM family. A3 Mega is powered by the NVIDIA H100 Tensor Core GPU, delivers a 2.4x improvement in large scale training performance over multiple A3 instances.<ul><li data-block-key="4mv6b">2x the GPU-to-GPU networking bandwidth over A3 Instances.</li><li data-block-key="cculf">Enhanced GPUDirect-TCPXO networking offloads GPUDirect memory access from the CPU, providing direct access through through the NIC (Network Interface Card) to GPU memory, based on <a href="https://cloud.google.com/titanium">Titanium TOPs</a>, which improves performance of multi-node distributed training workloads.</li></ul></li><li data-block-key="b7445">Simplify your Network: The Cloud Networking Product Management and Engineering team will be traveling across US cities in June/July and Sept. Learn how Cross-Cloud Network can transform your infrastructure. The workshop will address Cross-Cloud Networking for hybrid and multicloud enterprises with distributed applications, internet-facing content and applications, security, and AI-assisted network operations with Gemini Cloud Assist. Join us at one of the following Google office locations and meet the experts who will share the latest innovations, use cases, and demos. <a href="https://inthecloud.withgoogle.com/cross-cloud-network-innovation-workshop/register.html#form" target="_blank">Register here</a>.</li><li data-block-key="e0aud">Learn how you can leverage the cloud deployment archetypes (zonal, regional, multi-regional, global, hybrid, & multicloud) to architect cloud topologies that meet your workload’s requirements for reliability, cost, performance, & operational simplicity. <a href="https://cloud.google.com/blog/products/application-modernization/new-google-cloud-deployment-archetypes-guide">Read the full blog</a>.</li></ul><h3 data-block-key="1ihqo">Week of May 20-24</h3><ul><li data-block-key="2hdc">Maximize performance and optimize spend with Compute Engine’s latest General Purpose VMs, N4 and C4. N4's flexible configurations and price-performance gains help optimize costs, while C4 provides top-tier performance for demanding applications. With N4 and C4, you get tailored solutions for all your general-purpose workloads, so you can lower the total cost of running your business without compromising on performance or workload-specific requirements. <a href="https://cloud.google.com/blog/products/compute/a-closer-look-at-compute-engine-c4-and-n4-machine-series">Learn more here</a>.</li></ul><h3 data-block-key="8e14k">Week of Apr 22 - April 26</h3><ul><li data-block-key="er89f">Simplify your connectivity to Google by using a Verified Peering Provider to connect to Google, instead of using Direct Peering. Verified Peering Providers handle all of the complex connectivity allowing you to focus on your core business. <a href="https://cloud.google.com/blog/products/networking/verified-peering-provider-simplifies-enterprise-connectivity">Learn more here</a>.</li></ul><h3 data-block-key="9lq9v">Week of Apr 15- Apr 19</h3><ul><li data-block-key="8b1g">New training in AI, data analytics and cybersecurity, designed to expand onramps to tech careers through colleges and employers. <a href="https://cloud.google.com/blog/topics/training-certifications/new-introductory-courses-in-gen-ai-data-analytics-cybersecurity">Learn more.</a></li></ul><h3 data-block-key="7eco8">Week of Apr 1- Apr 5</h3><ul><li data-block-key="36qr1"><a href="https://cloud.google.com/security/products/security-command-center?hl=en">Security Command Center (SCC) Enterprise</a> is now generally available (GA). It is the industry’s first cloud risk management solution that converges cloud security and enterprise security operations into a single platform, supercharged by Mandiant expertise and AI. Learn more in our <a href="https://cloud.google.com/blog/products/identity-security/introducing-security-command-center-enterprise">announcement blog</a>.</li><li data-block-key="1cepm">Identify common container runtime attacks, analyzes suspicious code, and use natural language processing to pinpoint malicious scripts with <a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-gke-threat-detection">GKE threat detection</a>, powered by Security Command Center. Now in public preview.</li><li data-block-key="bhrmb">Get a fully managed compliance service that automatically delivers end-to-end coverage for GKE, scanning for issues against the most important benchmarks with <a href="https://cloud.google.com/kubernetes-engine/fleet-management/docs/about-compliance-dashboard">GKE compliance</a>, now in public preview. Near-real-time insights are available in a centralized dashboard, with compliance reports automatically produced for you.</li><li data-block-key="6b37f">Streamline your GCE backup strategy! With tag-based backups in Google Backup and DR, protection is automated – new VMs with the right tags are protected immediately, saving you time and increasing reliability. Read more on the blog here. <a href="https://cloud.google.com/blog/products/storage-data-transfer/tags-support-in-backup-and-dr-service-simplifies-vm-protection">Differential privacy enforcement with privacy budgeting is now available in BigQuery data clean rooms so organizations can prevent data from being reidentified when it is shared.</a></li></ul><h3 data-block-key="5cdt9">Week of Mar 18- Mar 22</h3><ul><li data-block-key="83c2e">Google Kubernetes Engine (GKE) and NVIDIA NeMo framework are used to train large language models (LLMs). Due to the increasing demand for efficient and scalable training of LLMs, the need for GPUs at a large scale with high speed networking is rapidly growing. GKE offers a comprehensive set of features that make it suitable for enterprise-level training and inference. This blog post shows how generative AI models can be adapted to your use cases by demonstrating how to train models on Google Kubernetes Engine (GKE) using the <a href="https://cloud.google.com/blog/products/compute/gke-and-nvidia-nemo-framework-to-train-generative-ai-models">NVIDIA NeMo framework</a>.</li><li data-block-key="4ubqp">Cloud Run now supports volume mounts! Mount a Cloud Storage bucket or NFS file share as a volume to easily serve static assets, access app configuration data, or access an AI/ML model. <a href="https://cloud.google.com/blog/products/serverless/introducing-cloud-run-volume-mounts">Learn more in our blog post</a>.</li></ul><h3 data-block-key="64l59">Week of Mar 11- Mar 15</h3><ul><li data-block-key="8qscm">Datastream adds support for SQL Server sources, now in preview. With existing support for MySQL, PostgreSQL, and Oracle, support for SQL Server sources extends the reach of Datastream and empowers you to replicate data from a range of relational sources to several Google Cloud services, such as BigQuery, Cloud Storage, AlloyDB, and Spanner. Read more in the blog <a href="https://cloud.google.com/blog/products/databases/datastream-supports-sql-server-sources">here</a>.</li></ul><h3 data-block-key="c9sv4">Week of Feb 5- Feb 9</h3><ul><li data-block-key="1rd42">Check out this new blog and learn more about the <a href="https://cloud.google.com/blog/topics/partners/new-integrated-commerce-network-streamlines-digital-commerce">Integrated Commerce Network (ICN)</a> delivered by Kin + Carta and built on Google Cloud. The ICN features 3 of our premier digital commerce partners for an integrated end-to-end solution including Bloomreach, commercetools and Quantum Metric.</li></ul><h3 data-block-key="94gdi">Week of Jan 29- Feb 2</h3><ul><li data-block-key="3q5r8"><a href="https://cloud.google.com/blog/products/infrastructure-modernization/idc-finds-318-percent-roi-from-migrating-to-google-cloud-iaas">IDC finds 318% ROI from migrating to Google Cloud IaaS</a>: Check out the latest IDC research study to learn how organizations worldwide are benefitting by adopting Google Cloud Infrastructure as a Service.</li></ul><h3 data-block-key="8uba3">Week of Jan 15-19</h3><ul><li data-block-key="a35er">Check out the latest generative AI training available from Google Cloud : Take a look at our <a href="https://cloud.google.com/blog/topics/training-certifications/ten-new-generative-ai-trainings-to-upskill-in-2024-with-duet-ai">top ten trainings in Duet AI</a> to help boost your productivity in 2024.</li></ul><h3 data-block-key="1p2na">Week of Jan 1-5</h3><ul><li data-block-key="l749"><a href="https://cloud.google.com/blog/products/gcp/google-cloud-top-news-of-2023">The year in Google Cloud: Top news of 2023</a>: A look back at the biggest stories of 2023 from Google Cloud, covering generative AI, DevOps, containers, data and databases, security, and more.</li></ul></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/inside-google-cloud/whats-new-google-cloud/Google CloudInside Google CloudarticleWhat’s new with Google Cloudhttps://storage.googleapis.com/gweb-cloudblog-publish/original_images/52_-_Whats_new.jpgGooglehttps://cloud.google.com/blog/topics/inside-google-cloud/whats-new-google-cloud/Google Cloud Content & Editorial What’s new with HPC and AI infrastructure at Google Cloudhttps://cloud.google.com/blog/topics/hpc/whats-new-with-hpc/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">At Google Cloud, we’re rapidly advancing our high-performance computing (HPC) capabilities, providing researchers and engineers with powerful tools and infrastructure to tackle the most demanding computational challenges. Here's a look at some of the key developments driving HPC innovation on Google Cloud, as well as our presence at Supercomputing 2024.</span></p>
<p><span style="vertical-align: baseline;">You can also stay apprised of our HPC and AI advances by joining the new </span><a href="https://rsvp.withgoogle.com/events/google-cloud-advanced-computing-community" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud Advanced Computing Community</span></a><span style="vertical-align: baseline;"> (details below). </span></p>
<h3><strong style="vertical-align: baseline;">Next-generation HPC VMs</strong></h3>
<p><span style="vertical-align: baseline;">We began our H-series with </span><a href="https://cloud.google.com/compute/docs/compute-optimized-machines#h3_series"><span style="text-decoration: underline; vertical-align: baseline;">H3 VMs</span></a><span style="vertical-align: baseline;">, specifically designed to meet the needs of demanding HPC workloads. Now, we’re excited to share some key features of the next generation of the H family, bringing even more innovation and performance to the table. The upcoming VMs will feature:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Improved workload scalability via RDMA-enabled 200 Gbps networking</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Native support to directly provision full, tightly-coupled HPC clusters on demand </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/blog/products/compute/introducing-dynamic-workload-scheduler?e=0"><span style="text-decoration: underline; vertical-align: baseline;">Dynamic Workload Scheduler</span></a><span style="vertical-align: baseline;"> to provision fixed-lifetime clusters now or in the future</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/titanium?e=0&hl=en"><span style="text-decoration: underline; vertical-align: baseline;">Titanium</span></a><strong style="vertical-align: baseline;"> </strong><span style="vertical-align: baseline;">technology that delivers superior performance, reliability, and security </span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">We provide </span><a href="https://github.com/GoogleCloudPlatform/cluster-toolkit/blob/main/examples/hpc-enterprise-slurm.yaml" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">system blueprints</span></a><span style="vertical-align: baseline;"> for setting up turnkey, pre-configured HPC clusters on our H series VMs.</span></p>
<p><span style="vertical-align: baseline;">The next generation of H series is coming in early 2025.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', 'Try Google Cloud for free'), ('body', <wagtail.rich_text.RichText object at 0x3e71416b4520>), ('btn_text', 'Get started for free'), ('href', 'https://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Parallelstore: World’s first fully-managed DAOS offering</strong></h3>
<p><a href="https://cloud.google.com/parallelstore?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Parallelstore</span></a><span style="vertical-align: baseline;"> is a fully managed, scalable, high-performance storage solution based on next-generation </span><a href="https://daos.io/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">DAOS technology</span></a><span style="vertical-align: baseline;">, designed for demanding HPC and AI workloads. It is now generally available and provides:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Up to 6x greater read throughput performance compared to competitive Lustre scratch offerings</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Low latency (<0.5ms at p50) and high throughput (>1GiB/s per TiB) to access data with minimal delays, even at massive scale</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">High IOPS (30K IOPS per TiB) for metadata operations</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Simplified management that reduces operational overhead with a fully managed service </span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">Parallelstore is great for applications requiring fast access to large datasets, such as:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Analyzing massive genomic datasets for personalized medicine</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Training large language models (LLMs) and other AI applications efficiently </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Running complex HPC simulations with rapid data access</span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">A3 Ultra VMs with NVIDIA H200 Tensor Core GPUs</strong></h3>
<p><span style="vertical-align: baseline;">For GPU-based HPC workloads, we recently announced </span><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview?e=0"><span style="text-decoration: underline; vertical-align: baseline;">A3 Ultra VMs</span></a><span style="vertical-align: baseline;">, which feature NVIDIA H200 Tensor Core GPUs. A3 Ultra VMs offer a significant leap in performance over previous generations. They are built on servers with our new </span><a href="https://cloud.google.com/titanium"><span style="text-decoration: underline; vertical-align: baseline;">Titanium ML network adapter</span></a><span style="vertical-align: baseline;">, optimized to deliver a secure, high-performance cloud experience for AI workloads, and powered by NVIDIA ConnectX-7 networking. Combined with our datacenter-wide 4-way rail-aligned network, A3 Ultra VMs deliver non-blocking 3.2 Tbps of GPU-to-GPU traffic with RDMA over Converged Ethernet (RoCE). </span></p>
<p><span style="vertical-align: baseline;">Compared with A3 Mega, A3 Ultra offers: </span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">2x the GPU-to-GPU networking bandwidth, powered by Google Cloud’s Titanium ML network adapter and backed by our Jupiter data center network</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Up to 2x higher LLM inferencing performance with nearly double the memory capacity and 1.4x more memory bandwidth</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Ability to scale to tens of thousands of GPUs in a dense, performance-optimized cluster for large AI and HPC workloads</span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">With </span><a href="https://cloud.google.com/cluster-toolkit/docs/deploy/a3-mega-cluster-overview"><span style="text-decoration: underline; vertical-align: baseline;">system blueprints</span></a><span style="vertical-align: baseline;">, available through </span><a href="https://cloud.google.com/cluster-toolkit/docs/overview"><span style="text-decoration: underline; vertical-align: baseline;">Cluster Toolkit</span></a><span style="vertical-align: baseline;">, customers can quickly and easily create turnkey, pre-configured HPC clusters with Slurm support on A3 VMs.</span></p>
<p><span style="vertical-align: baseline;">A3 Ultra VMs will also be available through </span><a href="https://cloud.google.com/kubernetes-engine?e=0"><span style="text-decoration: underline; vertical-align: baseline;">Google Kubernetes Engine</span></a><span style="vertical-align: baseline;"> (GKE), which provides an open, portable, extensible, and highly-scalable platform for large-scale training and serving of AI workloads.</span></p>
<h3><strong style="vertical-align: baseline;">Trillium: Ushering in a new era of TPU performance for AI</strong></h3>
<p><span style="vertical-align: baseline;">Tensor Processing Units, or TPUs, power our most advanced AI models such as </span><a href="https://cloud.google.com/products/gemini?e=0"><span style="text-decoration: underline; vertical-align: baseline;">Gemini</span></a><span style="vertical-align: baseline;">, popular Google services like Search, Photos, and Maps, as well as scientific breakthroughs like AlphaFold 2 — which </span><a href="https://www.nature.com/articles/d41586-024-03214-7" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">led to a Nobel Prize this year</span></a><span style="vertical-align: baseline;">!</span></p>
<p><span style="vertical-align: baseline;">We recently announced that </span><a href="https://cloud.google.com/blog/products/compute/trillium-sixth-generation-tpu-is-in-preview?e=0"><span style="text-decoration: underline; vertical-align: baseline;">Trillium, our sixth-generation TPU</span></a><span style="vertical-align: baseline;">, is available to Google Cloud customers in preview. </span></p>
<p><span style="vertical-align: baseline;">Compared with TPU v5e, Trillium delivers: </span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Over 4x improvement in training performance </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Up to 3x increase in inference throughput </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">67% increase in energy efficiency</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">4.7x increase in peak compute performance per chip </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Double the high bandwidth memory capacity </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Double the interchip interconnect bandwidth </span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">Cluster Toolkit: Streamlining HPC deployments</strong></h3>
<p><span style="vertical-align: baseline;">We continue to improve </span><a href="https://cloud.google.com/cluster-toolkit/docs/overview"><span style="text-decoration: underline; vertical-align: baseline;">Cluster Toolkit</span></a><span style="vertical-align: baseline;">, providing open-source tools for deploying and managing HPC environments on Google Cloud. Recent updates include:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://github.com/GoogleCloudPlatform/cluster-toolkit/tree/main/examples#major-changes-in-from-slurm-gcp-v5-to-v6" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Slurm-gcp V6</span></a><span style="vertical-align: baseline;"> is now generally available, providing faster deployments and robust reconfiguration among other benefits.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/support?e=48754805&hl=en"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud Customer Care</span></a><span style="vertical-align: baseline;"> is now available for Toolkit. You can find more information </span><a href="https://cloud.google.com/cluster-toolkit/docs/getting-support"><span style="text-decoration: underline; vertical-align: baseline;">here</span></a><span style="vertical-align: baseline;"> on how to get support via the Cloud Customer Care console.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/blog/topics/hpc/ga-rocky-linux-8-and-centos-7-versions-of-hpc-vm-image?e=0"><span style="text-decoration: underline; vertical-align: baseline;">HPC VM Image</span></a><span style="vertical-align: baseline;"> Rocky Linux 8 is now generally available, making it easy to build an HPC-ready VM instance, incorporating our </span><a href="https://cloud.google.com/solutions/hpc?hl=en&e=0#section-7"><span style="text-decoration: underline; vertical-align: baseline;">best practices running HPC on Google Cloud</span></a><span style="vertical-align: baseline;">. </span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">GKE: Container orchestration with scale and performance</strong></h3>
<p><span style="vertical-align: baseline;">GKE continues to lead the way for containerized workloads with </span><a href="https://cloud.google.com/blog/products/containers-kubernetes/gke-65k-nodes-and-counting?e=4875480"><span style="text-decoration: underline; vertical-align: baseline;">the support of the largest Kubernetes clusters in the industry</span></a><span style="vertical-align: baseline;">. With support for up to 65,000 nodes, we believe GKE offers more than 10X larger scale than the other two largest public cloud providers.</span></p>
<p><span style="vertical-align: baseline;">At the same time, we continue to invest in automating and simplifying the building of HPC and AI platforms, with:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/data-container-image-preloading"><span style="text-decoration: underline; vertical-align: baseline;">Secondary boot disk</span></a><span style="vertical-align: baseline;">, which provides faster workload startups through container image caching </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/kubernetes-engine/docs/how-to/dcgm-metrics"><span style="text-decoration: underline; vertical-align: baseline;">Fully-managed DCGM metrics</span></a><span style="vertical-align: baseline;"> for improved accelerator monitoring </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://cloud.google.com/kubernetes-engine/docs/concepts/about-custom-compute-classes"><span style="text-decoration: underline; vertical-align: baseline;">Custom compute classes</span></a><span style="vertical-align: baseline;">, offering greater control over compute resource allocation and scaling</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Extensive innovations in </span><a href="http://kueue.sh/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Kueue.sh</span></a><span style="vertical-align: baseline;">, which is becoming the de facto standard for job queueing on Kubernetes with topology-aware scheduling, priority and fairness in queueing, multi-cluster support (</span><a href="https://www.youtube.com/watch?v=xMmskWIlktA&list=PLj6h78yzYM2Pw4mRw4S-1p_xLARMqPkA7&index=4" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">see demo by Google and CERN engineers</span></a><span style="vertical-align: baseline;">), and more</span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">Customer success stories: Atommap and beyond</strong></h3>
<p><a href="https://cloud.google.com/blog/topics/hpc/atommap-builds-elastic-supercomputer-on-google-cloud?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Atommap</span></a><span style="vertical-align: baseline;">, a company specializing in atomic-scale materials design, is using Google Cloud HPC to accelerate its research and development efforts. With H3 VMs and Parallelstore, Atommap has achieved: </span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Significant speedup in simulations: Reduced time-to-results by more than half, enabling faster innovation </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Improved scalability: Easily scaled resources for 1,000s to 10,000s of molecular simulations, to meet growing computational demands </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Better cost-effectiveness: Optimized infrastructure costs, with savings of up to 80%, while achieving high performance </span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">Atommap's success story highlights the transformative potential of Google Cloud HPC for organizations pushing the boundaries of scientific discovery and technological advancement.</span></p>
<h3><strong style="vertical-align: baseline;">Looking ahead</strong></h3>
<p><span style="vertical-align: baseline;">Google Cloud is committed to continuous innovation for HPC. Expect further enhancements to HPC VMs, Parallelstore, Cluster Toolkit, Slurm-gcp, and other HPC products and solutions. With a focus on performance, scalability, compatibility, and ease of use, we’re empowering researchers and engineers to tackle the world's most complex computational challenges.</span></p>
<h2><strong style="vertical-align: baseline;">Google Cloud Advanced Computing Community</strong></h2>
<p><span style="vertical-align: baseline;">We’re excited to announce the launch of the </span><strong style="vertical-align: baseline;">Google Cloud Advanced Computing Community</strong><span style="vertical-align: baseline;">, a new kind of community of practice for sharing and growing HPC, AI, and quantum computing expertise, innovation, and impact.</span></p>
<p><span style="vertical-align: baseline;">This community of practice will bring together thought leaders and experts from Google, its partners, and HPC, AI, and quantum computing organizations around the world for engaging presentations and panels on innovative technologies and their applications. The Community will also leverage Google’s powerful, comprehensive, and cloud-native tools to create an interactive, dynamic, and engaging forum for discussion and collaboration.</span></p>
<p><span style="vertical-align: baseline;">The Community launches now, with meetings starting in December 2024 and a full rollout of learning and collaboration resources in early 2025. To learn more, register </span><a href="https://rsvp.withgoogle.com/events/google-cloud-advanced-computing-community" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">here</span></a><span style="vertical-align: baseline;">. </span></p>
<h2><strong style="vertical-align: baseline;">Google Cloud at Supercomputing 2024</strong></h2>
<p><span style="vertical-align: baseline;">The annual </span><a href="https://supercomputing.org/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Supercomputing Conference</span></a><span style="vertical-align: baseline;"> series brings together the global HPC community to showcase the latest advancements in HPC, networking, storage and data analysis. Google Cloud is excited to return to </span><a href="https://sc24.supercomputing.org/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Supercomputing 2024</span></a><span style="vertical-align: baseline;"> in Atlanta with our largest presence ever. </span></p>
<p><span style="vertical-align: baseline;">Visit Google Cloud at </span><strong style="vertical-align: baseline;">booth #1730</strong><span style="vertical-align: baseline;"> to jump in and learn about our HPC, AI infrastructure, and quantum solutions. The booth will feature a Trillium TPU board, NVIDIA H200 GPU and ConnectX-7 NIC, hands-on labs, a full schedule of talks, a comfortable lounge space, and plenty of great swag!</span></p>
<p><span style="vertical-align: baseline;">The booth theater will include talks from ARM, Altair, Ansys, Intel, NAG, SchedMD, Siemens, Sycomp, Weka, and more. Booth labs will get you deploying Slurm clusters to fine-tune the Llama2 model or run GROMACS using Cloud Batch to run microbenchmarks or quantum simulations, and more.</span></p>
<p><span style="vertical-align: baseline;">We’re also involved in several parts of SC24's technical program, including BoFs, User Groups, and Workshops. Googlers will participate in the following technical sessions: </span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://sc24.conference-program.com/presentation/?id=bof236&sess=sess586" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Converged HPC and Cloud Computing in the Era of Generative AI</span></a><span style="vertical-align: baseline;"> (</span><a href="https://sc24.conference-program.com/presenter/?uid=169204" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Bill Magro</span></a><span style="vertical-align: baseline;"> speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://sc24.conference-program.com/presentation/?id=bof239&sess=sess667" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">HPC & Cloud Convergence: drivers, triggers, and constraints</span></a><span style="vertical-align: baseline;"> (</span><a href="https://sc24.conference-program.com/presenter/?uid=222953" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Felix Schürmann </span></a><span style="vertical-align: baseline;">speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://daos.io/dug24" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">DAOS User Group (DUG) ‘24</span></a><span style="vertical-align: baseline;"> (</span><a href="https://sc24.conference-program.com/presenter/?uid=648153" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Dean Hildebrand</span></a><span style="vertical-align: baseline;"> speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://sc24.conference-program.com/presentation/?id=bof199&sess=sess639" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">DAOS BoF</span></a><span style="vertical-align: baseline;"> (Dean Hildebrand speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://www.pdsw.org/index.shtml" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">9th International Parallel Data Systems Workshop (PDSW)</span></a><span style="vertical-align: baseline;"> (Dean Hildebrand speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://sc24.conference-program.com/presentation/?id=bof108&sess=sess606" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">IO500: The High-Performance Storage Community BoF</span></a><span style="vertical-align: baseline;"> (Dean Hildebrand speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://sc24.conference-program.com/presentation/?id=tut143&sess=sess417" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">High-Performance Object Storage: I/O for the Exascale Era Tutorial</span></a><span style="vertical-align: baseline;"> (Dean Hildebrand speaking)</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://womeninhpc.org/events/sc-2024-workshop" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Women in HPC Workshop</span></a></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">Google is also hosting or sponsoring the following exciting events during SC24. We’re looking forward to seeing you there!</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://d126qb04.na1.hubspotlinks.com/Ctc/2M+113/d126qB04/VVt2492XYN3JW1_zCzQ21wWdCW6tdK5Q5mCdXBN6D7jYv3qn9gW7Y8-PT6lZ3pXW1dLqXH8DHPZwW7MKvrq761rrQW2L76ML8K8xFDN8rtGLzR1rPDW2W_Vhd7WLvTMW1r77qY4xVGbdW7gb9d72rp-S7W4PjwX73Zbp5lW7qQb138JVdmjN4dzXC8KGkkwVqn3091JTxz4W1kPDm26rfKJjW1ps5d06tgM2VW49hWyz5G-vYpW6zFBT51tkwgbW6Y2x_33PdjMJW4Hn3xM672S4rW7cQz4S2CFDqRN6FRq-1lKCcqW2kjp7m8CZTq-W4x6nVm4yP08KW8_F1z518GbkjW29VsDr8CBfDbW246K4578Lm_dW4Q_kln19yjxBW7hS4bP5Z92wjf5XTdKd04" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Sycomp Reception</span></a><span style="text-decoration: underline; vertical-align: baseline;"> </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://beowulfbash.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Beowulf Bash</span></a></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://hyperionresearch.com/register-breakfast-briefing/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Hyperion Research - Breakfast Briefing</span></a></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://womeninhpc.org/events/sc-2024-networking-reception" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Women in HPC Reception</span></a></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><a href="https://carahevents.carahsoft.com/Event/Register/544427-google" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Carahsoft Reception</span></a></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">Finally, we’ll be holding private meetings and roadmap briefings with our HPC leadership throughout the conference. To schedule a meeting, please contact </span><a href="mailto:[email protected]"><span style="text-decoration: underline; vertical-align: baseline;">[email protected]</span></a><span style="vertical-align: baseline;">.</span></p></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/hpc/whats-new-with-hpc/AI & Machine LearningComputeHPCarticleWhat’s new with HPC and AI infrastructure at Google CloudGooglehttps://cloud.google.com/blog/topics/hpc/whats-new-with-hpc/Annie Ma-WeaverGroup Product Manager, HPC, Google CloudWyatt GormanSolutions Manager, HPC & AI Infrastructure, Google CloudDataproc Serverless: Now faster, easier and smarterhttps://cloud.google.com/blog/products/data-analytics/dataproc-serverless-performance-and-usability-updates/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">We are thrilled to announce new capabilities that make running Dataproc Serverless even faster, easier, and more intelligent.</span></p>
<p><span style="vertical-align: baseline;">Elevate your Spark experience with:</span></p>
<ul>
<li role="presentation"><strong style="vertical-align: baseline;">Native query execution: </strong><span style="vertical-align: baseline;">Experience significant performance gains with the new </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/native-query-execution"><span style="text-decoration: underline; vertical-align: baseline;">Native query execution</span></a><span style="vertical-align: baseline;"> in the Premium tier.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Seamless monitoring with Spark UI: </strong><span style="vertical-align: baseline;">Track job progress in real time with a built-in </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#spark_ui"><span style="text-decoration: underline; vertical-align: baseline;">Spark UI</span></a><span style="vertical-align: baseline;"> available by default for all Spark batches and sessions.</span></li>
<li role="presentation"><strong style="vertical-align: baseline;">Streamlined investigation</strong><span style="vertical-align: baseline;">: Troubleshoot batch jobs from a central </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#gemini-assisted_troubleshooting_for_dataproc"><span style="text-decoration: underline; vertical-align: baseline;">"Investigate" tab </span></a><span style="vertical-align: baseline;">displaying all the essential metrics highlights and logs filtered by errors automatically.</span></li>
<li><strong style="vertical-align: baseline;">Proactive autotuning and assisted troubleshooting with Gemini: </strong><span style="vertical-align: baseline;">Let </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#gemini-assisted_troubleshooting_for_dataproc"><span style="text-decoration: underline; vertical-align: baseline;">Gemini</span></a><span style="vertical-align: baseline;"> minimize failures and autotune performance based on historical patterns. Quickly resolve issues using Gemini-powered insights and recommendations.</span></li>
</ul></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud data analytics'), ('body', <wagtail.rich_text.RichText object at 0x3e712c7355b0>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/bigquery/'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Accelerate your Spark jobs with native query execution</strong></h3>
<p><span style="vertical-align: baseline;">You can unlock considerable speed improvements for your Spark batch jobs in the </span><a href="https://cloud.google.com/dataproc-serverless/pricing"><span style="text-decoration: underline; vertical-align: baseline;">Premium tier</span></a><span style="vertical-align: baseline;"> on Dataproc Serverless Runtimes 2.2.26+ or 1.2.26+ by enabling </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/native-query-execution"><span style="text-decoration: underline; vertical-align: baseline;">native query execution</span></a><span style="vertical-align: baseline;"> — no application changes required.</span></p>
<p><strong style="vertical-align: baseline;">This new feature in Dataproc Serverless Premium tier improved the query performance by ~47%in our tests on queries derived from TPC-DS and TPC-H benchmarks.</strong></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_0utxRaq.max-1000x1000.png"
alt="1">
</a>
<figcaption class="article-image__caption "><p data-block-key="2ixne">Note: Performance results are based on 1TB GCS Parquet data and queries derived from the <a href="https://www.tpc.org/tpcds/">TPC-DS standard</a> and <a href="https://www.tpc.org/tpch/">TPC-H standard</a>. These runs as such aren’t comparable to published <a href="https://www.tpc.org/tpcds/">TPC-DS standard</a> and <a href="https://www.tpc.org/tpch/">TPC-H standard</a> results, as these runs don’t comply with all requirements of the <a href="https://www.tpc.org/tpcds/">TPC-DS standard</a> and and <a href="https://www.tpc.org/tpch/">TPC-H standard</a> specification.</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Start now by running the </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/native-query-execution#native_query_execution_qualification_tool"><span style="text-decoration: underline; vertical-align: baseline;">native query execution qualification tool</span></a><span style="vertical-align: baseline;"> that can help you easily identify eligible jobs and estimate potential performance gains. Once you have the list of batch jobs identified for native query execution, you can enable it and have the jobs run faster and potentially save costs.</span></p>
<h3><strong style="vertical-align: baseline;">Seamless monitoring with Spark UI</strong></h3>
<p><span style="vertical-align: baseline;">Tired of wrestling with setting up the persistent history server (PHS) clusters and maintaining them just to debug your Spark batches? Wouldn't it be easier if you could avoid the ongoing costs of the history server and yet see the Spark UI in real-time?</span></p>
<p><span style="vertical-align: baseline;">Until now, monitoring and troubleshooting Spark jobs in Dataproc Serverless required setting up and managing a separate </span><a href="https://cloud.google.com/dataproc/docs/concepts/jobs/history-server#use_phs_with_spark_batch_workloads"><span style="text-decoration: underline; vertical-align: baseline;">Spark persistent history server</span></a><span style="vertical-align: baseline;">. Crucially, each batch job had to be configured to use the history server. Otherwise, the open-source UI would be unavailable for analysis for the batch job. Additionally, the open-source UI suffered from slow navigation between applications.</span></p>
<p><span style="vertical-align: baseline;">We’ve heard you, loud and clear. </span><strong style="vertical-align: baseline;">We’re excited to announce a fully managed </strong><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#spark_ui"><strong style="text-decoration: underline; vertical-align: baseline;">Spark UI</strong></a><strong style="vertical-align: baseline;"> in Dataproc Serverless that makes monitoring and troubleshooting a breeze.</strong></p>
<p><span style="vertical-align: baseline;">The new </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#spark_ui"><span style="text-decoration: underline; vertical-align: baseline;">Spark UI</span></a><span style="vertical-align: baseline;"> is built-in and automatically available for every batch job and session in both Standard and Premium tiers of Dataproc Serverless at no additional cost. Simply submit your job and start analyzing performance in real time with the Spark UI right away.</span></p>
<p><span style="vertical-align: baseline;">Here's why you'll love the Serverless Spark UI:<br/><br/></span></p>
<div align="left">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"><table><colgroup><col/><col/><col/></colgroup>
<tbody>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;"> </td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Traditional Approach</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">The new Dataproc Serverless Spark UI</span></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Effort</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><a href="https://cloud.google.com/dataproc/docs/concepts/jobs/history-server#create_a_phs_cluster"><span style="text-decoration: underline; vertical-align: baseline;">Create and manage</span></a><span style="vertical-align: baseline;"> a Spark history server cluster. Configure each batch job to use the cluster.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">No cluster setup or management required. Spark UI is available by default for all your batches without any extra configuration.The UI can be accessed directly from the Batch / Session details page in the Google Cloud console.</strong></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Latency</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">UI performance can degrade with increased load. Requires active resource management.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Enjoy a responsive UI that automatically scales to handle even the most demanding workloads.</strong></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Availability</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">The UI is only available as long as the history server cluster is running.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Access your Spark UI for 90 days after your batch job is submitted.</strong></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Data freshness</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Wait for a stage to complete to see that its events are in the UI.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">View regularly updated data without waiting for the stage to complete.</strong></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Functionality</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Basic UI based on open-source Spark.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Enhanced UI with ongoing improvements based on user feedback.</strong></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">Cost</strong></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Ongoing cost for the PHS cluster.</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><strong style="vertical-align: baseline;">No additional charge.</strong></p>
</td>
</tr>
</tbody>
</table></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<h3><strong style="vertical-align: baseline;">Accessing the Spark UI</strong></h3>
<p><span style="vertical-align: baseline;">To gain deeper insights into your Spark batches and sessions — whether they’re still running or completed — simply navigate to the Batch Details or Session Details page in the Google Cloud console. You'll find a "</span><strong style="vertical-align: baseline;">VIEW SPARK UI</strong><span style="vertical-align: baseline;">" link in the top right corner.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_knlkcby.max-1000x1000.jpg"
alt="2">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">The new </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#spark_ui"><span style="text-decoration: underline; vertical-align: baseline;">Spark UI</span></a><span style="vertical-align: baseline;"> provides the same powerful features as the open-source Spark History Server, giving you deep insights into your Spark job performance. Easily browse both running and completed applications, explore jobs, stages, and tasks, and analyze SQL queries for a comprehensive understanding of the execution of your application. Quickly identify bottlenecks and troubleshoot issues with detailed execution information. For even deeper analysis, the 'Executors' tab provides direct links to the relevant logs in Cloud Logging, allowing you to quickly investigate issues related to specific executors.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/3-Spark-UI-GIF-video.gif"
alt="3">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">You can still use the "VIEW SPARK HISTORY SERVER" link to </span><span style="vertical-align: baseline;">view the Persistent Spark History Server if you had already configured one</span><span style="vertical-align: baseline;">.</span></p>
<p><strong style="vertical-align: baseline;">Explore this feature now.</strong><span style="vertical-align: baseline;"> Click </span><strong style="vertical-align: baseline;">"VIEW SPARK UI" </strong><span style="vertical-align: baseline;">on the top right corner of the Batch details page of any of your </span><a href="https://cloud.google.com/console/dataproc/batches"><span style="text-decoration: underline; vertical-align: baseline;">recent Spark batch jobs</span></a><span style="vertical-align: baseline;"> to get started. Learn more in the </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#spark_ui"><span style="text-decoration: underline; vertical-align: baseline;">Dataproc Serverless user guide</span></a><span style="vertical-align: baseline;">.</span></p>
<h3><strong style="vertical-align: baseline;">Streamlined investigation (Preview)</strong></h3>
<p><span style="vertical-align: baseline;">A new "</span><strong style="vertical-align: baseline;">Investigate</strong><span style="vertical-align: baseline;">" tab in the Batch details screen gives you instant diagnostic highlights collected at a single place.</span></p>
<p><span style="vertical-align: baseline;">In the “</span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#batch_metric_highlights"><span style="text-decoration: underline; vertical-align: baseline;">Metrics highlights</span></a><span style="vertical-align: baseline;">” section, the essential metrics are automatically displayed, giving you a clear picture of your batch job's health. You can further create a custom dashboard if you need more metrics.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/4_zGtB1Fg.max-1000x1000.png"
alt="4">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Below the metrics highlights, a widget “</span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#job_logs"><span style="text-decoration: underline; vertical-align: baseline;">Job Logs</span></a><span style="vertical-align: baseline;">” shows the logs filtered by errors, so you can instantly spot and address problems. If you would like to dig further into the logs, you can go to the Logs Explorer.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/5_mls8Usg.max-1000x1000.png"
alt="5">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Proactive autotuning and assisted troubleshooting with Gemini (Preview)</strong></h3>
<p><span style="vertical-align: baseline;">Last but not least, </span><a href="https://cloud.google.com/gemini/docs/bigquery/overview"><span style="text-decoration: underline; vertical-align: baseline;">Gemini in BigQuery</span></a><span style="vertical-align: baseline;"> can help reduce the complexity of optimizing hundreds of Spark properties in your batch job configurations while submitting the job. If the job fails or runs slow, Gemini can save the effort of wading through several GBs of logs to troubleshoot the job. </span></p>
<p><strong style="vertical-align: baseline;">Optimize performance: </strong><span style="vertical-align: baseline;">Gemini can automatically </span><a href="https://cloud.google.com/dataproc-serverless/docs/concepts/autotuning"><span style="text-decoration: underline; vertical-align: baseline;">fine-tune the Spark configurations</span></a><span style="vertical-align: baseline;"> of your Dataproc Serverless batch jobs for optimal performance and reliability.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/6_RlKPyLK.max-1000x1000.png"
alt="6">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><strong style="vertical-align: baseline;">Simplify troubleshooting: </strong><span style="vertical-align: baseline;">You can quickly diagnose and resolve issues with slow or failed jobs by clicking </span><a href="https://cloud.google.com/dataproc-serverless/docs/guides/monitor-troubleshoot-batches#advanced_troubleshooting_preview"><span style="text-decoration: underline; vertical-align: baseline;">"Ask Gemini"</span></a><span style="vertical-align: baseline;"> for AI-powered analysis and guidance.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/7_0FdVhfE.max-1000x1000.png"
alt="7">
</a>
</figure>
</div>
</div>
</div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/8_dCcqmIi.max-1000x1000.png"
alt="8">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><a href="http://goo.gle/gemini-in-bq-preview" rel="noopener" target="_blank"><strong style="text-decoration: underline; vertical-align: baseline;">Sign up here</strong></a><strong style="vertical-align: baseline;"> for a free preview of the Gemini features and “Investigate” tab for Dataproc Serverless</strong><a href="http://goo.gle/gemini-in-bq-preview" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">.</span></a></p></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/data-analytics/dataproc-serverless-performance-and-usability-updates/StreamingData AnalyticsarticleDataproc Serverless: Now faster, easier and smarterGooglehttps://cloud.google.com/blog/products/data-analytics/dataproc-serverless-performance-and-usability-updates/Pardha Saradhi UppalaProduct ManagerVirat BansalSoftware EngineerShift-left your cloud compliance auditing with Audit Managerhttps://cloud.google.com/blog/products/identity-security/shift-left-your-cloud-compliance-auditing-with-audit-manager/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Cloud compliance can present significant regulatory and technical challenges for organizations. These complexities often include delineating compliance responsibilities and accountabilities between the customer and cloud provider. </span></p>
<p><span style="vertical-align: baseline;">At Google Cloud, we understand these challenges faced by our customers’ cloud engineering, compliance, and audit teams, and want to help make them easier to manage. That's why we’re pleased to announce that our </span><a href="https://cloud.google.com/products/audit-manager"><span style="text-decoration: underline; vertical-align: baseline;">Audit Manager</span></a><span style="vertical-align: baseline;"> service, which can digitize and help streamline the compliance auditing process, is now generally available.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/1_-_Understanding_compliance_across_layers.max-1000x1000.jpg"
alt="1 - Understanding compliance across layers in Google Cloud">
</a>
<figcaption class="article-image__caption "><p data-block-key="luerq">Understanding compliance across layers in Google Cloud.</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud security products'), ('body', <wagtail.rich_text.RichText object at 0x3e712caaf520>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Traditional compliance methodologies, reliant on manual processes for evidence collection, are inefficient, prone to errors, and resource-intensive. According to the </span><a href="https://www.gartner.com/en/audit-risk/trends/audit-survey-audit-top-priorities" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Gartner® Audit Survey</span></a><span style="vertical-align: baseline;">, “When surveyed on their key priorities for 2024, 75% of chief audit executives (CAEs) cited audit’s ability to keep up with the fast-evolving cybersecurity landscape as their top priority — making it the most commonly cited priority.”</span></p>
<h3><strong style="vertical-align: baseline;">Introducing </strong><strong style="vertical-align: baseline;">Audit Manager </strong></h3>
<p><span style="vertical-align: baseline;">Audit Manager can help organizations accelerate compliance efforts by providing: </span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Clear shared responsibility outlines</strong><span style="vertical-align: baseline;">: A matrix of </span><a href="https://cloud.google.com/architecture/framework/security/shared-responsibility-shared-fate"><span style="text-decoration: underline; vertical-align: baseline;">shared responsibilities</span></a><span style="vertical-align: baseline;"> that delineates compliance duties between the cloud provider and customers, offering actionable recommendations tailored to your workloads. </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Automated compliance assessments</strong><span style="vertical-align: baseline;">: Evaluation of your workloads against industry-standard technical control requirements in a simple and automated manner. Audit manager already supports popular industry and regulatory frameworks including NIST 800-53, ISO, SOC, and CSA-CCM. You can see the full list of supported frameworks</span><a href="https://cloud.google.com/audit-manager/docs/overview#supported_compliance"><span style="vertical-align: baseline;"> </span><span style="text-decoration: underline; vertical-align: baseline;">here</span></a><span style="vertical-align: baseline;">. </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Audit-ready evidence</strong><span style="vertical-align: baseline;">:</span><strong style="vertical-align: baseline;"> </strong><span style="vertical-align: baseline;">Automated generation of comprehensive verifiable evidence reports to support your compliance claims and overarching governance activity. Audit Manager provides you with a quick execution summary of compliance at a framework level and the ability to deep-dive using control level reports. </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Actionable remediation guidance</strong><span style="vertical-align: baseline;">: Insights to swiftly address each compliance gap that is identified. </span></p>
</li>
</ul>
<h3><strong style="vertical-align: baseline;">The compliance audit journey with Audit Manager</strong></h3>
<p><span style="vertical-align: baseline;">The cloud compliance audit process involves defining responsibilities, identifying and mitigating risks, collecting supporting data, and generating a final report. This process requires collaboration between Governance, Risk, and Compliance analysts, compliance managers, developers, and auditors, each with their own specific tasks. Audit Manager streamlines this process for all involved roles, which can help simplify their work and improve efficiency.</span></p></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/images/2_-_Shift_left_your_compliance_audit_proce.max-1000x1000.jpg"
alt="2 - Shift left your compliance audit process with Audit Manager">
</a>
<figcaption class="article-image__caption "><p data-block-key="luerq">Shift left your compliance audit process with Audit Manager.</p></figcaption>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">Customer case study: Deutsche Börse Group</strong><strong style="vertical-align: baseline;"> </strong></h3>
<p><span style="vertical-align: baseline;">Deutsche Börse Group, an international stock exchange organization and innovative market infrastructure provider, began their strategic partnership with Google Cloud in 2022. Their cloud transformation journey is well under way, which brings with it the challenge of achieving and documenting compliance in their environment. </span><span style="vertical-align: baseline;"> </span></p>
<p><span style="vertical-align: baseline;">Florian Rodeit, head of cloud governance for Google Cloud, Deutsche Börse Group, first heard about Audit Manager during a Las Vegas Google Cloud Next 2024 session. </span></p>
<p><span style="vertical-align: baseline;">“The Audit Manager product promises a level of automation and audit control that has a lot of potential. At Deutsche Börse Group, we were excited to access the preview, explore the functionality further and build out a joint solution,” he said. </span></p>
<p><span style="vertical-align: baseline;">Following the European preview launch of Audit Manager, Deutsche Börse Group and Google Cloud set up a collaborative project to explore automating cloud controls via Audit Manager. Deutsche Börse Group had already created a comprehensive control catalog</span><span style="font-style: italic; vertical-align: baseline;"> </span><span style="vertical-align: baseline;">to manage their cloud control requirements across the organization. They analyzed the Cloud Security Alliance’s </span><a href="https://cloudsecurityalliance.org/research/cloud-controls-matrix" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Cloud Controls Matrix</span></a><span style="vertical-align: baseline;"> against their written rules framework to create inputs for Audit Manager, and set out ownership and implementation guidelines for cloud-specific controls. </span></p>
<p><span style="vertical-align: baseline;">Now, Deutsche Börse Group</span><span style="font-style: italic; vertical-align: baseline;"> </span><span style="vertical-align: baseline;">can use Audit Manager to check if there are resources configured that deviate from the control framework, such as any resources that have been set up outside of approved regions. This provides automated, auditable evidence to support their specific requirements for compliant usage of Google Cloud resources. </span></p>
<p><span style="vertical-align: baseline;">Benjamin Möller, expert cloud governance, vice-president, Deutsche Börse Group, has been leading the collaborative project. “Moving forward, we hope that Audit Manager will allow us to automate many of our technical controls — giving us robust assurance that we are compliant, enabling us to quickly identify and rectify non-compliance, and minimizing the manual over-head of audit evidence. We are excited to continue making progress on our joint venture,” he said. </span></p>
<h3><strong style="vertical-align: baseline;">Take the next step </strong><span style="vertical-align: baseline;"> </span></h3>
<p><span style="vertical-align: baseline;">To use Audit Manager, access the tool directly from your </span><a href="https://console.cloud.google.com/compliance/auditmanager"><span style="text-decoration: underline; vertical-align: baseline;">Google Cloud console</span></a><span style="vertical-align: baseline;">. Navigate to the Compliance tab in your Google Cloud console, and select Audit Manager. For a comprehensive guide on using Audit Manager, please refer to our </span><a href="https://cloud.google.com/audit-manager/docs"><span style="text-decoration: underline; vertical-align: baseline;">detailed product documentation</span></a><span style="vertical-align: baseline;">. We encourage you to share your </span><a href="https://forms.gle/KivNGB9XjVW1HSnw9" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">feedback</span></a><span style="vertical-align: baseline;"> on this service to help us improve Audit Manager’s user experience.</span></p></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/identity-security/shift-left-your-cloud-compliance-auditing-with-audit-manager/Security & IdentityarticleShift-left your cloud compliance auditing with Audit ManagerGooglehttps://cloud.google.com/blog/products/identity-security/shift-left-your-cloud-compliance-auditing-with-audit-manager/Pratik BhangaleProduct Manager, Google CloudElise BaileyCloud Program Manager, Google CloudAlloyDB Omni supercharges performance: Faster transactions, analytics, and vector searchhttps://cloud.google.com/blog/products/databases/whats-new-in-alloydb-omni-version-15-7-0/<div class="block-paragraph_advanced"><p><a href="https://cloud.google.com/alloydb/omni"><span style="text-decoration: underline; vertical-align: baseline;">AlloyDB Omni</span></a><span style="vertical-align: baseline;"> is back with a new release, version 15.7.0, and it's bringing serious enhancements to your PostgreSQL workflows, including:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Faster performance</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">A new ultra-fast disk cache</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">An enhanced columnar engine</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">The general availability of ScANN vector indexing </span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">A new release of the AlloyDB Omni Kubernetes operator</span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">From transactional and analytical workloads to cutting-edge vector search, this update delivers across the board – in your data center, at the edge, on your laptop, and in any cloud and with 100% PostgreSQL compatibility.</span></p>
<p><span style="vertical-align: baseline;">Let’s jump in.</span></p>
<h3><strong style="vertical-align: baseline;">Better performance</strong></h3>
<p><span style="vertical-align: baseline;">Many workloads already get a boost compared to standard PostgreSQL. In our performance tests, AlloyDB Omni is more than 2x faster than standard PostgreSQL for transactional workloads, with most of the tuning being done for you automatically, without special configurations. One of the key advantages is the memory agent that optimizes shared buffers while at the same time avoiding out-of-memory errors. In general, the more memory you configure for AlloyDB Omni, the better it performs, serving more queries from the shared buffers and reducing the need to make calls to disk, which can be magnitudes slower than memory, particularly when using durable network storage</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', '$300 in free credit to try Google Cloud databases'), ('body', <wagtail.rich_text.RichText object at 0x3e71402ad220>), ('btn_text', 'Start building for free'), ('href', 'http://console.cloud.google.com/freetrial?redirectPath=/products?#databases'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><h3><strong style="vertical-align: baseline;">An ultra-fast disk cache</strong></h3>
<p><span style="vertical-align: baseline;">This trade-off between memory and disk storage also just got more flexible, with the introduction of an ultra-fast disk cache. It allows you to configure a fast, local, and not necessarily durable storage device as an extension of Postgres’ buffer cache. Instead of aging data out of memory to make space for new data, AlloyDB Omni can keep a copy of not-quite-hot data in the disk cache, where it can be accessed faster than from persistent disk.</span></p>
<h3><strong style="vertical-align: baseline;">Enhanced columnar engine</strong></h3>
<p><span style="vertical-align: baseline;">AlloyDB Omni's analytics accelerator is changing the game for mixed workloads. Developers are finding it invaluable for gaining real-time analytical insights from their transactional data, all without the overhead of managing extra data pipelines or separate databases. You can instead enable the columnar engine, assign a portion of your memory to it, and let AlloyDB Omni decide which columns or tables to populate in the columnar engine to speed up queries. In our benchmarks, the columnar engine speeds up analytical queries up to 100x compared to standard PostgreSQL.</span></p>
<p><span style="vertical-align: baseline;">The practical size limit to the analytics accelerator was determined by the amount of memory you are able to assign to the columnar engine. What’s new is a feature that allows you to configure a fast local storage device for the columnar engine to spill to. This increases the volume of data that you can run analytical queries on.</span></p>
<h3><strong style="vertical-align: baseline;">SCaNN goes GA</strong></h3>
<p><span style="vertical-align: baseline;">Lastly, for vector database use-cases, AlloyDB Omni already offers great performance with pgvector using either the ivf or hnsw indexes. But while vector indexes are a great way to accelerate queries, they can be slow to build and rebuild. At Google Cloud Next 2024 </span><a href="https://cloud.google.com/blog/products/databases/scann-for-alloydb-index-is-ga?e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">we introduced ScaNN index</span></a><span style="vertical-align: baseline;"> as another available index type. AlloyDB AI's ScaNN index surpasses standard PostgreSQL's HNSW index by offering up to 4x faster vector queries. Beyond pure speed, ScaNN delivers significant advantages for real-world applications:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Rapid indexing: Accelerate development and eliminate bottlenecks in large-scale deployments with significantly faster index build times.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Optimized memory utilization: Reduce memory consumption by 3-4x compared to PostgreSQL's HNSW index. This allows larger workloads to run on smaller hardware and boosts performance for diverse, hybrid workloads.</span></p>
</li>
</ul>
<p><span style="vertical-align: baseline;">As of AlloyDB Omni version 15.7.0, AlloyDB AI ScANN indexing is generally available.</span></p>
<h3><strong style="vertical-align: baseline;">A new Kubernetes operator</strong></h3>
<p><span style="vertical-align: baseline;">In addition to the new version of AlloyDB Omni, we have also released version 1.2.0 of the AlloyDB Omni Kubernetes operator. This release adds support for more configuration options for health checks when high availability is enabled, support for configuring high availability to be enabled when a disaster recovery secondary cluster is promoted to primary, and support for log rotation to help manage storage space used by PostgreSQL log files. </span></p>
<p><span style="vertical-align: baseline;">Ready to explore? Dive into the </span><a href="https://cloud.google.com/alloydb/docs/release-notes#November_08_2024"><span style="text-decoration: underline; vertical-align: baseline;">full release notes</span></a><span style="vertical-align: baseline;"> and experience the new features. Try them now in our </span><a href="https://console.cloud.google.com/marketplace/product/click-to-deploy-images/alloydb-omni"><span style="text-decoration: underline; vertical-align: baseline;">marketplace image</span></a><span style="vertical-align: baseline;"> or grab the </span><a href="https://hub.docker.com/r/google/alloydbomni" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">latest container image from Docker Hub</span></a><span style="vertical-align: baseline;">! Also, learn why AlloyDB is the new way to PostgreSQL in this </span><a href="https://inthecloud.withgoogle.com/alloydb-ebook-lp-email/dl-cd.html" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">ebook</span></a><span style="vertical-align: baseline;">.</span></p></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/products/databases/whats-new-in-alloydb-omni-version-15-7-0/DatabasesarticleAlloyDB Omni supercharges performance: Faster transactions, analytics, and vector searchGooglehttps://cloud.google.com/blog/products/databases/whats-new-in-alloydb-omni-version-15-7-0/Bjoern RostProduct ManagerRavi MurthyEngineering Director, AlloyDBHow cloud and AI are bringing scale to corporate climate mitigation and adaptationhttps://cloud.google.com/blog/topics/sustainability/at-cop29-thoughts-on-cloud-ai-and-climate-change/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Climate change is the biggest challenge our society faces. As scientists, governments, and industry leaders gather in Baku, Azerbaijan for the 2024 United Nations Climate Change Conference, a.k.a. </span><a href="https://cop29.az/en/home" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">COP29</span></a><span style="vertical-align: baseline;">, it’s incumbent upon all of us to find innovative solutions that can drive impact at a global scale. </span></p>
<p><span style="vertical-align: baseline;">The gravity of climate change requires solutions that go beyond incremental change. To find those solutions, we need the ability to make better decisions about how to approach climate mitigation and adaptation across every human activity — from transport, industry, and agriculture to communications, finance, and housing. This requires processing vast volumes of data generated by these industries. The combination of AI and cloud technologies offer the potential to unlock climate change solutions that can be both transformational and global in scale.</span></p>
<p><span style="vertical-align: baseline;">We already have a lot of examples that we can draw from.</span></p></div>
<div class="block-aside"><dl>
<dt>aside_block</dt>
<dd><ListValue: [StructValue([('title', 'Try Google Cloud for free'), ('body', <wagtail.rich_text.RichText object at 0x3e712c770be0>), ('btn_text', 'Get started for free'), ('href', 'https://console.cloud.google.com/freetrial?redirectPath=/welcome'), ('image', None)])]></dd>
</dl></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Today, for example, </span><a href="https://earthengine.google.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Google Earth Engine</span></a><span style="vertical-align: baseline;"> is being used by the </span><a href="https://www.forestdatapartnership.org/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Forest Data Partnership</span></a><span style="vertical-align: baseline;">, a collaboration for global monitoring of commodity-driven deforestation, to monitor every oil palm plantation around the globe</span><strong style="vertical-align: baseline;">, </strong><span style="vertical-align: baseline;">providing participating companies live early-warning signals for deforestation risks, and dramatically reducing the costs involved in forest monitoring. Similarly, </span><a href="https://ngis.com.au/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">NGIS</span></a><span style="vertical-align: baseline;"> is using Google Earth Engine to power TraceMark, helping businesses deliver traceability and transparency across global supply chains. </span></p>
<p><span style="vertical-align: baseline;">Another example is </span><a href="https://globalfishingwatch.org/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Global Fishing Watch</span></a><span style="vertical-align: baseline;">, an international nonprofit co-founded by Google that is </span><a href="https://cloud.google.com/customers/global-fishing-watch"><span style="text-decoration: underline; vertical-align: baseline;">using geospatial analytics and AI</span></a><span style="vertical-align: baseline;"> to understand how human activity impacts the seas, global industries, climate, biodiversity and more. The datasets map global ocean infrastructure and vessels that don’t publicly broadcast their positions. This helps to advance policy conversations about offshore renewables development, provides insight into carbon dioxide emissions from maritime vessels, and enables marine protection. </span></p>
<p><span style="vertical-align: baseline;">It’s clear that AI can process large volumes of data, optimize complex systems, and drive the development of new business models. We see businesses harnessing the technology in the fight against climate change in four ways:</span></p>
<h3 role="presentation"><strong style="vertical-align: baseline;">1. Measuring business performance </strong></h3>
<p><span style="vertical-align: baseline;">Businesses are using AI-powered insights to help monitor their advance towards sustainability targets, which ultimately contributes to building business resilience. </span></p>
<p><span style="vertical-align: baseline;">In today's business landscape, this is of paramount importance as companies face growing demands for transparency and accountability regarding their environmental and social impact. </span></p>
<p><span style="vertical-align: baseline;">We are seeing cloud and AI being used to de-risk investments, improve transparency, and increase profitability through the use of large-scale datasets, machine learning, and generative AI. These technologies allow companies to analyze their ESG performance, gain insights into climate risks, and monitor supplier behaviors. </span></p>
<p><span style="vertical-align: baseline;">For example, Palo Alto Networks partnered with Watershed, a Google Cloud Ready - Sustainability Partner, to measure and track their carbon emissions across their entire business using Google Cloud. This partnership enabled them to gain a comprehensive understanding of their environmental impact and set actionable targets for reducing emissions.</span></p></div>
<div class="block-video">
<div class="article-module article-video ">
<figure>
<a class="h-c-video h-c-video--marquee"
href="https://youtube.com/watch?v=Gdh15VB7yk8"
data-glue-modal-trigger="uni-modal-Gdh15VB7yk8-"
data-glue-modal-disabled-on-mobile="true">
<div class="article-video__aspect-image"
style="background-image: url(https://storage.googleapis.com/gweb-cloudblog-publish/images/maxresdefault_52p8G3p.max-1000x1000.jpg);">
<span class="h-u-visually-hidden">Securing a Sustainable Future with Google Cloud, Palo Alto Networks & Watershed</span>
</div>
<svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white">
<use xlink:href="#mi-youtube-icon"></use>
</svg>
</a>
</figure>
</div>
<div class="h-c-modal--video"
data-glue-modal="uni-modal-Gdh15VB7yk8-"
data-glue-modal-close-label="Close Dialog">
<a class="glue-yt-video"
data-glue-yt-video-autoplay="true"
data-glue-yt-video-height="99%"
data-glue-yt-video-vid="Gdh15VB7yk8"
data-glue-yt-video-width="100%"
href="https://youtube.com/watch?v=Gdh15VB7yk8"
ng-cloak>
</a>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Another example is HSBC, which developed a new credit ranking tool on Google Cloud that allows them to </span><a href="https://cloud.google.com/customers/hsbc-risk-advisory-tool"><span style="text-decoration: underline; vertical-align: baseline;">run multiple climate risk scenarios simultaneously</span></a><span style="vertical-align: baseline;">. This tool empowers HSBC to make more informed investment decisions while considering the potential impact of climate change on their portfolio.</span></p>
<p><span style="vertical-align: baseline;">Or Swiss Re, which is using Google Earth Engine and AI for flood modeling for </span><a href="https://cloud.google.com/customers/swiss"><span style="text-decoration: underline; vertical-align: baseline;">better risk calculation in insurance</span></a><span style="vertical-align: baseline;">.</span></p>
<h3 role="presentation"><strong style="vertical-align: baseline;">2. Optimizing operations and supply chains </strong></h3>
<p><span style="vertical-align: baseline;">Secondly, businesses are using AI to optimize their operations and supply chains for energy and resource efficiency, as well as to cut costs. </span></p>
<p><span style="vertical-align: baseline;">This is crucial for companies seeking to enhance their sustainability performance while simultaneously improving their bottom line. Through the use of AI and machine learning, cloud technologies empower organizations to optimize their existing operations, improve cost efficiency, and minimize waste.</span></p>
<p><span style="vertical-align: baseline;">For example, Geotab, another Google Cloud Ready - Sustainability partner, is managing 75 billion data records in BigQuery for 4 million commercial fleet vehicles every day to optimize vehicle routes, increase driver safety behaviors and accelerate the path to fleet electrification.</span></p></div>
<div class="block-video">
<div class="article-module article-video ">
<figure>
<a class="h-c-video h-c-video--marquee"
href="https://youtube.com/watch?v=zqE2QdXmT0o"
data-glue-modal-trigger="uni-modal-zqE2QdXmT0o-"
data-glue-modal-disabled-on-mobile="true">
<img src="//img.youtube.com/vi/zqE2QdXmT0o/maxresdefault.jpg"
alt="Geotab and Google Cloud - accelerate the road to sustainability"/>
<svg role="img" class="h-c-video__play h-c-icon h-c-icon--color-white">
<use xlink:href="#mi-youtube-icon"></use>
</svg>
</a>
</figure>
</div>
<div class="h-c-modal--video"
data-glue-modal="uni-modal-zqE2QdXmT0o-"
data-glue-modal-close-label="Close Dialog">
<a class="glue-yt-video"
data-glue-yt-video-autoplay="true"
data-glue-yt-video-height="99%"
data-glue-yt-video-vid="zqE2QdXmT0o"
data-glue-yt-video-width="100%"
href="https://youtube.com/watch?v=zqE2QdXmT0o"
ng-cloak>
</a>
</div>
</div>
<div class="block-paragraph_advanced"><h3 role="presentation"><strong style="vertical-align: baseline;">3. Identifying cleaner business models </strong></h3>
<p><span style="vertical-align: baseline;">As the world shifts towards more sustainable practices, businesses must adapt and identify new avenues for growth. Cloud and AI is helping businesses do just that. Cloud and AI allow organizations to reimagine their business models, explore new markets, and create innovative products and services that align with their sustainability goals.</span></p>
<p><a href="https://recykal.com/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Recykal</span></a><span style="vertical-align: baseline;">, for instance, has partnered with Google Cloud to build Asia's largest circular economy marketplace. By leveraging Google Cloud's AI and machine learning capabilities, Recykal is revolutionizing waste management and promoting sustainable practices in Asia.</span></p>
<p><span style="vertical-align: baseline;">Another example is </span><a href="https://einride.tech/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">Einride</span></a><span style="vertical-align: baseline;">, a company that is reimagining freight transport by using electric, self-driving vehicles and an AI-powered platform. Their innovative approach to logistics is disrupting the transportation industry and contributing to a more sustainable future.</span></p>
<p><span style="vertical-align: baseline;">More recently, Climate Engine and Robeco have started using AI and geospatial technologies with their scientific expertise and investment knowledge to inform how publicly traded companies’ actions impact biodiversity. You can read their joint thought leadership paper </span><a href="https://climateengine.com/story/climate-engine-and-robeco-launch-white-paper-on-biodiversity-finance/" rel="noopener" target="_blank"><span style="text-decoration: underline; vertical-align: baseline;">here</span></a><span style="vertical-align: baseline;">. </span></p>
<h3 role="presentation"><strong style="vertical-align: baseline;">4. Building more sustainably</strong></h3>
<p><span style="vertical-align: baseline;">Finally, and very importantly, businesses want to ensure that the actual use of cloud and AI technologies doesn’t lead to increased climate impacts. From the get-go, developers need to take concrete steps towards reducing the carbon footprint and cost of their applications in the cloud. </span></p>
<p><span style="vertical-align: baseline;">This is why, through our </span><a href="https://cloud.google.com/carbon-footprint?hl=en&e=48754805"><span style="text-decoration: underline; vertical-align: baseline;">Carbon Sense suite</span></a><span style="vertical-align: baseline;">, we provide developers with the tools and resources they need to build and deploy applications in a way that minimizes their environmental impact, all while maintaining cost efficiency. </span></p>
<p><span style="vertical-align: baseline;">L'Oréal, for example, leverages Google Cloud's Carbon Footprint tool to track the gross carbon emissions associated with their cloud usage. This allows L'Oréal to understand the environmental impact of their technology decisions and implement strategies to reduce their footprint.</span></p></div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Finally, Google takes its own carbon footprint very seriously, and is pursuing an ambitious goal to achieve net-zero emissions across all of its operations and value chain, supported by a goal to run on 24/7 carbon-free energy on every grid where it operates by 2030.</span></p>
<p><span style="vertical-align: baseline;">Google Cloud is committed to helping organizations of all sizes achieve their sustainability goals. With cloud, data analytics, and AI, we’re delivering new ways to build resilience, reduce costs, and unlock sustainable growth, while also accelerating the impact of organizations’ sustainability initiatives through the smarter use of data. This is an opportunity to drive tangible business results and create a more sustainable future for all.</span></p></div>Fri, 15 Nov 2024 17:00:00 +0000https://cloud.google.com/blog/topics/sustainability/at-cop29-thoughts-on-cloud-ai-and-climate-change/AI & Machine LearningSustainabilityarticleHow cloud and AI are bringing scale to corporate climate mitigation and adaptationhttps://storage.googleapis.com/gweb-cloudblog-publish/images/COP29_8XtHe1W.max-600x600.jpgGooglehttps://cloud.google.com/blog/topics/sustainability/at-cop29-thoughts-on-cloud-ai-and-climate-change/Justin KeebleManaging Director for Global SustainabilityDenise PearlSustainability and New Energy PartnershipsUse AI to build AI: Save time on prompt design with AI-powered prompt writinghttps://cloud.google.com/blog/products/ai-machine-learning/save-time-on-prompt-design-with-ai-powered-prompt-writing/<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Crafting the perfect prompt for generative AI models can be an art in itself. The difference between a useful and a generic AI response can sometimes be a well-crafted prompt. But, getting there often requires time-consuming tweaking, iteration, and a learning curve. That's why we're thrilled to announce new updates to the </span><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing"><span style="text-decoration: underline; vertical-align: baseline;">AI-powered prompt writing tools</span></a><span style="vertical-align: baseline;"> in Vertex AI, designed to make prompting easier and more accessible for all developers.</span></p>
<p><span style="vertical-align: baseline;">We're introducing two powerful features designed to streamline your prompt engineering workflow:</span><strong style="vertical-align: baseline;"> </strong><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing#generate_prompts"><strong style="text-decoration: underline; vertical-align: baseline;">Generate prompt</strong></a><span style="vertical-align: baseline;"> and</span><strong style="vertical-align: baseline;"> </strong><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing#refine_prompts"><strong style="text-decoration: underline; vertical-align: baseline;">Refine</strong><span style="text-decoration: underline; vertical-align: baseline;"> </span><strong style="text-decoration: underline; vertical-align: baseline;">prompt</strong></a><span style="vertical-align: baseline;">.</span></p></div>
<div class="block-paragraph_advanced"><h3><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing#generate_prompts"><strong style="text-decoration: underline; vertical-align: baseline;">Generate prompt</strong></a><strong style="vertical-align: baseline;">: From objective to prompt in seconds</strong></h3>
<p><span style="vertical-align: baseline;">Imagine you need a prompt to summarize customer reviews about your latest product. Instead of crafting the prompt yourself, you can simply tell the </span><strong style="vertical-align: baseline;">Generate prompt </strong><span style="vertical-align: baseline;">feature your goal. It will then create a comprehensive prompt, including placeholders for the reviews, which you can easily populate with your own data later. </span><strong style="vertical-align: baseline;">Generate prompt</strong><span style="vertical-align: baseline;"> takes the guesswork out of prompt engineering by:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Turning simple objectives into tailor-made, effective prompts. This way, you don’t need to agonize over phrasing and keywords.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Generating placeholders for context, like customer reviews, news articles, or code snippets. This allows you to quickly add your specific data and get immediate results.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Speeding up the prompt writing process. Focus on your core tasks, not on perfecting prompt syntax.</span></p>
</li>
</ul></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/1_PHzrxEg.gif"
alt="1">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><h3><a href="https://cloud.google.com/vertex-ai/generative-ai/docs/learn/prompts/ai-powered-prompt-writing#refine_prompts"><strong style="text-decoration: underline; vertical-align: baseline;">Refine prompt</strong></a><strong style="vertical-align: baseline;">: Iterate and improve with AI-powered suggestions</strong></h3>
<p><span style="vertical-align: baseline;">Once you have a prompt, either crafted by </span><strong style="vertical-align: baseline;">Generate prompt </strong><span style="vertical-align: baseline;">or one you've written yourself, </span><strong style="vertical-align: baseline;">Refine prompt </strong><span style="vertical-align: baseline;">helps you modify it for optimal performance. Here’s how it works:</span></p>
<ol>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Provide feedback:</strong><span style="vertical-align: baseline;"> After running your prompt, simply provide feedback on the response, the same way you would critique a writer.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Instant suggestions: </strong><span style="vertical-align: baseline;">Vertex AI generates a new, suggested prompt in one step, taking your feedback into account.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Iterate and improve: </strong><span style="vertical-align: baseline;">You can accept or reject the suggestion and continue iterating by running the refined prompt and providing further feedback.</span></p>
</li>
</ol></div>
<div class="block-image_full_width">
<div class="article-module h-c-page">
<div class="h-c-grid">
<figure class="article-image--large
h-c-grid__col
h-c-grid__col--6 h-c-grid__col--offset-3
"
>
<img
src="https://storage.googleapis.com/gweb-cloudblog-publish/original_images/2_7yRN5Zm.gif"
alt="2">
</a>
</figure>
</div>
</div>
</div>
<div class="block-paragraph_advanced"><p><span style="vertical-align: baseline;">Prompt refinement boosts the quality of the prompt, while also saving significant times during prompt design. The quality is typically improved by augmenting the prompt instructions in a way that Gemini will better understand. </span></p>
<p><span style="vertical-align: baseline;">Below are some sample prompts that were revised with </span><strong style="vertical-align: baseline;">Refine prompt</strong><span style="vertical-align: baseline;">:<br/><br/></span></p>
<div align="left">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;">
<div style="color: #5f6368; overflow-x: auto; overflow-y: hidden; width: 100%;"><table><colgroup><col/><col/></colgroup>
<tbody>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Original prompts</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">After using Prompt Refinement</span></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Suggest engaging lesson plan ideas for art class</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Suggest 3 engaging lesson plan ideas for a high school art class, each focusing on a different art form. Be concise and only include the most relevant information, such as the art form, target age group, and key activity.</span></p>
</td>
</tr>
<tr>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Plan a schedule for a week with focus time and meeting time. Take in account that there are 2 teams with 6 hour delay</span></p>
</td>
<td style="vertical-align: top; border: 1px solid #000000; padding: 16px;">
<p><span style="vertical-align: baseline;">Create a detailed weekly schedule for a team with a 6-hour time difference. The schedule should include:</span></p>
<ul>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Specific time blocks for focus time and meetings.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Consideration of overlapping work hours to ensure effective communication and collaboration.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">A balance of individual work and team interactions.</span></p>
</li>
<li aria-level="1" style="list-style-type: disc; vertical-align: baseline;">
<p role="presentation"><span style="vertical-align: baseline;">Suggestions for time zone conversion tools or strategies to facilitate scheduling.</span></p>
</li>
</ul>
</td>
</tr>
</tbody>
</table></div>
</div>
</div>
</div>
</div>
</div>
<h3><strong style="vertical-align: baseline;">A powerful duo: Generate prompt meets Refine prompt</strong></h3>
<p><span style="vertical-align: baseline;">These two features work in tandem to help you craft the most effective prompt for your objective – irrespective of your skill level. </span><strong style="vertical-align: baseline;">Generate prompt</strong><span style="vertical-align: baseline;"> gets you started quickly, while </span><strong style="vertical-align: baseline;">Refine prompt</strong><span style="vertical-align: baseline;"> allows for iterative improvement in five steps:</span></p>
<ol>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Define</strong><span style="vertical-align: baseline;"> your objective: Tell </span><strong style="vertical-align: baseline;">Generate prompt</strong><span style="vertical-align: baseline;"> what you want to achieve.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Generate</strong><span style="vertical-align: baseline;"> a prompt: </span><strong style="vertical-align: baseline;">Generate prompt</strong><span style="vertical-align: baseline;"> creates a ready-to-use prompt, often with helpful placeholders for context.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Run</strong><span style="vertical-align: baseline;"> the prompt and review the output: Execute the prompt with your chosen LLM in Vertex AI.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Refine</strong><span style="vertical-align: baseline;"> with feedback: Use </span><strong style="vertical-align: baseline;">Refine prompt</strong><span style="vertical-align: baseline;"> to provide feedback on the output and receive AI-powered suggestions for prompt improvement.</span></p>
</li>
<li aria-level="1" style="list-style-type: decimal; vertical-align: baseline;">
<p role="presentation"><strong style="vertical-align: baseline;">Iterate</strong><span style="vertical-align: baseline;"> until ideal performance: Continue refining and rerunning your prompt until you achieve your desired results.</span></p>
</li>
</ol>
<h3><strong style="vertical-align: baseline;">How to get started</strong></h3>
<p><span style="vertical-align: baseline;">Go ahead and try out an AI-assisted prompt-writing through our interactive critiquing workflow. Vertex AI’s easy-to-use UI for refining prompts can be tested without setting up a Google Cloud account through </span><a href="https://cloud.google.com/generative-ai-studio"><span style="text-decoration: underline; vertical-align: baseline;">this link</span></a><span style="vertical-align: baseline;"> (to demo without a Google Cloud account, be sure you are logged out of your Google account in your web browser or use incognito mode). For those with an account, you’ll have the ability to save, manage, and fine-tune your prompts.</span></p></div>Thu, 14 Nov 2024 20:00:00 +0000https://cloud.google.com/blog/products/ai-machine-learning/save-time-on-prompt-design-with-ai-powered-prompt-writing/AI & Machine LearningarticleUse AI to build AI: Save time on prompt design with AI-powered prompt writingGooglehttps://cloud.google.com/blog/products/ai-machine-learning/save-time-on-prompt-design-with-ai-powered-prompt-writing/Colby HawkerProduct Manager