[B! llm] rawwellã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯

rawwell id:rawwell

llmã«é–¢ã™ã‚‹rawwellã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯ (16)

${{author_name}}$

{{author_name}} {{created}}

{{{comment_expanded}}}

{{label}}

{{#is_bookmark}}ãƒªã‚¹ãƒˆ{{/is_bookmark}}{{^is_bookmark}}ãƒªãƒ³ã‚¯{{/is_bookmark}}

${{author_name}}$
{{author_name}}{{created}}
{{ #comment }}{{ comment }}{{ /comment }}
- {{ label }}

{{#following_bookmarks}}

${{author_name}}$

{{author_name}} {{created}}

{{{comment_expanded}}}

{{label}}

{{#is_bookmark}}ãƒªã‚¹ãƒˆ{{/is_bookmark}}{{^is_bookmark}}ãƒªãƒ³ã‚¯{{/is_bookmark}}

{{/following_bookmarks}}

{{/is_wiped}}

Provable unlearning in topic modeling and downstream tasks
rawwell 2025/05/14
unlearning

ai

llm
ãƒªãƒ³ã‚¯
UnSTAR: Unlearning with Self-Taught Anti-Sample Reasoning for LLMs
rawwell 2025/05/14
unlearning

ai

llm
ãƒªãƒ³ã‚¯
Low Compute Unlearning via Sparse Representations
rawwell 2025/05/14
unlearning

ai

llm
ãƒªãƒ³ã‚¯
Controllable Unlearning for Image-to-Image Generative Models via...
rawwell 2025/05/14
unlearning

ai

llm
ãƒªãƒ³ã‚¯
ICLR 2025 A Probabilistic Perspective on Unlearning and Alignment for Large Language Models Oral
rawwell 2025/05/14
unlearning

ai

llm
ãƒªãƒ³ã‚¯
PanGu-$Ï€$: Enhancing Language Model Architectures via Nonlinearity Compensation
rawwell 2025/05/14
PanGu

LLM

ascend910
ãƒªãƒ³ã‚¯
PanGu-Î£: Towards Trillion Parameter Language Model with Sparse Heterogeneous Computing
The scaling of large language models has greatly improved natural language understanding, generation, and reasoning. In this work, we develop a system that trained a trillion-parameter language model on a cluster of Ascend 910 AI processors and MindSpore framework, and present the language model with 1.085T parameters named PanGu-Î£. With parameter inherent from PanGu-Î±, we extend the dense Transfo
rawwell 2025/05/14
PanGu

LLM

ascend910
ãƒªãƒ³ã‚¯
Evaluating World Models with LLM for Decision Making
rawwell 2025/05/14
worldmodel

llm
ãƒªãƒ³ã‚¯
https://dl.acm.org/doi/abs/10.1145/3664647.3681488
rawwell 2025/05/14
worldmodel

llm
ãƒªãƒ³ã‚¯
Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents
rawwell 2025/05/14
worldmodel

llm
ãƒªãƒ³ã‚¯
WorldCoder, a Model-Based LLM Agent: Building World Models by Writing Code and Interacting with the Environment
rawwell 2025/05/14
worldmodel

llm
ãƒªãƒ³ã‚¯
What if we drop the causal mask in auto-regressive Transformer?
rawwell 2024/08/10
â€œThis property ensures that the model can only attend to previous positions in the sequence, not future positions, in order to generate predictions sequentially.â€

LLM

research
ãƒªãƒ³ã‚¯
å®Ÿå‹™ã«ãŠã‘ã‚‹RAG ã€œå¦ã³ã¨ç¾å ´ã®ãƒŽã‚¦ãƒã‚¦ã€œ | ãƒ‰ã‚¯ã‚»ãƒ«
ã‚¹ãƒ©ã‚¤ãƒ‰æ¦‚è¦ å®Ÿå‹™ã«ãŠã„ã¦RAGï¼ˆRetrieval-Augumented Generationï¼‰ã‚’ãŸã£ã·ã‚ŠçµŒé¨“ã—ãŸhoxo-mã®å¦ã³ã¨ç¾å ´ã®ãƒŽã‚¦ãƒã‚¦ã‚’ã¾ã¨ã‚ãŸ
rawwell 2024/07/24
LLM

RAG
ãƒªãƒ³ã‚¯
Pretraining LLMs - DeepLearning.AI
rawwell 2024/07/18
LLM

research
ãƒªãƒ³ã‚¯
GitHub - janakiramm/Intel-Xeon-LLM-RAG-Inference-Setup: This repository provides a comprehensive guide to setting up and running a LLM inference server optimized for Intel Xeon machines, with a focus on Retrieval Augmented Generation (RAG). The repository
rawwell 2024/07/13
rag

llm
ãƒªãƒ³ã‚¯
LLM Inference Series: 4. KV caching, a deeper look
In the previous post, we introduced KV caching, a common optimization of the inference process of LLMs that make compute requirements of the (self-)attention mechanism to scale linearly rather than quadratically in the total sequence length (prompt + generated completions). More concretely, KV caching consists to spare the recomputation of key and value tensors of past tokens at each generation st
rawwell 2024/07/12
"Another possible optimization not covered by PagedAttention is reusing the key-value cache across requests. This would apply when the prompts share a common prefix, which commonly occurs in multi-round use cases like chat and agents or when using prompt templates (Figure 4)."

llm

kv_cache
ãƒªãƒ³ã‚¯
1

ãŠçŸ¥ã‚‰ã›

ã‚‚ã£ã¨èªã‚€

å…¬å¼Twitter

@HatenaBookmark
ãƒªãƒªãƒ¼ã‚¹ã€éšœå®³æƒ…å ±ãªã©ã®ã‚µãƒ¼ãƒ“ã‚¹ã®ãŠçŸ¥ã‚‰ã›
@hatebu
æœ€æ–°ã®äººæ°—ã‚¨ãƒ³ãƒˆãƒªãƒ¼ã®é…ä¿¡

ã‚ãƒ¼ãƒœãƒ¼ãƒ‰ã‚·ãƒ§ãƒ¼ãƒˆã‚«ãƒƒãƒˆä¸€è¦§

jæ¬¡ã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯

kå‰ã®ãƒ–ãƒƒã‚¯ãƒžãƒ¼ã‚¯

lã‚ã¨ã§èªã‚€

eã‚³ãƒ¡ãƒ³ãƒˆä¸€è¦§ã‚’é–‹ã

oãƒšãƒ¼ã‚¸ã‚’é–‹ã

è¨å®šã‚’å¤‰æ›´ã—ã¾ã—ãŸx