Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metadata correction for 2024.acl-long.142 #4688

Open
howard-yen opened this issue Feb 19, 2025 · 1 comment
Open

Metadata correction for 2024.acl-long.142 #4688

howard-yen opened this issue Feb 19, 2025 · 1 comment
Assignees
Labels
correction for corrections submitted to the anthology metadata Correction to metadata

Comments

@howard-yen
Copy link

JSON data block

{
  "anthology_id": "2024.acl-long.142",
  "abstract": "Extending large language models (LLMs) to process longer inputs is crucial for a wide range of applications. However, the substantial computational cost of transformers and limited generalization of positional encoding restrict the size of their context window. We introduce Context Expansion with Parallel Encoding (CEPE), a framework that can be applied to any existing decoder-only LLMs to extend their context window. CEPE employs a small encoder to process long inputs chunk by chunk, enabling the frozen decoder to utilize additional contexts via cross-attention. CEPE is efficient, generalizable, and versatile: trained with 8K-token documents, it extends the context window of LLAMA-2 to 128K tokens, offering 10x the throughput with only 1/6 of the memory. CEPE yields strong performance on language modeling and in-context learning. CEPE also excels in retrieval-augmented applications, while existing long-context models degenerate with retrieved contexts. We further introduce a CEPE variant that can extend the context window of instruction-tuned models using only unlabeled data, and showcase its effectiveness on LLAMA-2-CHAT, leading to a strong instruction-following model that can leverage very long contexts on downstream tasks."
}
@howard-yen howard-yen added correction for corrections submitted to the anthology metadata Correction to metadata labels Feb 19, 2025
Copy link

Found ACL Anthology entry: https://aclanthology.org/2024.acl-long.142

Thumbnail

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
correction for corrections submitted to the anthology metadata Correction to metadata
Projects
None yet
Development

No branches or pull requests

2 participants