07 Nov 14:21

danielfleischer

8cf1762

v3.1.0 Latest

Latest

What's Changed

Update llava.py by @mosheber in #54
Remove indexing function by @mosheber in #55
IPEX benchmarking fix by @peteriz in #58
Removing Handlers with Phi3.5 Suppport by @mosheber in #59
replaced list[str] with List[str] by @mosheber in #67
Adding files for multi modal pipeline by @mosheber in #68
Lazy initialization of OVModel by @danielfleischer in #66
update protobuf version to 5.28.3 by @mosheber in #70
Update one link in nutrition_data.json by @bilgeyucel in #72

New Contributors

@bilgeyucel made their first contribution in #72

Full Changelog: v3.0.2...v3.1.0

Contributors

peteriz, danielfleischer, and 2 other contributors

Assets 2

09 Jul 09:49

peteriz

v3.0.2

722860d

v3.0.2

What's Changed

Fix IPEX embedders performance by @peteriz in #52
Fix support python versions by @peteriz in #53

Full Changelog: v3.0.1...v3.0.2

Contributors

peteriz

Assets 2

02 Jul 13:38

peteriz

v3.0.1

07ceba6

v3.0.1

What's Changed

Gaudi Generator by @mosheber in #50
Adding pypi packaging support by @peteriz in #51

Full Changelog: v3.0...v3.0.1

Contributors

peteriz and mosheber

Assets 2

22 May 15:13

danielfleischer

v3.0

8087aea

v3.0.0

Compatibility with Haystack v2

⚡ All our classes are now compatible with 🤖 Haystack v2, including the example notebooks and yaml pipeline configurations.
💻 We based our demos on the Chainlit UI library; examples include RAG chat with multi-modality! 🖼️

❤️ Feel free to report any issue, bug or question!

Assets 2

24 Dec 16:24

danielfleischer

v2.0

373e546

v2.0.0

fastRAG 2.0: Let's do RAG Efficiently 🔥

fastRAG 2.0 includes new highly-anticipated efficiency-oriented components, an updated chat-like demo experience with multi-modality and improvements to existing components.

The library now utilizes efficient Intel optimizations using Intel extensions for PyTorch (IPEX), 🤗 Optimum Intel and 🤗 Optimum-Habana for running as optimal as possible on Intel® Xeon® Processors and Intel® Gaudi® AI accelerators.

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

fastRAG is the first RAG framework to support Habana Gaudi accelerators for running LLMs efficiently; more details here.

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

Added support to run quantized LLMs on ONNX runtime and LlamaCPP for higher efficiency and speed for all your RAG pipelines.

⚡ CPU Efficient Embedders

We added support running bi-encoder embedders and cross-encoder ranker as efficiently as possible on Intel CPUs using Intel optimized software.

We integrated the optimized embedders into the following two components:

QuantizedBiEncoderRanker - bi-encoder rankers; encodes the documents provided in the input and re-orders according to query similarity.
QuantizedBiEncoderRetriever - bi-encoder retriever; encodes documents into vectors given a vectors store engine.

⏳ REPLUG

An implementation of REPLUG, an advanced technique for ensemble prompting of retrieved documents, processing them in parallel and combining their next token predictions for better results.

🏆 New Demos

We updated our demos (and demo page) to include two new demos that depict a chat-like experience plus fusing multi-modality RAG.

🐠 Enhancements

Added documentation for most models and components, containing examples and notebooks ready to run!
Support for the Fusion-in-Decoder (FiD) model using a dedicated invocation layer.
Various bug fixes and compatibility updates supporting the Haystack framework.

Full Changelog: v1.3.0...v2.0

Assets 2

20 Jun 12:06

danielfleischer

v1.3.0

81b7a1a

v1.3.0

What's Changed

ColBERT Upstream Updates by @danielfleischer in #19

Full Changelog: v1.2.1...v1.3.0

Contributors

danielfleischer

Assets 2

13 Jun 10:57

peteriz

v1.2.1

bf4df83

v1.2.1

What's Changed

Update plaid_colbert_pipeline.ipynb by @mosheber in #17
Update colbert.py by @mosheber in #18

Full Changelog: v1.2.0...v1.2.1

Contributors

mosheber

Assets 2

21 May 11:05

danielfleischer

v1.2.0

af86bc2

v1.2.0: New: Retrieval Augmented Generation with LLM

Retrieval Augmented Generation with LLM Demo (#16)

- Added a new RAG + prompt + LLM UI (demo).
- Added an example config and notebook.
- Updated main README with "updates" sub-section.
- Updated `run_demo.py` to include all the options to run a demo (UI, UI + service, UI + <user_defined_service>)

Assets 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What's Changed

New Contributors

Contributors

What's Changed

Contributors

What's Changed

Contributors

Compatibility with Haystack v2

fastRAG 2.0: Let's do RAG Efficiently 🔥

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

⚡ CPU Efficient Embedders

⏳ REPLUG

🏆 New Demos

🐠 Enhancements

What's Changed

Contributors

What's Changed

Contributors

Releases: IntelLabs/fastRAG

v3.1.0

What's Changed

New Contributors

Contributors

v3.0.2

What's Changed

Contributors

v3.0.1

What's Changed

Contributors

v3.0.0

Compatibility with Haystack v2

v2.0.0

fastRAG 2.0: Let's do RAG Efficiently 🔥

🚀 Intel Habana Gaudi 1 and Gaudi 2 Support

🌀 Running LLMs with the ONNX Runtime and LlamaCPP Backends

⚡ CPU Efficient Embedders

⏳ REPLUG

🏆 New Demos

🐠 Enhancements

v1.3.0

What's Changed

Contributors

v1.2.1

What's Changed

Contributors

v1.2.0: New: Retrieval Augmented Generation with LLM