needle

bryanwhiting · Dec 16, 2023 · 5232ad2 · 5232ad2
1 parent a24203b
commit 5232ad2
Show file tree

Hide file tree

Showing 4 changed files with 45 additions and 1 deletion.
diff --git a/lists/img/ai.jpeg → lists/ai/ai.jpeg b/lists/img/ai.jpeg → lists/ai/ai.jpeg
diff --git a/lists/ai.md → lists/ai/index.md b/lists/ai.md → lists/ai/index.md
@@ -7,12 +7,18 @@ categories: [tech, futurism, ai]
 draft: false
 ---
 
-![](img/ai.jpeg) 
+![](ai.jpeg) 
 
 ::: callout-note
 ## TL;DR: AI is wild. Last updated per last headline. First created 2023-12-14
 :::
 
+# Coffee by Coframe
+
+Build React UIs super fast. [GitHub](https://t.co/0vBssgp0ue)
+
+<blockquote class="twitter-tweet"><p lang="en" dir="ltr">Announcing Coffee: build and iterate on your UI 10x faster with AI ☕️👇<a href="https://t.co/0vBssgp0ue">https://t.co/0vBssgp0ue</a> <a href="https://t.co/JqwC8WpDzs">pic.twitter.com/JqwC8WpDzs</a></p>&mdash; Coframe (@coframe_ai) <a href="https://twitter.com/coframe_ai/status/1735069815566631054?ref_src=twsrc%5Etfw">December 13, 2023</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
+
 # Prompt Engineering
 
 Added 2023-12-15

diff --git a/posts/2023-12-15-llms-can-find-a-needle-in-the-haystack/index.md b/posts/2023-12-15-llms-can-find-a-needle-in-the-haystack/index.md
@@ -0,0 +1,38 @@
+---
+title: LLMs can find a needle in the haystack
+description: |
+  GPT outperforms Claude. 
+date: 2023-12-15
+categories: [ai]
+draft: false
+---
+
+![](photo.jpeg) 
+
+::: callout-note
+## Is RAG necessary when you have incredible memory?
+:::
+
+# Context
+
+Check out this thread:
+
+<blockquote class="twitter-tweet"><p lang="en" dir="ltr">(1/8) The Needle in the Haystack done by <a href="https://twitter.com/GregKamradt?ref_src=twsrc%5Etfw">@GregKamradt</a> was an amazing analysis of retrieval performance! Greg has graciously allowed us to build on his work with a repository that is now OSS.<a href="https://twitter.com/natfriedman?ref_src=twsrc%5Etfw">@natfriedman</a> We have a much more rigorous test we’ve put out based on this idea.… <a href="https://t.co/i5O8zrcwQT">pic.twitter.com/i5O8zrcwQT</a></p>&mdash; Aparna Dhinakaran (@aparnadhinak) <a href="https://twitter.com/aparnadhinak/status/1735678863814938695?ref_src=twsrc%5Etfw">December 15, 2023</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
+
+This is a powerful analysis. Sure, Anthropic will find a way to improve or challenge the results. But the point is clear: these technologies can remember hyper specific 7-digit random numbers out of a batch of 126,000 tokens, where a token is roughly 4 characters. GPT is clear winner here, too. 
+
+Also, open source is getting incredibly good. This implies the future is open source. 
+
+<blockquote class="twitter-tweet"><p lang="en" dir="ltr">Comparing <a href="https://twitter.com/OpenAI?ref_src=twsrc%5Etfw">@OpenAI</a> <a href="https://twitter.com/hashtag/GPT4?src=hash&amp;ref_src=twsrc%5Etfw">#GPT4</a> Turbo to <a href="https://twitter.com/MistralAI?ref_src=twsrc%5Etfw">@MistralAI</a> <br><br>GPT-4 is pretty good in that region in general. Interesting to see how <a href="https://twitter.com/MistralAI?ref_src=twsrc%5Etfw">@MistralAI</a> scales to larger context windows <a href="https://t.co/WQo6MmGIHh">pic.twitter.com/WQo6MmGIHh</a></p>&mdash; Aparna Dhinakaran (@aparnadhinak) <a href="https://twitter.com/aparnadhinak/status/1735747087021600916?ref_src=twsrc%5Etfw">December 15, 2023</a></blockquote> <script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
+
+# Impact
+
+RAG can be used to make retrieval more efficient. But if retrieval is already super efficient maybe RAG is only a short term thing. Context lengths of 10m tokens...probably by next year right?
+
+Start of the year we were at 4K tokens. Now there are 126,000 tokens. 30x improvement. So to do another 30x improvement is 3.76M. So yea, by next year you should be able to just load the entire RAG database into memory. But...gonna be super expensive. 
+
+Point is: would GPT be this effective if it was using RAG over a database? Or is it more effective loading it all into context?
+
+
+
+
diff --git a/posts/2023-12-15-llms-can-find-a-needle-in-the-haystack/photo.jpeg b/posts/2023-12-15-llms-can-find-a-needle-in-the-haystack/photo.jpeg