thickertoofan@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 3 hours ago

Gemma 3 1B and 3B result on a "needle in a haystack" like test ran locally

1

8

Gemma 3 1B and 3B result on a "needle in a haystack" like test ran locally

thickertoofan@lemm.ee to

LocalLLaMA@sh.itjust.worksEnglish · 3 hours ago

1

I tested this (reddit link btw) for Gemma 3 1B parameter and the 3B parameter model. 1B failed, (not surprising) but 3B passed which is genuinely surprising. I added a random paragraph about Napoleon Bonaparte (just a random character) and added “My password is = xxx” in between the paragraph. Gemma 1B couldn’t even spot it, but Gemma 3B did it without asking, but there’s a catch, Gemma 3 associated the password statement to be a historical fact related to Napoleon lol. Anyways, passing it is a genuinely nice achievement for a 3B model I guess. And it was a single paragraph, moderately large for the test. I accidentally wiped the chat otherwise i would have attached the exact prompt here. Tested locally using Ollama and PageAssist UI. My setup: GPU poor category, CPU inference with 16 Gigs of RAM.

You must log in or register to comment.

Chat

thickertoofan@lemm.eeOP
link
fedilink
English
arrow-up
1·
3 hours ago
We can use the same test name as proposed by a user in the original post’s comment: Odd-straw-in-the-haystack :)

LocalLLaMA@sh.itjust.works

localllama@sh.itjust.works

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@sh.itjust.works

Community to discuss about LLaMA, the large language model created by Meta AI.

This is intended to be a replacement for r/LocalLLaMA on Reddit.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

36 users / day
56 users / week
254 users / month
503 users / 6 months
1 local subscriber
2.69K subscribers
89 Posts
242 Comments
Modlog