Context Length - LLMs Unplugged Facilitator Guide

Overview:

Give a structure that documents should have
train a context length 2 (using N^2 bins for N words)
compute a context length 1 by summing to marginalize (can be done later by pouring or if enough bins to copy)
Generate samples from length 2, show preserved
Generate sampels from length 1, show not there

Rules for Context Two¶

after a double word the next has to be white
words have a sequence, but doubles allowed, skips can occur

Sequence¶

words can only go in order, unless a double word. 1 skip is okay. first can follow last in the sequence,

Example: order: {‘purple’: 0, ‘blue’: 1, ‘green’: 2, ‘pink’: 3}

purple, green, purple is okay
purple, green, blue is not
green green pink is okay

doc can end at anytime with a white

Discussion Guide¶

how might you train better to make it answer questions?
k-shot prompting