Overview:
Give a structure that documents should have
train a context length 2 (using
N^2bins forNwords)compute a context length 1 by summing to marginalize (can be done later by pouring or if enough bins to copy)
Generate samples from length 2, show preserved
Generate sampels from length 1, show not there
Rules for Context Two¶
after a double word the next has to be white
words have a sequence, but doubles allowed, skips can occur
Sequence¶
words can only go in order, unless a double word. 1 skip is okay. first can follow last in the sequence,
Example: order: {‘purple’: 0, ‘blue’: 1, ‘green’: 2, ‘pink’: 3}
purple, green, purple is okay
purple, green, blue is not
green green pink is okay
doc can end at anytime with a white
Discussion Guide¶
how might you train better to make it answer questions?
k-shot prompting