LESSWRONG
LW

2321
Bartlomiej Lewandowski
4140
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
Why no major LLMs with memory?
Answer by Bartlomiej LewandowskiMar 29, 202321

I think there has been a lot of research in the past in this space. The first thing that popped into my mind was https://huggingface.co/docs/transformers/model_doc/rag 

Currently, there are some approaches using langchain to persist the history of a conversation into an embeddings database, and retrieve the relevant parts performing a similar query / task.

Reply
GPT can write Quines now (GPT-4)
Bartlomiej Lewandowski3y31

OpenAI has hired a lot of software engineers to code simple tasks, maybe these quines were a part of the fine tuning set? 

Reply
Deepmind's Gopher--more powerful than GPT-3
Bartlomiej Lewandowski4y20

How is retro different from https://ai.facebook.com/blog/retrieval-augmented-generation-streamlining-the-creation-of-intelligent-natural-language-processing-models/ ?

Reply
A closer look at chess scalings (into the past)
Bartlomiej Lewandowski4y10

Isn't ELO a reference metric that changes with time? I would assume that 2800 ELO in the 90s is a different level to today's 2800. Can we still make the same conclusions with that in mind?

Reply
1SOLAR model paper questions
Q
2y
Q
0