I am perplexed by this model and theory on a couple fronts.
1. As an R&D Developer working in RAD and AI, I often go to my tools. And my battle buddy is an idiot. By that I mean LLM. LLM's look snazzy, but they're parrots. Yes I can glean good code from them - sometimes, but I have to keep starting new conversations. Why? No context retention at all! No matter what the sellers of these tools say. They start drifting into very dumb code, and I start getting annoyed. So the best bet is to restart and rebuild context with a new conversation.
That's the best we've done with sequence-to-sequence modeling?... (read more)
I am perplexed by this model and theory on a couple fronts.
1. As an R&D Developer working in RAD and AI, I often go to my tools. And my battle buddy is an idiot. By that I mean LLM. LLM's look snazzy, but they're parrots. Yes I can glean good code from them - sometimes, but I have to keep starting new conversations. Why? No context retention at all! No matter what the sellers of these tools say. They start drifting into very dumb code, and I start getting annoyed. So the best bet is to restart and rebuild context with a new conversation.
That's the best we've done with sequence-to-sequence modeling?... (read more)