Dominik here with what maybe be a long tweet, but the nuance and POV that is so often missing from AI in edu conversations 👏🏻
Big question about LLMs and learning (with a very long and imperfect answer):
⁉️ How do we use LLMs to learn something we don't know much about? ⁉️
Often, people say you should only use Large Language Models if you know enough to evaluate the output. But that's too limiting and assumes perfect transmission with reliable output.
Here's a much longer than anybody wants exposition of what I think is involved. (Note, this genuinely started as 280 character tweet. Once I gave up on that, I gave up on any semblance of brevity.)
Status quo: How we learn from unreliable sources
People learn wrong things from reliable textbooks or tutors, all the time. And in reality the learning inputs are a mixture of reliable but variably comprehensible input, variably reliable prior understanding, and variably reliable feedback from other sources.
Some examples of unreliable inputs into learning:
- prior knowledge and learning that is either insufficient for comprehension of the input or in conflict with it
- a tutor answering a question wrongly (this happens a lot more than we assume in our models)
- a tutor answering a different question because they did not understand the query (this happens all the time!),
- a peer who shared imperfect understanding of the problem (it is rarely just the one learner with the source of information)
- a socially validated but unreliable or misleading source, for instance, peer reviewed research is completely unreliable as a source of basic information and many textbooks are out of date or present biased information or pass on myths prevalent in the discipline or simplify for pedagogic reasons (which may be inappropriate for the need of the learner who is using them),
- commonly held misconceptions that are passed through the media, common discourse or even practitioners (e.g. many doctors still prescribe antibiotics for viral infections)
Yet, it is possible to learn from this mixture of inputs, because learning is a process and there are many opportunities to learn and unlearn over time as we receive feedback. And as we know, just purely from the progress of knowledge, much of what we learn in textbooks is wrong simply because of new discoveries.
Also, much of the accuracy actual information we learn is irrelevant outside very narrow contexts (e.g. historical facts). Most people who would make fun of creationists do not really understand how evolution through natural selection works or what the roles of genes are. If you interrogate their understanding it becomes clear that they have most a Lamarckian notion of the origin of species with a lot of Huxleyan teleology thrown in. Yet, this does not make any difference.
What kind of input into learning do LLMs provide
The popular pedagogical discourse falls into two frames:
1. LLMs are amazing because they can explain hard concepts in a way that the learner can probe
2. LLMs are dangerous because they may be either subtly wrong or completely hallucinate facts, so that somebody who has no prior understanding will be easily misled
The problem is that both of those framings are accurate. But they can apply to human tutors, as well, or as I suggested to printed validated sources.
But even though on the higher level, LLMs are not that different from other inputs into learning, here, as everywhere they are weird. They have a strange mixture of vast knowledge and misconception. They have unexpected blind spots.
They are deeply conceptual but flawed factually. My quip here is: "ChatGPT is not Wikipedia, but it can help you understand Wikipedia."
So, it is legitimate to approach LLMs with a lot of caution as an input into learning. But the answer is not to exclude them nor to focus only on the imperfections. We need to reconceptualise learning to take into account both framings and the actual state of the status quo.
How do we reconceptualise learning in a world with LLMs?
First, we need to start with the acknowledgement that LLMs with all their strengths and imperfections have already become part of the mix of inputs into learning. And their role will only increase.
Second, we need to fully recognize that the current inputs into learning are a mixture of reliable and unreliable and comprehensible and incomprehensible. And one input can be all of them at the same time for different learners (e.g. a textbook may be actively misleading for the purpose of comprehension, or a paper may be actively incomprehensible for the purpose of maintaining accuracy).
Third, we also need to be more purposeful in distinguishing where reliability is important. For example, if I am a researcher in chemistry working on an experiment and need to look up the exact atomic number of an element, I require perfect accuracy. As do I when I am a student studying for an exam. But if I am learner trying to understand the concept of atomic numbers, the accuracy of the numbers is less important and I can always learn the correct number later.
We already have a perfect model for this in Wikipedia. We are perfectly happy to use it for random facts that don't matter (who was in what movie, when a something happened, what's the definition of a concept), yet we would not (should not) rely on that information if perfect accuracy were necessary - we would instead look up the sources or find alternative sources.
Fourth, we need to pay much more attention to the process of learning. I am very sceptical of simplifying this into "we just need to teach students how to learn" - know how to learn needs to emerge along the process of learning something - pure cognition without knowledge is no good. But I would suggest that we need to incorporate reflection on the learning process more deeply into learning. This does not have a history of success.
Partly, this requires undermining the role of the tutor as an arbiter of reliability and, partly, this is not the exciting part. My analogy is some piano tutorials on YouTube that always combine this is what you need to learn with this is how you practice it.
The problem is that most current tutors are selected for their role because they figured out ways of learning and practicing their subject implicitly and their reflections on the nature of their knowledge / skill or the process it took to acquire it are profoundly unreliable. And even worse, the process they could use for learning/practice is very idiosyncratic to their type - somebody good enough to become a tutor without explicit instruction.
And the other problem is that in our conception of learning, we don't have a positive framing of an imperfectly reliable sources - we only think of it as a bad thing that needs to be eliminated (rather than an inevitability, that needs to be recognised). Although we do have many framings that recognise the reality.
And the final obstacte that stands in our way is that we don't actually have (yet) a reliable set of techniques that will work for a variety of learners using LLMs across a variety of subjects.