Why English Majors Should Take On-Site, Invigilated Written Exams

November 12, 2025UncategorizedThomas Basbøll

[I generally don’t like this kind of stunt. But I was able to generate this essay in under 10 minutes using Microsoft Copilot. While it reused some of the language from my prompts, all the paragraphs (and the title of the post) were generated out of whole cloth by the AI. I had a brief exchange with it about why it is so good at generating prose about Hamlet, and then I asked it to write a five-paragraph essay using this fact to argue for on-site examination. I then simply asked it to add two paragraphs (the fifth and sixth paragraphs in the final version) about how (a) students would be incentivized to write at home and (b) how homework could still be used, giving only indicative grades. I have done no editing of the final text at all. I would say it is 100% in line with my views on the subject. I’m posting it here mainly to document where we’re at today. Copilot’s parting comment was, of course, to offer more help: “Would you like me to tighten this into a formal academic style with references, turn it into a persuasive op-ed, or create a version optimized for a university policy proposal?” I left it there.]

In the age of artificial intelligence, the integrity of literary education faces an unprecedented challenge. Large language models (LLMs) have made it remarkably easy for students to produce polished essays on canonical texts without engaging deeply with them. Nowhere is this more evident than with Shakespeare’s Hamlet, a play that has generated centuries of commentary and analysis. Ironically, the very text we most hope students will wrestle with is also the one they can most convincingly fake an understanding of using AI. This reality demands a reconsideration of assessment methods, and on-site, invigilated written exams offer a compelling solution.

The problem begins with the nature of LLMs and their training data. These models are built on vast corpora of text, and Hamlet occupies a privileged position in English-language discourse. It appears in countless essays, scholarly articles, and online discussions, giving AI systems a rich interpretive reservoir to draw from. As a result, generating a plausible essay on Hamlet requires no genuine engagement with the text—only a prompt. When students can outsource interpretation so easily, traditional take-home assignments lose their ability to measure understanding.

This erosion of authenticity matters because literary study is not merely about producing words; it is about grappling with ambiguity, forming original insights, and developing interpretive judgment. Hamlet is famously complex, inviting questions about action and inaction, mortality, and meaning. These questions cannot be answered by parroting consensus views—they require intellectual struggle. If students bypass that struggle through AI, they miss the very experience that makes literature transformative. Assessment must therefore shift from evaluating polished output to observing the thinking process in real time.

On-site, invigilated written exams restore this integrity by creating conditions where students must rely on their own interpretive resources. Without access to AI, they are compelled to engage directly with the text and articulate their ideas under pressure. This format does not eliminate preparation or external influence, but it ensures that the final performance reflects the student’s own reasoning. In doing so, it preserves the core educational value of literary study: the cultivation of independent thought.

Moreover, this kind of exam would incentivize students to practice writing at home. Knowing that their grade will depend on their ability to produce coherent, insightful prose under time constraints, students will naturally rehearse the skills required for success. They will write practice essays, refine their arguments, and internalize the rhythms of literary analysis. In this way, the exam becomes not just an evaluative tool but a motivational one, encouraging sustained engagement with both the text and the craft of writing.

Finally, homework can still play a role in this model, provided it is not graded in a way that affects the final mark. Assigning essays for indicative feedback rather than summative assessment allows students to experiment, receive guidance, and improve without the temptation to outsource their work to AI. Teachers can use these assignments diagnostically, identifying areas of weakness and tailoring instruction accordingly. By decoupling homework from high-stakes grading, we preserve its pedagogical value while neutralizing its vulnerability to technological shortcuts.

In conclusion, the rise of LLMs has exposed a vulnerability in traditional assessment practices, particularly for canonical works like Hamlet. If we want English majors to truly understand literature rather than merely simulate understanding, we must design exams that cannot be outsourced to machines. On-site, invigilated written exams are not a nostalgic return to the past—they are a necessary adaptation to a future where authenticity is at risk. By embracing this change, we safeguard the intellectual rigor that defines the study of English.

Calculators, Screwdrivers, and Other Tools

November 11, 2025UncategorizedThomas Basbøll

On Saturday, I wrote about one objection to my “Prompts and Conditions” post that came up during my faculty development course on “Teaching Writing in the Age of AI.” This post is about another one, which begins with something I say at the end of that post:

Surely, we can say what a math student at any level should be able to do without a calculator, or what a history student should be capable of without the internet. AI generalizes this problem. In each field, we must decide what students should be able to do on their own.

A participant reminded me that the calculator analogy has long been discredited as a helpful analogy for understanding AI, especially if our aim is to limit reliance on it. After all, everyone relies on calculators these days, and very few exams ban their use. This hasn’t caused any kind of catastrophe for education. We have simply changed the way we teach and learn math.

I think it’s worth looking into this claim in some detail, I should say. After all, it is my impression that many high-stakes exams — like the American SAT — have very specific rules for calculators that enforce limits on the functions that are allowed. I’m pretty sure this has been the case for as long as calculators have been available; their use is in governed by policy. But the general idea that math instruction hasn’t banned them altogether is of course true, nor have we kept teaching the same “old” things is. In any case, talking about “what a math student should be able to do without a calculator,” the participant suggested, was like asking what a carpenter should be able to do without a screwdriver. The whole point of learning the craft is learning how to use the tools.

I immediately liked this way of putting it because when he mentioned carpentry I thought he was going to talk about power tools, but the problem, of course, arises already at the level of saws and hammers and screwdrivers. We may as well start there. Would I say, “Surely, we can say what a carpenter’s apprentice should be capable of without a screwdriver”? As it happens, I would answer yes. But I must first emphasize that I have not said that students and apprentices should be examined only without their tools. I have said they should also be examined without their tools and that, in any program of instruction, there must be some set of skills that can be examined this way. It’s not either/or, but both, separately.

In the case of the carpenter’s apprentice, I suggested that someone who is able to use the standard toolkit will also be (and should also be) able to talk intelligently about how they would go about a particular task, without holding any of the tools in their hands. Also, an apprentice woodworker can be sent into the woodshed to pick out some boards that would be ideally suited to making a particular piece of furniture. This requires no tools, only a good grasp of the materials themselves (a “feel” for them, if you will). It might also be worth seeing if they can “eyeball” rough dimensions, i.e., whether they have realistic intuitions about size and space.

(I am sometimes told horror stories by teachers of quantitative methods about students who do not immediately recognize that a calculation they have let a spreadsheet carry out is off by three orders of magnitude and even in the wrong direction: positive when they should be negative, negative when they should be positive. It is worth having students estimate calculations, without a calculator, simply to make sure they have a realistic sense of the thing they are calculating.)

I think it is true that we must accept AI into writing instruction just as we have accepted calculators into math instruction. We can’t burry our heads in the sand (or, perhaps more precisely, require our students to tie their hands behind their backs). But, just as we can require an apprentice to be able to tell us how they plan to go about a project before they pick up a tool and show us what they’re capable of, we can just as reasonably at least require university students to tell us how they would use AI to solve a writing problem. But we can go further.

My participant suggested that I was ignoring what we know about “embodied cognition” (and we might add “extended mind”). But I am absolutely on board with those sorts of views. We exist in an environment of tools and machines, which not only help us to get around, but shape our very being. I will insist, however, that our environment also includes other people and the language we use to communicate with them. Our words, as Heidegger pointed out, are part of the “equipmental contexture” of our existence, our being-with-others. Teaching students how to write good prose by themselves is very much a way of helping them embody their knowledge.

I want to stress that my point is that AI “generalizes” the issue. (Indeed, Silicon Valley keeps promising us something the call AGI: “artificial general intelligence.”) With minimal prompting, AI is increasingly able to simulate almost any academic competence at least “passably” (deserving a C, let’s say, or what we call a 7 in Denmark.) If universities are to maintain their assessment integrity, we need to find a way to make sure that the actual bodies of the students are capable of something in particular, something that reflects 3 to 5 years of study. And that means we have to come up with some things we can test whether their bodies can do.

On Ruining the Weekend

November 8, 2025UncategorizedThomas Basbøll

I’m running a faculty development course this month called “Teaching Writing in the Age of Generative AI” and it has led to some interesting discussions. I asked the participants to read and reflect on my post, “Prompts and Conditions,” which I wrote a couple of years ago, when the challenge that AI might pose for higher education was just coming into view. I think the basic idea of the post was agreeable to most of the participants, but a number of them had some interesting reactions to what I would call the “rhetoric” of the post. I want to address two of them in particular in a post each.

One of the participants noted a parenthesis at the end of what I took to be a very practical remark about setting deadlines. Here’s what I had written:

If we imagine classes are held on Mondays, students can be given the take-home prompts at the end of the class and submit their essays on Friday (there’s no need to ruin their weekend).

“Doesn’t this suggest that our students don’t want to learn? Why would we presume that studying on the weekend ruins it?” asked the participant. He recalled that when he was a student he was happy to spend his evenings and weekends studying and, indeed, that he felt that he was expected to do so by his teachers. Is this something that we have suddenly abandoned? (And, we might add, does AI force us to do so?)

Now, I must say that I had not expected anyone to take this remark as seriously as that. The idea of “ruining” a weekend by doing school work was only intended as a lighthearted gesture at the priorities of young people. But perhaps playing to these priorities is ill-advised; and perhaps it comes off as condescending, even to the students who have them. It’s always worth thinking about the rhetorical effects of our pedagogical strategies.

In defending my choice of words at the time, I did point out that he was taking a rather hard line against another kind of concern that teachers often express for their students: not everyone has the luxury of devoting their entire lives to school while they are attending university. Many have jobs on the side; some even have families to tend to. “Ruining the weekend” may be more existential for some students than merely skipping a night on the town. This seemed to elicit some nods in the room, including from my critic.

In any case, it’s important to remember that deadlines are always somewhat arbitrary and are likely to occasion both procrastination at first and consternation at last among some students. So I spend a lot of time teaching (and coaching) students (and faculty) to plan their work in orderly, half-hour “moments” of composition so that they can comfortably meet their deadlines without having to miraculate a text at the eleventh hour. For the same reason, whenever it is up to me, I like to place that eleventh hour before noon on a Friday, rather than midnight on a Sunday. It’s just a good way to signal that you may as well get the work done during the working week, as part of your regular day-to-day program of study. It keeps the task of doing an assignment in proportion.

Like I say, I don’t want to dismiss the concern about the rhetoric force of talk about “ruining” our students weekend by asking them to study. But perhaps it is precisely the question of whether what they do during their “free” time is chosen or assigned. A student that wanted to read a book you have suggested or do some writing of their own can still have that hope “ruined” by poor planning and ostensibly “generous” deadlines.

I don’t claim to have a definitive take on how to talk about school-life balance with students and how important we should presume (whether in our thoughts or in our speech) learning is to them. As an occasion to give it some thought, my participants remark is well taken. I’m happy to hear more thoughts in the comments below.

The Human Scale

September 16, 2025UncategorizedThomas Basbøll

When I talk about academic writing with students at the Danish Design School, I begin by showing them a short clip of Thomas Leslie, Professor of Architecture at the University of Illinois, talking about Pier Luigi Nervi’s approach to designing structures. Before I unpack it, I encourage you to take five minutes to watch it yourself (from 27:25 to 32:12) and see if you can discern the lessons for writers that I extract from it.

First of all, it’s useful to think of your writing problem as in some way related or analogous to the problems of your academic discipline. (When I talk to students in innovation management, for example, I tell them that “Good writing is the creative destruction of bad ideas”; when I talk to project management students, I try to get them to see to that a collaborative writing project is one of the most difficult projects they may ever manage.) Getting students think of writing a paper as a “design problem” is especially apt, not least on Leslie’s definition of a “designer”: “someone who thinks things through.”

Another great point that he makes early on is that Nervi’s greatness lay in not “throwing up his arms” at “impossible constraints” but, rather, going ahead and working within them, to “hack together” a solution under the material conditions he had been given. Indeed, for Nervi, the aim was to build something “out of almost nothing” and this, of course, is the essential and difficult problem of the writer: to use the very limited materials provided by words to express what we know. Nervi, as any writer must, was also able to adapt his thinking to the time constraints he was given. He had a deadline and he made sure that his design could be realized within it.

But what I really love about Leslie’s presentation here is the way he relates the handmade components (the ferrocemento components of the roof) to the larger project. The “rigorous, algorithmic process” of producing and assembling the pieces ends up producing an impressive “architectural space”. The “structural form” is intimately related to “the pattern that comes from the human scale of the process”. Nervi’s design assumed that he would be “working with human beings” and this sensibility is then felt even in the enormous exhibition hall that results. Something important emerges from the fact that, as Leslie puts it, “people are actually making all of the things.”

This is something I latch onto, especially, as you can imagine, in this so-called “age of AI”. A larger text, like a research paper or thesis, will necessarily be a complex structure with many working parts. But it is important that the reader experience this overarching (!) “architectural form,” not as something that was “generated” by a monstrously intelligent machine on the basis of some “large” stochastic “model” but, rather, as something that was crafted by human hands, one paragraph at a time, one deliberate moment after another.

Why I Refuse to Read the Paper David Gunkel “Co-Authored” with ChatGPT

September 8, 2025UncategorizedThomas Basbøll

The journal Human-Machine Communication recently published a paper by David Gunkel called “Prompted by me. Generated by ChatGPT,” which he promoted on X under the slogan “Death of the Author.” I had a look and quickly made up my mind not to read it. After a little more thought, I realized that my personal decision should also be a maxim for others. “I think it’s important for academics to refuse to read this kind of paper,” I said, reposting his paper. Over the past few days, David and I have been going back and forth about it, and he finally suggested that I should make an argument for my refusal. This post is a (too long) first draft of that argument.

TL;DR: David’s “process note” abrogates responsibility for the text by pre-emptively deferring an unspecified amount of its intention to a stochastic process. The note is not a declaration but a disclaimer. This renders the text uninterpretable.

Let me begin with what I hope is an elementary logical point that I was surprised to have to make repeatedly in the threads on X. If there are good reasons not to read a text, they are not to be found in that text itself. If that were the case, one would have to read it in order to defend one’s refusal to read it, a catch 22. Indeed, on X, David has implied that I am rejecting an argument that I am refusing to read, but this isn’t a fair criticism of my stance. I am refusing to even hear an argument on the basis of how it has been presented.

What, then, are my reasons and on what basis have I formed them? I did, of course, have to read some of the paper before refusing to go on. I read the title, the byline, the abstract, and the process note. I have also consulted the journal’s policies, especially those related to AI. And I have had a look at the editors and editorial board of the journal; David serves on the latter.

In the “Process Note on Composition and Attribution” — which, to be precise, has been placed in a box and is signed, enigmatically, “The Authors,” although the paper formally has only one author, namely, David J. Gunkel — the paper presents itself as a “human-machine collaboration”. Part of the note is procedural and part of it is philosophical. The procedure is stated in the middle of the note:

The text that is presented here emerged through an iterative process involving prompts, generated responses, revisions, and theoretical provocations that circulated between David J. Gunkel and ChatGPT 4o. At times, the human component of the team guided or corrected; at other times, the model generated formulations that challenged and even recontextualized the human’s own assumptions direction.

I want to stress that I have no objections to this procedure. This is no doubt already a common way to compose text “in the age of AI” and it’s altogether likely that most scholars and students in the future will compose their texts in this way to a greater or lesser extent. To anticipate my philosophical objections (or my objections to the philosophical part of the note) I might take issue with the presentation of David as “the human component of the team,” which does indeed sound like it was generated by the machine, not the man, but the iterative process that is being described here is a perfectly legitimate use of technology in the production of texts and, indeed, or perhaps better, in the development of one’s ideas. Prompting a language model to probe the prose of the world strikes me as a reasonable thing to do in some cases, even using it to “raise the temperature” of (i.e., introduce some randomness into) one’s thinking is also legitimate if you like that sort of thing. In any case, it is not what David did with the machine, but the significance he accords it for the authorship of his text, that motivates my refusal to read it.

Before we get to that, however, we have to answer the question of why, if the process note can be attributed to “the Authors” (in the plural), the published paper has only one author, namely, like I say, David — indeed, David J. Gunkel, whom I suspect the David I know and love to goad on X might not fully identify with. Here the answer is simple and is to be found in the publication ethics statement of the journal, Human-Machine Communication:

No tool, program, or other forms of AI (such as a large language model, i.e., LLM) will receive a byline (recognition as an author) on an article published in Human-Machine Communication. Currently, machines do not have the ability to accept responsibility or discipline for work created and therefore are prohibited from authorship. Authors using any machine-generated content, similar to content created by an LLM, are required to document this in both the acknowledgments and methods section of a submission. Furthermore, HMC may place an LLM-badge on the article as an acknowledgment to readers about the practice.

Now, I do in fact have an issue (to be taken up in another post) with the requirement to “document” machine-generated content and the practice of labelling it with badges. (I note, however, that HMC has not placed an LLM-badge on David’s article for some reason). But I completely agree that AIs should never be given a byline. In a moment, we’ll get to my core objection, which is that the process note explicitly recognizes ChatGPT as an author and therefore seems to prima facie violate HMC’s own policy here but, before we get to that, I want to note that, “HMC supports the ICMJE definitions of authorship,” which state that: “Authors should not list AI and AI-assisted technologies as an author or co-author, nor cite AI as an author.” I will agree with anyone who suggests that David’s paper follows the letter of these rules, but I will insist that it violates their spirit. Indeed, the purpose of the paper, as stated in the abstract, is to “disrupt” those rules or, as David also sometimes likes to say, “think otherwise” about them. That is what I refuse to do and therefore refuse to read the paper.

Taking the procedural part as read, then, the process note makes two assertions of a more philosophical nature that I reject. The first is a general reflection on collaborative writing:

As with any collaborative work, it can be difficult — if not impossible — to draw clear lines demarcating where the contribution of one partner ends and the other begins. This is especially the case when one of the partners is a large language model.

The first sentence here is unobjectionable. But is there any meaningful sense in which the contribution of a large language model is “especially” difficult to distinguish from that of a human co-author? On the face it, since it is possible to document every step of the process, it would seem much easier to demarcate the contributions made by the language model. We just need to consider the ongoing, unrecorded conversation between two peers, over months and years, that results in a conventionally co-authored paper. But, more importantly, this inability to itemize the contributions of each author is precisely why co-authors are conventionally both held responsible for the entire paper that their names are signed to. That is, the consequence of the lack of clear lines is simply a shared, un-demarcated responsibility for the text. Since the journal’s author guidelines implicitly rule out this taking of responsibility by ChatGPT, David’s attempt to “attribute” the contribution of some of the content, content that is not demarcated from his own, is a de facto abrogation of responsibility for the meaning of an (unspecified) portion of the text.

After the procedural description that I already quoted, the note ends as follows:

In this case, the language model was not used as an instrument by a human agent but functioned like a co-author and interlocutor — producing text, proposing structures, and participating in the generation of new insights. The result of this effort is not the product of one or the other, but of this entangled (and difficult to disentangle) interchange.

I have emphasized “like a coauthor” because I believe this is a violation of (the spirit of) the journal’s rules against giving language models “recognition as an author” (recall the ICMJE statement that we should “not list AI and AI-assisted technologies as an author or co-author, nor cite AI as an author.“) HMC’s guidelines are clear: “Currently, machines do not have the ability to accept responsibility or discipline for work created and therefore are prohibited from authorship.” That is, David may be right that he can’t himself disentangle his own contribution from ChatGPT — though I would say that scholars should normally pride themselves on being able to disentangle their own contribution from their sources — but before you publish a work you must accept responsibility for it. My objection to David’s note is that he does not do this. He obscures the “authorship” of the whole paper behind a screen of stochastic processes that he claims he cannot “disentangle” from his own intentions.

Now, having not read it, the only access I have to the “intention” of “the Authors” is the abstract of the paper, which is the basis on which we often decide whether or not a paper is worth our time to read. Here it is in full:

This essay—which is not only about human-machine collaboration but is a performance in human-machine collaboration—interrogates the shifting terrain of authorship and creativity in the age of generative artificial intelligence (GAI). Challenging both the instrumentalist view of technology and the romantic myth of the singular genius, it argues for a reconceptualization of creative production as distributed, dialogical, and co-constituted. Drawing on both theoretical innovations in poststructuralism and the practices of pre- and post-modern content creators, the essay repositions the algorithm not as a mere tool but as an active participant in the generation of meaning. In doing so, it exposes and disrupts—in both content and form—the metaphysical assumptions that continue to underwrite our understanding of writing, agency, and communication.

The gesture at the “shifting terrain of authorship” and its declared aim of “exposing and disrupting … the metaphysical assumptions that continue to underwrite or understanding of writing…” resonates, as David has made clear on X, with thinkers like Barthes, Foucault, and Derrida, i.e, what he here calls “theoretical innovations in poststructuralism” and elsewhere simply “the death of the author”.

Now, I disagree with David about the consequences of poststructuralism for both authorship in general and the place of large language models in that terrain. Let me say that again: I disagree with David. What he has done with this paper is to attempt to force me (and others like me) into accepting his conclusion simply by reading and interpreting his text. While even his journal won’t let him do it, he has asked his reader to recognize ChatGPT as his co-author. If his conclusion is to “disrupt” the metaphysics of authorship, he would have won his argument the moment I begin to read his text as though it might possibly mean something.

On these grounds, then, I refuse to the paper. There is much more to be said. And I have not said even this much as efficiently as I could. The issue is serious and the consequences are worth thinking about. And in future posts I will deal with the problem of AI authorship more generally and without direct reference to this paper. David asked me to make an argument. And I have at least outlined one here, stuffing it with more documentation than might be useful to the reader. At least we’re now on the same page, I hope.

I think this paper is a bad idea — indeed, it is an infamous device — and, as I said on X, it is unethical in at least one simple sense: it sets a bad example for others, and especially, students to follow. In my opinion, even reading it, is bad form. But talking about our reasons for not reading something is entirely in order. Finally, let me address something that of course troubles me in this whole exercise: Do all these words not vindicate David’s little stunt? Is he not drawing attention to an issue and getting us talking about important things? Well, I would not condone the literal murder of a writer to draw attention to the literary “death of the author”. And I might even pen a lengthy denunciation of the act if someone I otherwise respected ever committed it. I would certainly refuse to read the manifesto he wrote in the blood of the victim!

Inframethodology

A weblog devoted to the underlying craft of research

Why English Majors Should Take On-Site, Invigilated Written Exams

Calculators, Screwdrivers, and Other Tools

On Ruining the Weekend

The Human Scale

Why I Refuse to Read the Paper David Gunkel “Co-Authored” with ChatGPT