Bring Back the Blue-Book Exam

46 diodorus 57 8/24/2025, 5:44:55 PM chronicle.com ↗

Comments (57)

Syzygies · 1h ago
One can't have an informed opinion on this without witnessing how grading a college exam takes place, first hand.

Grading a stack of blue books is a "just kill me now" brutal experience. A majority of cognitive effort is just finding the page for the next problem to grade, with finding the answer another delay; any programming language with this kind of access latency would stay a "one bus" language. So of course professors rely on disinterested grad students to help grade. They'll make one pass through the stack, getting the hang of the problem and refining the point system about twenty blue books in, but never going back.

With stapled exams one problem per page one can instead sort into piles for scores 0-6 (if you think your workplace is petty, try to imagine a sincere conversation about whether an answer is worth 14 or 15 points out of 20), and it's easy to review piles.

When I had 200 linear algebra exams to grade at once, I'd scan everything, and use my own software to mark and bin one question at a time, making review passes a pleasure. I could grade a 10 question final in one intense sitting, with far more confidence in the results than team grading ever gave me.

ipcress_file · 1h ago
I think problem number one is that you had 200 exams to grade.
armchairhacker · 46m ago
It doesn’t have to be a book, it can be a cheap laptop without internet.
mlpoknbji · 2h ago
The commenters lamenting this trend presumably have not given a takehome assignment to college students in recent years. The problem is huge and in class tests are basically the only way to test if students are learning. Unfortunately this doesn't solve the problem of AI assisted cheating on in-class exams, which is shockingly prevalent these days at least in STEM settings.
wrp · 1h ago
Survey question: To what extent did blue book exams go away?

When I started out (and the original Van Halen was still together), blue book exams were the norm in humanities classes. I've had narrow experience with American undergrad classes the past 25 years, so I don't have a feeling for how things have evolved.

ipcress_file · 1h ago
I've never stopped using them, with the exception of the pandemic year, when we were forced to run online exams.

Why replace a system that generally works well with one that introduces additional potential problems?

MikeTheGreat · 1h ago
Same here.

Online instruction / learning can work for some people, and that's good.

I don't understand how anyone ever thought that an online exam could be made secure. There's just no way to ensure that the person who registered for the course is the one taking the exam when you don't control anything about the hardware or location, when students have a wide variety of hardware that you must support, and any attempt at remove video monitoring of the exam immediately runs into scalability and privacy issues. Like, even if you're watching a video of the person taking the online exam, how do they prove that they didn't just hook up an extra keyboard, mouse and (mirrored) monitor for person #2 to take the exam for them while they do their best to type and/or mouse in a convincing way?

It also doesn't help that you periodically get students who will try to wheedle, whinge, and weasel their way into an online exam, but then bomb the in-person exam (it's so strange and totally unrelated that they really, really wanted to take an online exam instead of in-person!).

Ok, I'll stop ranting now :)

ipcress_file · 34m ago
I get it. My major concern was that students were cheating online (nearly 50% if I was detecting them all) who I didn't think would have cheated in the classroom. I didn't like the idea that we were creating a situation that enticed students to cheat.

That being said, the whole experience had an impact on my generally optimistic view of human nature.

BrenBarn · 46m ago
I'm not sure, but I used them as recently as 2019. (Well, not a blue book per se, but a printed exam where students handwrote their answers on the provided sheets.) I'm pretty sure some faculty never stopped using them except when forced to by online classes. (Incidentally, increased pressure to offer more online classes is a worrisome combination with increased AI-aided cheating.)
jccalhoun · 1h ago
I was an English major and Math minor in the 90s and I used them for one class - a history general studies requirement. All the essay tests in my other classes were on letter sized paper with blank space for us to write in.
wrp · 57m ago
You raise a good point. I've assumed that "blue book exam" is used as a generic term for in-class tests that consist of just essays, not necessarily using the physical Blue Books that you buy at the campus bookstore.
CompoundEyes · 1h ago
There are schemes happening in interviews too. I've been doing many technical interviews for a senior role with off shore candidates lately. They come to me after passing a challenging online test that has controls to check for browser focus lost, opening new tabs etc. I think it may require a camera to be on too. Even with that more than half of the candidates can't code.

Our interview usually starts with them breathlessly reading from a script out of the corner of their eye. I'm ok with notes to make sure you hit some high points about yourself even in person. Nervousness shouldn't disqualify a talented person. But with the coding part I've gotten exasperated and started asking these senior candidates to share their screen and do a fizz buzz exercise live in a text editor in the first few minutes. If they struggle I politely end the interview on the 15.

One candidate cheated and it was interesting to watch. In the time I sent the message in Zoom and them sharing their screen, just a few seconds, they had either queried or LLM-ed it on their phone or another computer, had someone off screen or in the same room listening and sharing the answer on another monitor or something else. Whatever it was they turned their head slightly to the side, squinted a bit and typed the answer in Java. A few syncopated characters at a time. When asked what modulo was they didn't know and couldn't make any changes to it. It was wacky. In retrospect I think it was them reading the question out loud to an LLM.

I'm waiting for the candidate who has someone behind them with the same shirt on pretending to be their arms.

sarchertech · 2h ago
This is the obvious solution to chatgpt essays. You could also just add more lab sections to CS classes, and force students to do assignments with no access to AI.

When I took physics we had weekly 3.5 hour lab sections. That should be enough for most CS assignments.

SoftTalker · 2h ago
The lab sections were enough time to run the experiments. Writing up the lab report took additional time outside of that, at least that's how my lab sections went in school.
girvo · 23m ago
Yep, absolutely the case for my labs.
neonate · 1h ago
i_dont_know_ · 2h ago
I feel like these kind of things push us as a society to decide what exactly the purpose of school should be.

Currently, it's been a place for acquiring skills but also a sorting mechanism for us to know who the "best" are... I think we've put too much focus on the sorting mechanism aspect, enticing many to cheat without thinking about the fact that in doing so they shortchange themselves of actual skills.

I feel like some of the language here ("securing assessments in response to AI") really feels like they're worried more about sorting than the fact that the kids won't be developing critical thinking skills if they skip that step.

Maybe we can have

viccis · 1h ago
The current system "sorts" the students who "developed critical thinking skills" out from the ones who didn't put in effort. If there's no expectation that they'll be sorted thus, then the vast majority won't (and right now don't) bother with developing or exercising those skills. Usually they'll just put all their effort into the one or two classes they have that actually make them demonstrate mastery of the material.
belinder · 2h ago
Sure school is for acquiring skills, but it's also day care. A place to keep children during the day so their parents can work, especially in a society where more and more the expectation is that both parents work.
bdowling · 2h ago
The article is about college-level education, which is primarily about ranking students in order of who should get the best entry-level jobs. If technology is disrupting the effectiveness of that ordering function, then something needs to change.
xqcgrek2 · 2h ago
How do blue book exams translate to real world problems the students will encounter?

Why not open book + AI use exams, because that's what students will have in their careers?

benbreen · 2h ago
Exactly, this is the reason why I struggle with this sort of solution to the problems we are all facing in education currently. On the surface it seems to make sense, but the blue book exam is entirely artificial and has basically no relationship to real world skills (subconsciously, even the quality of student handwriting handwriting could influence how graders assess blue books). Even leaving AI aside, anyone writing anything nowadays is using Google and Wikipedia and word processors, so why constrain those?

Oxford and Cambridge have a "tutorial" system that is a lot closer to what I would choose in an ideal world. You write an essay at home, over the course of a week, but then you have to read it to your professor, one on one, and they interrupt you as you go, asking clarifying questions, giving suggestions, etc. (This at least is how it worked for history tutorials when I was a visiting student at an Oxford college back in 2004-5 - not sure if it's still like that). It was by far the best education I ever had because you could get realtime expert feedback on your writing in an iterative process. And it is basically AI proof, because the moment they start getting quizzed on their thinking behind a sentence or claim in an essay, anyone who used ChatGPT to write it for them will be outed.

jbreckmckye · 2m ago
Not sure how it is these days, but at Cambridge, supervisions (what Oxford calls tutorials) did not contribute to our examination / tripos scores. They were just a learning aid.
SoftTalker · 2h ago
It really boils down to: are universities trying to teach abstract knowledge, or are they trade schools?

If they are trade schools, yes teach React and Node using LLMs (or whatever the enabling tools of the day are) and get on with it.

lokar · 29m ago
As a CS student I rather enjoyed the blue book essay exams in my classics courses.

And it did teach and evaluate skills I’ve used me entire career.

nxobject · 2h ago
> Even leaving AI aside, anyone writing anything nowadays is using Google and Wikipedia and word processors, so why constrain those?

And the library, and inter-library loan (in my case), and talking to a professor with a draft...

girvo · 21m ago
For the same reason I wasn’t allowed my extremely powerful SAT solver calculator in my maths exams (for high school and first year uni where it would’ve helped):

Because I was demonstrating that I understood the material intrinsically, not just knew how to use tools to answer it.

acbart · 2h ago
There are many subskills that you must be proficient in without tools, before you can learn more interesting skills. You need to know how to do multiplication by hand before you rely on a calculator. If you can't do multiplication with a calculator, you're not going to be able to make sense of the concepts in Algebra.
mlloyd · 2h ago
There are also many subskills not worth learning to some people. Sometimes traversal is what's needed and not understanding. (Though I'm never going to knock gaining more understanding)

Tools allow traversal of poorly understood, but recognized, subskills in a way that will make one effective in their job. An understanding of the entire stack of knowledge for every skill needed is an academic requirement born out of a lack of real world employment experience. For example, I don't need to know how LLMs work to use them effectively in my job or hobby.

We should stop spending so much time teaching kids crap that will ONLY satisfy tests and teachers but has a much reduced usefulness once they leave school.

noosphr · 1h ago
Algebra has nothing to do with long hand multiplication, people who say otherwise can't do either.

We know, because we taught computers how to do both. The first long multiplication algorithm was written for the Colossus about 10 minutes after they got it working.

The first computer algebra system that could manage variable substitution had to wait for Lisp to be invented 10 years later.

superposeur · 2h ago
petra303 · 2h ago
I doubt they’re talking about entry level maths.
bootsmann · 2h ago
Why should other subjects be any different?
sdwr · 2h ago
Are multiplication and long division by hand really necessary skills?

I never need to "fall back" to the principles of multiplication. Multiplying by the 1s column, then the 10s, then the 100s feels more like a mental math trick (like the digits of multiples of 9 adding to 9) than a real foundational concept.

slipperydippery · 1h ago
They’re proof you learned the topic well enough to coherently write a very-few pages about it. That’s what they’re for.

Making them open book + AI would just mean you need “larger” questions to be as effective a test, so you’re adding work for the graders for basically no reason.

parpfish · 2h ago
most exams aren't about solving a problem, it's about demonstrating that you've learned something.
nxobject · 2h ago
Sadly, I think that's what's going to be lost in the switch back to exams: the ability to assess someone's ability to iteratively solve a problem or craft a thesis.
viccis · 1h ago
Would you let a second grader take their spelling test using a word processor with a spell checker?
nine_k · 2h ago
The problem is not that a student can ask AI for answers. The problem is that the student has to have an idea what questions to ask.

For that, the student must have internalized certain concepts, ideas, connections. This is what has to be tested in a connectivity-free environment.

altairprime · 2h ago
Can’t use AI on a date, or at a dinner party, or during a board meeting.

Faking intelligence with AI only works in an online-exclusive modality, and there’s a lot of real world circumstances where being able to speak, reason, and interpret on the fly without resorting to a handheld teleprompter is necessary if you want to be viewed positively. I think a lot of people are going to be enraged when they discover that dependency on AI is unattractive once AI is universally accessible. “But I benefited from that advantage! How dare they hold that against me!”

cyberax · 1h ago
> Can’t use AI on a date, or at a dinner party, or during a board meeting.

Challenge accepted. One possible solution: https://github.com/RonSijm/ButtFish

altairprime · 1h ago
Cheaters are universal no matter what social boundaries are defined, and neither high-bandwidth wireless signals nor onboard acoustic processing can be reliably performed within the human rectum to any reasonable degree of fidelity. If one would externalize basic critical reasoning skills, I encourage finding another location to store one’s supplemental cranium :)
noosphr · 1h ago
>Can’t use AI on a date, or at a dinner party, or during a board meeting.

I get the same "you won't always have a calculator with you" vibes from 90s teachers chiding you to show your work when I hear people say stuff like this.

altairprime · 1h ago
I wouldn’t equate trigonometry, which underpins the classic parabolic example you’re referring to, with critical reasoning in human conversation. One is situationally useful at best; the other is mandatory to prevent exploitation by malicious people. Mental quadratics may be appealing, but the ability to reason is the bare minimum. Besides: if you’re using a calculator or AI in a board meeting, you’re likely unprepared for the board meeting.
noosphr · 1h ago
I'm not sure how you did proofs in trigonometry with a calculator.
altairprime · 58m ago
I ran my school district out of math to teach me in the 80s and ended up focusing on sysadmin in the 90s rather than repeating trigonometry or pursuing formal math at the local college. Sorry I can’t be of more use to your argument! Perhaps someone else will have applicable experiences.
bigstrat2003 · 1h ago
You say that like those teachers were incorrect. They were correct, and still are correct. You don't always have a computer to hand, and you do in fact need to be able to do basic math.
Ekaros · 1h ago
Also LLMs have fairly well proven that even if you have calculator you probably should have ability to do some sanity check on the answer. In case you hit wrong button for example. With LLMs they can be confidently wrong and unless you are able to tell you are out of luck...

Plus all about capability to actually retain whatever you ask from the model...

altairprime · 50m ago
Ironically, America finally automated one of our oldest workplace specialties: grifting! People overconfidently declaring made-up nonsense in board meetings is a classic executive behavior. I suppose it’ll be interesting to see what happens now that everyone has one in their pocket. Will their patter improve from exposure alone, or will they more easily detected because their skills weaken from disuse?
noosphr · 1h ago
If I don't have access to my phone the power grid has been down for at least two days and by that point I've got more pressing issue than showing my work when doing basic math.
lokar · 31m ago
So we just give up on direct reading comprehension and critical thinking?
furyofantares · 2h ago
In fact, why teach the material at all?
philwelch · 1h ago
If there weren’t differences between the educational environment and the workplace, that would be the same as if you just dumped people straight into the workplace with no education. For education to be useful at all it has to be distinct from the “real world” and optimized for learning. One way of optimizing for learning is to make the student solve the problem the hard way, so the student understands the entire process and not just the parts that are harder to optimize.
add-sub-mul-div · 2h ago
Because a tradeoff of relying on calculators means the average person can't do simple math on their own. It would be bad for society if a crutch for general thinking dulls those skills the same way calculators dulled home economics skills. In theory people could pull out a calculator at the grocery store but I've never seen it happen.
nxobject · 2h ago
As someone whose college papers always went through multiple drafts, I genuinely hope that the "joy" the author feels doesn't get in the way of teaching writing skills.

And, as someone who got paid minimum wage to proctor tests in college, I couldn't keep a straight face at this:

> The most cutting-edge educational technology of the future might very well be a stripped-down computer lab, located in a welcoming campus library, where students can complete assignments, by hand or on machines free of access to AI and the internet, in the presence of caring human proctors.

I think the author's leaning heavily on vibes to do the convincing here.

sarchertech · 2h ago
We had a math lab where you could go do your math homework and the majority of the student assistants were great. Same with the undergrad lab assistants in physics labs.
pessimizer · 42m ago
> I think the author's leaning heavily on vibes to do the convincing here.

I have no idea what you're trying to express in your comment, so who's using vibes?*

Were you triggered by the word "caring?" A waiter usually cares that the people they're serving have an enjoyable meal. It doesn't mean that they love them, it means that they think the work of feeding people is purposeful and honest (and theirs.)

-----

[*] It's certainly not in the words; I don't know what made you angry about "joy," I don't know why you think the author does not teach writing skills in "communications," I don't know why the fact that you went through multiple drafts in writing school papers is relevant or different than anyone else's experience. Maybe that's over now. Maybe I actually don't care if you use AI for your second and further drafts, if I know you can write a first draft.