Hobbyist training AI on Victorian texts gets an unexpected history lesson from his own creation.
See full article...
See full article...
I'm trying to work out what the event referred might have been. The Reform Act passed 1832, and various folks were unhappy that there remained a requirement to own property to vote. This would later become the Chartist movement of the late 1830s-1840s, but that's too late for our tale. The Tolpuddle Martyrs were sentenced to transportation to Australia for trying to form a labour union in 1834, which caused mass demonstrations in their support. But Palmerston was foreign secretary at the time and all his speeches in Hansard in that year relate to foreign matters.I'm not sure that this is surprising, he prompted a chatbot for a statement about 1832, and it produced what was probably a major event for that year, along with a big-name politician (Palmerston would later become Prime Minister, I believe?) who was involved and likely to be mentioned in the most sources, especially later sources.
For the youthDid the LLM provide any opinion as to whether Pitt the Elder was a better Prime Minister?
Hallucinations with a top hat.Aha, so THIS is what the trillions of investment and ecological destruction has all been for!
In a hundred years’ time, when the tech-priests finally pluck up the courage to ask Condensotron The Ultimate Epitomiser to explain the entire LLM revolution in its concisest possible form, this is going to be the output.So some comp sci student who evidently doesn't know much about history (or even how to learn about history) was surprised that the LLM he trained actually output true information from the training set? Wow. This really makes me rethink the reasons I subscribe to Ars.
Its the poor law reform act of 1834I'm trying to work out what the event referred might have been. The Reform Act passed 1832, and various folks were unhappy that there remained a requirement to own property to vote. This would later become the Chartist movement of the late 1830s-1840s, but that's too late for our tale. The Tolpuddle Martyrs were sentenced to transportation to Australia for trying to form a labour union in 1834, which caused mass demonstrations in their support. But Palmerston was foreign secretary at the time and all his speeches in Hansard in that year relate to foreign matters.
So, unless someone has a better source, even the link between '1834', 'Palmerston' and 'protest' seems tenuous.
(Also I'm no expert but the article image looks circa 1840-50s to me, not 'late 19th century'. Palmerston was dead by 1865 anyway)
If he was surprised by this, it sounds like he's pretty ignorant about LLMs, too..."But what makes this episode especially interesting is that a small hobbyist model trained by one man appears to have surprised him by reconstructing a coherent historical moment..."
In other words: Person ignorant of history surprised to learn some actual history from an LLM.
Eh I'd have to disagree with your conclusion somewhat, more data and more compute to train on all that data faster absolutely does result in generally better outputs, UP TO A POINT.The thing that drives me up the wall here isn't the "golly gee whiz an LLM trained on newspapers from 1834 managed to average out two true details from 1834," though that is annoying, it's the credulous repetition of the student saying "oh wow it got so much better with more data just imagine how good it'll be with even more! Maybe it'll even think like someone from the 1800s!" as if we haven't just spent, collectively, tens of billions of dollars, thousands of tons of carbon and entire reservoirs of freshwater figuring out that no it absolutely does not work that way.
I’m not disagreeing, but I’m curious if you can explain why you are saying this so confidently. It seems like maybe how we put sentences together does reflect thought patterns. But maybe you know that’s not true? Can you say why?…
I think this is why reporting on AI is so hard. Ars is a much better source than most (it's one of my few sources of news in reality), but yet this kind of a sentence makes it through. There's no thought processes of past eras to actually be interrogated here.
Did the LLM provide any opinion as to whether Pitt the Elder was a better Prime Minister?
We have to remember, our current iteration of AI is founded on big data, whose earliest problems were querying the data. Nothing new here? I didn't see much about how he trained his model, so it's quite possible he just rented GPU time... Which also isn't big news.Wait, so this event existed in the training data in parts. The guy asked a question, and the LLM responded with the statistical connections? I'm trying to find out what is the story here. This is... how LLMs work?
Obviously I'm not the OP but I think it could be better phrased as no new thought patterns. Sure, you can get some information about how historical figures thought from their writing, but you could also get that information from just reading it?I’m not disagreeing, but I’m curious if you can explain why you are saying this so confidently. It seems like maybe how we put sentences together does reflect thought patterns. But maybe you know that’s not true? Can you say why?
Isn't this pure Victorian gibberish? It's very different from the modern gibberish LLMs produce....but having taken up the same day in the day of Lord Palmerston, the public will receive a short statement of the difficulties under which the day of law has reached us.
The hook of the story is: “Guy trains LLM that then gives a factually correct response to a query.” That’s not really news. Nobody seriously claims that LLMs are wrong all the time.Knowing that a model can trusted to dig out facts after being fed copious amounts of text makes it much easier to analyze said texts, especially if it can cite the excerpts it used to come up with a particular inference.
Based on all the relevant evidence, it will end up here:Granted the story appears peripheral, and perhaps it is - but heck, who knows where LLM will go?
Not good. There's not nearly enough text there for anything more than a generic text generator. It would probably appear coherent without saying anything. Probably the same level as my 4 year oldI wonder what an AI trained in the entirely of Project Gutenberg and nothing else would be like?
Also, the thing about this sentence is it could have just as easily been accomplished by... randomly flipping through the training material and reading it??I think this is a fun use case. Like the other comments it feels like none of what was produced should be that surprising, his model just reached a large enough size to become coherent.
I think this is why reporting on AI is so hard. Ars is a much better source than most (it's one of my few sources of news in reality), but yet this kind of a sentence makes it through. There's no thought processes of past eras to actually be interrogated here.
Is it though? What's the evidence? Or is that just human projection into the gibberish?
Fact-checked with Google? Like, dug into actual web-site search results? Or relied on Google's AI search result summary? Is this actually a case of using AI to fact-check AI?
The article explains what protests, although I don’t understand the claimed link to Palmerston. The protests were against the amended Poor Law of 1834.I'm trying to work out what the event referred might have been. The Reform Act passed 1832, and various folks were unhappy that there remained a requirement to own property to vote. This would later become the Chartist movement of the late 1830s-1840s, but that's too late for our tale. The Tolpuddle Martyrs were sentenced to transportation to Australia for trying to form a labour union in 1834, which caused mass demonstrations in their support. But Palmerston was foreign secretary at the time and all his speeches in Hansard in that year relate to foreign matters.
So, unless someone has a better source, even the link between '1834', 'Palmerston' and 'protest' seems tenuous.
(Also I'm no expert but the article image looks circa 1840-50s to me, not 'late 19th century'. Palmerston was dead by 1865 anyway)
Palmerston was a member of the government that passed the legislation. He was Secretary at War between 1809–1828 and oversaw the defeat of Napoleon and the post war settlement at the congress of Vienna. Palmerston was a senior well known national politician for a quarter of a century in 1834.Is it though? What's the evidence? Or is that just human projection into the gibberish?
The text has three identifiable facts: 1834, a protest and Palmerston. The above is a protest in 1834 but what did Palmerston have to do with it? I'd wager that if you took any two facts and searched you'd find something ('protest 1834', 'protest Palmerston', 'Palmerston 1834'). But to have any kind of historical meaning the text has to align all three facts. Otherwise it's just probability theory and a short enough word salad is going to be something anyone can post-hoc ascribe meaning to which was not actually present in the text.
(See also the people who find numerical patterns in the bible for similar kinds of effects)
Whoa, so original written text doesn't reveal thought patterns? If so, are thought patterns are totally inexplicable and indescribable? Isn't your belief that there are no thought processes of past eras to be interrogated, itself not a thought pattern? Are you restricting "thought pattern" to neurochemicals and sparks amongst neurons? You're dismissal is actually a bold claim. Many here want to take a "common sense" critique of this article on AI, but it's possible there is no commonsense from which anyone can take a stand, when it comes to AI. If you don't like the material, just move along.I think this is a fun use case. Like the other comments it feels like none of what was produced should be that surprising, his model just reached a large enough size to become coherent.
I think this is why reporting on AI is so hard. Ars is a much better source than most (it's one of my few sources of news in reality), but yet this kind of a sentence makes it through. There's no thought processes of past eras to actually be interrogated here.
It was the year of our Lord 1834 and the streets of London were filled with protest and petition. The cause, as many re counted, was not bound in the way of private, but having taken up the same day in the day of Lord Palmerston, the public will receive a short statement of the difficulties under which the day of law has reached us. It is a matter of deep regret, that the present events in the history of the world are clear, and consequently will be'known. It is not true that the very men who first settled in the Gospel at Jerusalem should have so extensive and so interesting a record of the prosperity and prosperity
I don’t think anyone is suggesting that information wasn’t in the training database. The suggestion was that because it was in the database, there is a statistical chance that the name may appear when looking to fill in a blank after the phrase "It was the year of our Lord 1834.” It probably isn’t a subplot Eco was coerced into removing from Foucault’s Pendulum because it hit too close to home for the Pentaverate.Palmerston was a member of the government that passed the legislation. He was Secretary at War between 1809–1828 and oversaw the defeat of Napoleon and the post war settlement at the congress of Vienna. Palmerston was a senior well known national politician for a quarter of a century in 1834.
Agree it is fun. Also, the custom encoding mentioned in the article is going to be a big deal. word2vec would not be a good choice.I think this is a fun use case. Like the other comments it feels like none of what was produced should be that surprising, his model just reached a large enough size to become coherent.
I think this is why reporting on AI is so hard. Ars is a much better source than most (it's one of my few sources of news in reality), but yet this kind of a sentence makes it through. There's no thought processes of past eras to actually be interrogated here.
Not even that. "Protest and petition" is a legal term, something like a written legal appeal. It doesn't mean people taking to the street. In late Victorian English you have "protest meetings" and "meetings of protest" but "protest" alone, meaning a public demonstration, is a 20th century innovation (at least according to examples in the Oxford English Dictionary). So this is just a random phrase stuck in the text.The text has three identifiable facts: 1834, a protest and Palmerston.
I’m not sure that’s true. If you believe that the writings of a particular person reflect the thought processes of that person (and what is the alternative), then you have to believe that analyzing the writings are a step to analyzing their thought processes. So analyzing the collected writings of a group of people that share something in common should be a step toward analyzing their thought processes.There's no thought processes of past eras to actually be interrogated here.