Vintage chatbot lives in the past like an elderly relative
A team of AI researchers has developed Talkie, a 13-billion-parameter language model trained exclusively on English-language texts published before 1931, making it unable to reference events or technology after that period. The model aims to help researchers understand AI behavior, test forecasting abilities, and study historical language and cultural context. Despite its novelty, Talkie underperforms compared to modern models due to OCR errors and data limitations, and it occasionally exhibits temporal leakage by referencing post-1930 events. The creators plan to scale the model and improve its training data, including re-OCR efforts and collaboration with historians.
- ▪Talkie is a vintage language model trained only on texts published before the end of 1930, limiting its knowledge to pre-Depression-era information.
- ▪The model was created to study AI cognition, historical language interpretation, and the potential for AI to make scientific predictions based on period-limited data.
- ▪Talkie underperforms modern models in most evaluations due to noise from optical character recognition (OCR) of scanned historical documents.
- ▪The model shows 'temporal leakage,' incorrectly referencing post-1930 figures like FDR, due to imperfect filtering of training data.
- ▪Talkie is publicly available on GitHub and Hugging Face, with a warning that it may generate inaccurate or offensive content reflecting early 20th-century biases.
Opening excerpt (first ~120 words) tap to expand
AI + ML 4 Vintage chatbot lives in the past like an elderly relative 4 Talkie's training data stops at the end of 1930, and its creators hope it'll help us better understand how AI thinks Brandon Vigliarolo Tue 28 Apr 2026 // 17:51 UTC If you're tired of interacting with a bot that spews Nazi propaganda or refers to itself as MechaHitler, you could sign off of Elon Musk's xAI. Or, just to be sure, use an LLM whose training data ends in 1930, three years before the Nazis took power in Germany and nine years before World War II started.
…
Excerpt limited to ~120 words for fair-use compliance. The full article is at The Register.