Models

THE BIG THREE PROPRIETARY FRONTIER MODELS 

OTHER PROPRIETARY MODELS

SCIENCE and MATH MODELS

OPEN SOURCE MODELS

REGIONAL and CULTURALLY-SPECIFIC MODELS

HISTORICAL LLMS

CONSOLIDATORS 

REASONING MODELS: In 2025 models started to process through problems before answering. They do NOT actually reason (although it appears that way) but they have internal instructions that break problems down into steps which (especially when combined with web searching) improves accuracy and allows much more complicated problem solving. You need to use them a little differently (more here): give it something hard to do and note (or ask) how it describes its reasoning. Look at this example. The progress here has been rapid and substantial (read this report about the new o3 from Dec 2024). Most of the above models now have free reasoning (but with different names so look for buttons: Kimi calls this “researcher.”)

CUSTOM BOTS: Each of the big models also has a way to build and then distribute your own fine-tuned applications with your own prompt instructions. There are also GPTs (from OpenAI), Assistants (from HuggingFace), Bots (from Poe). There are also educational platforms, like BoodleBox which allows the teacher to see everything students do–and has lots of other faculty features like “coach mode” which the chat default (and won’t provide students with direct answers. Much more (including how to build them) on the Custom Bot page.

MINI MODELS and EDGE AI: These are smaller, faster and more specialized (often) OPEN SOURCE tools that you customize to live and run on your phone. Note that the ways to make an LLM better are model size (see Frontier models above), data set size and and the amount of training. Since it is not clear that larger more capable models will be cost effective, these faster smaller models (with more training) may end up being more useful. Apple Intelligence will test this idea. More smaller models are coming. 

AGENTS

A chatbot can only chat with you, but an “agent” can plan and execute a series of tasks, like building you a website or finding information on your computer. GenAI can create content, but Agentic AI can work autonomously. Agents can also respond to a trigger–without you having to initiate. Agents code, but that is essentially everything you do on a computer, so don’t think of these just as coding tools. Agents can use multiple tools and know when to switch, so an AI agent can manage a workflow. It is like a contractor rather than a chatbot. Here are details about the “Agent2Agent” (or A2A) or “Model Context Protocol.” MCP is an open-source universal interface for connecting AI to external systems and data. MCP is like a port on your computer (or like being connected to the internet)–you don’t need to know how it works, just that it connects files and tools (like AI and your calendar). MCP was released by Anthropic in Nov 2024 and rapidly adopted. Where RAG is static and one directional (Ai can look for answers in a file) MCP is bi-directional, but they are both ways to give AI context. The distinction between agents and vibe-coding apps is narrowing–the difference is partly workflow and that many of the apps use other foundation models. Stay tuned. There are now lots of demos of agents doing students homework.

AGENIC BROWSERS

First came the browser extensions (with a large cohort of fill in the answer cheating extensions). Google now has its Gemini AI built into its browser(although AI mode is substantially better than the default AI summaries.) Microsoft Edge now has CoPilot integrated. Chrome and other browsers are also starting to integrate, but we now have new ground up AI-browsers with the ability to both ask questions and do things built in.

ROBOTS & MORE

AI is also propelling an advance in robotics including new home robots like Neo, Figure 03, the Walker S2 (from UBTech now actually mass delivered) and robots dogs). Look at the ridable hydrogen-powered Corleo from Kawasaki. There is also AlterEgo, which allows you to have silent (“almost telepathic”) conversations with AI or another human.

EpochAI is an important independent organization that is keeping track of these models, how they compare and where we might be going. They maintain a great dashboardcomparing capabilities of the best models (against their own benchmarks) and also this larger data set of virtual all models. They produce excellent reports about trends including a recent prediction that AI will continue to improve rapidly.

You can find a complete list of AI products (tracked by Ithaka S+R) here

Here is a great AI guide for students.