AI Jargon Dictionary (March 2026 Edition), Recommended to Bookmark

marsbitPublished on 2026-03-11Last updated on 2026-03-11

Abstract

AI Jargon Dictionary (March 2026 Edition): A Practical Guide This article provides a clear glossary of essential AI terminology, perfect for anyone looking to quickly understand the field. It breaks down key concepts into two categories: foundational and advanced terms. Foundational terms (12) include: - **LLM (Large Language Model)**: Deep learning models trained on vast data to understand and generate language. - **AI Agent**: Systems that understand goals, use tools, and execute multi-step tasks. - **Multimodal**: Models that process and generate multiple content types (text, image, audio, video). - **Generative AI (AIGC)**: AI focused on creating new content. - **Token**: The basic unit for processing input/output, affecting cost and performance. - **Context Window**: The amount of data a model can consider at once. - **RAG (Retrieval-Augmented Generation)**: Enhances responses using external, up-to-date data sources. Advanced terms (18) cover: - **Transformer**: The architecture behind modern LLMs. - **Hallucination**: When models generate incorrect but confident responses. - **Agentic Workflow**: Systems that autonomously break down and execute tasks. - **Vibe Coding**: Generating code through natural language commands without manual coding. The guide is designed to help readers confidently navigate AI discussions and implementations.

Author|Golem(@web 3_golem)

Now, if people in the crypto circle don't pay attention to AI, they are easily ridiculed (yes, my friend, think about why you clicked in).

Are you completely clueless about the basic concepts of AI, asking Douban what every abbreviation in a sentence means? Or are you at AI offline events, confused by all the jargon, yet pretending you're still online?

Although it's unrealistic to jump into the AI industry overnight, it's always beneficial to know the high-frequency basic vocabulary of the AI industry. Fortunately, the following article is prepared for you↓ Sincerely recommend you to read it thoroughly and bookmark it.

Basic Vocabulary (12)

LLM (Large Language Model)

The core of an LLM is a deep learning model trained on massive amounts of data, excelling at understanding and generating language. It processes text and is increasingly capable of handling other types of content.

In contrast, there is the SLM (Small Language Model)—typically emphasizing lower cost, lighter deployment, and greater convenience for localization.

AI Agent (AI Intelligent Agent)

An AI Agent refers not just to a "chat model," but to a system that can understand goals, call tools, execute tasks step-by-step, and even perform planning and verification when necessary. Google defines an agent as software that can reason based on multimodal inputs and perform actions on behalf of the user.

Multimodal (Multimodal)

Such AI models don't just process text but can simultaneously handle multiple forms of input and output, such as text, images, audio, and video. Google explicitly defines multimodality as the ability to process and generate different types of content.

Prompt (Prompt)

The instruction input by the user to the model, which is the most basic form of human-machine interaction.

Generative AI (Generative AI / AIGC)

Emphasizes AI's ability to "generate" rather than merely classify or predict. Generative models can generate text, code, images, memes, videos, and other content based on prompts.

Token (Token)

This is one of the concepts in the AI world most similar to a "Gas unit." Models don't process content by "word count" but by tokens. Billing, context length, and response speed are often strongly related to tokens.

Context Window (Context Window / Context Length)

Refers to the total number of tokens a model can "see" and utilize at one time. It can also be described as the number of tokens the model can consider or "remember" during a single processing instance.

Memory (Memory)

Allows the model or Agent to retain user preferences, task context, and historical states.

Training (Training)

The process by which a model learns parameters from data.

Inference (Inference Execution)

Opposite of training, it refers to the process where the model receives input and generates output after deployment. The industry often says, "Training is expensive, but inference is even more costly," because many costs during the actual commercialization phase occur during inference. The distinction between training and inference is also a fundamental framework in discussions about deployment costs among mainstream vendors.

Tool Use / Tool Calling (Tool Calling)

Means the model doesn't just output text but can also call tools such as search, code execution, databases, and external APIs. This is already considered one of the key capabilities of an Agent.

API (Interface)

The infrastructure used when AI products, applications, or Agents connect to third-party services.

Advanced Vocabulary (18)

Transformer (Transformer Architecture)

A model architecture that makes AI better at understanding contextual relationships. It is the technical foundation for most large language models today. Its biggest feature is the ability to simultaneously consider the relationships between every word in an entire segment of content.

Attention (Attention Mechanism)

It is the most critical core mechanism of the Transformer. Its role is to allow the model to automatically determine "which words are most worth focusing on" when reading a sentence.

Agentic / Agentic Workflow (Agentic / Agentic Workflow)

This is a very hot term recently. It means a system is no longer just "question-and-answer" but has a certain degree of autonomy to break down tasks, decide the next steps, and call external capabilities. Many vendors see it as a sign of "moving from Chatbot to an executable system."

Subagents (Sub-agents)

An Agent further breaks down into multiple specialized smaller Agents to handle sub-tasks.

Skills (Reusable Capability Modules)

With the explosion of OpenClaw, this term has become noticeably more common. These are installable, reusable, and composable capability units/operating instructions for AI Agents, but there are also specific warnings about tool misuse and data exposure risks.

Hallucination (Machine Hallucination)

Refers to the model confidently spouting nonsense, "perceiving patterns that do not exist," thereby generating incorrect or absurd outputs. This is the model's seemingly reasonable but actually erroneous overconfident output.

Latency (Latency)

The time it takes for the model to output results after receiving a request. It is one of the most common engineering jargon terms and frequently appears in discussions about implementation and productization.

Guardrails (Guardrails)

Used to restrict what the model/Agent can do, when to stop, and what content cannot be output.

Vibe Coding (Vibe Coding)

This term is also one of the hottest AI jargons today. It means users directly express requirements through conversation, and the AI writes the code, while the user doesn't need to specifically understand how to code.

Parameters (Parameters)

The internal numerical scale of the model used to store capabilities and knowledge. Often used as a rough measure of the model's size. "Billions of parameters" and "trillions of parameters" are among the most common intimidating phrases in the AI circle.

Reasoning Model (Strong Reasoning Model)

It typically refers to models that are better at multi-step reasoning, planning, verification, and executing complex tasks.

MCP (Model Context Protocol)

This is a very hot new jargon in the past year. Its role is similar to establishing a universal interface between the model and external tools/data sources.

Fine-tuning / Tuning (Fine-tuning)

Continuing training on a base model to make it more adapted to specific tasks, styles, or domains. Google's glossary directly lists tuning and fine-tuning as related concepts.

Distillation (Distillation)

Compressing the capabilities of a large model into a smaller model as much as possible, akin to a "teacher" teaching a "student."

RAG (Retrieval-Augmented Generation)

This has almost become a basic configuration for enterprise AI. Microsoft defines it as a "search + LLM"模式 (pattern), using external data to ground answers, solving problems like outdated training data and lack of knowledge about private knowledge bases. The goal is to base answers on real documents and private knowledge, rather than relying solely on the model's own memory.

Grounding (Fact Alignment)

Often appears together with RAG. It means making the model's answers based on external sources like documents, databases, web pages, etc., rather than relying solely on parameter memory for "free creation." Microsoft explicitly lists grounding as a core value in its RAG documentation.

Embedding (Vector Embedding / Semantic Vector)

Encoding content like text, images, and audio into high-dimensional numerical vectors to perform semantic similarity calculations.

Benchmark (Benchmark Test)

An evaluation method that uses a unified set of standards to test model capabilities. It is also the ranking language most loved by various companies to "prove they are strong."

Recommended Reading

Lobster's Key 11 Questions: The Most Easy-to-Understand Breakdown of OpenClaw Principles

Related Questions

QWhat is the core definition of an LLM according to the article?

ALLM (Large Language Model) is a deep learning model trained on massive amounts of data, excelling at understanding and generating language. It primarily processes text but is increasingly capable of handling other types of content.

QWhat key ability is described as essential for an AI Agent beyond just chatting?

AAn AI Agent is defined as a system that can understand goals, call tools, execute tasks step-by-step, and perform planning and verification when necessary. It's definition by Google includes reasoning based on multimodal inputs and acting on behalf of the user.

QWhat does the term 'Multimodal' refer to in the context of AI models?

AMultimodal refers to an AI model's ability to simultaneously process and generate different types of content, such as text, images, audio, and video, not just text alone.

QWhat is the purpose of RAG (Retrieval-Augmented Generation) as explained in the article?

ARAG is a 'search + LLM' model that uses external data to ground answers, addressing issues like outdated training data or lack of private knowledge. It ensures responses are based on real documents and private knowledge rather than the model's memory alone.

QWhat does 'Hallucination' mean in relation to AI models?

AHallucination refers to the phenomenon where an AI model confidently generates incorrect or nonsensical outputs by perceiving patterns that do not exist, resulting in seemingly plausible but factually wrong information.

Related Reads

Huawei Cloud Rejects Token Price War, Zhou Yuefeng Seeks a New Winning Formula for AI Cloud

At the 2026 Huawei Cloud INSPIRE Creator Conference, CEO Zhou Yuefeng outlined Huawei Cloud's distinct strategy in the competitive AI cloud market. Instead of engaging in price wars based on token volume or Maas revenue—a common focus for rivals like Alibaba Cloud and ByteDance's Volcano Engine—Huawei Cloud is shifting the competition towards real-world productivity gains. Zhou highlighted three core differentiators: a fully domestic computing stack (Ascend, Kunpeng), a focus on government and enterprise clients rather than consumer internet, and a deep commitment to open-source ecosystems. To this end, Huawei Cloud launched a suite of new products under the "Agentic Infra" paradigm, including the AICS Lingqu computing cluster, AMS memory storage, and the ModelArts Next platform. These aim to solve enterprise challenges in deploying AI agents, such as latency, memory, scheduling, and security. The strategy further involves creating specialized industry zones ("AI Dream Factories") for sectors like healthcare and embodied intelligence. For example, a smart medical zone developed with Shanghai Ruijin Hospital aims to democratize expert-level diagnostic capabilities. In essence, Huawei Cloud is positioning itself not as a commodity token provider, but as the foundational infrastructure for industrial AI, leveraging its domestic supply chain and hybrid cloud solutions to serve sectors where productivity, not just scale, is the ultimate measure of value.

marsbit7m ago

Huawei Cloud Rejects Token Price War, Zhou Yuefeng Seeks a New Winning Formula for AI Cloud

marsbit7m ago

70% of the Public Opposes AI, Americans Hope the U.S. Loses the AI War

70% of Americans believe AI development is moving too fast, with growing public resistance evolving from online criticism to real-world protests and violence. This widespread anti-AI sentiment stems from fears of job losses, rising utility costs, environmental damage, threats to democracy, and financial instability. Key incidents illustrate the backlash: Google's former CEO Eric Schmidt was loudly booed at a graduation for promoting AI; AI company ads are vandalized; protests and even violent attacks target AI firms and data centers. Polls show deep public pessimism and strong local opposition to data center construction, often surpassing resistance to nuclear power plants. The core grievances are economic and practical: AI is seen as automating jobs, concentrating wealth, and increasing household electricity and water bills due to massive data center resource demands. Environmentalists also oppose AI's high energy use and carbon emissions. This opposition has turned AI into a major political issue in the US. While the Trump administration prioritizes AI innovation for global competition, bipartisan pushback is growing. Democrats and factions within the MAGA movement are forming temporary alliances to support stricter regulations and local bans on new data centers, pressuring the administration to choose between its tech industry backers and its voter base. The situation highlights a profound national divide over AI's future.

marsbit40m ago

70% of the Public Opposes AI, Americans Hope the U.S. Loses the AI War

marsbit40m ago

Agents Take Over Traffic Distribution Power: What Are Tencent, ByteDance, and Alibaba Competing For?

In the race to dominate the AI era's entry point, China's tech giants—Tencent, ByteDance, and Alibaba—are aggressively deploying AI Agents to control the future of traffic distribution. Alibaba is pursuing a dual-track "closed loop + openness" strategy. Its Qianwen app is evolving into a super-Agent integrated across its ecosystem (Taobao, Alipay, etc.) to handle complex tasks like travel planning. Concurrently, it is opening its platform to external brands (Luckin Coffee, KFC) and has launched a B2B Agent platform, "Wukong," targeting enterprise automation. Its other flagship, Quark, aims to be an "AI super search box" for information and tasks. ByteDance is executing an omnipresent "sprawl strategy." Its Doubao app boasts over 300 million monthly active users and is evolving into a default AI entry point for daily life, with plans for paid versions and e-commerce integration. Its core weapon is the Kouzi platform, a visual "AI assembly factory" for developers to build custom Agents. ByteDance is also pushing hardware integration, collaborating on AI phones and developing smart glasses to embed Doubao everywhere. Tencent is playing its long-held "ultimate card" by quietly embedding an AI Agent directly into WeChat. This Agent, accessible via a swipe, can understand user commands and automatically execute tasks by calling upon WeChat's millions of mini-programs (e.g., finding and ordering coffee). This leverages WeChat's unparalleled 1.4-billion-user ecosystem to position the app as an AI-powered "service operating system," a move that could dramatically reshape the competitive landscape. The core battleground is shifting from competing for "user screen time" to competing to be the "default execution layer" for user intent. The business model is evolving from an "attention economy" to an "intent economy," where the Agent that can most efficiently fulfill a user's need gains control over service access and token flow. This represents a fundamental change in how users connect with digital services, making the fight for the Agent入口 (entry point) a pivotal moment for redefining industry leadership in the AI age.

marsbit2h ago

Agents Take Over Traffic Distribution Power: What Are Tencent, ByteDance, and Alibaba Competing For?

marsbit2h ago

Trading

Spot
Futures

Hot Articles

Discussions

Welcome to the HTX Community. Here, you can stay informed about the latest platform developments and gain access to professional market insights. Users' opinions on the price of AI (AI) are presented below.

活动图片