AI Jargon Dictionary (March 2026 Edition), Recommended to Bookmark

Odaily星球日报Опубликовано 2026-03-11Обновлено 2026-03-11

Введение

The "AI Jargon Dictionary (March 2026 Edition)" is a practical guide for those new to the AI field, especially crypto enthusiasts looking to stay relevant. It covers essential and advanced AI terms to help readers understand key concepts and avoid confusion in industry discussions. The dictionary is divided into two parts: **Basic Vocabulary (12 terms):** - Core concepts like LLM (Large Language Model), AI Agent (intelligent systems that execute tasks), Multimodal (handling multiple data types), and Prompt (user instructions). - Key technical terms: Token (processing unit), Context Window (token capacity), Memory (retaining user data), Training vs. Inference (learning vs. execution), and Tool Use (calling external tools). - Generative AI (AIGC) and API (integration interface) are also explained. **Advanced Vocabulary (18 terms):** - Technical foundations: Transformer architecture, Attention mechanism, and Parameters (model scale). - Emerging trends: Agentic Workflow (autonomous systems), Subagents, Skills (reusable modules), and Vibe Coding (AI-assisted programming). - Challenges: Hallucination (incorrect outputs), Latency (response time), Guardrails (safety controls). - Optimization techniques: Fine-tuning, Distillation (model compression), RAG (Retrieval-Augmented Generation), Grounding (fact-based responses), Embedding (vector encoding), and Benchmark (performance evaluation). The article emphasizes practicality, urging readers to learn these term...

Original | Odaily Planet Daily (@OdailyChina)

Author | Golem (@web 3_golem)

Now, if people in the crypto circle don't pay attention to AI, they are easily ridiculed (yes, my friend, think about why you clicked in).

Are you completely clueless about the basic concepts of AI, asking Doubao what every abbreviation in a sentence means? Or are you confused by various technical terms at AI offline events, pretending you haven't lost track?

Although it's unrealistic to jump into the AI industry overnight, knowing the high-frequency basic vocabulary of the AI industry is always beneficial. Fortunately, the following article is prepared for you↓ Sincerely recommend that you read it thoroughly and bookmark it.

Basic Vocabulary (12)

LLM (Large Language Model)

LLM is essentially a deep learning model trained on massive amounts of data, excelling at understanding and generating language. It processes text and is increasingly capable of handling other types of content.

In contrast, there is SLM (Small Language Model)—typically emphasizing lower cost, lighter deployment, and more convenient localization of language models.

AI Agent (AI Intelligent Agent)

AI Agent refers not just to "a model that can chat," but to a system that can understand goals, call tools, execute tasks step by step, and even plan and verify when necessary. Google defines an agent as software that can reason based on multimodal inputs and perform actions on behalf of the user.

Multimodal (Multimodal)

Such AI models don't just read text; they can simultaneously process and generate various types of input and output, including text, images, audio, and video. Google explicitly defines multimodality as the ability to process and generate different types of content.

Prompt (Prompt)

The instruction input by the user to the model, which is the most basic form of human-computer interaction.

Generative AI (Generative AI / AIGC)

Emphasizes AI's ability to "generate" rather than merely classify or predict. Generative models can generate text, code, images, memes, videos, and other content based on prompts.

Token (Token)

This is one of the concepts in the AI circle most similar to a "Gas unit." Models don't understand content by "word count" but process input and output by tokens. Billing, context length, and response speed are often strongly related to tokens.

Context Window (Context Window / Context Length)

Refers to the total number of tokens a model can "see" and utilize at once. It can also be described as the number of tokens the model can consider or "remember" during a single processing instance.

Memory (Memory)

Allows the model or Agent to retain user preferences, task context, and historical states.

Training (Training)

The process by which the model learns parameters from data.

Inference (Inference Execution)

Opposite of training, it refers to the process where the model receives input and generates output after deployment. The industry often says, "Training is expensive, but inference is even more costly," because many costs during the actual commercialization phase occur during inference. The distinction between training and inference is also a fundamental framework in discussions about deployment costs among major vendors.

Tool Use / Tool Calling (Tool Calling)

Means the model doesn't just output text but can also call tools such as search, code execution, databases, and external APIs. This is already considered one of the key capabilities of an Agent.

API (Interface)

The infrastructure used when AI products, applications, or Agents connect to third-party services.

Advanced Vocabulary (18)

Transformer (Transformer Architecture)

A model architecture that enables AI to better understand contextual relationships. It is the technical foundation for most large language models today. Its biggest feature is the ability to simultaneously consider the relationships between every word in an entire segment of content.

Attention (Attention Mechanism)

It is the most critical core mechanism of the Transformer. Its role is to allow the model to automatically determine "which words are most worth focusing on" when reading a sentence.

Agentic / Agentic Workflow (Agentic / Agentic Workflow)

This is a very hot term recently. It means a system is no longer just "question and answer" but has a certain degree of autonomy to break down tasks, decide next steps, and call external capabilities. Many vendors see it as a sign of "moving from Chatbot to an executable system."

Subagents (Sub-agents)

An Agent further breaks down into multiple specialized smaller Agents to handle sub-tasks.

Skills (Reusable Capability Modules)

With the popularity of OpenClaw, this term has become noticeably more common recently. These are installable, reusable, and composable capability units/operation manuals for AI Agents. However, there are specific warnings about tool misuse and data exposure risks.

Hallucination (Machine Hallucination)

Refers to the model speaking nonsense with a straight face, "perceiving patterns that do not exist" and thus generating erroneous or absurd outputs. It is the model's seemingly reasonable but actually wrong overconfident output.

Latency (Latency)

The time it takes for the model to output results after receiving a request. It is one of the most common engineering jargon terms and frequently appears in discussions about implementation and productization.

Guardrails (Guardrails)

Used to restrict what the model/Agent can do, when to stop, and what content cannot be output.

Vibe Coding (Vibe Coding)

This term is also one of the most popular AI jargons today. It means users directly express requirements through conversation, and the AI writes the code, while the user doesn't need to specifically understand how to code.

Parameters (Parameters)

The internal numerical scale of the model used to store capabilities and knowledge. It is often used as a rough measure of the model's size. "Billions of parameters" and "trillions of parameters" are among the most common intimidating phrases in the AI circle.

Reasoning Model (Strong Reasoning Model)

It typically refers to models that are better at multi-step reasoning, planning, verification, and complex task execution.

MCP (Model Context Protocol)

This is a very hot new jargon in the past year. Its role is similar to establishing a universal interface between the model and external tools/data sources.

Fine-tuning / Tuning (Fine-tuning)

Continuing training on a base model to make it more adapted to specific tasks, styles, or domains. Google's terminology list directly treats tuning and fine-tuning as related concepts.

Distillation (Distillation)

Compressing the capabilities of a large model into a smaller model as much as possible, akin to having a "teacher" teach a "student."

RAG (Retrieval-Augmented Generation)

This has almost become a basic configuration for enterprise AI. Microsoft defines it as a "search + LLM" pattern, using external data to ground the answers, solving problems like outdated training data and lack of knowledge about private knowledge bases. The goal is to base answers on real documents and private knowledge, rather than relying solely on the model's own memory.

Grounding (Fact Alignment)

Often appears together with RAG. It means making the model's answers based on external sources like documents, databases, web pages, etc., rather than just relying on parameter memory for "free play." Microsoft explicitly lists grounding as a core value in its RAG documentation.

Embedding (Vector Embedding / Semantic Vector)

Encoding text, images, audio, and other content into high-dimensional numerical vectors to calculate semantic similarity.

Benchmark (Benchmark Test)

A evaluation method that uses a unified set of standards to test model capabilities. It is also the language of leaderboards that various vendors love to use to "prove they are strong."

Связанные с этим вопросы

QWhat is the core function of a Large Language Model (LLM) according to the article?

AThe core function of an LLM is to be a deep learning model trained on massive amounts of data, excelling at understanding and generating language. It primarily processes text and is increasingly capable of handling other types of content.

QWhat key ability defines an AI Agent beyond just being a 'chatty model'?

AAn AI Agent is defined by its ability to understand goals, call upon tools, execute tasks step-by-step, and perform planning and verification when necessary. It's a system that can reason based on multimodal input and perform actions on behalf of the user.

QWhat does the term 'Token' represent in the context of AI models, and what is it compared to?

AA 'Token' is a unit of processing for AI models, similar to how 'Gas' is a unit in some blockchain systems. It's how the model understands content, not by word count. Fees, context length, and response speed are all strongly related to token usage.

QWhat is the purpose of 'RAG' (Retrieval-Augmented Generation) as described in the article?

AThe purpose of RAG is to combine search with a Large Language Model (LLM). It uses external data to ground the model's answers, solving problems like outdated training data and a lack of knowledge about private databases. The goal is to base answers on real documents and private knowledge rather than relying solely on the model's internal memory.

QWhat does the term 'Hallucination' mean in relation to AI models?

A'Hallucination' refers to the phenomenon where an AI model confidently produces incorrect or nonsensical outputs, 'perceiving patterns that do not exist'. It is the model's seemingly reasonable but actually erroneous overconfident output.

Похожее

Google and Amazon Simultaneously Invest Heavily in a Competitor: The Most Absurd Business Logic of the AI Era Is Becoming Reality

In a span of four days, Amazon announced an additional $25 billion investment, and Google pledged up to $40 billion—both direct competitors pouring over $65 billion into the same AI startup, Anthropic. Rather than a typical venture capital move, this signals the latest escalation in the cloud wars. The core of the deal is not equity but compute pre-orders: Anthropic must spend the majority of these funds on AWS and Google Cloud services and chips, effectively locking in massive future compute consumption. This reflects a shift in cloud market dynamics—enterprises now choose cloud providers based on which hosts the best AI models, not just price or stability. With OpenAI deeply tied to Microsoft, Anthropic’s Claude has become the only viable strategic asset for Google and Amazon to remain competitive. Anthropic’s annualized revenue has surged to $30 billion, and it is expanding into verticals like biotech, positioning itself as a cross-industry AI infrastructure layer. However, this funding comes with constraints: Anthropic’s independence is challenged as it balances two rival investors, its safety-first narrative faces pressure from regulatory scrutiny, and its path to IPO introduces new financial pressures. Globally, this accelerates a "tri-polar" closed-loop structure in AI infrastructure, with Microsoft-OpenAI, Google-Anthropic, and Amazon-Anthropic forming exclusive model-cloud alliances. In contrast, China’s landscape differs—investments like Alibaba and Tencent backing open-source model firm DeepSeek reflect a more decoupled approach, though closed-source models from major cloud providers still dominate. The $65 billion bet is ultimately about securing a seat at the table in an AI-defined future—where missing the model layer means losing the cloud war.

marsbit1 ч. назад

Google and Amazon Simultaneously Invest Heavily in a Competitor: The Most Absurd Business Logic of the AI Era Is Becoming Reality

marsbit1 ч. назад

Computing Power Constrained, Why Did DeepSeek-V4 Open Source?

DeepSeek-V4 has been released as a preview open-source model, featuring 1 million tokens of context length as a baseline capability—previously a premium feature locked behind enterprise paywalls by major overseas AI firms. The official announcement, however, openly acknowledges computational constraints, particularly limited service throughput for the high-end DeepSeek-V4-Pro version due to restricted high-end computing power. Rather than competing on pure scale, DeepSeek adopts a pragmatic approach that balances algorithmic innovation with hardware realities in China’s AI ecosystem. The V4-Pro model uses a highly sparse architecture with 1.6T total parameters but only activates 49B during inference. It performs strongly in agentic coding, knowledge-intensive tasks, and STEM reasoning, competing closely with top-tier closed models like Gemini Pro 3.1 and Claude Opus 4.6 in certain scenarios. A key strategic product is the Flash edition, with 284B total parameters but only 13B activated—making it cost-effective and accessible for mid- and low-tier hardware, including domestic AI chips from Huawei (Ascend), Cambricon, and Hygon. This design supports broader adoption across developers and SMEs while stimulating China's domestic semiconductor ecosystem. Despite facing talent outflow and intense competition in user traffic—with rivals like Doubao and Qianwen leading in monthly active users—DeepSeek has maintained technical momentum. The release also comes amid reports of a new funding round targeting a valuation exceeding $10 billion, potentially setting a new record in China’s LLM sector. Ultimately, DeepSeek-V4 represents a shift toward open yet realistic infrastructure development in the constrained compute landscape of Chinese AI, emphasizing engineering efficiency and domestic hardware compatibility over pure model scale.

marsbit1 ч. назад

Computing Power Constrained, Why Did DeepSeek-V4 Open Source?

marsbit1 ч. назад

Торговля

Спот
Фьючерсы

Популярные статьи

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

2025 год — год институциональных инвесторов, в будущем он будет доминировать в приложениях реального времени.

1.8k просмотров всегоОпубликовано 2025.12.16Обновлено 2025.12.16

Неделя обучения по популярным токенам (2): 2026 может стать годом приложений реального времени, сектор AI продолжает оставаться в тренде

Обсуждения

Добро пожаловать в Сообщество HTX. Здесь вы сможете быть в курсе последних новостей о развитии платформы и получить доступ к профессиональной аналитической информации о рынке. Мнения пользователей о цене на AI (AI) представлены ниже.

活动图片