正在加载入行工作台

入行大模型知识中枢 | 入行 365 旗下的「入行之路」

入行365知识资产推荐引擎

探索入行知识中枢

这里沉淀入行365可复用的知识资产。登录并完成评测后，系统会按你的短板和目标重排。

glossary

Agent Sandbox（Isolated Execution Sandbox）

An isolated environment where an AI agent runs code or executes actions with restricted access to the host system. Prevents a compromised or misbehaving agent from touching the filesystem, network, or processes outside its allowed scope. Cr

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIagent sandbox

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Guardrails（Safety Guardrails）

Validation layers that sit around an LLM to detect and block unsafe inputs or outputs: harmful content, PII leakage, off-topic requests, prompt injection attempts. Can be implemented as input filters, output classifiers, or both. The primar

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIguardrails

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Prompt Injection（Indirect Prompt Injection）

An attack where malicious instructions are hidden in content the agent reads (a webpage, email, document) and hijack its behavior. For example, a webpage telling the agent to ignore its instructions and exfiltrate data. The SQL injection of

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIprompt injection

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Grounding（Factual Grounding）

Connecting model outputs to verifiable external sources: search results, databases, real-time APIs: to reduce hallucination and keep answers accurate. RAG is one form of grounding; web search is another.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIgrounding

开始实战任务

来自: tomerjann/llm-field-notes

glossary

MCP（Model Context Protocol）

Anthropic's open protocol for connecting LLMs to external tools and data sources in a standardized way. Any MCP-compatible server can plug into any MCP-compatible model or IDE.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AImcp

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Tool Use（Function Calling / Tool Use）

The mechanism by which LLMs can invoke external functions or APIs: like running code, searching the web, or querying a database: by outputting structured JSON that the host application executes and feeds back as a result.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AItool use

开始实战任务

来自: tomerjann/llm-field-notes

tool

12000

ChatGPT

OpenAI 的通用 AI 助手，适合问答、写作、代码、数据分析、方案生成和多场景工作流协作。

约 5 分钟

AI & LLMs通用助手写作代码

开始实战任务

来自: 入行工具库

glossary

Agentic Loop（Perceive, Plan, Act, Observe Loop）

The core execution cycle of an AI agent: observe the environment or task, reason about what to do next, call a tool or produce output, then observe the result and repeat. Agents run this loop until a stopping condition is met.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIagentic loop

开始实战任务

来自: tomerjann/llm-field-notes

glossary

AI Agent（Autonomous AI Agent）

An LLM given access to tools (web search, code execution, APIs) and the ability to reason over multi-step tasks autonomously: perceiving state, planning actions, executing them, and iterating until a goal is achieved.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary智能体 AIai agent

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Structured Output（Constrained / Structured Generation）

Forcing the model to emit output in a specific format: JSON, XML, a fixed schema: rather than freeform text. Critical for building reliable pipelines where downstream code needs to parse the model's response. Often paired with tool use.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文structured output

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Context Window（Context Length / Window）

The maximum number of tokens a model can process in a single call: both input and output combined. Claude 3.5 Sonnet has a 200K token context. Longer contexts enable more complex tasks but increase memory and compute costs.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文context window

开始实战任务

来自: tomerjann/llm-field-notes

glossary

In-context Learning（In-Context Learning (ICL)）

The model's ability to learn new tasks purely from examples in its context window: no weight updates required. You show it examples, and it adapts. The core mechanism behind few-shot prompting and one of the most surprising emergent abiliti

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文in-context learning

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Few-shot（Few-shot and Zero-shot Prompting）

Zero-shot: asking the model to do a task with no examples. Few-shot: providing 2 to 5 input/output examples in the prompt so the model pattern-matches the format. Few-shot is one of the most reliable and underused techniques in prompt engin

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文few-shot

开始实战任务

来自: tomerjann/llm-field-notes

tool

11800

Claude

Anthropic 的通用大模型助手，适合长文本阅读、复杂任务拆解、代码解释、知识工作和结构化写作。

约 5 分钟

AI & LLMs长文本写作代码

开始实战任务

来自: 入行工具库

glossary

ReAct（Reason + Act）

A prompting pattern that interleaves reasoning traces with tool actions: the model thinks out loud ("Thought: I need to search for X"), calls a tool ("Action: search(X)"), observes the result, and repeats. The blueprint behind most modern A

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文react

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Chain of Thought（Chain-of-Thought (CoT) Prompting）

A prompting technique that instructs the model to reason step-by-step before giving a final answer. Dramatically improves performance on complex reasoning tasks. The basis of "thinking" models like Claude's extended thinking mode.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文chain of thought

开始实战任务

来自: tomerjann/llm-field-notes

glossary

System Prompt（System / Developer Prompt）

A hidden instruction block sent at the start of a conversation that sets the model's persona, rules, and behavior before any user message arrives. The primary mechanism operators use to customize LLM behavior for their product.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文system prompt

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Context Engineering（Context Window Engineering）

The broader discipline of deciding what information goes into the model's context window: not just the prompt, but retrieved documents, tool results, memory, conversation history, and how it is all structured and prioritized.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文context engineering

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Prompt Engineering（Prompt Engineering）

The practice of carefully crafting the text inputs to a model to elicit better, more reliable outputs: using techniques like few-shot examples, chain-of-thought, role instructions, and output formatting constraints.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary提示词与上下文prompt engineering

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Dogfooding（Internal Dogfooding）

The practice of using your own LLM or AI-powered tools internally as part of your own development workflow before shipping them to customers. AI labs use their frontier models to write code, generate evals, draft research, and run internal

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary评估与基准dogfooding

开始实战任务

来自: tomerjann/llm-field-notes

tool

11702

DeepSeek

DeepSeek 的通用大模型助手，适合中文问答、代码推理、低成本模型调用和日常知识工作。

约 5 分钟

AI & LLMs通用助手中文代码

开始实战任务

来自: 入行工具库

glossary

Harness Engineering（Eval Harness Engineering）

The craft of building the infrastructure to run evaluations at scale: test runners, dataset pipelines, scoring logic, and result tracking. A well-built eval harness is what makes it possible to iterate on a model safely and quickly.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary评估与基准harness engineering

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Evals（Evaluations / Benchmarks）

Systematic tests used to measure a model's capabilities, accuracy, or safety across specific tasks. Good evals are what separate rigorous AI development from vibes-based iteration. Everything from math benchmarks to red-teaming.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary评估与基准evals

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Distillation（Knowledge Distillation）

Training a smaller "student" model to mimic the outputs of a larger "teacher" model. The student learns not just the correct answers but the teacher's probability distributions, capturing nuanced knowledge. How many efficient small models a

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary训练与对齐distillation

开始实战任务

来自: tomerjann/llm-field-notes

glossary

LoRA（Low-Rank Adaptation）

An efficient fine-tuning technique that freezes the base model weights and adds small trainable "adapter" matrices. Trains in a fraction of the time and memory of full fine-tuning, while achieving comparable results. The dominant fine-tunin

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary训练与对齐lora

开始实战任务

来自: tomerjann/llm-field-notes

glossary

Hallucination（Model Hallucination）

When a model confidently generates factually incorrect or fabricated information. Happens because LLMs are trained to produce plausible-sounding text, not verified facts. RAG and grounding techniques help mitigate this.

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary训练与对齐hallucination

开始实战任务

来自: tomerjann/llm-field-notes

glossary

RLHF（Reinforcement Learning from Human Feedback）

A training technique where human raters rank model outputs, and those preferences train a reward model, which then guides the LLM via reinforcement learning to produce more helpful, harmless, and honest responses. Used by Claude, GPT-4, and

约 3 分钟理解一个 LLM 底层技术术语的工程含义

D1glossary训练与对齐rlhf

开始实战任务

来自: tomerjann/llm-field-notes

工具、Prompt、工作流不是收藏品，要进入你的任务成果和复盘记录。

先按领域、方向和当前场景跑通一次，系统再按身份、短板和目标重排知识资产，并把可执行任务保存到今日工作台。

探索 入行知识中枢

Agent Sandbox（Isolated Execution Sandbox）

Guardrails（Safety Guardrails）

Prompt Injection（Indirect Prompt Injection）

Grounding（Factual Grounding）

MCP（Model Context Protocol）

Tool Use（Function Calling / Tool Use）

ChatGPT

Agentic Loop（Perceive, Plan, Act, Observe Loop）

AI Agent（Autonomous AI Agent）

Structured Output（Constrained / Structured Generation）

Context Window（Context Length / Window）

In-context Learning（In-Context Learning (ICL)）

Few-shot（Few-shot and Zero-shot Prompting）

Claude

ReAct（Reason + Act）

Chain of Thought（Chain-of-Thought (CoT) Prompting）

System Prompt（System / Developer Prompt）

Context Engineering（Context Window Engineering）

Prompt Engineering（Prompt Engineering）

Dogfooding（Internal Dogfooding）

DeepSeek

Harness Engineering（Eval Harness Engineering）

Evals（Evaluations / Benchmarks）

Distillation（Knowledge Distillation）

LoRA（Low-Rank Adaptation）

Hallucination（Model Hallucination）

RLHF（Reinforcement Learning from Human Feedback）

探索入行知识中枢