KNOWLEDGE LIBRARY

What is Agentic AI? Stanford Explains LLM Planning & Problem Solving (2026 Guide)

⏱️57分の動画5分で読める

📘この記事で学べること

AI 、 、 。 、 、 、RAG 、 。

manabi AI標準
2026/5/3 作成 2026/6/1 更新
Stanford Webinar - Agentic AI: A Progression of Language Model Usage
動画を再生

Stanford OnlineStanford Webinar - Agentic AI: A Progression of Language Model Usage📅 2025年2月5日 公開

この動画の内容を、要点・図解・学習ポイントとして 分かりやすく AI が要約しています。

⚠️

AI が要約しているため、 内容は必ずしも正確とは限りません。 重要な内容は元動画などでご確認ください。

🎯

こんな人におすすめ

  • AI
  • AI
  • RAG AI
  • AI

この動画から学べる学習ポイント

  • 1
  • 2
  • 3
  • 4AI
  • 5

ここからが本番

詳細な解説記事 - ここを読むと
一気に理解度が深まります

The Fundamental Shift: From Predictive Text to Instruction Following

What is Agentic AI? Stanford Explains LLM Planning & Problem Solving (2026 Guide) - 導入 イラスト

Large Language Models (LMs) are essentially sophisticated machine learning models designed to predict the next word in a sequence based on a massive corpus of training data. As Insop Song from GitHub Next explains, the journey of a language model begins with a pre-training phase, where the model consumes vast amounts of internet text and books to understand linguistic patterns and global knowledge. However, a pre-trained model alone is often difficult to control or use for specific tasks.

To bridge this gap, the industry employs post-training techniques, specifically instruction tuning and Reinforcement Learning from Human Feedback (RLHF). This stage aligns the model with human preferences, teaching it to follow specific commands rather than just completing sentences. By training on datasets formatted with instructions and expected outputs, the model becomes a more versatile tool for applications like coding assistants and conversational interfaces like ChatGPT.

💡Key insight: Pre-training provides the model with 'knowledge,' but post-training provides the model with 'behavioral alignment,' making it practical for real-world utility.

Despite these advancements, developers must understand that an LM is still a probabilistic engine. It generates the most likely next token, which can lead to issues if the prompt is ambiguous. Clear communication is the bedrock of effective AI interaction. As we move toward more complex systems, the quality of these foundational models determines the potential of the higher-level agentic structures built upon them.

  1. 1Pre-training: Learning from massive, unlabelled datasets.
  2. 2Instruction Tuning: Learning to respond to specific commands.
  3. 3RLHF: Aligning outputs with human values and preferences.
Training PhasePrimary ObjectiveKey Output
Pre-trainingNext-token predictionBase world knowledge
Post-trainingTask completionInstruction-following capability
RLHFPreference alignmentHuman-centric safety and utility

Optimizing Performance: The Art of Prompt Engineering and Reasoning

What is Agentic AI? Stanford Explains LLM Planning & Problem Solving (2026 Guide) - 本論 イラスト

To extract the maximum value from modern LMs, developers must employ strategic prompting techniques. Writing clear, descriptive instructions is non-negotiable; as Insop Song notes, the model cannot read your mind. Detail is your friend. Furthermore, providing 'few-shot' examples—showing the model the exact format and style you expect—significantly boosts the consistency of the output. This is particularly vital in production environments where structured data is required.

Another critical technique is 'Chain of Thought' (CoT) prompting. Instead of asking for a final answer immediately, you instruct the model to think step-by-step. This 'time to think' allows the model to allocate more attention to its own reasoning process, often correcting errors that would occur in a 'one-shot' response. For complex tasks, breaking down the prompt into a chain of simpler sub-tasks ensures higher accuracy at each stage.

🔥ここから本番

ここからが大事な
ポイントです

具体例・注意点・明日から使えるヒントを整理しています。

無料閲覧で全文 + 図解の完全版を3日間いつでも読み返せる

あなたの好きな動画も、
1分でAI要約

📚 お気に入り保存 + ✨ あなたの動画をAI要約
(無料登録10秒)

✏️ この記事で学べること

  • AI

10秒で完了・パスワード作成不要

この続きは…

残り 6,078/9,607 文字(残り 63%)

あと 3 章 + 編集視点 + FAQ

manabi AI

動画の内容を基にAIが自動生成しました

YouTube要約 1,000ノートが
いつでも無料で学習し放題

YouTube の知恵を 5 分で学べるメディア

10秒で完了