Back to Blog 中文 EN

AI Agents ─ The Complete Beginner's Guide
From Using Them to Building Your Own

AI Agents are the hottest concept of 2026 ─ and the easiest to over-hype. This post breaks it down for you ─ what an Agent actually is, how it differs from a ChatBot, how to use Claude Code / Cursor Agent / MCP, 6 projects you can build today, and 4 traps to watch out for.

※ AI tooling moves incredibly fast, so treat everything here as a snapshot from when it was written (May 2026). For exact APIs, features, and pricing, always check the official docs.

First, let's be clear ─ Agent vs ChatBot

A lot of people lump these together, but they're very different.

  • ChatBot: you ask, the AI answers. One round and you're done.
  • Agent: you give it a goal, and the AI plans, executes, and corrects itself until it's finished.

For example ─ "Go through last week's email, find the ones I need to reply to, and draft responses":

  • ChatBot: "Sure, paste the emails here for me"
  • Agent: connects to the Gmail API itself, reads the messages, decides which ones need a reply, drafts them, and drops them in your draft folder ─ fully automatic

The core difference: an Agent can use tools (tool use), reason across multiple steps, and correct itself.


The 4 core capabilities of an Agent

  1. Planning ─ breaking a big goal into small, executable steps
  2. Tool Use ─ calling external APIs, running commands, reading files
  3. Memory ─ remembering the conversation and intermediate results
  4. Reflection ─ checking whether it did things right, and adjusting when it's wrong

Across these 4, today's AI Agents (Claude Code, Cursor Agent, GitHub Copilot Agent) are roughly at this level of maturity ─ planning and tool use are strong, while memory and reflection are still evolving.


3 Agents you can use right now

01 Claude Code (CLI)

Anthropic's terminal-based AI agent.

What it can do:

  • Run in your codebase, read files, edit files
  • Run shell commands (including git)
  • Plan multi-step refactors
  • Read error logs and debug on its own
  • Connect to external tools through MCP

How to use it: run claude in your project directory, give it a goal, and let it run.

02 Cursor Agent

The agent mode built into the Cursor IDE (formerly called Composer).

What it can do:

  • Make changes across multiple files
  • Read the codebase to find relevant context
  • Use @ to reference specific files / docs
  • Run commands and inspect the output

Good for: people working inside an IDE who want to see the diff visually.

03 GitHub Copilot Agent

GitHub's PR-level agent.

What it can do:

  • Pick up an issue and write the PR automatically
  • Run CI, check the results, fix what's broken
  • Engage with reviewer comments

Good for: cases where you want to outsource a well-defined task to AI end to end.


MCP (Model Context Protocol) ─ connecting Agents to your tools

MCP is an open protocol introduced by Anthropic. Its goal is to let AI Agents "talk to your tools / databases / services in a unified way".

The world without MCP:

  • Want to connect to Slack → write a Slack integration
  • Want to connect to GitHub → write a GitHub integration
  • Want to connect to Postgres → write a DB integration
  • And every agent has to redo it all over again

The world with MCP:

  • Someone has already built a Slack MCP server
  • Someone has already built a GitHub MCP server
  • Your agent adds one line of config and can use every MCP server out there

What it's really doing ─ "APIs are to programming" as "MCP is to AI Agents".

There are already plenty of public MCP servers: filesystem, GitHub, Slack, Postgres, Linear, Notion… just grab one and use it.


6 projects you can build today

① Auto-generate your daily standup

Difficulty ⭐⭐ · Tools: Claude Code + GitHub MCP

Every morning: the Agent automatically reads yesterday's GitHub commits, PR comments, and Slack messages, turns them into a standup draft, and posts it to your Notion / DMs.

Value: saves you 10 minutes of writing every day, and you're less likely to miss anything.

② Automated first-pass code review

Difficulty ⭐⭐⭐ · Tools: GitHub Copilot Agent / CodeRabbit

When a PR opens, the Agent runs automatically: catches lint issues, checks for typos, looks for missing tests, and gives initial suggestions.

Value: human reviewers don't waste time nitpicking semicolons and can focus on the architecture discussion.

③ Automated support / FAQ answers

Difficulty ⭐⭐⭐ · Tools: Claude API + your docs

Hand the Agent your product docs, and once it's read them, it can answer common user questions. Complex issues get escalated to a human automatically.

Value: saves a small team a ton of support time.

④ Resume / cover letter generation

Difficulty ⭐⭐ · Tools: Claude API + your background info

Give the Agent your experience, and it tailors a cover letter to each JD and reworks the highlights on your resume.

Value: sending out 100 applications during a job search no longer takes 100 hours.

⑤ Internal knowledge-base Q&A

Difficulty ⭐⭐⭐⭐ · Tools: Claude + Vector DB + your internal data

Turn your company's Notion / Confluence / Slack history into a knowledge base the Agent can query. New hires ask the Agent instead of pinging people.

Value: dramatically shorter onboarding time.

⑥ Automated scraping + data wrangling

Difficulty ⭐⭐⭐ · Tools: Claude Code + Playwright MCP

"Check these 5 competitor sites for me every day and email me a summary whenever something changes" ─ the Agent schedules itself, scrapes, decides whether anything changed, and summarizes it.

Value: automation for market research / competitor tracking.


Build your first Agent ─ in 3 steps

Step 1: Pick a framework / SDK

  • Anthropic SDK ─ uses Claude, with native support for tool use + MCP
  • OpenAI SDK ─ uses GPT, has function calling, large ecosystem
  • LangChain ─ an abstraction layer, works across LLMs, but a steep learning curve
  • Mastra / Vercel AI SDK ─ TypeScript-first, great for building web apps

If you're new, just go with the Anthropic / OpenAI SDK ─ don't rush into LangChain.

Step 2: Define your Tools

Tools are the "external capabilities" an Agent can call. For example:

// Using the Anthropic SDK as an example
const tools = [
  {
    name: "search_email",
    description: "Search the user's Gmail",
    input_schema: {
      type: "object",
      properties: {
        query: { type: "string" }
      }
    }
  },
  {
    name: "send_email",
    description: "Send an email",
    input_schema: {
      type: "object",
      properties: {
        to: { type: "string" },
        subject: { type: "string" },
        body: { type: "string" }
      }
    }
  }
]

Step 3: Write the Agent loop

async function runAgent(userGoal) {
  const messages = [{ role: "user", content: userGoal }]

  while (true) {
    const response = await claude.messages.create({
      model: "claude-sonnet",
      tools,
      messages
    })

    // Agent wants to use a tool
    if (response.stop_reason === "tool_use") {
      const toolCall = response.content.find(c => c.type === "tool_use")
      const result = await executeTool(toolCall.name, toolCall.input)

      messages.push({ role: "assistant", content: response.content })
      messages.push({
        role: "user",
        content: [{ type: "tool_result", tool_use_id: toolCall.id, content: result }]
      })
      continue
    }

    // Agent is done
    return response.content
  }
}

That's the minimal Agent loop ─ call the LLM, run a tool, call the LLM again, until it's done.


4 traps to avoid

01 Treating the Agent like magic

A lot of people assume "Agent = fully automatic" ─ then hand it a complex task and feel let down when it gets it wrong.

The reality ─ Agents are strong at "small, well-defined tasks" and get stuck on "vague, sprawling tasks".

The principle ─ the more specifically you break down the task, the better the Agent performs.

02 Giving it too much access

An Agent that can run arbitrary commands could accidentally wipe your production DB.

The safe approach:

  • Read before write: grant read APIs first, then write once you've confirmed it's safe
  • Develop with read-only credentials, and only switch to write in production
  • Force human confirmation for critical operations (delete, deploy)
  • Log every tool call and review them regularly

03 Not setting a cost ceiling

An Agent loop can "fail to stop" ─ thinking endlessly, calling the API endlessly, and burning through your $$$.

You absolutely have to set:

  • Max iterations (a loop cap, e.g. 20)
  • Max tokens (a per-call cap)
  • Daily budget (a daily spending cap)

04 Using an Agent to replace "things that shouldn't be automated"

Some things suit an Agent, others don't:

  • ✅ Repetitive, rule-based, low impact when wrong ─ suitable
  • ❌ Requires reading context, high stakes, hard to undo when wrong ─ not suitable

Don't hand "talking to customers", "processing payments", or "deleting data" straight to an Agent.


Where AI Agents are headed

Where Agents stand right now (2026):

  • One task, completed automatically by the Agent ✅
  • Multiple Agents collaborating with each other (multi-agent) ⚠️ early days
  • Agents learning on their own and adjusting their strategy ⚠️ still being researched

What we might see within 5 years:

  • Personal Agents ─ like a personal assistant that remembers your preferences long-term
  • Enterprise Agents ─ replacing parts of entry-level ops work
  • Agent marketplaces ─ subscribing to Agents other people have built

But don't let the headlines scare you. The technology curve moves slower than the hype, and it'll take 3-5 years to actually work its way into your workflow.


One last reminder

AI Agents don't replace engineers, they amplify them.
An engineer who knows how to use Agents ─ one person can do the work of three.
An engineer who doesn't ─ will keep falling behind colleagues who do.

This tool isn't as magical as you imagine, and it isn't as simple either. Its value lies in "which workflow you place it into".

Starting this week ─ find the single most repetitive, most boring thing in your job and think about whether an Agent can automate it. Build it, and you'll understand why this is a trend.

Want to build an Agent but don't know where to start?

A 30-minute 1-on-1 consult for NT$1,500 ─ I'll look at the scenario you want to automate and give you concrete tech choices + a first-week implementation SOP.

Book a consult on LINE Subscribe to the newsletter first

Further reading — Engineers in the AI Era series