Build CoreOrdered learning track

Learn Agentic Ai Engineering Part 019 Repository Understanding Agents

[]20 min read3864 words

In This Lesson

1. Kaufman Framing 2. Core Mental Model: Repository as Layered System 3. Repository Understanding Agent Responsibilities

PrevNext

Lesson 1935 lesson track07–19 Build Core

title: Learn Advanced Agentic AI Engineering & Autonomous Software Engineering - Part 019 description: Repository understanding agents for autonomous software engineering: repo mapping, symbol search, build graph discovery, test graph discovery, dependency analysis, ownership, conventions, risk surface, and context packet generation. series: learn-agentic-ai-engineering seriesTitle: Learn Advanced Agentic AI Engineering & Autonomous Software Engineering order: 19 partTitle: Repository Understanding Agents tags:

agentic-ai
autonomous-software-engineering
coding-agent
repository-understanding
code-intelligence
series date: 2026-06-29

Part 019 — Repository Understanding Agents

Target part ini: mampu mendesain repository understanding agent yang bisa membaca codebase sebagai sistem, bukan sebagai kumpulan file acak. Agent harus bisa menemukan module boundary, build/test command, symbol, dependency, ownership, conventions, risk surface, dan context packet yang cukup untuk membuat keputusan engineering yang benar.

Coding agent yang buruk biasanya gagal bukan karena tidak bisa menulis syntax.

Ia gagal karena tidak memahami repository.

Ia salah memilih file. Ia melewatkan test relevan. Ia mengubah abstraction boundary. Ia tidak tahu generated code. Ia tidak tahu module owner. Ia tidak tahu dependency direction. Ia tidak tahu bahwa behavior penting ada di configuration, migration, schema, fixture, atau contract test.

Repository understanding agent adalah komponen yang menjawab pertanyaan:

Given a software repository and an engineering task,
what parts of the system matter,
how do they relate,
what can safely be changed,
and what evidence is required before proposing a patch?

Part ini bukan membahas autocomplete. Part ini membahas codebase intelligence layer untuk autonomous SWE.

1. Kaufman Framing

1.1 Target performance

Setelah part ini, kita ingin mampu:

mendesain repo-map generator untuk codebase besar,
membedakan physical file map, logical module map, symbol map, dependency map, build graph, test graph, runtime graph, dan ownership map,
membuat context packet yang cukup kecil untuk LLM tetapi cukup kaya untuk engineering task,
memilih retrieval strategy berdasarkan jenis task,
mengevaluasi apakah agent benar-benar memahami repository atau hanya melakukan keyword search,
mendeteksi risiko seperti generated code, vendored code, unsafe build script, prompt injection di docs, secret leakage, dan stale index,
membangun repo understanding loop untuk autonomous bug fixing, review, migration, dan onboarding.

Target praktis:

Jika diberi repository unknown, agent harus bisa dalam beberapa menit membuat “engineering map” yang menjawab: cara build, cara test, module utama, entrypoint, public API, dependency direction, test ownership, risky files, dan file/symbol paling relevan untuk task tertentu.

1.2 Deconstruct the skill

Repository understanding terdiri dari subskill:

Inventory discovery — file, directory, manifest, language, framework, package manager.
Build discovery — command, module, profile, environment variable, generated source, cache.
Test discovery — test framework, naming convention, test command, test target, flaky/slow marker.
Symbol extraction — class, function, method, type, endpoint, command handler, job, schema, config key.
Dependency modelling — package dependency, import graph, call graph, module dependency, service dependency.
Semantic mapping — business concepts, domain terms, bounded context, workflows, invariants.
Ownership mapping — CODEOWNERS, recent commit authors, team boundary, review rules.
Risk mapping — security-sensitive files, migration, public contract, critical path, generated/vendored code.
Context packet generation — selecting evidence for a task under token budget.
Continuous refresh — invalidating stale index after code change.

1.3 Learn enough to self-correct

Minimal knowledge agar bisa self-correct:

tahu bahwa repository has multiple graphs, not one tree,
tahu bahwa grep saja tidak cukup,
tahu bahwa embedding search saja tidak cukup,
tahu bahwa build/test command adalah artifact penting,
tahu bahwa context terlalu banyak bisa menurunkan kualitas reasoning,
tahu bahwa code understanding harus menghasilkan evidence, bukan feeling.

1.4 Remove friction

Friction umum:

agent mulai patch sebelum tahu cara menjalankan test,
agent menelan seluruh repo ke context,
agent hanya mencari keyword dari issue,
agent tidak membedakan source code, generated code, fixture, doc, dan vendor,
agent tidak menyimpan map sehingga setiap task mengulang discovery,
agent tidak bisa menjelaskan mengapa file tertentu relevan.

Solusinya:

buat repo map incremental,
simpan build/test manifest,
gunakan search ladder,
representasikan evidence secara structured,
gunakan context packet sebagai kontrak antar stage,
ukur file localization dan test selection.

1.5 Practice loop

Latihan deliberate:

Ambil repository open-source medium-size.
Buat repo map manual.
Minta agent membuat repo map.
Bandingkan missing modules, wrong assumptions, wrong commands.
Beri agent task bug/feature kecil.
Lihat apakah context packet cukup.
Jalankan patch loop.
Ukur apakah file/test yang dipilih tepat.

2. Core Mental Model: Repository as Layered System

Repository bukan folder tree. Repository adalah sistem berlapis.

Setiap layer menjawab pertanyaan berbeda.

Layer	Pertanyaan	Contoh artifact
Physical file tree	File apa saja yang ada?	`src/`, `test/`, `docs/`, `config/`
Language/package map	Teknologi apa yang dipakai?	`pom.xml`, `build.gradle`, `package.json`, `pyproject.toml`
Build graph	Bagaimana artifact dibangun?	Maven modules, Gradle tasks, npm scripts, Makefile targets
Test graph	Test apa yang melindungi behavior?	unit, integration, contract, e2e, snapshot
Symbol graph	Abstraction apa yang tersedia?	classes, functions, endpoints, handlers
Import/call graph	Dependency bergerak ke mana?	imports, call edges, service clients
Runtime map	Apa entrypoint dan execution path?	controllers, CLI commands, jobs, consumers
Domain map	Konsep bisnis apa yang penting?	account, policy, order, case, entitlement
Ownership map	Siapa yang biasanya mengubah area ini?	CODEOWNERS, commit history, reviewers
Risk map	Perubahan mana yang sensitif?	auth, billing, migrations, crypto, compliance

Agent yang hanya melihat file tree akan membuat perubahan dangkal. Agent yang memahami semua layer bisa menjawab: “untuk issue ini, file yang relevan kemungkinan X/Y/Z, test yang harus dijalankan A/B/C, dan risiko utamanya adalah perubahan contract di module M.”

3. Repository Understanding Agent Responsibilities

Repository understanding agent bukan coding agent utama. Ia adalah specialist yang menghasilkan structured repo intelligence.

3.1 Input

Input umum:

repository:
  root: /workspace/repo
  branch: feature/task-123
  commit: abc123

task:
  type: bugfix | feature | refactor | migration | review | onboarding
  description: "..."
  artifacts:
    - issue_body
    - stack_trace
    - logs
    - failing_test
    - screenshot
    - customer_report

constraints:
  max_context_tokens: 12000
  allowed_tools:
    - read_file
    - search_text
    - parse_symbols
    - run_build_readonly
  forbidden_paths:
    - secrets/
    - prod-data/

3.2 Output

Output ideal:

repo_understanding_packet:
  repository_fingerprint:
    languages: [java, typescript]
    build_systems: [gradle, npm]
    test_frameworks: [junit, vitest]
    architecture_style: modular-monolith

  build_manifest:
    primary_commands:
      - ./gradlew test
      - npm test
    module_commands:
      billing-service: ./gradlew :billing-service:test

  relevant_areas:
    - path: billing-service/src/main/java/.../InvoiceCalculator.java
      reason: "contains symbol referenced by stack trace"
      confidence: 0.91
    - path: billing-service/src/test/java/.../InvoiceCalculatorTest.java
      reason: "nearest unit tests for target symbol"
      confidence: 0.87

  dependency_context:
    upstream_callers:
      - BillingController
      - InvoiceJob
    downstream_dependencies:
      - TaxRuleRepository
      - CurrencyRoundingPolicy

  risk_notes:
    - "Invoice rounding affects public billing behavior"
    - "Existing contract tests must be run"

  recommended_next_actions:
    - reproduce_failure
    - inspect_rounding_policy
    - run_targeted_tests

The output should be machine-readable and human-readable.

Machine-readable untuk downstream agent. Human-readable untuk reviewer.

4. Why Naive Repository Understanding Fails

4.1 Keyword search trap

Issue:

Refund amount is incorrect when invoice contains mixed tax rates.

Naive search:

rg "refund amount"

Problem:

code mungkin memakai CreditMemo, bukan Refund,
domain term di UI berbeda dari internal model,
bug mungkin ada di TaxAllocationPolicy, bukan refund module,
behavior mungkin dikontrol config.

Better search:

search domain synonyms,
inspect endpoint/handler for refund flow,
trace from API to domain service,
locate tests around tax allocation,
inspect recent commits touching refund/tax/invoice.

4.2 Context stuffing trap

Naive approach:

Put every relevant file into context.

This often worsens quality.

Risiko:

attention dilution,
conflicting outdated docs,
irrelevant boilerplate,
generated code noise,
hidden prompt injection in documentation,
token budget waste.

Better approach:

Build a compact context packet with evidence, summaries, exact snippets, and explicit uncertainty.

4.3 Embedding-only trap

Embedding search bagus untuk semantic recall, tetapi buruk untuk:

exact symbol lookup,
stack trace line mapping,
import relation,
generated file exclusion,
build/test command discovery,
dependency direction,
security boundary.

Gunakan embedding sebagai salah satu tool, bukan source of truth.

4.4 AST-only trap

AST bagus untuk structure, tetapi tidak selalu menangkap:

runtime configuration,
DI wiring,
reflection,
dynamic imports,
framework conventions,
database migrations,
generated code,
feature flags.

Gunakan AST bersama build graph, runtime graph, and tests.

5. The Repository Map

A repository map is an indexed, queryable representation of the repository.

5.1 Minimum useful repo map

repo_map:
  identity:
    name: payments-platform
    commit: abc123
    generated_at: 2026-06-29T10:00:00+07:00

  languages:
    java:
      files: 1240
      build_system: gradle
    typescript:
      files: 310
      build_system: npm

  modules:
    - name: payment-api
      path: services/payment-api
      type: service
      build_command: ./gradlew :payment-api:build
      test_command: ./gradlew :payment-api:test
      public_entrypoints:
        - PaymentController
        - PaymentEventConsumer

  symbols:
    - name: PaymentAuthorizationService
      kind: class
      path: services/payment-api/src/main/java/.../PaymentAuthorizationService.java
      exports: false
      tests:
        - PaymentAuthorizationServiceTest

  risky_paths:
    - path: services/payment-api/src/main/java/.../AuthPolicy.java
      reason: authorization-critical
    - path: db/migration
      reason: database-migration

5.2 Repo map design principle

A good repo map is:

incremental — refresh changed files, not entire repo,
queryable — supports file, symbol, dependency, owner, test, and risk queries,
explainable — every result has reason and confidence,
bounded — excludes vendor/build artifacts by default,
task-aware — can generate context packet for a specific task,
auditable — records source and timestamp of every inference.

5.3 Repo map should not be a giant summary

Bad repo map:

This repo is a payments app. It has controllers, services, repositories, and tests.

Good repo map:

module: payment-core
responsibility: payment authorization, capture, refund, reversal
entrypoints:
  - PaymentCommandHandler.authorize
  - PaymentEventConsumer.onSettlementEvent
key_invariants:
  - authorization amount must not exceed available balance
  - reversal must be idempotent by transaction id
risk:
  - money movement
  - compliance audit trail
nearest_tests:
  - PaymentAuthorizationServiceTest
  - PaymentReversalIdempotencyIT

6. Discovery Pipeline

6.1 High-level pipeline

6.2 Step 1 — Scan files

Use deterministic file inventory first.

Typical commands:

pwd
git rev-parse HEAD
git status --short
git ls-files
find . -maxdepth 3 -type f | sed 's#^./##' | sort | head -200

Avoid naive recursive traversal that includes:

.git/,
node_modules/,
target/,
build/,
.gradle/,
.venv/,
generated caches,
binary artifacts.

6.3 Step 2 — Classify repository

Look for language/build signals:

Signal	Meaning
`pom.xml`	Maven project
`build.gradle`, `settings.gradle`	Gradle project
`package.json`	Node/JS/TS package
`pnpm-workspace.yaml`	pnpm monorepo
`pyproject.toml`	Python package/project
`go.mod`	Go module
`Cargo.toml`	Rust crate/workspace
`Dockerfile`	containerized runtime/build
`.github/workflows`	CI workflow clues
`CODEOWNERS`	ownership/review boundary

Do not assume one repo equals one app.

A repository may contain:

multiple services,
shared libraries,
frontend/backend,
infra code,
generated API clients,
sample apps,
docs,
test harnesses.

6.4 Step 3 — Discover build and test commands

Build/test discovery should combine:

manifest inspection,
CI workflow inspection,
README instructions,
package scripts,
Makefile targets,
historical commands in docs,
local execution in sandbox.

Example discovery result:

build_manifest:
  confidence: 0.84
  source_evidence:
    - path: .github/workflows/ci.yml
      line: "./gradlew check"
    - path: README.md
      line: "Run ./gradlew test"
  commands:
    full_check:
      command: ./gradlew check
      estimated_cost: high
    unit_tests:
      command: ./gradlew test
      estimated_cost: medium
    module_test:
      command_template: ./gradlew :{module}:test
      estimated_cost: low

Never hide uncertainty.

If no command found:

build_manifest:
  confidence: 0.31
  issue: "No CI workflow or README command found"
  next_action: "inspect package manager manifests and try dry-run commands"

6.5 Step 4 — Extract symbols

Symbol extraction options:

Technique	Strength	Weakness
regex/ctags	fast, simple	shallow, language-specific issues
Tree-sitter	structural, incremental parsing	needs grammar per language
LSP	semantic references, rename, diagnostics	slower, requires environment
compiler index	accurate	expensive setup
custom parser	framework-aware	maintenance cost
embeddings	semantic recall	not exact source of truth

Good agent architecture supports multiple symbol providers.

symbol_providers:
  fast:
    - ripgrep
    - ctags
  structural:
    - tree_sitter
  semantic:
    - language_server
  learned:
    - embedding_index

6.6 Step 5 — Build dependency graph

At minimum:

package dependency graph,
module dependency graph,
import graph,
test-to-production relation,
entrypoint-to-service path,
configuration dependency.

Example:

6.7 Step 6 — Mine conventions

Agent must detect conventions, not impose external style.

Examples:

controller naming,
service naming,
DTO location,
test naming,
package layering,
error handling pattern,
logging style,
validation style,
migration naming,
feature flag convention,
generated file marker.

Convention packet:

conventions:
  tests:
    unit_test_suffix: Test
    integration_test_suffix: IT
    fixture_path: src/test/resources/fixtures
  layering:
    controller_calls_service: true
    service_calls_repository: true
    repository_should_not_call_service: true
  error_handling:
    domain_errors_extend: DomainException
    api_errors_mapped_by: ApiExceptionMapper

6.8 Step 7 — Detect risk surfaces

Risk surfaces include:

authentication/authorization,
payment/money movement,
billing/tax/ledger,
cryptography,
data migration,
schema migration,
concurrency/locking,
distributed transaction,
compliance/audit,
public API contract,
generated client/server code,
infrastructure/deployment,
secrets/configuration.

Risk detector example:

risk_surface:
  path: services/auth/src/main/java/.../PermissionEvaluator.java
  categories:
    - authorization-critical
    - public-api-impact
  required_controls:
    - human_approval
    - security_review
    - targeted_tests
    - regression_tests

7. Search Ladder for Repository Understanding

A good repository understanding agent uses a search ladder.

7.1 Level 1 — Exact search

Use exact search for:

stack trace class names,
error messages,
API paths,
config keys,
feature flags,
database table names,
log messages,
exception names.

Example:

rg "IllegalStateException: invoice already settled"
rg "invoice.already.settled"
rg "POST /api/v1/invoices"

7.2 Level 2 — Symbol search

Use symbol search for:

method/class names,
interface implementations,
endpoint handlers,
event consumers,
command handlers.

Questions:

where is this symbol defined?
who calls it?
who implements it?
which tests cover it?

7.3 Level 3 — Semantic search

Use semantic search for:

domain concept search,
synonyms,
fuzzy requirement mapping,
docs/ADR retrieval,
historical rationale.

But semantic search output must be verified by exact evidence.

7.4 Level 4 — Graph expansion

After locating candidate symbol, expand graph:

callers,
callees,
imports,
tests,
configs,
fixtures,
migrations,
docs.

7.5 Level 5 — Test mapping

For each candidate file, find nearest tests:

same package,
naming convention,
import relation,
coverage data if available,
CI target,
failing test logs.

7.6 Level 6 — History and ownership

Use git history carefully:

git log --oneline -- path/to/file
git blame -L 40,90 path/to/file
git log --grep "refund" --oneline

History can reveal:

why code exists,
previous bug fixes,
risky churn,
owner/reviewer candidates,
stale TODOs.

But history can mislead if project migrated or ownership changed.

8. Context Packet Engineering

Repository understanding agent should output context packets, not random snippets.

8.1 Context packet structure

context_packet:
  task_summary: "Refund is incorrect for mixed tax rates"
  repo_facts:
    - "Project uses Gradle multi-module build"
    - "Refund domain is implemented as CreditMemo internally"
  candidate_files:
    - path: billing/src/main/java/.../CreditMemoService.java
      role: likely_patch_target
      why: "refund workflow maps to credit memo generation"
      evidence:
        - "method createCreditMemoForInvoice"
    - path: billing/src/main/java/.../TaxAllocationPolicy.java
      role: likely_root_cause
      why: "mixed tax rates handled here"
      evidence:
        - "method allocateTaxByLineItem"
  candidate_tests:
    - path: billing/src/test/java/.../CreditMemoServiceTest.java
      command: ./gradlew :billing:test --tests CreditMemoServiceTest
  risk:
    - "money movement and tax calculation"
  unknowns:
    - "No reproduction yet"
  recommended_next_step: reproduce_failure

8.2 Context packet should contain uncertainty

Bad:

The bug is in TaxAllocationPolicy.

Better:

hypothesis:
  statement: "The bug may be in TaxAllocationPolicy allocation of mixed tax rates."
  confidence: 0.68
  evidence:
    - "issue mentions mixed tax rates"
    - "TaxAllocationPolicy contains mixed-rate allocation code"
  missing_evidence:
    - "no failing test reproduced yet"

8.3 Context packet levels

Level	Use case	Content
L0	Triage	repo fingerprint, likely modules, commands
L1	Localization	candidate files, symbols, tests, evidence
L2	Patch planning	relevant snippets, invariants, risk, tests
L3	Review	diff explanation, impacted behavior, verification
L4	Audit	full trace, commands, outputs, approvals