Skip to main content

AI and Agentic Models: Innovation Today and What Comes Next

 


Introduction

Artificial Intelligence (AI) has moved from pattern recognition to goal-directed behaviour. The shift is powered by agentic models—systems that don’t just predict the next token, but perceive, plan, act, and learn in pursuit of objectives. For businesses and researchers, this unlocks workflows that are too dynamic for rules engines and too open-ended for traditional machine learning.

This article explains what agentic AI is, how it differs from earlier waves of AI, where it’s already producing impact, and what to expect over the next 12–24 months.

What Are Agentic Models?

An agentic model combines a large model (often an LLM) with a loop that lets it:

  1. Perceive: read data, files, APIs, or the current state of an environment.

  2. Plan: decompose goals into steps (task planning/tool selection).

  3. Act: call tools, write code, trigger workflows, or interact with users/systems.

  4. Reflect: analyse outcomes, update memory, and iterate.

Typical Architecture

  • Reasoning core: an LLM or multimodal model for language, code, and perception.

  • Tools/skills: API connectors (search, databases, CRMs, trading, RPA, cloud ops).

  • Memory: short-term scratchpads plus vector stores for long-term recall.

  • Planner & critic: sub-agents for step planning, self-verification, and safety checks.

  • Controller/orchestrator: governs autonomy level, throttles actions, logs, and audits.

  • Guardrails: policy filters, PII redaction, jailbreak resistance, and human-in-the-loop.

Why Now?

  • Stronger reasoning: modern models are better at decomposition, tool use, and self-correction.

  • Cheaper inference: falling token costs make continuous agent loops viable.

  • Mature tooling: off-the-shelf frameworks for multi-agent orchestration, evaluation, and governance.

  • Richer integrations: enterprises expose internal systems via APIs, enabling agents to do real work.

Where Agentic AI Is Delivering Value

1) Operations & Back-Office

  • Invoice → payment: classify, reconcile, and post entries; escalate anomalies.

  • Procurement: auto-draft RFQs, compare vendor terms, and schedule approvals.

  • IT service desks: triage tickets, run diagnostic scripts, and close low-risk issues.

2) Compliance & Risk

  • AML/KYB screening: dynamic name matching, adverse-media triage, and risk file assembly with source citations; agents enrich SAR drafts and route cases by materiality.

  • Policy copilots: answer “can I do X?” with rule excerpts, precedents, and rationale.

3) Software & Data

  • Autonomous code changes: open PRs, generate tests, run CI, and request reviews.

  • Data agents: build SQL, check data quality, generate dashboards, and schedule alerts.

4) Customer Experience

  • Resolution bots: handle multi-turn issues across channels, check entitlements, book refunds, and follow up; hand off with full context when confidence is low.

  • Personalised advisory: goal-based planning (finance, education, wellness) with transparent assumptions.

5) Research & Knowledge Work

  • Literature review: gather, cluster, and summarise papers; extract claims with references.

  • Market scanning: track competitors, regulation, and signals; draft briefs with confidence bands.

Quick Wins vs. Moonshots

  • Quick wins (4–8 weeks):

    • Case-file assembly agents for AML/KYC.

    • Support deflection with tool-enabled chat (refunds, status, password reset).

    • Data-question agents for internal analytics (“Explain last week’s churn spike.”).

  • Moonshots (6–12 months):

    • Multi-agent “digital teams” that own end-to-end processes (e.g., vendor onboarding).

    • Continuous controls monitoring across finance, ops, and security.

Measuring Agent Quality

  • Task success rate (objective completion without human help).

  • Autonomy-adjusted throughput (cases/hour at fixed quality).

  • Precision/recall on guarded actions (e.g., fraud blocks, sanctions hits).

  • Self-consistency & citation rate (evidence-backed answers).

  • Human satisfaction (analyst & customer CSAT).

  • Total cost to outcome (TCO per resolved task vs. baseline).

Risks and How to Mitigate Them

  • Hallucinations & overreach → require tool-verified actions; mandate citations; gate high-impact steps behind approvals.

  • Data leakage → PII redaction, confidential routing, and strict tenancy.

  • Model bias → fairness testing on representative datasets; adverse-impact monitoring.

  • Compliance gaps → log every action, keep immutable audit trails, and map tasks to policies.

  • Runaway loops/costs → step budgets, watchdog timers, and kill-switches.

Implementation Playbook (Crawl → Walk → Run)

  1. Crawl: pick one high-volume workflow; add a copilot that drafts but doesn’t execute. Instrument evaluation.

  2. Walk: graduate to low-risk automated actions with human review on exceptions. Add memory and tool use.

  3. Run: multi-agent orchestration; autonomous execution for pre-approved actions; continuous red-teaming and governance.

The Next 12–24 Months: What to Expect

  • Tool-centric models: models trained to call tools first, reducing hallucination and cost.

  • Richer multimodality: agents that read contracts, charts, code, and screens; hands-free RPA.

  • Federated and on-prem agents: privacy-preserving collaboration across organisations.

  • Self-verifying pipelines: built-in critics, unit tests, and proof-of-execution for regulated actions.

  • Domain-tuned small models: cheaper, faster agents fine-tuned for specific processes.

  • Policy-aware autonomy: agents that embed control objectives (e.g., SOX, AML, GDPR) in their planning loop.

Conclusion

Agentic AI marks a step-change: from insight to initiative. When paired with careful governance—evidence, auditability, safety checks, and human oversight—agents can compress cycle times, raise quality, and free teams to focus on judgment instead of drudgery. The organisations that treat agentic AI as process re-design (not just a model swap) will capture the outsized gains.


Optional FAQ (for Blogger readers)

Is this replacing people?
Well-designed agents augment teams: they handle the repetitive scaffolding so humans spend time on nuance, escalation, and ethics.

Where should I start?
Pick a narrow, high-volume workflow with clear success criteria and available tools/data. Ship small, measure, iterate.

Comments

Popular posts from this blog

Visa, MasterCard, and the Rise of Alternative Payment Options in Southeast Asia: A Changing Financial Landscape

Southeast Asia is experiencing a dynamic transformation in its financial ecosystem. As digital payments gain traction across the region, the dominance of traditional card networks like Visa and MasterCard is being challenged by alternative payment methods and regional giants like China UnionPay. This evolving landscape reshapes how consumers and businesses conduct transactions, reflecting broader shifts in the global financial sector. In this blog post, we will explore the current penetration of Visa and MasterCard in Southeast Asia, the influence of China UnionPay, and the emergence of alternative payment options. I will also share how my PhD research in Artificial Intelligence (AI) focuses on the Fintech sector, delving into the implications of these developments for the future of finance in the region. Visa and MasterCard: Established Players in a Competitive Market Visa and MasterCard have long been the global leaders in credit and debit card payments. In Southeast Asia, they have ...

Financial Technology, Regulation, and Artificial Intelligence in China: A Global Fintech Powerhouse

China has emerged as a global leader in financial technology (fintech) and artificial intelligence (AI), demonstrating a unique model of rapid digital innovation supported by strong state involvement, robust infrastructure, and an expansive domestic market. With the rise of tech giants like Ant Group, Tencent, and JD.com, China’s financial sector has undergone a remarkable digital transformation. At the same time, Chinese regulators have implemented sweeping reforms to balance innovation with financial stability and consumer protection. This article explores the regulatory landscape, fintech developments, and AI integration in China's financial services ecosystem. Regulatory Environment China’s fintech landscape is regulated by multiple agencies, including: People’s Bank of China (PBoC) – Central bank and key regulator for monetary policy, payment systems, and financial stability. China Banking and Insurance Regulatory Commission (CBIRC) – Oversees banks and insurance pr...

Artificial Intelligence and Financial Innovation in Southeast Asia: Emerging Trends in Vietnam, Cambodia, and Thailand

The rapid proliferation of artificial intelligence (AI) technologies is transforming financial sectors across the globe. In Southeast Asia (SEA), countries such as Vietnam, Cambodia, and Thailand are increasingly leveraging AI to enhance banking services, streamline regulatory compliance, and improve financial inclusion. These nations, each with distinct economic and technological profiles, present compelling case studies on how AI can catalyze development and reshape financial ecosystems. Vietnam: A Hub of Digital Financial Innovation Vietnam has emerged as a frontrunner in AI adoption within the financial sector. Driven by a burgeoning fintech ecosystem and strong government support, financial institutions are utilizing AI to offer smarter lending solutions, improve credit scoring, and develop personalized banking experiences. According to a report by the World Bank (2021), Vietnam's fintech market is expected to reach USD 18 billion by 2024, powered by mobile banking and AI-driv...