2025 Year in Review: The Quiet Power of Steady Progress
My 2025 AI journey in four numbers: 6, 5, 1, 6. Not because it was glamorous, but because it was grounded. Building GenAI in a bank is like replacing the pl...
First-hand observations on AI Agents in financial institutions, GenAI in production, GraphRAG, Ontology architecture, DevOps × AI, and enterprise AI platform engineering.
My 2025 AI journey in four numbers: 6, 5, 1, 6. Not because it was glamorous, but because it was grounded. Building GenAI in a bank is like replacing the pl...
In May 2026, Anthropic hosted Code with Claude 2026 across San Francisco, London, and Tokyo. The conference introduced no new foundation model, instead delivering a powerful combination of compute infrastructure, agent capabilities, developer tools, and cost optimization — signaling a decisive shift from benchmark competition toward real-world deployment.
This isn't a quiz about RAG or prompts — it asks: in real AI product scenarios, is your judgment ready? A 20-question skills check across 5 core dimensions to map your AI PM readiness.
Anthropic's Cat Wu describes a new PM rhythm in the AI era: roles merging, prototypes over docs, iteration in days not months. Reading it brought back memories of my own undefined role in an enterprise AI team—and Peter Deng's Avengers-style team philosophy.
TSMC's stock surged 137% from ~$164 in April 2025 to $387 in April 2026. This post breaks down how AI chip demand, CoWoS bottlenecks, and NVIDIA dethroning Apple as top customer drove the run.
The 2026 AI race is fundamentally about Harness engineering. This deep dive covers the 12 core modules of a production-grade Agent Harness, leading framework philosophies, and the 7 architectural decisions every AI architect must face.
GPT-5.5 launched April 23, topping 14 benchmarks and cutting token usage 40%. Behind the scenes, Jensen Huang and NVIDIA are betting up to $100B on the compute infrastructure that makes it run.
Ilya says compression is learning. Freedman finds only polynomial-growth monoids are compressible. If Persona can be projected onto a nilpotent substructure, PPV is not just a statistical fit — it's algebraically grounded personality compression.
On April 17, 2026, Anthropic launched Claude Design, a conversational AI visual design tool. Users simply describe what they need, and Claude generates interactive prototypes, slide decks, one-pagers, and more. Powered by Claude Opus 4.7, Anthropic's most capable vision model, the launch sent Figma's stock down 5% on the day.
Most AI Agents forget everything after each session. Hermes Agent is different — it remembers what you teach it and gets better over time. Here's what makes this open-source framework from NousResearch stand out.
Harness Engineering is the execution layer in AI Agent architecture. This post introduces the core design of a Harness: execution control, observability, hooks, tool sandboxing, and state management.
When AI researchers say LLMs are 'human-like,' which humans do they mean? A 2023 Harvard study used 262 cross-cultural survey variables and 94,278 respondents to show ChatGPT's cultural psychology aligns most closely with WEIRD Western democracies (r = -.70).
Can LLMs truly simulate 'you'? From Generative Agents to BehaviorChain, and the RAG-Free Psychometric Persona Vector (PPV) framework, this article compares three leading approaches to AI persona simulation.
Former Tesla AI Director Andrej Karpathy proposes replacing traditional RAG with an LLM-maintained personal Wiki. How does this three-layer architecture compound knowledge like interest? A complete breakdown.
Released April 2026 under Apache 2.0, Gemma 4 comes in four sizes — E2B, E4B, 26B MoE, and 31B Dense. The 31B ranks #3 among all open models globally with 256K context and native agentic workflows. A complete breakdown for AI developers.
In late March 2026, an accidental 59.8MB Source Map in Anthropic's npm release led to a full leak of Claude Code's underlying architecture. Beyond an engineering flaw, this is the first unboxing of enterprise-grade Agent frameworks, multi-layer prompting, and Undercover modes.
AI shopping agents are evolving from demos to real consumer tools. Walmart launched Sparky, Target partnered with Google Gemini, and Shopify released its Agentic Commerce protocol. When AI agents start swiping your card, e-commerce rules are being rewritten.
OpenClaw founder Peter Steinberger turned a weekend hack powered by Anthropic's Claude into a viral AI agent framework. After a trademark dispute forced a rebrand, OpenAI came knocking. In the Age of AI Exploration, even the smallest idea can reshape the world.
In his latest interview, Andrej Karpathy described experiencing 'AI Psychosis'—he hasn't written code himself in months. This article summarizes his core insights from the 'No Priors' podcast, including the concept of 'Claws' and the paradigm shift in software development.
2026 年 3 月 Google 推出了 Stitch 更新。這款由 Gemini 驅動的 AI 原生設計畫布不僅能透過自然語言生成 UI,更新增 Voice Canvas 語音編輯。它將如何徹底顛覆 Figma 與設計師的未來工作流?
OpenClaw showed us that an assistant is an always-on computing layer, not just a chatbot. But its variants (like NanoBot, CoPaw, IronClaw) are even more fascinating. Spanning five distinct paths, they outline the true shape of next-generation AI assistants.
AI Agents sound cool, but building Agent products in enterprise is full of pitfalls. Here are five design traps I've experienced firsthand.
When your boss asks 'Is AI worth the investment?', you need numbers. Here's the four-metric framework I use to prove GenAI value.
Enterprise prompt engineering is nothing like personal ChatGPT use. Structured templates, version control, multi-role design — lessons from the trenches.
Building a RAG system in banking: how to choose your chunk strategy, embedding model, and retrieval pipeline. Lessons from real production experience.
Does an AI PM need to code? A complete skill tree breakdown comparing AI PMs' skills, including Vibe Coding and specialized evaluation, versus traditional PMs.
Deploying AI in a bank isn't just picking a model. Compliance, security, data governance, organizational culture — each hurdle is the necessary path from 1...
Worried your AI feature will be rendered obsolete by the next model update? Learn how to anticipate model evolution and file AI patents to build an uncopyab...
No coding? Think again. The daily routine of an AI PM involves shifting from a traditional PM to a holistic 'Builder', testing prompts, and battling risk.
Same era, same job title — one group is being laid off while another is being hired. What separates them isn't seniority or credentials; it's how fast they'...
Over the past year leading a team, I found that people with genuine curiosity thrive in AI-augmented work. Boris Cherny of Anthropic thinks this gap is exac...
We're building an AI Agent platform that actually ships to real users — want to go from PoC to production? We're looking for full-stack, backend, and GenAI...
I built an AI Browser that records every reasoning step, tracks queries, auto-decides when to screenshot, and compiles everything into a structured investig...
There was a time when saying 'AI' in serious academic circles was a mark against you. Geoffrey Hinton won the Turing Award in 2018 and the Nobel Prize in Ph...
After Google pushed Gemini 3 Pro and Antigravity, I started rethinking the relationship between developers and AI infrastructure — and what 'role elevation'...
Perplexity is caught between being the next-generation search paradigm and facing mounting legal pressure from content publishers. Can they find a deal stru...
At DevFest Taipei 2025 I shared a real production AI coaching platform — multi-agent collaboration, Persona World, Ontology + GraphRAG, delivering 24/7 pers...
On November 30th I'll be presenting real AI Agent team applications running in production at DevFest Taipei 2025, hosted by Google GDG.
GraphRAG replaces flat vector retrieval with graph-structured knowledge, enabling multi-hop reasoning and consistent context — 86% accuracy on RobustQA vs....
All six utility model patents filed at the start of this year have been approved — two dual-filed. Another five submitted last month. This is what real GenA...
How does a bank GenAI Product Manager design an LLM system that automatically builds a knowledge graph from business pain points, and successfully obtain a...
I'll be speaking at DevFest Taipei 2025 on November 30th — AI Agent team applications in production. Free entry, registration required.
The clearest explanation of 'Attention Is All You Need' I've come across — the mechanics, not just the intuition.
Jason Wei's talk gave me a genuine 'aha' moment. The systematic framework he lays out for finding AI use cases is exactly what I wish I'd articulated earlier.
This October at the iThome Hello World Developer Conference, I presented four intensive sessions covering MCP, GraphRAG, Vibe Coding, and Enterprise LLM Gua...
Does setting temperature to 0 give perfectly consistent AI outputs? No — and Thinking Machine Lab found out why. Batch processing is the culprit, and they b...
When a GenAI system queries sensitive data, how do you prevent malicious users from bypassing security? This article details how a bank AI Product Manager d...
The vibe coding landscape has consolidated around three players — OpenAI's Codex, Google's Gemini, and Anthropic's Claude. Each pulls in a different direction.
Tested Gemini 2.5 Flash's image editing with three sequential prompts — suit, smile, tie adjustment. The precision was genuinely impressive.
We're taking financial AI to Southeast Asia and looking for two engineers. DevOps and full-stack roles open in Taipei Xinyi.
Former OpenAI VP of Product Peter Deng details the essence of product, 1-to-100 growth strategies, the five PM archetypes, and the value of invisible AI.
ChatGPT's agent renders its operations in real-time with full verbosity — like watching a capable human assistant work at a workstation on your behalf.
Are LLM deployment costs skyrocketing? This article shares how a bank GenAI Product Manager used modular architecture design to customize AI systems on dema...
Traditional DBAs manage databases on experience, but under high concurrency and complex loads, that's not enough. This article shares how a GenAI Product Ow...
When introducing an AI knowledge base query system in a bank, how do you prevent PII leaks without sacrificing response quality? This article introduces a G...
Stargate is a 24/7 round-the-clock server construction project. When Americans start running shifts like this, you know this is a race they don't intend to...
Relentless. Product after product, packed into 32 minutes. Google I/O felt less like a keynote and more like being underwater with no room to breathe.
Elon told Kobe: imagination matters more than knowledge. Frieren said magic is a world of imagination. To me, Transformers and generative AI are exactly tha...
What are the true pain points of Relationship Managers? How does GenAI help them generate real-time personalized investment advice in conversations? This ar...
Our financial AI team is looking for DevOps and data science professionals passionate about deploying generative AI applications in real production environm...
We challenged intern candidates to build a static website combining LLM and front-end skills in 60 minutes. The results changed how I think about what a hir...
This is the year of AI Agents. Join me at DevOpsDays on June 5–6 for a session on five agent behavior patterns and building the future DevOps ecosystem with AI.
OpenAI showcased four major innovations: Vision Fine-Tuning, Realtime API, Model Distillation, and Prompt Caching — handing more creative control to develop...