DEV Community

James Lee profile picture

James Lee

🚀 Senior LLM Engineer | RAG · AI Agents · LLMOps | Python · AWS · K8s

Work

Senior LLM Engineer — RAG · AI Agents · LLMOps

From 60% to 93%: How We Built a Continuous Evaluation Framework for LLM Systems

From 60% to 93%: How We Built a Continuous Evaluation Framework for LLM Systems

Comments
9 min read

Want to connect with James Lee?

Create an account to connect with James Lee. You can also sign in below to proceed if you already have an account.

Already have an account? Sign in
Serverless Best Practices: Production Architecture, Stateless Design & Cost Optimization

Serverless Best Practices: Production Architecture, Stateless Design & Cost Optimization

Comments
10 min read
Serverless Workflows: Orchestrating Multi-Step Pipelines with AWS Step Functions

Serverless Workflows: Orchestrating Multi-Step Pipelines with AWS Step Functions

Comments
5 min read
Event-Driven Automation: Building a Serverless Maintenance Bot with Lambda & EventBridge

Event-Driven Automation: Building a Serverless Maintenance Bot with Lambda & EventBridge

Comments
8 min read
Traffic Routing in AWS Lambda: Canary Deployments, Weighted Aliases & Blue/Green

Traffic Routing in AWS Lambda: Canary Deployments, Weighted Aliases & Blue/Green

Comments
7 min read
Auto Scaling in AWS Lambda: Concurrency, Throttling & Scale-to-Zero

Auto Scaling in AWS Lambda: Concurrency, Throttling & Scale-to-Zero

Comments
7 min read
Lambda Triggers Explained: S3, EventBridge, API Gateway & SQS

Lambda Triggers Explained: S3, EventBridge, API Gateway & SQS

Comments
8 min read
Cold Start in AWS Lambda: Causes, Phases & How to Fix It

Cold Start in AWS Lambda: Causes, Phases & How to Fix It

Comments
7 min read
Kubernetes Data Access Flow: How Network Traffic Moves Inside and Outside the Cluster

Kubernetes Data Access Flow: How Network Traffic Moves Inside and Outside the Cluster

Comments
5 min read
Kubernetes Resource Orchestration: How kubelet Prepares Storage, Network & Compute for Every Pod

Kubernetes Resource Orchestration: How kubelet Prepares Storage, Network & Compute for Every Pod

Comments
5 min read
Kubernetes Resource Scheduling: Filtering, Scoring & Priority Preemption

Kubernetes Resource Scheduling: Filtering, Scoring & Priority Preemption

Comments
4 min read
Kubernetes Control Flow: How Resources Are Created, Deleted, Modified & Queried

Kubernetes Control Flow: How Resources Are Created, Deleted, Modified & Queried

Comments
4 min read
Kubernetes Logical Architecture: Control Plane vs Worker Nodes & Why the Control Plane Runs kubelet Too

Kubernetes Logical Architecture: Control Plane vs Worker Nodes & Why the Control Plane Runs kubelet Too

Comments
3 min read
Kubernetes Architecture Overview: Control Plane, Worker Nodes & the Container Stack

Kubernetes Architecture Overview: Control Plane, Worker Nodes & the Container Stack

Comments
3 min read
Go Garbage Collection: Tri-Color Mark & Sweep, Write Barriers & STW Optimization

Go Garbage Collection: Tri-Color Mark & Sweep, Write Barriers & STW Optimization

Comments
5 min read
Go Compiler & defer: Bootstrap, Three defer Implementations, panic/recover & Closures

Go Compiler & defer: Bootstrap, Three defer Implementations, panic/recover & Closures

1
Comments
6 min read
Goroutine Scheduling: GMP Model, Schedule Loop, Preemption & Stack Management

Goroutine Scheduling: GMP Model, Schedule Loop, Preemption & Stack Management

Comments
6 min read
Go System Calls & Blocking: syscall Wrapping, Async vs Sync & GMP Separation

Go System Calls & Blocking: syscall Wrapping, Async vs Sync & GMP Separation

Comments
5 min read
Go I/O Optimization: goroutine-per-connection, netpoller & the Reader/Writer Interface

Go I/O Optimization: goroutine-per-connection, netpoller & the Reader/Writer Interface

Comments
5 min read
Go Heap Memory Allocation: tcmalloc, Mutator/Allocator & Multi-Level Cache

Go Heap Memory Allocation: tcmalloc, Mutator/Allocator & Multi-Level Cache

Comments
5 min read
Go Performance Optimization: pprof, Flame Graphs & Hotspot Profiling

Go Performance Optimization: pprof, Flame Graphs & Hotspot Profiling

Comments
4 min read
Tekton + Argo CD: Building a Complete GitOps Pipeline End-to-End

Tekton + Argo CD: Building a Complete GitOps Pipeline End-to-End

Comments
4 min read
Argo CD Application Updates & Rollbacks: GitOps-Driven Version Control in Practice

Argo CD Application Updates & Rollbacks: GitOps-Driven Version Control in Practice

Comments
5 min read
Argo CD Application Management: Deploy to Multiple Clusters with a Unified View

Argo CD Application Management: Deploy to Multiple Clusters with a Unified View

Comments
6 min read
Argo CD Installation & Configuration: From Zero to Running in 10 Minutes

Argo CD Installation & Configuration: From Zero to Running in 10 Minutes

Comments
5 min read
Argo CD Core Concepts & Architecture: The GitOps CD Engine for Kubernetes

Argo CD Core Concepts & Architecture: The GitOps CD Engine for Kubernetes

Comments
4 min read
Tekton in Practice: Building a Java CI/CD Pipeline from Scratch

Tekton in Practice: Building a Java CI/CD Pipeline from Scratch

Comments
6 min read
Tekton Concept Model: Steps, Tasks, Pipelines and How They Actually Run

Tekton Concept Model: Steps, Tasks, Pipelines and How They Actually Run

Comments
5 min read
Tekton Components & Resource Objects: A Complete Overview

Tekton Components & Resource Objects: A Complete Overview

Comments
4 min read
GitOps Continuous Delivery: Immutable Infrastructure, Pull Pipelines & Observability

GitOps Continuous Delivery: Immutable Infrastructure, Pull Pipelines & Observability

Comments
4 min read
GitOps vs DevOps: What's the Difference and How Do They Relate?

GitOps vs DevOps: What's the Difference and How Do They Relate?

Comments
3 min read
What is GitOps? IaC, Git, and the Future of Cloud-Native CD

What is GitOps? IaC, Git, and the Future of Cloud-Native CD

Comments
5 min read
Python Object Model: How CPython Represents Everything as an Object

Python Object Model: How CPython Represents Everything as an Object

Comments
5 min read
Python list Internals: How Dynamic Arrays Work Under the Hood

Python list Internals: How Dynamic Arrays Work Under the Hood

Comments
5 min read
Python dict Internals: Hash Tables, Collision Resolution, and Hash Attacks

Python dict Internals: Hash Tables, Collision Resolution, and Hash Attacks

1
Comments 6
6 min read
Python GIL: Why One Lock Rules the Entire Interpreter

Python GIL: Why One Lock Rules the Entire Interpreter

1
Comments
5 min read
Python Memory Optimization: How CPython's Memory Pool Works

Python Memory Optimization: How CPython's Memory Pool Works

Comments
6 min read
Building a Production-Grade LLM Customer Service in 8 Weeks: Architecture Decisions, Pitfalls, and Best Practices

Building a Production-Grade LLM Customer Service in 8 Weeks: Architecture Decisions, Pitfalls, and Best Practices

3
Comments 2
13 min read
Production Optimization: Inference Cost and Performance Control

Production Optimization: Inference Cost and Performance Control

Comments
10 min read
Hybrid Knowledge Retrieval: Combining Neo4j Graph Queries, GraphRAG and Vector Search for Enterprise AI Customer Service

Hybrid Knowledge Retrieval: Combining Neo4j Graph Queries, GraphRAG and Vector Search for Enterprise AI Customer Service

Comments
11 min read
Building Safety Guardrails for LLM Customer Service That Actually Work in Production

Building Safety Guardrails for LLM Customer Service That Actually Work in Production

Comments
9 min read
From Single-Agent to Multi-Agent: Designing and Deploying an Enterprise-Grade Intelligent Customer Service System with LangGraph

From Single-Agent to Multi-Agent: Designing and Deploying an Enterprise-Grade Intelligent Customer Service System with LangGraph

Comments
10 min read
Engineering GraphRAG for Production: API Design, Query Optimization, and Service Reliability

Engineering GraphRAG for Production: API Design, Query Optimization, and Service Reliability

Comments
6 min read
Production-Grade GraphRAG Data Pipeline: End-to-End Construction from PDF Parsing to Knowledge Graph

Production-Grade GraphRAG Data Pipeline: End-to-End Construction from PDF Parsing to Knowledge Graph

Comments 1
9 min read
From 0 to MVP in 2 Weeks: Building a Production-Grade AI Customer Service System

From 0 to MVP in 2 Weeks: Building a Production-Grade AI Customer Service System

1
Comments
9 min read
MCP Framework: The "Swiss Army Knife" for AI System Integration — A GraphRAG Case Study

MCP Framework: The "Swiss Army Knife" for AI System Integration — A GraphRAG Case Study

6
Comments
5 min read
OpenManus Architecture Deep Dive: Enterprise AI Agent Development with Real-World Case Studies

OpenManus Architecture Deep Dive: Enterprise AI Agent Development with Real-World Case Studies

13
Comments
8 min read
CrewAI for Marketing Research: Building a Multi-Agent Collaboration System

CrewAI for Marketing Research: Building a Multi-Agent Collaboration System

11
Comments 4
5 min read
Breaking Limitations: Advanced Customization Guide for Dify Platform

Breaking Limitations: Advanced Customization Guide for Dify Platform

8
Comments
8 min read
Multi-Agent Hybrid Knowledge Base Retrieval: Building a High-Precision Legal Case Analysis Platform

Multi-Agent Hybrid Knowledge Base Retrieval: Building a High-Precision Legal Case Analysis Platform

7
Comments 1
7 min read
Building LLM-Powered Real Estate Intelligent Agents: Technical Implementation and Business Value

Building LLM-Powered Real Estate Intelligent Agents: Technical Implementation and Business Value

16
Comments 6
8 min read
n8n and AI Agents: Breaking Boundaries in Enterprise-Grade Consulting

n8n and AI Agents: Breaking Boundaries in Enterprise-Grade Consulting

12
Comments 1
6 min read
Building a Medical Literature Assistant: RAG System Practice Based on LangChain

Building a Medical Literature Assistant: RAG System Practice Based on LangChain

15
Comments
14 min read
Build an enterprise-level financial data analysis assistant: multi-source data RAG system practice based on LangChain

Build an enterprise-level financial data analysis assistant: multi-source data RAG system practice based on LangChain

9
Comments 1
16 min read
Enterprise-Level Deployment and Optimization of LLM Applications: A Production Practice Guide Based on LangChain

Enterprise-Level Deployment and Optimization of LLM Applications: A Production Practice Guide Based on LangChain

6
Comments 1
15 min read
Design and Implementation of LLM-based Intelligent O&M Agent System

Design and Implementation of LLM-based Intelligent O&M Agent System

9
Comments 1
5 min read
Building Enterprise-Level Data Analysis Agent: Architecture Design and Implementation

Building Enterprise-Level Data Analysis Agent: Architecture Design and Implementation

6
Comments
9 min read
Building an Intelligent Customer Service Agent System from Scratch

Building an Intelligent Customer Service Agent System from Scratch

5
Comments
5 min read
Agent Task Orchestration System: From Design to Production

Agent Task Orchestration System: From Design to Production

7
Comments
4 min read
LangGraph State Machines: Managing Complex Agent Task Flows in Production

LangGraph State Machines: Managing Complex Agent Task Flows in Production

23
Comments
3 min read
loading...