[Tech Series 01] Web Browser, the New Stage for AI: Edge Intelligence
Exploring the concept of Edge Intelligence, which reduces server load and protects data privacy, and the necessity of Web ML.
![[Tech Series 01] Web Browser, the New Stage for AI: Edge Intelligence](/images/blog/edge_intelligence_concept.png)
Web Browser, the New Stage for AI: Edge Intelligence
Artificial Intelligence (AI) technology has made rapid progress over the past few years. In particular, the emergence of Large Language Models (LLM) like ChatGPT has completely changed our daily lives. However, behind this AI revolution, there are important challenges to solve: the structural limitations of the ‘Cloud AI’ approach.
Recently, Edge Intelligence, which runs AI on the user’s device (the ‘Edge’), is attracting attention as a new trend to solve these problems.
Limitations of Cloud AI and the Rise of Edge

Traditional AI services mostly run on central servers equipped with high-performance GPUs. When a user sends data, the server processes it and sends the result back. This method has the following problems:
- Latency: Inevitable delays occur as data travels back and forth across the network. This can be critical for applications requiring real-time responses like autonomous vehicles or AR/VR services.
- Privacy: Sensitive user data (photos, voice, medical information, etc.) must be sent to external servers, creating risks of data leakage.
- Cost: The cost of operating GPU servers to run AI models is astronomical, threatening the sustainability of services.
Edge Intelligence solves these problems fundamentally. By running AI models on the user’s smartphone, laptop, or web browser itself, data doesn’t need to leave the device, providing immediate response speeds regardless of network conditions.
Web ML: The Most Universal Edge AI Platform
Among them, Web ML (Machine Learning on the Web) has the most powerful impact. Web browsers are the most universal platform, already installed on billions of devices worldwide.
The ability to experience cutting-edge AI technology with just a single URL click, without any app installation or complex environment setup, is a unique advantage of Web ML. In the past, only simple models could run due to browser performance limitations, but with the emergence of WebAssembly and WebGPU, we now have an environment that can achieve performance comparable to, or even exceeding, native apps.
Realays aims to provide safe and fast AI experiences using only web technologies, at the forefront of this technological change.
Real-World Applications of Web ML
1. Real-Time Video Processing
Modern web browsers can process real-time video using WebGPU:
- Background Removal: Real-time background blur or replacement during video calls
- Filter Effects: Instagram-style real-time filters
- Motion Tracking: Real-time tracking of user’s hands or body movements
2. Natural Language Processing
- Translation: Instant page translation (without server communication)
- Text Summarization: Automatic summarization of long documents
- Sentiment Analysis: Automatic emotion analysis of user feedback
3. Image Recognition
- Product Search: Find similar products by image
- Document OCR: Instant text extraction from photos
- Face Recognition: Automatic photo album organization
The Future of Edge Intelligence
Technology Trends
- More Powerful Models: GPT-4 level models expected to run in browsers
- Battery Efficiency: Development of lightweight inference engines optimized for power consumption
- Multimodal: Models that integrate text, image, and voice processing
Industry Impact
- Healthcare: Assist in diagnosis while protecting patient data privacy
- Finance: Analyze sensitive financial information without sending to servers
- Education: Provide personalized learning experiences even offline
Getting Started Guide for Developers
Your First Web ML Project
// Simple image classification example
import * as tf from "@tensorflow/tfjs";
async function classifyImage(imageElement) {
// Load MobileNet model
const model = await tf.loadLayersModel(
"https://storage.googleapis.com/tfjs-models/tfjs/mobilenet_v1_0.25_224/model.json",
);
// Convert image to tensor
const tensor = tf.browser
.fromPixels(imageElement)
.resizeNearestNeighbor([224, 224])
.toFloat()
.expandDims();
// Run prediction
const predictions = await model.predict(tensor).data();
// Clean up memory
tensor.dispose();
return predictions;
}
Recommended Learning Resources
- TensorFlow.js Official Docs: https://www.tensorflow.org/js
- WebGPU Tutorial: https://web.dev/gpu/
- Hugging Face Model Hub: Explore pre-trained models
Frequently Asked Questions (FAQ)
Q: Isn’t Web ML slower than server-based AI? A: Initially yes, but thanks to WebGPU and optimization techniques, in many cases it’s actually faster when considering server round-trip time.
Q: Does it work on all browsers? A: TensorFlow.js works on most modern browsers. WebGPU is supported on Chrome 94+, Edge 94+, and other browsers are gradually adding support.
Q: Does it work on mobile devices? A: Yes, Web ML works well on modern mobile browsers. TensorFlow.js Lite enables mobile optimization.
Q: Is privacy really guaranteed? A: Since data doesn’t leave the user’s device, it’s much safer than server-based AI. You can verify network traffic using developer tools.
In the next post, we’ll explore how Web ML technology has evolved, from TensorFlow.js to WebLLM.

![[Tech Series 02] From TensorFlow.js to WebLLM: Evolution of Web ML](/images/blog/web_ml_evolution.png)
![[Tech Series 05] Realays AI Use Cases: FruitsFace & Dalendar](/images/blog/fruitsface_concept.png)