Skip to main content

Command Palette

Search for a command to run...

🧠 AEM Architecture Explained β€” Author, Publish & Dispatcher (Beginner Guide)

Published
β€’4 min read

Chapter 1 of AEM Learning Series β€” Understanding the core architecture used in every production AEM setup.

These 3 layers work together to deliver fast, secure, and scalable websites.

✍️ 1. AUTHOR Instance

AspectDetail
UsersAuthors, marketers, developers
PurposeCreate, edit, approve content
Default URLhttp://localhost:4502
Public Access❌ Never exposed

What happens on Author?

  • Drag & drop page building

  • Upload images/videos to DAM

  • Preview content before publishing

  • Run workflows (review β†’ approve β†’ publish)

  • Developers deploy code here first

Think of Author as the content factory 🏭


🌐 2. PUBLISH Instance

AspectDetail
UsersWebsite visitors
PurposeServe live website
Default URLhttp://localhost:4503
Public Accessβœ… Yes

What happens on Publish?

  • Serves final HTML pages

  • Contains only activated content

  • No editing interface

  • Multiple instances used for scaling

Publishing Flow

Author writes page β†’ Clicks Publish β†’ Replicates to Publish β†’ Users see it

Think of Publish as the live website servers.


⚑ 3. DISPATCHER (Most Important Layer)

AspectDetail
What is it?Apache HTTP module
PurposeCaching + Security + Load balancing
PositionBetween browser and Publish

How Dispatcher Works

Dispatcher’s 3 Superpowers

1️⃣ Caching β†’ Stores rendered HTML
2️⃣ Security β†’ Blocks malicious URLs
3️⃣ Load Balancing β†’ Distributes traffic

πŸ‘‰ Dispatcher is NOT part of AEM β€” it runs on Apache/IIS.

πŸ”„ Real-World Flow Example

Imagine Coca-Cola launches a Summer Sale Page:

  1. Author creates page on Author

  2. Clicks Publish

  3. Content replicates to Publish

  4. User visits website

  5. Dispatcher checks cache

  6. First user β†’ page generated & cached

  7. Next 10,000 users β†’ cached version ⚑

This is how AEM handles massive traffic.

🏭 Production Setup (Enterprise Scale)

                         β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
                         β”‚  AUTHOR  β”‚
                         β””β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”˜
                              β”‚ Replication
           β”Œβ”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”€β”€β”€β”€β”
           β–Ό          β–Ό          β–Ό
          PUBLISH1   PUBLISH2   PUBLISH3
               β”‚          β”‚          β”‚
          DISPATCHER1 DISPATCHER2 DISPATCHER3
                    β””β”€β”€β”€β”€β”€β”€β”¬β”€β”€β”€β”€β”€β”€β”˜
                           β–Ό
                     LOAD BALANCER
                           β–Ό
                        Users

Large enterprises run multiple publish + dispatcher nodes for high availability.


🧾 Key Things to Remember

  • Author β†’ Content creation (private)

  • Publish β†’ Live website (public)

  • Dispatcher β†’ Cache + Security layer

  • Content flows Author β†’ Publish

  • Users NEVER access Author directly


🎯 AEM Interview Questions (30 Q&A)


🟒 Section A β€” Basics

Q1. What is AEM?
AEM is an enterprise CMS used to create, manage and deliver digital experiences.

Q2. AEM technology stack?
OSGi β€’ JCR β€’ Sling β€’ HTL β€’ Java

Q3. Three tiers of AEM?
Author β€’ Publish β€’ Dispatcher

Q4. Author instance?
Content creation environment (port 4502).

Q5. Publish instance?
Live website server (port 4503).

Q6. Dispatcher?
Apache module for caching, security, load balancing.

Q7. Default ports?
Author β†’ 4502 | Publish β†’ 4503

Q8. Author users?
Authors, marketers, developers.

Q9. Publish users?
Website visitors.

Q10. AEM language?
Java.


πŸ”΅ Section B β€” Replication

Q11. Replication?
Author β†’ Publish content transfer.

Q12. Activation vs Deactivation?
Publish vs Unpublish.

Q13. Reverse replication?
Publish β†’ Author data flow.

Q14. Replication agent?
Configuration controlling replication.

Q15. Publish β†’ Author publishing?
Not directly (only reverse replication).


🟣 Section C β€” Dispatcher

Q16. Dispatcher functions?
Caching β€’ Security β€’ Load balancing.

Q17. Where installed?
Apache HTTP / IIS server.

Q18. Cache miss?
Request sent to Publish β†’ cached.

Q19. Cache invalidation?
Flush agent clears cache.

Q20. Personalized content caching?
No β€” use AJAX/client-side.


🟠 Section D β€” Production

Q21. Author instances in prod?
Usually 1 (or primary + standby).

Q22. Publish instances?
3–10+ depending on traffic.

Q23. In front of Dispatcher?
Load balancer or CDN.

Q24. Hosting types?
On-Prem β€’ AMS β€’ Cloud Service.

Q25. Full request flow?
Browser β†’ LB β†’ Dispatcher β†’ Publish β†’ Response.


πŸ”΄ Section E β€” Scenarios

Q26. Users see old content?
Check dispatcher/CDN cache or replication.

Q27. Author down β€” website works?
Yes.

Q28. Publish down β€” website works?
Cached pages still load.

Q29. Wrong content published?
Republish or deactivate.

Q30. Why Author behind VPN?
Security & protection.


πŸŽ‰ Summary

If you understand this architecture, you understand the foundation of AEM.

Everything else in AEM builds on this.