Taking Ownership of a Legacy Project with Just One Page of Documentation
Note: This case study is based on my personal experience, written by me and refined with AI tools for clarity and polish.
Imagine inheriting a critical food delivery system that serves thousands of customers daily—yet, the only documentation you receive is a single page with a few lines about its features. That was the challenge I faced when I took over HungryNaki, an online platform where users order food from over 750 restaurants across Dhaka, Chittagong, and Sylhet.
The platform operates with 1,500+ riders and 25 hubs, ensuring most orders are delivered within an hour. Customers rely on its website and mobile apps to search for restaurants, place orders, and get their food delivered seamlessly. However, the system struggled to handle peak-hour traffic, leading to frequent failures, frustrated customers, and business disruptions.
The Challenge
I was assigned as the lead for this project, a live system with ongoing client dissatisfaction. The system struggled to handle peak-hour traffic, particularly in the evenings when most customers placed orders or when the marketing team launched new discount promotions.
This was not a small system—it included:
- 1 iOS app
- 3 Android apps
- 3 web platforms
- 1 Node.js backend supporting the entire load
- 100+ MongoDB collections
I was leading a team of four junior developers, most of whom were new to the project. The available documentation consisted of just one page, containing only a 4–5 line overview of its features.
My Approach
My first goal was to understand the full lifecycle of a single request—from the client app or web platform to the server and all the components it passed through.
Once I gained an understanding of how the system functioned and the roles of different components, I focused on identifying system bottlenecks using metrics. However, I quickly discovered that we had minimal monitoring in place, forcing me to rely on system usage patterns instead.
I also found that the system was a monolith, running on a single server with a high load on MongoDB. To improve scalability, I planned to transition it into a modular monolith, ensuring that we wouldn’t be limited to a single instance.
Execution
Short-Term Fixes (Immediate Stability & Performance Improvement)
- Fixed DNS Issue: Discovered that our Cloudflare subscription had expired, causing intermittent failures. Renewed the subscription to restore stability.
- Increased System Capacity: Adjusted server resources to temporarily handle peak traffic more efficiently.
- Database Indexing: Analyzed MongoDB queries and added missing indexes to speed up frequent read operations, reducing query execution time.
Long-Term Fixes (Scalability & Architectural Improvements)
- Modular Monolith Refactoring: Planned a transition from a monolithic system to a modular monolith, allowing independent scaling of critical services without full system dependency.
- Advanced Query Optimization & Indexing: Introduced compound indexes and fine-tuned existing ones based on query patterns to improve database efficiency further.
- Load Balancing & Horizontal Scaling: Deployed additional instances behind a load balancer to distribute traffic evenly.
- Asynchronous Processing: Moved non-critical operations (e.g., notifications, analytics updates) to background jobs using a queue system to reduce request-response time.
Results
The immediate system failures during peak hours were mitigated, allowing the business to continue operations.
However, performance was still slow at peak times, requiring further optimization efforts.
Learnings
- The importance of logging and monitoring in a live system.
- Prioritizing business-critical issues first—focusing on the most impactful problems rather than trying to fix everything at once.
- Thorough documentation is essential—future developers can understand the code, but high-level overviews, implementation details, and key considerations are often missing.
Conclusion
Starting work on a new system—especially one with minimal documentation—can be overwhelming. The best approach is to begin with the smallest unit, gradually expand understanding, and prioritize the most critical issues first.
Key Technologies & Tools
- Node.js
- MongoDB
- Redis
- Elasticsearch
- GraphQL
- Java