Module 3: NoSQL and Document-Oriented Databases

Module Overview

Need to deal with Big Data? You may need tools beyond standard SQL approaches. Enter NoSQL and document-oriented databases! In this module, we explore the world of NoSQL databases with a focus on MongoDB, one of the most popular document-oriented databases. We'll learn how to store, retrieve, and query data in a schema-less environment.

Learning Objectives

Setting Up MongoDB in 2025

MongoDB has evolved into a leading cloud database platform, with MongoDB Atlas as the recommended solution for most users. Atlas is a fully managed cloud service that simplifies deployment, scaling, and management. While local installations are still available for on-premises or offline use, Atlas is preferred for its ease of use, global availability, and automatic scaling. Here's what you need to know to get started in 2025:

Modern Setup Process

Current Deployment Options (2025)

Note: Pricing varies by cloud provider (AWS, Azure, Google Cloud), region, and usage (e.g., compute, storage, data transfer). Always verify current rates on MongoDB’s pricing page, as costs may fluctuate.

Key Requirements

Quick Start Guide

To set up a free MongoDB Atlas cluster:

  1. Sign up at MongoDB Atlas.
  2. Choose the Free Tier (M0), select a cloud provider (e.g., AWS), and pick a region (e.g., US East).
  3. Configure your cluster’s security: Add your IP address to the IP whitelist (e.g., 0.0.0.0/0 for access from anywhere, but use caution) and create a database user with a username and password.
  4. Deploy the cluster (takes ~5 minutes). Copy the connection string to connect via your app or MongoDB Compass.
  5. Test your connection using Python: Install pymongo (pip install pymongo) and use the connection string in your code (replace with your user’s password).

Tip: Ensure your IP whitelist includes your current network to avoid connection errors. For production apps, use VPC peering or private endpoints for secure access.

Additional Learning Resources

Enhance your MongoDB skills with these free resources:

Note: As of 2025, MongoDB has deprecated M2/M5 Shared clusters and Serverless instances, transitioning to Flex Clusters for greater flexibility. Atlas is the industry standard for cloud deployments due to its automated management, global scalability, and cost predictability. Local installations (e.g., MongoDB Community Edition) remain an option for on-premises or offline environments but require manual setup and maintenance.

Guided Project

In this guided project, we'll learn how to work with NoSQL databases and build data pipelines between different database types. Open guided-project.md in the GitHub repository below to follow along with the guided project.

The GitHub repository contains valuable resources, examples, and documentation that align with the lecture content and learning objectives. Take time to review these materials as they will help reinforce your understanding of NoSQL databases and MongoDB implementation.

Module Assignment

For this assignment, you'll practice working with MongoDB, creating document-oriented databases, and building data pipelines between SQL and NoSQL systems.

Solution Video

Additional Resources

MongoDB Learning

Documentation