Introduction
Data comes in many shapes—and choosing the right type and file format is a core skill for any data analyst. On Day 3, you’ll learn how data is structured, how it’s stored, and why CSV or JSON is chosen in real business scenarios. This lesson is practical, simple, and directly applicable to work tasks.
Learning Objectives
By the end of this lesson, you will be able to:
-
Distinguish structured vs unstructured data
-
Understand CSV and JSON file formats
-
Decide which file format fits a specific use case
Topic 1: Structured vs Unstructured Data
Structured Data
Structured data:
-
Has a fixed format
-
Is organized into rows and columns
-
Fits neatly into tables or spreadsheets
Examples:
-
Sales records
-
Employee lists
-
Transaction logs
📌 Easy analogy:
Think of structured data as an Excel sheet—everything has its place.
Unstructured Data
Unstructured data:
-
Has no fixed format
-
Is free-form and flexible
-
Often harder to analyze directly
Examples:
-
Emails
-
Chat messages
-
Images, videos, PDFs
📌 Easy analogy:
Unstructured data is like a folder of random notes, photos, and messages.
CSV (Comma-Separated Values)
What it is:
A simple text file where values are separated by commas.
Best for:
-
Tabular data
-
Reporting and analytics
-
Excel, Google Sheets, SQL imports
Pros:
-
Easy to read
-
Lightweight
-
Widely supported
Cons:
-
No nested data
-
Limited structure
JSON (JavaScript Object Notation)
What it is:
A structured text format using key-value pairs and nested objects.
Best for:
-
APIs
-
Web and app data
-
Semi-structured datasets
Pros:
-
Flexible structure
-
Supports nesting
-
Great for applications
Cons:
-
Harder to read manually
-
Not ideal for spreadsheets
Dataset Samples
Sample CSV
customer_id,name,amount
101,Ana,1500
102,Mark,2300
Sample JSON
{
"customer_id": 101,
"name": "Ana",
"orders": [
{"amount": 1500}
]
}
Exercises
Exercise 1: Identify the Data Type
Classify each as Structured or Unstructured:
-
Excel sales report
-
Customer support emails
-
SQL database table
-
Product images
Exercise 2: Choose the Correct File Format
Pick CSV or JSON and explain why:
-
Monthly revenue report
-
API response from a shopping app
-
Employee payroll file
-
Mobile app user activity log
Bonus Challenge
Convert a small CSV file into JSON format (even manually!) to see how structure changes.
What to Expect After Completion of Day 3
After finishing Day 3, you will:
-
Confidently recognize data types in real jobs
-
Know when to use CSV vs JSON
-
Be ready to import data into Excel, SQL, or BI tools
-
Understand data formats used in APIs, dashboards, and reports
📌 You’re now thinking like a data analyst—not just using files, but choosing them strategically.
Conclusion
Understanding data types and file formats is foundational. This knowledge helps you avoid mistakes, work faster, and communicate better with engineers and business teams. Tomorrow, you’ll start using these skills in real analysis workflows.

No comments:
Post a Comment