Final Challenges — Fase 4
Challenge 1 — Quiz
- Beda
&dananddi NumPy/Pandas? - Beda
ilocdanloc? - Apa itu broadcasting?
- Kenapa
df.copy()penting saat slice? - Beda merge inner vs outer?
- Kapan pakai pivot vs pivot_table?
- Beda rolling vs expanding window?
- Beda
applydan vectorized op? Mana lebih cepat? - Beda histogram dan KDE plot?
- Kapan pakai box vs violin plot?
Challenge 2 — Speed Comparison
import numpy as np
import pandas as pd
import time
n = 1_000_000
df = pd.DataFrame({"x": np.random.randn(n), "y": np.random.randn(n)})
# Compare 3 cara hitung x + y
# Method 1: iterrows
# Method 2: apply
# Method 3: vectorized
# Hitung waktu masing-masing
Challenge 3 — Real Project
Bikin kompletkan project EDA dengan:
- 1 dataset besar (>10k rows)
- 20+ visualisasi
- 10+ insight detailed
- Cleaning + feature engineering
- Save processed data
- README + LinkedIn post
Push to GitHub. Ini akan jadi portfolio piece pertama yang serius.
Challenge 4 — Anki Update
Tambah 30 cards baru. Total 120+ cards.
Checklist
- NumPy lancar untuk array manipulation
- Pandas lancar (groupby, merge, pivot)
- Bisa visualisasi dengan matplotlib + seaborn
- Selesai 1 EDA project end-to-end
- Notebook published di GitHub
Selanjutnya: Fase 5 — Machine Learning