๐Ÿง  ์ธ๊ณต์ง€๋Šฅ์˜ 7๋‹จ๊ณ„

2025. 6. 17. 11:58ใ†Python(AI)

๐Ÿ’ก AI ๊ฐœ๋ฐœ ์‹ค๋ฌด 7๋‹จ๊ณ„ × ๊ธฐ์ˆ  ๋ฐ ์–ธ์–ด ์ •๋ฆฌ

๋‹จ๊ณ„ ์‹ค๋ฌด ์šฉ์–ด ์ฃผ์š” ์ž‘์—… ์‚ฌ์šฉ ๊ธฐ์ˆ  & ๋„๊ตฌ ์‚ฌ์šฉ ์–ธ์–ด
1๏ธโƒฃ ๋ฌธ์ œ ์ •์˜
(Problem Definition)
ํ•ด๊ฒฐ ๋ชฉํ‘œ ์ •์˜
๋ฐ์ดํ„ฐ ์š”๊ฑด ํŒŒ์•…
ํ‰๊ฐ€ ์ง€ํ‘œ ์„ค์ •
Notion, Google Docs, Whimsical
ํ˜‘์—…ํˆด: Jira, Trello, Slack
์—†์Œ (์ž์—ฐ์–ด ๊ธฐ์ˆ )
2๏ธโƒฃ ๋ฐ์ดํ„ฐ ์ˆ˜์ง‘
(Data Collection)
DB, API, ํฌ๋กค๋ง, ์„ผ์„œ ์ˆ˜์ง‘
ํŒŒ์ผ ์ˆ˜์ง‘ (CSV, JSON ๋“ฑ)
SQL (MySQL, PostgreSQL)
API: REST, GraphQL
์›น ํฌ๋กค๋ง: BeautifulSoup, Selenium
Python (requests, bs4)
Shell Script
SQL
3๏ธโƒฃ ๋ฐ์ดํ„ฐ ์ „์ฒ˜๋ฆฌ
(Data Cleaning)
๊ฒฐ์ธก์น˜ ์ฒ˜๋ฆฌ
์ด์ƒ์น˜ ์ œ๊ฑฐ
์Šค์ผ€์ผ๋ง/์ธ์ฝ”๋”ฉ ๋“ฑ
Pandas, NumPy
Sklearn: SimpleImputer, StandardScaler
OpenRefine
Python
R
Excel
4๏ธโƒฃ ๋ฐ์ดํ„ฐ ํƒ์ƒ‰
(EDA)
์‹œ๊ฐํ™”
๋ถ„ํฌ ํ™•์ธ
์ƒ๊ด€๊ด€๊ณ„ ๋ถ„์„
Matplotlib, Seaborn, Plotly
Pandas Profiling
Tableau, Power BI
Python
R
SQL
5๏ธโƒฃ ๋ชจ๋ธ๋ง
(Model Training)
๋ชจ๋ธ ์„ ํƒ ๋ฐ ํ•™์Šต
๊ต์ฐจ๊ฒ€์ฆ, ํŠœ๋‹
Sklearn, XGBoost, LightGBM
TensorFlow, PyTorch
MLflow
Python
R
Java/C++ (์ƒ์‚ฐํ™˜๊ฒฝ)
6๏ธโƒฃ ์˜ˆ์ธก ๋ฐ ํ‰๊ฐ€
(Inference & Evaluation)
ํ…Œ์ŠคํŠธ ์˜ˆ์ธก
์ •ํ™•๋„/F1/RMSE ์ธก์ •
Sklearn ํ‰๊ฐ€ ํ•จ์ˆ˜
Keras/TensorFlow ํ‰๊ฐ€ API
Optuna, GridSearchCV
Python
Jupyter Notebook
7๏ธโƒฃ ์„œ๋น™ ๋ฐ ๊ฐœ์„ 
(Deployment & Feedback)
๋ชจ๋ธ APIํ™”
๋ชจ๋‹ˆํ„ฐ๋ง ๋ฐ ํ”ผ๋“œ๋ฐฑ ์ ์šฉ
Flask, FastAPI
Docker, Kubernetes
AWS, GCP, MLflow, Airflow
Python
Bash
YAML
SQL