User Tools

Site Tools


data_science_curriculum

This is an old revision of the document!


Curriculum Design

아래 대 분류 섹션은 각 대학교의 DS 프로그램의 커리큘럼 내용입니다. 큰 그림으로 보면 미국의 이런 프로그램은 대개 “Math와 Stat,” “Comp Sci”의 과목이 주가 되는 듯 싶습니다. 미디어학과의 경우, “미디어” 사용이라는 도메인(혹은 익스퍼트) 지식이 연계된 내용이 포함이 되어야 할 텐데, 이런 예가 많지 않습니다. 아래는 다른 프로그램들에 기초해서 학생들에게 제공할 수 있는 내용을 정리해 본 것입니다.


기초(Foundation)

  1. 기초
  2. 수학
  3. 전산
  4. (사회)심리, 경영, 경제 기초

심화(Advanced)

  • 아래에서 첫단계는 분야입니다. (콘텐츠산업, 금융공학, 등등)
  • 두번째 단계는 세부 분야입니다 (콘텐츠산업 밑에 데이터저널리즘).
  • 세번째 단계는 관련 수업이나 교육제목입니다 (데이터저널리즘을 위해서는 데이터마이닝, 데이터비주얼라이제이선, . . . 등의 수업이 필요함)
  • 두번째와 세번째 단계를 개개인이 콘트리뷰션해 주시면 좋겠습니다. 나중에 필요없다 싶으면 지우고 정리하고 하는 작업이 있으면 되니 부담갖지 마시고 관련 수업이나 내용을 적어 주시면 되겠습니다.
  1. 데이터 시스템 (하둡과 같은 데이터 관리 시스템 지식: 미디어학과와는 거리가 좀 있음)
  2. 금융공학
  3. 콘텐츠 산업관련 서비스
    1. 데이터 저널리즘
      1. 데이터 마이닝
      2. 소셜미디어 분석
      3. 데이터 비주얼라이제이션
    2. 빅 데이터 분석기반 광고
      1. 데이터 마이닝
      2. 소셜미디어 분석
      3. 데이터 비주얼라이제이션
    3. 웹서비스
      1. 웹 아날리틱스
    4. 영상, 음악 서비스
      1. 데이터 분석에 기반한 영상, 음악 추천 서비스
  4. 게임
  5. 에듀테인먼트
  6. 부상, 부각되는 분야
    1. 의료 서비스
    2. 군사 관련 서비스
    3. 투자 관련 서비스

—-

Berkeley

USC

Curriculum:

Total Units: 32

You must take the following required courses (12 units):

CS 570 - Analysis of Algorithms (4)
CS 585 - Database Systems (4)
CS 561 - Foundations of Artificial Intelligence (4)
Group Electives (3 course - minimum of 1 course from each of the two groups, 9-12 units):

Group 1 - Data Systems:

CSCI 548 - Information Integration on the Web (4)
CSCI 572 - Information Retrieval and Web Search Engines (4)
CSCI 586 - Database Systems Interoperability (3)
CSCI 587 - Geospatial Information Management (4)
CSCI 653 - High Performance Computing and Simulation (4)
CSCI 685 - Advanced Topics in Database Systems (4)

Group 2 - Data Analysis:

CSCI 567 - Machine Learning (4)
CSCI 573 - Probabilistic Reasoning (3)
CSCI 686 - Advanced Big Data Analytics (4)
ISE 520 - Optimization: Theory and Algorithms (3)
MATH 467 - Theory and Computational Methods for Optimization (4)
MATH 574 - Applied Matrix Analysis (3)

Additional Electives (8-11 units):

Any 500 or 600 level course in CSCI
MATH 458 - Numerical Methods (4)
MATH 501 - Numerical Analysis and Computation (3)
MATH 502ab - Numerical Analysis (3-3)
MATH 505a - Applied Probability (3)
MATH 601 - Optimization Theory and Techniques (3)
MATH 650 - Seminar in Statistical Consulting (3)
CSCI 598 - Engineering Writing and Communication (1) AND*
ENGR 596 - Engineering Internship (1, max 3)
CSCI 590 - Directed Research (1-4, max 4)
CSCI 591 - Computer Science Research Colloquium (1, max 2)
*CSCI 598 must be taken BEFORE a student can be approved for ENGR 596.

Curriculum - NYU Center for Data Science

NYU Center for Data Science

http://cds.nyu.edu/academics/ms-in-data-science/curriculum/required-courses/

Intro to the MS in DS at NYU video.

DS-GA-1001 Intro to Data Science 3
DS-GA-1002 Statistical and Mathematical Methods for Data Science 3
Data Science Elective 1

DS-GA-1003 Machine Learning and Computational Statistics 3
DS-GA-1004 Big Data 3
Data Science Elective 2 3

DS-GA-1005 Inference and Representation 3
DS-GA-1006 Capstone Project in Data Science 3
Data Science Elective 3 3

Data Science Elective 4 3
Data Science Elective 5 3
Data Science Elective 6 3

Indiana Univ. Bloomington

http://www.soic.indiana.edu/graduate/degrees/data-science/ms-data-science/ms-requirements.html

Sample residential curriculums
Every course may not yet be offered, and a selection of these course are currently offered online.

Example decision-maker curriculum

Year 1 Semester 1:
I590: Topics in Informatics: Big Data Applications and Analytics
I590: Topics in Informatics: Management, Access, and Use of Big and Complex Data
STAT S520 Introduction to Statistics

Year 1 Semester 2:
B661: Database Theory and System Design
Z637: Information Visualization
B669: Scientific Data Management and Preservation

Year 1 Summer:
Z605: Internship in Library and Information Science

Year 2 Semester 1:
Z604: Data Curation
I525: Organizational Informatics and Economics of Security
I590: Topics in Informatics: Big Data Open Source Software and Projects

Example technical curriculum

Year 1 Semester 1:
B503: Analysis of Algorithm;
B561: Advanced Database Concepts
S520: Introduction to Statistics

Year 1 Semester 2:
B649: Cloud Computing
Z534: Search
B555: Machine Learning

Year 1 Summer:
Z605: Internship in Library and Information Science

Year 2 Semester 3:
B565: Data Mining
I520: Security For Networked Systems
Z637: Information Visualization

Example: Computational and Analytic Data Science Track

B649: Cloud Computing
I590: Topics: Projects on Big Data Software
I590: Topics: Data Science for Drug Discover
I590: Topics: Perspectives in Data Science
Z636: Data Semantics
Z637: Information Visualization

Univ. of Virginia

Course requirements for the MSDS program

Summer Term (approximately 6 weeks, starting mid-July):

  • CS 5010: Programming and Systems for Data Science
  • STAT 6430: Statistical Computing for Data Science

Fall Term:

  • STAT 6021: Linear Models for Data Science
  • CS 5012: Foundations of Computer Science
  • SYS 6018: Data Mining
  • DS 6001: Practice and Application of Data Science
  • DS 6002a: Ethics of Big Data
  • DS 6003a: Capstone Project

Spring Term:

  • SYS 6016: Machine Learning
  • DS 6002b: Ethics of Big Data
  • DS 6003b: Capstone Project
  • Elective 1
  • Elective 2

Selection of elective courses is done in consultation with the program director. There are a variety of possible electives available, including the following:

  • CS 6501: Special Topics in Computer Science
  • CE 6400: Traffic Operations
  • STAT 6130: Applied Multivariate Statistics
  • STAT 5390: Exploratory Data Analysis
  • SYS 6001: Introduction to Systems Engineering
  • SYS 6003: Optimization I
  • SYS 6005: Stochastic Systems I
  • CS 6750: Database Systems
  • STAT 5170: Applied Time Series
  • STAT 5340: Bootstrap and Other Resampling Methods
  • MATH 5110: Stochastic Processes

Minnesota

Statistics Track 6
Algorithmics Track 6
Infrastructure Track 6
Elective Credits 6
Capstone Credits 6
Colloquium Credits 6


2 sem design

CSCI 5523 - Introduction to Data Mining 3
CSCI 5707 - Principles of Database Systems 3
STAT 5302 - Applied Regression Analysis 3
Capstone Project (First Half) 3

CSCI 5451 - Introduction to Parallel Computing: Architectures, Algorithms, and Programming 3
EE 5239 - Introduction to Nonlinear Optimization 3
STAT 5401 - Applied Multivariate Methods 3
Elective 3
Capstone Project (Second Half) 3

The Open Source Data Science Curriculum

http://datasciencemasters.org
Foundation

  • Intro to Data Science UW / Coursera:
  • Data Science / Harvard Video Archive & Course:
  • Data Science with Open Source Tools Book $27:

Math

  • Linear Algebra & Programming
  • Statistics
  • Differential Equations & Calculus
  • Problem Solving (Problem-Solving Heuristics “How To Solve It”)

Computing

  • Algorithms
  • Distributed Computing Paradigms
  • Databases
  • Data Mining
  • Machine Learning
  • Probabilistic Modeling
  • Deep Learning (Neural Networks)
  • Social Network & Graph Analysis
  • Natural Language Processing
  • Analysis

Data Design

  • Visualization
  • Data Journalism

Python (Libraries)

  • Data Structures & Analysis Packages
  • Machine Learning Packages
  • Networks Packages
  • Statistical Packages
  • Natural Language Processing & Understanding
  • Live Data Packages
  • Visualization Packages
  • iPython Data Science Notebooks

Data Science as a Profession
Capstone Project

Udacity

data_science_curriculum.1458269341.txt.gz · Last modified: 2016/03/18 11:19 by hkimscil

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki