100年第1學期-4849 高維度資料分析 課程資訊

評分方式

評分項目 配分比例 說明
作業 30
學期研究報告(Term paper) 40 大型資料分析或程式建立(software-development)。
期末考 30

選課分析

本課程名額為 70人,已有6 人選讀,尚餘名額64人。


登入後可進行最愛課程追蹤 [按此登入]。

授課教師

呂恒輝

教育目標

本課程介紹多個創新多變量統計方法(例如: 反切迴歸、Principal Hessian Directions與MAVE等),用以達到資料縮減(data reduction)的目的。

課程概述

The reduction of dimension is an issue that can arise in every scientific field. Generally speaking, the difficulty lies on how to visualize a high dimensional function or data set. People often ask: How do they look?, What structures are there?, What model should be used? Aside from the differences that underlie the various scientific contexts, such kinds of questions do have a common root in Statistics. This is the driving force for the study of high dimensional data analysis. This course will discuss several statistical methodologies useful for exploring voluminous data. They include principal component analysis, clustering and classification, survival analysis and other recent developed sufficient dimension reduction (SDR) methods. Sliced inverse regression (SIR) and principal Hessian direction (PHD) are two novel SDR methods, useful for the extraction of geometric information underlying noisy data of several dimensions. The theories of several SDR methods will be discussed in depth. They will be used as the backbone for the entire course. Examples from various application areas will be given. They include social/economic problems like unemployment rates, biostatistics problems like clinic trials with censoring, machine learning problems like handwritten digital recognition, biomedical problems like functional Magnet Resonance Imaging, and bioinformatics problems like micro-array gene expression etc.

課程資訊

參考書目

No textbook. Lecture notes and selected papers will be available.

開課紀錄

您可查詢過去本課程開課紀錄。 高維度資料分析歷史開課紀錄查詢