BahVideo.com
Sample Complexity of Testing the Manifold Hypothesis
Sample Complexity of Testing the Manifold Hypothesis | BahVideo.com
Watch Sample Complexity of Testing the Manifold Hypothesis

Sample Complexity of Testing the Manifold Hypothesis

0 of 5 Stars
The hypothesis that high dimensional data tends to lie in the vicinity of a low dimensional manifold is the basis of a collection of methodologies termed Manifold Learning. In this paper, we study statistical aspects of the question of fitting a manifold with a nearly optimal least squared error. Given upper bounds on the dimension, volume, and curvature, we show that Empirical Risk Minimization can produce a nearly optimal manifold using a number of random samples that is it independent of the ambient dimension of the space in which data lie. We obtain an upper bound on the required number of samples that depends polynomially on the curvature, exponentially on the intrinsic dimension, and linearly on the intrinsic volume. For constant error, we prove a matching minimax lower bound on the sample complexity that shows that this dependence on intrinsic dimension, volume and curvature is unavoidable. Whether the known lower bound for the sample complexity of Empirical Risk minimization on k-means applied to data in a unit ball of arbitrary dimension is tight, has been an open question since 1997. Here eps is the desired bound on the error and de is a bound on the probability of failure. We improve the best currently known upper bound. Based on these results, we devise a simple algorithm for k-means and another that uses a family of convex programs to fit a piecewise linear curve of a specified length to high dimensional data, where the sample complexity is independent of the ambient dimension.
Channel: VideoLectures
Video Length: 0
Date Found: March 26, 2011
Category: Educational
Date Produced: March 25, 2011
View Count: 0
Flag
Related Videos
Hide the Stack: Toward Usable Linked Data | BahVideo.com
VideoLectures

Hide the Stack: Toward Usable Linked Data

0 of 5 Stars
July 10, 2011
The explosion in growth of the Web of Linked Data has provided, for the first time, a plethora of information in disparate locations, yet bound together by machine-readable, semantically typed relations. Utilisation of the Web of Data has been, until now, restricted to members of the community, ...
Business Ethics and Corporate Social Responsibility | BahVideo.com
VideoLectures

Business Ethics and Corporate Social Responsibility

0 of 5 Stars
July 10, 2011
Problems cannot be solved with the mentality that has caused them’. Hence, the 2008- crisis cannot be solved with ethics of one-sided and short-term mentality of the industrial and neoliberal economics, which has caused the ‘Bubble Economy’ of several recent decades. Neither the market nor the ...
Higher Education in India - An Insider’s View | BahVideo.com
VideoLectures

Higher Education in India - An Insider’s View

0 of 5 Stars
July 10, 2011
E-Government Core Vocabularies and federation of national semantic assets repositories: the European Commission approach. | BahVideo.com
VideoLectures

E-Government Core Vocabularies and federation of national semantic assets repositories: the European Commission approach.

0 of 5 Stars
July 10, 2011
Improving Categorisation in Social Media using Hyperlinks to Structured Data Sources | BahVideo.com
VideoLectures

Improving Categorisation in Social Media using Hyperlinks to Structured Data Sources

0 of 5 Stars
July 10, 2011
Social media presents unique challenges for topic classification, including the brevity of posts, the informal nature of conversations, and the frequent reliance on external hyperlinks to give context to a conversation. In this paper we investigate the usefulness of these external hyperlinks ...
: advertisement :
Featured
Content
Featuring websites that enhance the internet user’s experience.

Like
Like