Candidates vs. Noises Estimation for Large Multi-Class Classification Problem

48 mins 27 secs,  705.31 MB,  MPEG-4 Video  640x360,  29.97 fps,  44100 Hz,  1.94 Mbits/sec
Share this media item:
Embed this media item:


About this item
Image inherited from collection
Description: Zhang, T
Thursday 28th June 2018 - 09:00 to 09:45
 
Created: 2018-06-29 12:53
Collection: Statistical scalability
Publisher: Isaac Newton Institute
Copyright: Zhang, T
Language: eng (English)
Distribution: World     (downloadable)
Explicit content: No
Aspect Ratio: 16:9
Screencast: No
Bumper: UCS Default
Trailer: UCS Default
 
Abstract: In practice, there has been sigificant interest in multi-class classification problems where the number of classes is large. Computationally such applications require statistical methods with run time sublinear in the number of classes. A number of methods such as Noise-Contrastive Estimation (NCE) and variations have been proposed in recent years to address this problem. However, the existing methods are not statistically efficient compared to multi-class logistic regression, which is the maximum likelihood estimate. In this talk, I will describe a new method called Candidate v.s. Noises Estimation (CANE) that selects a small subset of candidate classes and samples the remaining classes. We show that CANE is always consistent and computationally efficient. Moreover, the resulting estimator has low statistical variance approaching that of the maximum likelihood estimator, when the observed label belongs to the selected candidates with high probability. Extensive experimental results show that CANE achieves better prediction accuracy over a number of the state-of-the-art tree classifiers, while it gains significant speedup compared to standard multi-class logistic regression.
Available Formats
Format Quality Bitrate Size
MPEG-4 Video * 640x360    1.94 Mbits/sec 705.31 MB View Download
WebM 640x360    458.69 kbits/sec 162.83 MB View Download
iPod Video 480x270    522.25 kbits/sec 185.33 MB View Download
MP3 44100 Hz 249.8 kbits/sec 88.74 MB Listen Download
Auto (Allows browser to choose a format it supports)