<?xml version="1.0" encoding="utf-8"?>
<?xml-stylesheet type="text/xsl" href="../assets/xml/rss.xsl" media="all"?><rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Asymptotic Labs (Posts about sparsity)</title><link>http://asymptoticlabs.com/</link><description></description><atom:link href="http://asymptoticlabs.com/categories/sparsity.xml" rel="self" type="application/rss+xml"></atom:link><language>en</language><copyright>Contents © 2022 &lt;a href="mailto:quidditymaster@gmail.com"&gt;Tim Anderton&lt;/a&gt; </copyright><lastBuildDate>Wed, 31 Aug 2022 21:28:20 GMT</lastBuildDate><generator>Nikola (getnikola.com)</generator><docs>http://blogs.law.harvard.edu/tech/rss</docs><item><title>Low Rank Approximation On Sparsely Observed Data</title><link>http://asymptoticlabs.com/posts/slra_sparse_obs.html</link><dc:creator>Tim Anderton</dc:creator><description>&lt;div tabindex="-1" id="notebook" class="border-box-sizing"&gt;
    &lt;div class="container" id="notebook-container"&gt;

&lt;div class="cell border-box-sizing text_cell rendered"&gt;&lt;div class="prompt input_prompt"&gt;
&lt;/div&gt;
&lt;div class="inner_cell"&gt;
&lt;div class="text_cell_render border-box-sizing rendered_html"&gt;
&lt;h2 id="Intermezzo:-Sparsely-Observed-Data"&gt;Intermezzo: Sparsely Observed Data&lt;a class="anchor-link" href="http://asymptoticlabs.com/posts/slra_sparse_obs.html#Intermezzo:-Sparsely-Observed-Data"&gt;¶&lt;/a&gt;&lt;/h2&gt;&lt;p&gt;In the post on &lt;a href="http://asymptoticlabs.com/posts/other_use_for_PCA_part2.html"&gt;using PCA for data imputation&lt;/a&gt; we used a weight for each of our data points. By assigning a weight of 0 to missing data and a weight of 1 to the rest of our data we managed to be able to get a reasonably good approximation to what we would find using PCA on the dataset without any data missing.&lt;/p&gt;
&lt;p&gt;This is fine when evaluating a dense model for our data matrix is not too much computational overhead. However when our input data are sparsely observed, that is to say most of our data consists of missing values then evaluating the model densely is a tremendous waste of computational resources. &lt;/p&gt;&lt;p&gt;&lt;a href="http://asymptoticlabs.com/posts/slra_sparse_obs.html"&gt;Read more…&lt;/a&gt; (23 min remaining to read)&lt;/p&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/div&gt;&lt;/body&gt;&lt;/html&gt;
</description><category>machine learning</category><category>mathjax</category><category>PCA</category><category>reccommender systems</category><category>sparsity</category><guid>http://asymptoticlabs.com/posts/slra_sparse_obs.html</guid><pubDate>Thu, 26 Oct 2017 06:00:00 GMT</pubDate></item></channel></rss>