<br />
<b>Deprecated</b>:  Function create_function() is deprecated in <b>/home/hidefide/public_html/blog/wp-content/plugins/wr-pagebuilder/core/core.php</b> on line <b>127</b><br />
{"id":14,"date":"2016-03-20T17:30:05","date_gmt":"2016-03-20T17:30:05","guid":{"rendered":"https:\/\/icaruspressblog.wordpress.com\/?p=14"},"modified":"2019-04-17T12:28:44","modified_gmt":"2019-04-17T12:28:44","slug":"blekko-identifies-over-a-million-domains-as-spam","status":"publish","type":"post","link":"https:\/\/hidefideas.com\/blog\/2016\/03\/20\/blekko-identifies-over-a-million-domains-as-spam\/","title":{"rendered":"Blekko Identifies Over a Million Domains as Spam"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone wp-image-15\" style=\"float: right; margin-left: 10px;\" src=\"http:\/\/hidefideas.com\/blog\/wp-content\/uploads\/2016\/03\/spam2_2.jpg\" alt=\"Spam2_2\" width=\"384\" height=\"256\" srcset=\"https:\/\/hidefideas.com\/blog\/wp-content\/uploads\/2016\/03\/spam2_2.jpg 600w, https:\/\/hidefideas.com\/blog\/wp-content\/uploads\/2016\/03\/spam2_2-300x200.jpg 300w, https:\/\/hidefideas.com\/blog\/wp-content\/uploads\/2016\/03\/spam2_2-225x150.jpg 225w\" sizes=\"auto, (max-width: 384px) 100vw, 384px\" \/>In what appears to be a hot and nasty brawl brewing between David and Goliath, a tale of two search engines is getting significant press about respective plans to fight spam by removing spam-laden sites from search results. It has all the trappings of a prize-fight: in this corner, the search engine behemoth Google, weighing in at several billion dollars; in that corner Blekko, the relatively unknown challenger, <!--more-->new to the scene but poised to take on its opponent with what Blekko calls, \u201cthe first search algorithm ever created to find spam rather than rank results.\u201d<\/p>\n<p>Blekko, the nascent search engine that launched last November, <a href=\"http:\/\/blog.blekko.com\/2011\/03\/09\/an-algorithm-for-finding-killing-spam\/\">announced last week<\/a> that it has identified over a million web domains as spam and blocked them from its search results. Utilizing a technology that Blekko calls its AdSpam algorithm, the move could have tremendous implications, at least for the users of Blekko, which reports a million queries a day and about a half million users each month. Rather than adopting Google\u2019s method of lowering the rank of suspicious sites in its search results, AdSpam instead takes a scorched earth policy by identifying sites that are laden with ads and light on content, and blocking them altogether<\/p>\n<p>The move of blocking 1.1 million domains has the direct effect of removing potentially hundreds of millions of spam pages, an achievement of which Blekko CEO Rich Skrenta is tremendously proud. \u201cDomains with low quality content plus keyword ads are \u2018machines that print money,\u2019 Skrenta has been <a href=\"http:\/\/bits.blogs.nytimes.com\/2011\/03\/09\/blekko-blocks-1-1-million-web-sites-from-its-search-engine\/\">quoted<\/a>. \u201cIf you make a machine to print money, people will exploit it.\u201d<\/p>\n<p>According to <a href=\"http:\/\/blog.blekko.com\/2011\/03\/09\/an-algorithm-for-finding-killing-spam\/\">Blekko<\/a>, AdSpam is \u201ca machine-learning algorithm that examines pages for a specific spam signals \u2014 the presence of multiple display ad positions on a single page and thin to zero content. Unlike algorithms used by other search engines, AdSpam is being used in conjunction with human curation to detect [Spam and] continue the War on Spam.\u201d<\/p>\n<p>What makes Blekko unique is its search method utilizing <a href=\"http:\/\/blekko.com\/ws\/+\/about\">slashtags<\/a> to pinpoint search and minimize spam results. By targeting content farms that push spam (like eHow.com and answerbag.com), Blekko has managed to provide what it feels is the path to \u201cbetter search results\u2026by using an algorithm that was created to kill spam, not just crawl it.\u201d<\/p>\n<p>This latest development is just another foray in a war that both Google and Blekko have committed to fighting. \u201cIn the past, our efforts to clean-up search have included our partnership with the Stack Overflow community,\u201d states the <a href=\"http:\/\/blog.blekko.com\/2011\/03\/09\/an-algorithm-for-finding-killing-spam\/\">Blekko blog<\/a>, \u201cand our public banning of the <a href=\"http:\/\/www.webpronews.com\/blekko-bans-ehow-and-other-content-farms-2011-02\">top 20 sites<\/a> most users marked as spam at Blekko.\u201d What remains to be seen is what both engines have up their sleeves. According to <a href=\"http:\/\/bits.blogs.nytimes.com\/2011\/03\/09\/blekko-blocks-1-1-million-web-sites-from-its-search-engine\/\">The New York Times<\/a>, Skrenta hasn\u2019t been squeamish about calling out Google. \u201cGoogle didn\u2019t actually take anyone out, they just reshuffled the deck. Instead of demoting these sites to No. 5 or No. 7, we\u2019re just throwing them out.\u201d<\/p>\n<p>It should be stressed that Googoliath hasn\u2019t exactly been sitting on its hands. In the past several months there has been a public backlash on the deteriorating quality of Google\u2019s search results. The company has responded with series of remedies, including updated search algorithms and the penalizing of low quality sites like content farms. In fact, <a href=\"http:\/\/unplugged.rcrwireless.com\/index.php\/20110310\/news\/7418\/googles-newest-anti-spam-weapon-full-site-blocking\/\">RCR Unplugged<\/a> reports that the recent \u2018Panda\u2019 update to Google\u2019s algorithm caused such a swing in page rankings that how-to site Mahalo had to lay off staff almost immediately.<\/p>\n<p><a href=\"http:\/\/googleblog.blogspot.com\/2011\/03\/hide-sites-to-find-more-of-what-you.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+blogspot%2FMKuf+%28Official+Google+Blog%29&amp;utm_content=Google+Reader\">Most recently<\/a>, Google reports that it\u2019s adding a \u2018Block All Results\u2019 option that will live right next to the \u2018Cached\u2019 and \u2018Similar\u2019 buttons, so that users can choose to weed out the spam that seems to have mastered the art of worming its way into the top ranks of Google. Even though Google\u2019s blog talks about this functionality as if it\u2019s already here, there\u2019s no indication if or when it will become active \u2013 a search performed while this article was written revealed no \u2018Block\u2019 link \u2013 please feel free to leave a comment if you\u2019ve seen it in the wild.<\/p>\n<p>Admittedly, there are inherent problems with Google\u2019s proposed solutions. First, it\u2019s difficult to identify an entire site as spam simply from search results. Also, sites like Mahalo will suffer from algorithms that box up their criteria in a way that may misidentify legitimate sites. For example, Google\u2019s new algorithms are based on the consistency of content \u2013 a site that focuses on one topic, like healthcare, will probably not be flagged whereas a generalized site with content based on a variety of topics may suffer the wrath of the giant G. By its own admission, Google states \u201cgenerally low quality\u201d of content as a reason to block something. For sites which rely on user-generated content by nonprofessional writers, this could end up being a troubling trend.<\/p>\n<p>So who has the right formula? Bing and Yahoo haven\u2019t really entered the fray as of yet, perhaps waiting to see what Google has to say on the whole matter of spam sites. Blekko, on the other hand, has chosen to lead and not to follow, a move that could greatly benefit the company as searchers seek alternatives to the millions of results being passed back to the end user. By being proactive, they certainly seem to be taking the war to the content farms and the unending battle between search engine and spam.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>In what appears to be a hot and nasty brawl brewing between David and Goliath, a tale of two search engines is getting significant press&hellip; <\/p>\n","protected":false},"author":3,"featured_media":15,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[13,14],"tags":[11,9,10,8,7],"class_list":["post-14","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-security","category-spam","tag-allspammedup","tag-bot","tag-botnet","tag-malware","tag-spam","jsn-master"],"_links":{"self":[{"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/posts\/14","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/comments?post=14"}],"version-history":[{"count":3,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/posts\/14\/revisions"}],"predecessor-version":[{"id":1865,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/posts\/14\/revisions\/1865"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/media\/15"}],"wp:attachment":[{"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/media?parent=14"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/categories?post=14"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hidefideas.com\/blog\/wp-json\/wp\/v2\/tags?post=14"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}