Difference between revisions of "Search engine optimization" - New World Encyclopedia

From New World Encyclopedia
 
(21 intermediate revisions by 3 users not shown)
Line 10: Line 10:
  
 
==History==
 
==History==
 +
[[Webmaster]]s and content providers began optimizing sites for search engines in the mid-1990s, as the first search engines were cataloging the early [[World Wide Web|Web]]. Initially, all a webmaster needed to do was submit a page, or [[Uniform Resource Locator|URL]], to the various engines which would send a [[Web crawler|spider]] to "crawl" that page, extract links to other pages from it, and return information found on the page to be [[Index (search engine)|indexed]]. The process involves a search engine spider downloading a page and storing it on the search engine's own server, where a second program, known as an [[search engine indexing|indexer]], extracts various information about the page, such as the words it contains and where these are located, as well as any weight for specific words, as well as any and all links the page contains, which are then placed into a scheduler for crawling at a later date.
  
[[Webmaster]]s and content providers began optimizing sites for search engines in the mid-1990s, as the first search engines were cataloging the early [[World Wide Web|Web]]. Initially, all a webmaster needed to do was submit a page, or [[Uniform Resource Locator|URL]], to the various engines which would send a [[Web crawler|spider]] to "crawl" that page, extract links to other pages from it, and return information found on the page to be [[Index (search engine)|indexed]].<ref>Brian Pinkerton, [http://www.webir.org/resources/phd/pinkerton_2000.pdf Finding What People Want: Experiences with the WebCrawler]. 2007-05-07 ''The Second International WWW Conference Chicago, USA, October 17–20, 1994''. Retrieved November 15, 2008.</ref> The process involves a search engine spider downloading a page and storing it on the search engine's own server, where a second program, known as an [[search engine indexing|indexer]], extracts various information about the page, such as the words it contains and where these are located, as well as any weight for specific words, as well as any and all links the page contains, which are then placed into a scheduler for crawling at a later date.
+
Site owners started to recognize the value of having their sites highly ranked and visible in search engine results, creating an opportunity for both [[white hat]] and [[black hat|black hat]] SEO practitioners. According to industry analyst [[Danny Sullivan (technologist)|Danny Sullivan]], the earliest known use of the phrase ''search engine optimization'' was in 1997.<ref>Danny Sullivan, [https://web.archive.org/web/20100423051708/http://forums.searchenginewatch.com/showpost.php?p=2119 Who Invented the Term "Search Engine Optimization"?]. ''Search Engine Watch'', June 14, 2004. See [http://groups.google.com/group/alt.current-events.net-abuse.spam/browse_thread/thread/6fee2777dc17b8ab/3858bff94e56aff3?lnk=st&q=%22search+engine+optimization%22&rnum=1#3858bff94e56aff3 Google groups thread]. Retrieved March 27, 2020.</ref>
  
Site owners started to recognize the value of having their sites highly ranked and visible in search engine results, creating an opportunity for both [[white hat]] and [[black hat|black hat]] SEO practitioners. According to industry analyst [[Danny Sullivan (technologist)|Danny Sullivan]], the earliest known use of the phrase ''search engine optimization'' was a spam message posted on Usenet on July 26, 1997.<ref>Danny Sullivan, [http://forums.searchenginewatch.com/showpost.php?p=2119&postcount=10 Who Invented the Term "Search Engine Optimization"?]. [[Search Engine Watch]], June 14, 2004. 2007-05-14; See [http://groups.google.com/group/alt.current-events.net-abuse.spam/browse_thread/thread/6fee2777dc17b8ab/3858bff94e56aff3?lnk=st&q=%22search+engine+optimization%22&rnum=1#3858bff94e56aff3 Google groups thread]. Retrieved November 15, 2008.</ref>
+
Early versions of search [[algorithm]]s relied on webmaster-provided information such as the keyword [[meta tag]], or index files in engines like [[Aliweb|ALIWEB]]. Meta tags provided a guide to each page's content. But using meta data to index pages was found to be less than reliable because the webmaster's account of keywords in the meta tag were not truly relevant to the site's actual keywords. Inaccurate, incomplete, and inconsistent data in meta tags caused pages to rank for irrelevant searches. Web content providers also manipulated a number of attributes within the HTML source of a page in an attempt to rank well in search engines.<ref>Glen Pringle, Lloyd Allison, and David L. Dowe, [http://www.csse.monash.edu.au/~lloyd/tilde/InterNet/Search/1998_WWW7.html What is a tall poppy among web pages?]. ''Proc. 7th Int. World Wide Web Conference'', Brisbane, April 1998. Retrieved March 27, 2020.</ref>
 
 
Early versions of search [[algorithm]]s relied on webmaster-provided information such as the keyword [[meta tag]], or index files in engines like [[Aliweb|ALIWEB]]. Meta tags provided a guide to each page's content. But using meta data to index pages was found to be less than reliable because the webmaster's account of keywords in the meta tag were not truly relevant to the site's actual keywords. Inaccurate, incomplete, and inconsistent data in meta tags caused pages to rank for irrelevant searches.<ref>[[Cory Doctorow]], [http://www.e-learningguru.com/articles/metacrap.htm|title=Metacrap: Putting the torch to seven straw-men of the meta-utopia]. August 26, 2001, ''e-LearningGuru'' 2007-05-08. Retrieved November 15, 2008.</ref> Web content providers also manipulated a number of attributes within the HTML source of a page in an attempt to rank well in search engines.<ref>G. Pringle, L. Allison, and D. Dowe, [http://www.csse.monash.edu.au/~lloyd/tilde/InterNet/Search/1998_WWW7.html What is a tall poppy among web pages?]. (April 1998) ''Proc. 7th Int. World Wide Web Conference''. 2007-05-08.  Retrieved November 15, 2008.</ref>
 
  
 
By relying so much on factors exclusively within a webmaster's control, early search engines suffered from abuse and ranking manipulation. To provide better results to their users, search engines had to adapt to ensure their [[SERP|results pages]] showed the most relevant search results, rather than unrelated pages stuffed with numerous keywords by unscrupulous webmasters. Since the success and popularity of a search engine is determined by its ability to produce the most relevant results to any given search allowing those results to be false would turn users to find other search sources. Search engines responded by developing more complex ranking [[algorithm]]s, taking into account additional factors that were more difficult for webmasters to manipulate.
 
By relying so much on factors exclusively within a webmaster's control, early search engines suffered from abuse and ranking manipulation. To provide better results to their users, search engines had to adapt to ensure their [[SERP|results pages]] showed the most relevant search results, rather than unrelated pages stuffed with numerous keywords by unscrupulous webmasters. Since the success and popularity of a search engine is determined by its ability to produce the most relevant results to any given search allowing those results to be false would turn users to find other search sources. Search engines responded by developing more complex ranking [[algorithm]]s, taking into account additional factors that were more difficult for webmasters to manipulate.
  
While graduate students at [[Stanford University]], [[Larry Page]] and [[Sergey Brin]] developed "backrub," a search engine that relied on a mathematical [[algorithm]] to rate the prominence of web pages. The number calculated by the algorithm, [[PageRank]], is a function of the quantity and strength of [[inbound link]]s.<ref name="lgscalehyptxt">Sergey Brin, and Larry Page, [http://www-db.stanford.edu/~backrub/google.html The Anatomy of a Large-Scale Hypertextual Web Search Engine] (Proceedings of the seventh international conference on World Wide Web. 1998), 107–117. 2007-05-08 Retrieved November 15, 2008.</ref> PageRank estimates the likelihood that a given page will be reached by a web user who randomly surfs the web, and follows links from one page to another. In effect, this means that some links are stronger than others, as a higher PageRank page is more likely to be reached by the random surfer.
+
While graduate students at [[Stanford University]], [[Larry Page]] and [[Sergey Brin]] developed "backrub," a search engine that relied on a mathematical [[algorithm]] to rate the prominence of web pages. The number calculated by the algorithm, [[PageRank]], is a function of the quantity and strength of [[inbound link]]s.<ref>Sergey Brin and Larry Page, [http://www-db.stanford.edu/~backrub/google.html The Anatomy of a Large-Scale Hypertextual Web Search Engine] Proceedings of the seventh international conference on World Wide Web, 1998. Retrieved March 27, 2020.</ref> PageRank estimates the likelihood that a given page will be reached by a web user who randomly surfs the web, and follows links from one page to another. In effect, this means that some links are stronger than others, as a higher PageRank page is more likely to be reached by the random surfer.
[[Image:Google_Campus2_cropped.jpg|right|thumb|250px|[[Googleplex|Google headquarters]]]]
+
[[Image:Google_Campus2_cropped.jpg|right|thumb|300px|[[Googleplex|Google headquarters]]]]
Page and Brin founded [[Google]] in 1998. Google attracted a loyal following among the growing number of Internet users, who liked its simple design.<ref name="bbc-1">Bill Thompson, December 19, 2003, [http://news.bbc.co.uk/1/hi/technology/3334531.stm Is Google good for you?]. [[BBC News]], Retrieved November 15, 2008.</ref> Off-page factors (such as PageRank and hyperlink analysis) were considered as well as on-page factors (such as keyword frequency, [[meta tags]], headings, links and site structure) to enable Google to avoid the kind of manipulation seen in search engines that only considered on-page factors for their rankings. Although PageRank was more difficult to game, webmasters had already developed link building tools and schemes to influence the [[Inktomi]] search engine, and these methods proved similarly applicable to gaining PageRank. Many sites focused on exchanging, buying, and selling links, often on a massive scale. Some of these schemes, or [[link farm]]s, involved the creation of thousands of sites for the sole purpose of [[spamdexing|link spamming]].<ref>Zoltan Gyongyi and Hector Garcia-Molina|url=[http://infolab.stanford.edu/~zoltan/publications/gyongyi2005link.pdf]. PDF Link Spam Alliances]. Proceedings of the 31st VLDB Conference, Trondheim, Norway. 2005. Retrieved November 15, 2008.</ref> In recent years major search engines have begun to rely more heavily on [[off-web factors]] such as the age, sex, location, and search history of people conducting searches in order to further refine results.
+
Page and Brin founded [[Google]] in 1998. Google attracted a loyal following among the growing number of Internet users, who liked its simple design.<ref>Bill Thompson, [http://news.bbc.co.uk/1/hi/technology/3334531.stm Is Google good for you?] ''BBC News'', December 19, 2003. Retrieved March 27, 2020.</ref> Off-page factors (such as PageRank and hyperlink analysis) were considered as well as on-page factors (such as keyword frequency, [[meta tags]], headings, links and site structure) to enable Google to avoid the kind of manipulation seen in search engines that only considered on-page factors for their rankings. Although PageRank was more difficult to game, webmasters had already developed link building tools and schemes to influence the [[Inktomi]] search engine, and these methods proved similarly applicable to gaining PageRank. Many sites focused on exchanging, buying, and selling links, often on a massive scale. Some of these schemes, or [[link farm]]s, involved the creation of thousands of sites for the sole purpose of [[spamdexing|link spamming]].<ref>Zoltan Gyongyi and Hector Garcia-Molina, [http://infolab.stanford.edu/~zoltan/publications/gyongyi2005link.pdf Link Spam Alliances]. ''Proceedings of the 31st VLDB Conference'', Trondheim, Norway. 2005. Retrieved March 27, 2020.</ref> In recent years major search engines have begun to rely more heavily on [[off-web factors]] such as the age, sex, location, and search history of people conducting searches in order to further refine results.
  
By 2007, search engines had incorporated a wide range of undisclosed factors in their ranking algorithms to reduce the impact of link manipulation. Google says it ranks sites using more than 200 different signals.<ref name="nyt0607">2007-06-06[http://www.nytimes.com/2007/06/03/business/yourmoney/03google.html Google Keeps Tweaking Its Search Engine]. [[New York Times]] June 3, 2007}} Retrieved November 15, 2008.</ref> The three leading search engines, Google, [[Yahoo]] and [[Microsoft]]'s [[Live Search]], do not disclose the algorithms they use to rank pages. Notable SEOs, such as Rand Fishkin, [[Barry Schwartz (technologist)|Barry Schwartz]], [[Aaron Wall]] and [[Jill Whalen]], have studied different approaches to search engine optimization, and have published their opinions in online forums and blogs.<ref>[[Danny Sullivan (technologist)|Danny Sullivan]] [http://blog.searchenginewatch.com/blog/050929-072711 Rundown On Search Ranking Factors]. [[Search Engine Watch]] September 29, 2005  Retrieved November 15, 2008.</ref><ref>[http://www.seomoz.org/article/search-ranking-factors Search Engine Ranking Factors V2] ''SEOmoz.org'', April 2, 2007. Retrieved November 15, 2008.</ref>  SEO practitioners may also study patents held by various search engines to gain insight into the algorithms.<ref>Christine Churchill [http://searchenginewatch.com/showPage.html?page=3564261 Understanding Search Engine Patents]. [[Search Engine Watch]] November 23, 2005. Retrieved November 15, 2008.</ref>
+
By 2007, search engines had incorporated a wide range of undisclosed factors in their ranking algorithms to reduce the impact of link manipulation. Google says it ranks sites using more than 200 different signals.<ref>Saul Hansell, [http://www.nytimes.com/2007/06/03/business/yourmoney/03google.html Google Keeps Tweaking Its Search Engine]. ''The New York Times'', June 3, 2007. Retrieved March 27, 2020.</ref> The three leading search engines, Google, [[Yahoo]] and [[Microsoft]]'s [[Live Search]], do not disclose the algorithms they use to rank pages. Notable SEOs, such as Rand Fishkin, [[Barry Schwartz (technologist)|Barry Schwartz]], [[Aaron Wall]] and [[Jill Whalen]], have studied different approaches to search engine optimization, and have published their opinions in online forums and blogs.<ref>[https://moz.com/search-ranking-factors Search Engine Ranking Factors 2015] ''Moz''. Retrieved March 27, 2020.</ref>
  
 
==Webmasters and search engines==
 
==Webmasters and search engines==
By 1997 search engines recognized that webmasters were making efforts to rank well in their search engines, and that some webmasters were even manipulating their rankings in search results by stuffing pages with excessive or irrelevant keywords. Early search engines, such as [[Infoseek]], adjusted their algorithms in an effort to prevent webmasters from manipulating rankings.<ref name="infoseeknyt">Laurie J. Flynn, November 11, 1996 [http://query.nytimes.com/gst/fullpage.html?res=940DE0DF123BF932A25752C1A960958260 Desperately Seeking Surfers]. [[New York Times]]. Retrieved November 15, 2008.</ref>
+
By 1997 search engines recognized that webmasters were making efforts to rank well in their search engines, and that some webmasters were even manipulating their rankings in search results by stuffing pages with excessive or irrelevant keywords. Early search engines, such as [[Infoseek]], adjusted their algorithms in an effort to prevent webmasters from manipulating rankings.<ref>Laurie J. Flynn, [http://query.nytimes.com/gst/fullpage.html?res=940DE0DF123BF932A25752C1A960958260 Desperately Seeking Surfers]. ''The New York Times'', November 11, 1996. Retrieved March 27, 2020.</ref>
  
Due to the high marketing value of targeted search results, there is potential for an adversarial relationship between search engines and SEOs. In 2005, an annual conference, AIRWeb, Adversarial Information Retrieval on the Web,<ref name="airweb">[http://airweb.cse.lehigh.edu/ AIRWeb]. Adversarial Information Retrieval on the Web, annual conference, 2007-05-09. Retrieved November 15, 2008.</ref> was created to discuss and minimize the damaging effects of aggressive web content providers.
+
Due to the high marketing value of targeted search results, there is potential for an adversarial relationship between search engines and SEOs. In 2005, an annual conference, AIRWeb, Adversarial Information Retrieval on the Web,<ref>[http://airweb.cse.lehigh.edu/ AIRWeb]. Adversarial Information Retrieval on the Web. Retrieved March 27, 2020.</ref> was created to discuss and minimize the damaging effects of aggressive web content providers.
  
SEO companies that employ overly aggressive techniques can get their client websites banned from the search results. In 2005, the [[Wall Street Journal]] reported on a company, [[Traffic Power]], which allegedly used high-risk techniques and failed to disclose those risks to its clients.<ref>''Wall Street Journal''[http://online.wsj.com/article/SB112714166978744925.html?apl=y&r=947596
+
SEO companies that employ overly aggressive techniques can get their client websites banned from the search results. In 2005, the ''[[Wall Street Journal]]'' reported on a company, [[Traffic Power]], which allegedly used high-risk techniques and failed to disclose those risks to its clients.<ref>David Kesmodel, [http://online.wsj.com/article/SB112714166978744925.html?apl=y&r=947596 Sites Get Dropped by Search Engines After Trying to 'Optimize' Rankings]. ''Wall Street Journal'', September 22, 2005. Retrieved March 27, 2020.</ref> Google's [[Matt Cutts]] later confirmed that Google did in fact ban Traffic Power and some of its clients.<ref> Matt Cutts, [http://www.mattcutts.com/blog/confirming-a-penalty/ Confirming a penalty]. ''Gadgets, Google, and Seo'', February 2, 2006. Retrieved March 27, 2020.</ref>
Sites Get Dropped by Search Engines After Trying to 'Optimize' Rankings]. David Kesmodel, September 22, 2005. Retrieved November 15, 2008.</ref> [[Wired Magazine|Wired]] magazine reported that the same company sued blogger [[Aaron Wall]] for writing about the ban.<ref name="wired09082005">Wired Magazine[http://www.wired.com/news/culture/0,1284,68799,00.html Legal Showdown in Search Fracas]. September 8, 2005. Adam L. Penenberg. Retrieved November 15, 2008.</ref> Google's [[Matt Cutts]] later confirmed that Google did in fact ban Traffic Power and some of its clients.<ref> ''mattcutts.com/blog'' [[Matt Cutts]][http://www.mattcutts.com/blog/confirming-a-penalty/ Confirming a penalty]. February 2, 2006. Retrieved November 15, 2008.</ref>
 
  
Some search engines have also reached out to the SEO industry, and are frequent sponsors and guests at SEO conferences, chats, and seminars. In fact, with the advent of paid inclusion, some search engines now have a vested interest in the health of the optimization community. Major search engines provide information and guidelines to help with site optimization.<ref name="g-wmguide" /><ref name="ms-wmguide" /><ref name="y-wmguide" /> Google has a [[Sitemaps]] program<ref name="googlesitemaps">[http://www.google.com/webmasters/sitemaps/login Google Webmaster Tools]. ''google.com'' Retrieved November 15, 2008.</ref> to help webmasters learn if Google is having any problems indexing their website and also provides data on Google traffic to the website. [[Google guidelines]] are a list of suggested practices Google has provided as guidance to webmasters. [[Yahoo! Site Explorer]] provides a way for webmasters to submit URLs, determine how many pages are in the Yahoo! index and view link information.<ref> [http://siteexplorer.search.yahoo.com Yahoo! Site Explorer]. ''yahoo.com''  Retrieved November 15, 2008.</ref>
+
Some search engines have also reached out to the SEO industry, and are frequent sponsors and guests at SEO conferences, chats, and seminars. In fact, with the advent of paid inclusion, some search engines now have a vested interest in the health of the optimization community. Major search engines provide information and guidelines to help with site optimization.<ref name="g-wmguide" /><ref name="y-wmguide" />  
  
 
===Getting indexed===
 
===Getting indexed===
The leading search engines, Google, Yahoo! and Microsoft, use [[Web crawler|crawlers]] to find pages for their algorithmic search results. Pages that are linked from other search engine indexed pages do not need to be submitted because they are found automatically. Some search engines, notably Yahoo!, operate a paid submission service that guarantee crawling for either a set fee or [[Pay per click|cost per click]].<ref>[http://searchenginewatch.com/showPage.html?page=2167871 Submitting To Search Crawlers: Google, Yahoo, Ask & Microsoft's Live Search]. 2007-03-12. ''[[Search Engine Watch]]'' Retrieved November 15, 2008.</ref> Such programs usually guarantee inclusion in the database, but do not guarantee specific ranking within the search results.<ref> [http://searchmarketing.yahoo.com/srchsb/index.php Search Submit]. ''searchmarketing.yahoo.com''. 2007-05-09. Retrieved November 15, 2008.</ref> Yahoo's paid inclusion program has drawn criticism from advertisers and competitors.<ref>[http://www.washingtonpost.com/ac2/wp-dyn/A48042-2004Mar10 Questionable Results at Revamped Yahoo]. [[Washington Post]], 2004-03-11.  Retrieved November 15, 2008.</ref> Two major directories, the Yahoo Directory and the [[Open Directory Project]] both require manual submission and human editorial review.<ref>[http://searchenginewatch.com/showPage.html?page=2167881 Submitting To Directories: Yahoo & The Open Directory]. 2007-03-12. [[Search Engine Watch]] Retrieved November 15, 2008.</ref> Google offers [[Google Webmaster Tools]], for which an XML [[Sitemap]] feed can be created and submitted for free to ensure that all pages are found, especially pages that aren't discoverable by automatically following links.<ref>[http://www.google.com/support/webmasters/bin/answer.py?answer=40318&topic=8514 What is a Sitemap file and why should I have one?]. ''google.com''. Retrieved November 15, 2008.</ref>
+
The leading search engines, Google, Yahoo! and Microsoft, use [[Web crawler|crawlers]] to find pages for their algorithmic search results. Pages that are linked from other search engine indexed pages do not need to be submitted because they are found automatically.  
  
[[Web search engine|Search engine]] crawlers may look at a number of different factors when [[Web crawler|crawling]] a site. Not every page is indexed by the search engines. Distance of pages from the root directory of a site may also be a factor in whether or not pages get crawled.<ref name="cho">J. Cho, H. Garcia-Molina, 1998.   [http://dbpubs.stanford.edu:8090/pub/1998-51 Efficient crawling through URL ordering]. ''Proceedings of the seventh conference on World Wide Web, Brisbane, Australia''. 2007-05-09. Retrieved November 15, 2008.</ref>
+
Two major directories, the Yahoo Directory and the [[Open Directory Project]] both require manual submission and human editorial review.<ref>[https://searchenginewatch.com/sew/news/2065394/submitting-to-directories-yahoo-the-open-directory Submitting To Directories: Yahoo & The Open Directory]. ''Search Engine Watch'', March 12, 2007. Retrieved March 27, 2020.</ref> Google offers [[Google Webmaster Tools]], for which an XML [[Sitemap]] feed can be created and submitted for free to ensure that all pages are found, especially pages that aren't discoverable by automatically following links.<ref>[https://support.google.com/webmasters/answer/156184?topic=8514&visit_id=0-636662498395291355-1533726943&rd=1 Learn about sitemaps]. ''google.com''. Retrieved March 27, 2020.</ref>
 +
 
 +
[[Web search engine|Search engine]] crawlers may look at a number of different factors when [[Web crawler|crawling]] a site. Not every page is indexed by the search engines. Distance of pages from the root directory of a site may also be a factor in whether or not pages get crawled.<ref >J. Cho, H. Garcia-Molina, and L. Page, [http://ilpubs.stanford.edu:8090/347/ Efficient crawling through URL ordering]. ''Proceedings of the seventh conference on World Wide Web, Brisbane, Australia'', April 14-18, 1998. Retrieved March 27, 2020.</ref>
  
 
===Preventing indexing===
 
===Preventing indexing===
{{main|Robots Exclusion Standard}}
+
To avoid undesirable content in the search indexes, webmasters can instruct spiders not to crawl certain files or directories through the standard [[robots.txt]] file in the root directory of the domain. Additionally, a page can be explicitly excluded from a search engine's database by using a [[meta tag]] specific to robots. When a search engine visits a site, the robots.txt located in the [[root directory]] is the first file crawled. The robots.txt file is then parsed, and will instruct the robot as to which pages are not to be crawled. As a search engine crawler may keep a cached copy of this file, it may on occasion crawl pages a webmaster does not wish crawled. Pages typically prevented from being crawled include login specific pages such as shopping carts and user-specific content such as search results from internal searches. In March 2007, Google warned webmasters that they should prevent indexing of internal search results because those pages are considered search spam.<ref>Danny Sullivan, [https://searchengineland.com/newspapers-amok-new-york-times-spamming-google-la-times-hijacking-carscom-11169 Newspapers Amok! New York Times Spamming Google? LA Times Hijacking Cars.com?] ''Search Engine Land, May 8, 2007. Retrieved March 27, 2020.</ref>
To avoid undesirable content in the search indexes, webmasters can instruct spiders not to crawl certain files or directories through the standard [[robots.txt]] file in the root directory of the domain. Additionally, a page can be explicitly excluded from a search engine's database by using a [[meta tag]] specific to robots. When a search engine visits a site, the robots.txt located in the [[root directory]] is the first file crawled. The robots.txt file is then parsed, and will instruct the robot as to which pages are not to be crawled. As a search engine crawler may keep a cached copy of this file, it may on occasion crawl pages a webmaster does not wish crawled. Pages typically prevented from being crawled include login specific pages such as shopping carts and user-specific content such as search results from internal searches. In March 2007, Google warned webmasters that they should prevent indexing of internal search results because those pages are considered search spam.<ref>[http://searchengineland.com/070508-165231.php|title=Newspapers Amok! New York Times Spamming Google? LA Times Hijacking Cars.com?]. [[Search Engine Land]], May 8, 2007. Retrieved November 15, 2008.</ref>
 
  
 
==White hat versus black hat==
 
==White hat versus black hat==
SEO techniques can be classified into two broad categories: techniques that search engines recommend as part of good design, and those techniques of which search engines do not approve. The search engines attempt to minimize the effect of the latter, among them [[spamdexing]]. Industry commentators have classified these methods, and the practitioners who employ them, as either [[white hat]] SEO, or [[black hat]] SEO.<ref>Andrew Goodman, SearchEngineWatch [http://searchenginewatch.com/showPage.html?page=3483941 Search Engine Showdown: Black hats vs. White hats at SES]. Retrieved November 15, 2008.</ref> White hats tend to produce results that last a long time, whereas black hats anticipate that their sites may eventually be banned either temporarily or permanently once the search engines discover what they are doing.<ref>[[Jill Whalen]], November 16, 2004. [http://www.searchengineguide.com/whalen/2004/1116_jw1.html Black Hat/White Hat Search Engine Optimization]. ''searchengineguide.com''Retrieved November 15, 2008.</ref>
+
SEO techniques can be classified into two broad categories: techniques that search engines recommend as part of good design, and those techniques of which search engines do not approve. The search engines attempt to minimize the effect of the latter, among them [[spamdexing]]. Industry commentators have classified these methods, and the practitioners who employ them, as either [[white hat]] SEO, or [[black hat]] SEO. White hats tend to produce results that last a long time, whereas black hats anticipate that their sites may eventually be banned either temporarily or permanently once the search engines discover what they are doing.<ref>Jill Whalen, Black Hat/White Hat Search Engine Optimization. ''Search Engine Guide'', November 16, 2004. </ref>
  
An SEO technique is considered white hat if it conforms to the search engines' guidelines and involves no deception. As the search engine guidelines<ref>[http://www.google.com/webmasters/seo.html What's an SEO? Does Google recommend working with companies that offer to make my site Google-friendly?]. ''google.com'' |
+
An SEO technique is considered white hat if it conforms to the search engines' guidelines and involves no deception. As the search engine guidelines<ref>[https://support.google.com/webmasters/answer/35291 Do you need an SEO?]. ''google.com''. Retrieved March 27, 2020.</ref><ref name="g-wmguide">[https://support.google.com/webmasters/answer/35769 Google's Webmaster Guidelines]. ''google.com''. Retrieved March 27, 2020.</ref><ref name="y-wmguide">[https://help.yahoo.com/kb/search-for-desktop/SLN2245.html?impressions=true Yahoo! Content Quality Guidelines]. ''help.yahoo.com''. Retrieved March 27, 2020.</ref> are not written as a series of rules or commandments, this is an important distinction to note. White hat SEO is not just about following guidelines, but is about ensuring that the content a search engine indexes and subsequently ranks is the same content a user will see. White hat advice is generally summed up as creating content for users, not for search engines, and then making that content easily accessible to the spiders, rather than attempting to trick the algorithm from its intended purpose. White hat SEO is in many ways similar to web development that promotes accessibility,<ref>Andy Hagans, [http://alistapart.com/article/accessibilityseo High Accessibility Is Effective Search Engine Optimization]. ''A List Apart'', November 8, 2005. Retrieved March 27, 2020.</ref> although the two are not identical.
Retrieved November 15, 2008.</ref><ref name="g-wmguide">[http://www.google.com/webmasters/guidelines.html Google's Guidelines on Site Design]. 2007-04-18 ''google.com''. Retrieved November 15, 2008.</ref><ref name="ms-wmguide">[http://search.msn.com/docs/siteowner.aspx?t=SEARCH_WEBMASTER_REF_GuidelinesforOptimizingSite.htm Site Owner Help: MSN Search Web Crawler and Site Indexing]. ''msn.com''. 2007-04-18. Retrieved November 15, 2008.</ref><ref name="y-wmguide">[http://help.yahoo.com/l/us/yahoo/search/basics/basics-18.html Yahoo! Search Content Quality Guidelines]. ''help.yahoo.com''. Retrieved November 15, 2008.</ref> are not written as a series of rules or commandments, this is an important distinction to note. White hat SEO is not just about following guidelines, but is about ensuring that the content a search engine indexes and subsequently ranks is the same content a user will see. White hat advice is generally summed up as creating content for users, not for search engines, and then making that content easily accessible to the spiders, rather than attempting to trick the algorithm from its intended purpose. White hat SEO is in many ways similar to web development that promotes accessibility,<ref>Andy Hagans, [[A List Apart]][http://alistapart.com/articles/accessibilityseo High Accessibility Is Effective Search Engine Optimization]. November 8, 2005 Retrieved November 15, 2008.</ref> although the two are not identical.
 
  
 
[[spamdexing|Black hat SEO]] attempts to improve rankings in ways that are disapproved of by the search engines, or involve deception. One black hat technique uses text that is hidden, either as text colored similar to the background, in an invisible [[Span and div|div]], or positioned off screen. Another method gives a different page depending on whether the page is being requested by a human visitor or a search engine, a technique known as [[cloaking]].
 
[[spamdexing|Black hat SEO]] attempts to improve rankings in ways that are disapproved of by the search engines, or involve deception. One black hat technique uses text that is hidden, either as text colored similar to the background, in an invisible [[Span and div|div]], or positioned off screen. Another method gives a different page depending on whether the page is being requested by a human visitor or a search engine, a technique known as [[cloaking]].
  
Search engines may penalize sites they discover using black hat methods, either by reducing their rankings or eliminating their listings from their databases altogether. Such penalties can be applied either automatically by the search engines' algorithms, or by a manual site review. One infamous example was the February 2006 Google removal of both [[BMW]] Germany and [[Ricoh]] Germany for use of deceptive practices.<ref name="intwebspam">[http://www.mattcutts.com/blog/ramping-up-on-international-webspam/]. ''mattcutts.com/blog'' Ramping up on international webspam]. ''mattcutts.com/blog'' [[Matt Cutts]] February 4, 2006. Retrieved November 15, 2008.</ref> Both companies, however, quickly apologized, fixed the offending pages, and were restored to Google's list.<ref>[http://www.mattcutts.com/blog/recent-reinclusions/ ''mattcutts.com/blog'' Recent reinclusions]. [[Matt Cutts]], February 7, 2006. Retrieved November 15, 2008.</ref>
+
Search engines may penalize sites they discover using black hat methods, either by reducing their rankings or eliminating their listings from their databases altogether. Such penalties can be applied either automatically by the search engines' algorithms, or by a manual site review. One infamous example was the February 2006 Google removal of both [[BMW]] Germany and [[Ricoh]] Germany for use of deceptive practices.<ref>Matt Cutts, [http://www.mattcutts.com/blog/ramping-up-on-international-webspam/ Ramping up on international webspam]. ''Gadgets, Google, and SEO'', February 4, 2006. Retrieved March 27, 2020.</ref> Both companies, however, quickly apologized, fixed the offending pages, and were restored to Google's list.<ref>Matt Cutts, [http://www.mattcutts.com/blog/recent-reinclusions/ Recent reinclusions]. ''Gadgets, Google, and SEO'', February 7, 2006. Retrieved March 27, 2020.</ref>
  
 
==As a marketing strategy==
 
==As a marketing strategy==
[[Image:2008-07-03-Inteligent life.png|thumb|200px|Wiki search engine in action.]]
+
Placement at or near the top of the rankings increases the number of searchers who will visit a site. However, more search engine referrals does not guarantee more sales. SEO is not necessarily an appropriate strategy for every website, and other Internet marketing strategies can be much more effective, depending on the site operator's goals. A successful Internet marketing campaign may drive organic traffic to web pages, but it also may involve the use of paid advertising on search engines and other pages, building high quality web pages to engage and persuade, addressing technical issues that may keep search engines from crawling and indexing those sites, setting up analytics programs to enable site owners to measure their successes, and improving a site's [[conversion rate]].<ref>Melissa Burdon,  [https://www.h2desk.com/blog/battle-search-engine-optimization-conversion-wins/ The Battle Between Search Engine Optimization and Conversion: Who Wins?]. ''H2 Desk'', March 31, 2007 . Retrieved March 27, 2020.</ref>
  
Eye tracking studies have shown that searchers scan a search results page from top to bottom and left to right (for left to right languages), looking for a relevant result. Placement at or near the top of the rankings therefore increases the number of searchers who will visit a site.<ref>[[Search Engine Watch]] [http://searchenginewatch.com/showPage.html?page=3488076 A New F-Word for Google Search Results].  March 8, 2005, Retrieved November 15, 2008.</ref> However, more search engine referrals does not guarantee more sales. SEO is not necessarily an appropriate strategy for every website, and other Internet marketing strategies can be much more effective, depending on the site operator's goals.<ref> [http://blog.v7n.com/2006/06/24/what-seo-isnt/|publisher=blog.v7n.com What SEO Isn't] June 24, 2006, Retrieved November 15, 2008.</ref> A successful Internet marketing campaign may drive organic traffic to web pages, but it also may involve the use of paid advertising on search engines and other pages, building high quality web pages to engage and persuade, addressing technical issues that may keep search engines from crawling and indexing those sites, setting up analytics programs to enable site owners to measure their successes, and improving a site's [[conversion rate]].<ref>Melissa Burdon,  [http://www.grokdotcom.com/2007/03/13/the-battle-between-search-engine-optimization-and-conversion-who-wins/ The Battle Between Search Engine Optimization and Conversion: Who Wins?]. ''Grok.com''. 2007-05-09, March 13, 2007. Retrieved November 15, 2008.</ref>
+
SEO may generate a [[return on investment]]. However, search engines are not paid for organic search traffic, their algorithms change, and there are no guarantees of continued referrals. Due to this lack of guarantees and certainty, a business that relies heavily on search engine traffic can suffer major losses if the search engines stop sending visitors.<ref>Andy Greenberg, [https://www.forbes.com/2007/04/29/sanar-google-skyfacet-tech-cx_ag_0430googhell.html#3adbd52e639b Condemned To Google Hell]. ''Forbes'', April 30, 2007. Retrieved March 27, 2020.</ref> It is considered wise business practice for website operators to liberate themselves from dependence on search engine traffic.<ref>Jakob Nielsen, [https://www.nngroup.com/articles/search-engines-as-leeches-on-the-web/ Search Engines as Leeches on the Web]. ''Nielsen Norman Group'',  January 9, 2006. Retrieved March 27, 2020.</ref> A top-ranked SEO blog reported, "Search marketers, in a twist of irony, receive a very small share of their traffic from search engines."<ref> [https://www.searchenginejournal.com/seomoz-best-seo-blog-of-2006/4195/ SEOmoz: Best SEO Blog of 2006]. ''Search Engine Journal'', January 3, 2007. Retrieved March 27, 2020.</ref> Instead, their main sources of traffic are links from other websites.
 
 
SEO may generate a [[return on investment]]. However, search engines are not paid for organic search traffic, their algorithms change, and there are no guarantees of continued referrals. Due to this lack of guarantees and certainty, a business that relies heavily on search engine traffic can suffer major losses if the search engines stop sending visitors.<ref>Andy Greenberg, April 30, 2007[[Forbes]] [http://www.forbes.com/technology/2007/04/29/sanar-google-skyfacet-tech-cx_ag_0430googhell.html?partner=rss Condemned To Google Hell]. Retrieved November 15, 2008.</ref> It is considered wise business practice for website operators to liberate themselves from dependence on search engine traffic.<ref>[[Jakob Nielsen (usability consultant)|Jakob Nielsen]], January 9, 2006. [http://www.useit.com/alertbox/search_engines.html Search Engines as Leeches on the Web]. ''useit.com''. Retrieved November 15, 2008.</ref> A top-ranked SEO blog Seomoz.org<ref> [http://www.searchenginejournal.com/seomoz-best-seo-blog-of-2006/4195/ SEOmoz: Best SEO Blog of 2006]. ''searchenginejournal.com''. January 3, 2007. Retrieved November 15, 2008.</ref> has reported, "Search marketers, in a twist of irony, receive a very small share of their traffic from search engines." Instead, their main sources of traffic are links from other websites.<ref> 2007-05-31 [http://www.seomoz.org/article/search-blog-stats#4 A survey of 25 blogs in the search space comparing external metrics to visitor tracking data]. ''seomoz.org''. Retrieved November 15, 2008.</ref>
 
  
 
==International Markets==
 
==International Markets==
The search engines' market shares vary from market to market, as does competition.
+
The search engines' market shares vary from market to market, as does competition. In 2003, [[Danny Sullivan (technologist)|Danny Sullivan]] stated that Google represented about 75 percent of all searches.<ref>Jefferson Graham, [https://usatoday30.usatoday.com/tech/news/2003-08-25-google_x.htm The search engine that could]. ''USA Today'', August 26, 2003. Retrieved March 27, 2020.</ref> In markets outside the United States, Google's share is often larger, as much as 90 percent.<ref>[http://gs.statcounter.com/search-engine-market-share Search Engine Market Share Worldwide] ''Stat Counter'', Global Stats. Retrieved March 27, 2020.</ref>  
In 2003, [[Danny Sullivan (technologist)|Danny Sullivan]] stated that Google represented about 75 percent of all searches.<ref>[http://www.usatoday.com/tech/news/2003-08-25-google_x.htm The search engine that could]. ''USA Today'', 2003-08-26. Retrieved November 15, 2008.</ref> In markets outside the United States, Google's share is often larger, and Google remains the dominant search engine worldwide as of 2007.<ref>Greg Jarboe, [http://searchenginewatch.com/showPage.html?page=3625072 Stats Show Google Dominates the International Search Landscape]. [[Search Engine Watch]], 2007-02-22.  Retrieved November 15, 2008.</ref> As of 2006, Google held about 40 percent of the market in the United States, but Google had an 85-90 percent market share in Germany.<ref name="grehan-1">Mike Grehan, April 3, 2006 [http://www.clickz.com/showPage.html?page=3595926 Search Engine Optimizing for Europe]. ''Click''. Retrieved November 15, 2008.</ref> While there were hundreds of SEO firms in the US at that time, there were only about five in Germany.<ref name="grehan-1" />  
 
  
In Russia the situation is reversed. Local search engine [[Yandex]] controls 50 percent of the paid advertising revenue, while Google has less than 9 percent.<ref>Eric Pfanner, December 18, 2006, ''New York Times'' [http://www.nytimes.com/2006/12/18/technology/18google.html?ex=1179374400&en=0da5cb873cb45b2e&ei=5070 New to Russia, Google Struggles to Find Its Footing]. Retrieved November 15, 2008.</ref> In China, [[Baidu]] continues to lead in market share, although Google has been gaining share as of 2007.<ref>[http://searchengineland.com/070306-144912.php Google Gaining, But Baidu Still Dominates In China]. March 6, 2007. ''Search Engine Land''. Retrieved November 15, 2008.</ref>
+
Successful search optimization for international markets may require professional [[language translation|translation]] of web pages, registration of a domain name with a [[top level domain]] in the target market, and [[web hosting]] that provides a local [[IP address]]. Otherwise, the fundamental elements of search optimization are essentially the same, regardless of language.
 
 
Successful search optimization for international markets may require professional [[language translation|translation]] of web pages, registration of a domain name with a [[top level domain]] in the target market, and [[web hosting]] that provides a local [[IP address]]. Otherwise, the fundamental elements of search optimization are essentially the same, regardless of language.<ref name="grehan-1" />
 
  
 
==Legal precedents==
 
==Legal precedents==
On October 17, 2002, [[SearchKing]] filed suit in the United States District Court, Western District of Oklahoma, against the search engine Google. SearchKing's claim was that Google's tactics to prevent [[spamdexing]] constituted a tortious interference with contractual relations. <!-- This may be compared to lawsuits that email spammers have filed against spam-fighters, as in various cases against MAPS and other [[DNSBL]]s. —> On May 27, 2003, the court granted Google's motion to dismiss the complaint because SearchKing "failed to state a claim upon which relief may be granted."<ref>[http://www.docstoc.com/docs/618281/Order-(Granting-Googles-Motion-to-Dismiss-Search-Kings-Complaint)]. PDF ''docstoc.com''. Search King, Inc. v. Google Technology, Inc., CIV-02-1457-M. May 27, 2003, Retrieved November 15, 2008.</ref><ref>Stefanie Olsen, May 30, 2003, [http://news.com.com/2100-1032_3-1011740.html Judge dismisses suit against Google]. [[CNET]] Retrieved November 15, 2008.</ref>
+
On October 17, 2002, [[SearchKing]] filed suit in the United States District Court, Western District of Oklahoma, against the search engine Google. SearchKing's claim was that Google's tactics to prevent [[spamdexing]] constituted a tortious interference with contractual relations. On January 13, 2003, the court granted Google's motion to dismiss the complaint because ''Google's Page Ranks are entitled to First Amendment protection'' and further that SearchKing "failed to show that Google's actions caused it irreparable injury, as the damages arising from its reduced ranking were too speculative."<ref> Martin Samson, [http://www.internetlibrary.com/cases/lib_case337.cfm Search King, Inc. v. Google Technology, Inc.] ''Internet Library of Law and Court Decisions''. Retrieved March 27, 2020.</ref>
  
In March 2006, [[KinderStart]] filed a lawsuit against Google over search engine rankings. Kinderstart's web site was removed from Google's index prior to the lawsuit and the amount of traffic to the site dropped by 70 percent. On March 16, 2007 the [[United States District Court for the Northern District of California]] ([[San Jose, California|San Jose]] Division) dismissed KinderStart's complaint without leave to amend, and partially granted Google's motion for [[Federal Rules of Civil Procedure#Chapter III - Pleadings and Motions|Rule 11]] sanctions against KinderStart's attorney, requiring him to pay part of Google's legal expenses.
+
In March 2006, [[KinderStart]] filed a lawsuit against Google over search engine rankings. Kinderstart's web site was removed from Google's index prior to the lawsuit and the amount of traffic to the site dropped by 70 percent. On March 16, 2007 the [[United States District Court for the Northern District of California]] ([[San Jose, California|San Jose]] Division) dismissed KinderStart's complaint without leave to amend, and partially granted Google's motion for [[Federal Rules of Civil Procedure#Chapter III - Pleadings and Motions|Rule 11]] sanctions against KinderStart's attorney, requiring him to pay part of Google's legal expenses.<ref>Eric Goldman, [https://blog.ericgoldman.org/archives/2007/03/kinderstart_v_g_2.htm KinderStart v. Google Dismissed–With Sanctions Against KinderStart’s Counsel] Technology & Marketing Law Blog, March 20, 2007. Retrieved March 27, 2020.</ref>
<ref>Eric Goldman, [http://www.blog.ericgoldman.org/archives/2007/03/kinderstart_v_g_2.htm]
 
Technology & Marketing Law Blog KinderStart v. Google Dismissed&mdash;With Sanctions Against KinderStart's Counsel. ''blog.ericgoldman.org''. 2008-06-23. Retrieved November 15, 2008.</ref>
 
  
 
== Notes==
 
== Notes==
Line 80: Line 71:
  
 
==References==
 
==References==
*Brin, Sergey, and Lawrence Page. [http://www-db.stanford.edu/~backrub/google.html The Anatomy of a Large-Scale Hypertextual Web Search Engine] ''Proceedings of the seventh international conference on World Wide Web'', 1998. Retrieved June 29, 2018.
+
 
*Cho, J., H. Garcia-Molina, and L. Page. [http://dbpubs.stanford.edu:8090/pub/1998-51 Efficient crawling through URL ordering] ''Proceedings of the seventh conference on World Wide Web'', Brisbane, Australia, April 14-18, 1998. Retrieved June 29, 2018.
 
*Cutts, Matt. [http://www.mattcutts.com/blog/confirming-a-penalty/ Confirming a penalty] February 2, 2006. Retrieved June 29, 2018.
 
*Cutts, Matt. [http://www.mattcutts.com/blog/ramping-up-on-international-webspam/ Ramping up on international webspam] February 4, 2006. Retrieved June 29, 2018.
 
*Cutts, Matt. [http://www.mattcutts.com/blog/recent-reinclusions/ Recent reinclusions]  February 7, 2006. Retrieved June 29, 2018.
 
* Flynn, Laurie J. [https://www.nytimes.com/1996/11/11/business/desperately-seeking-surfers.html Desperately Seeking Surfers] ''The New York Times'', November 11, 1996. Retrieved June 29, 2018.
 
*Graham, Jefferson. [http://www.usatoday.com/tech/news/2003-08-25-google_x.htm The search engine that could] ''USA Today'', August 26, 2003. Retrieved June 29, 2018.
 
 
*Grappone, Jennifer, and Gradiva Couzin. ''Search Engine Optimization: An Hour a Day.'' San Francisco, CA: Sybex, 2006. ISBN 978-0471787532
 
*Grappone, Jennifer, and Gradiva Couzin. ''Search Engine Optimization: An Hour a Day.'' San Francisco, CA: Sybex, 2006. ISBN 978-0471787532
* Gyongyi, Zoltan, and Hector Garcia-Molina. [http://infolab.stanford.edu/~zoltan/publications/gyongyi2005link.pdf Link Spam Alliances] ''Proceedings of the 31st VLDB Conference'', Trondheim, Norway, 2005. Retrieved June 29, 2018.
 
*Hagans, Andy. [http://alistapart.com/articles/accessibilityseo High Accessibility Is Effective Search Engine Optimization] ''A List Apart'', November 8, 2005. Retrieved June 29, 2018.
 
*Kesmodel, David. [https://www.wsj.com/articles/SB112714166978744925?apl=y&r=947596 Sites Get Dropped by Search Engines After Trying to 'Optimize' Rankings] ''Wall Street Journal'', September 22, 2005. Retrieved June 29, 2018.
 
 
*Konia, Brad S. ''Search Engine Optimization with WebPosition Gold 2. Wordware web programming/development library.'' Plano, TX: Wordware Pub, 2002. ISBN 978-0585428475
 
*Konia, Brad S. ''Search Engine Optimization with WebPosition Gold 2. Wordware web programming/development library.'' Plano, TX: Wordware Pub, 2002. ISBN 978-0585428475
 
*Ledford, Jerri L. ''SEO: Search Engine Optimization Bible.'' Hoboken, NJ: Wiley, 2008. ISBN 978-0470175002
 
*Ledford, Jerri L. ''SEO: Search Engine Optimization Bible.'' Hoboken, NJ: Wiley, 2008. ISBN 978-0470175002
* Nielsen, Jakob. [https://www.nngroup.com/articles/search-engines-as-leeches-on-the-web/ Search Engines as Leeches on the Web] ''Nielsen Norman Group'', January 9, 2006. Retrieved June 29, 2018.
+
*Pfanner, Eric. [https://www.nytimes.com/2006/12/18/technology/18google.html?ex=1179374400&en=0da5cb873cb45b2e&ei=5070 New to Russia, Google Struggles to Find Its Footing] ''The New York Times'', December 18, 2006. Retrieved March 27, 2020.
*Pfanner, Eric. [https://www.nytimes.com/2006/12/18/technology/18google.html?ex=1179374400&en=0da5cb873cb45b2e&ei=5070 New to Russia, Google Struggles to Find Its Footing] ''The New York Times'', December 18, 2006. Retrieved June 29, 2018.
 
 
*Potts, Kevin. ''Web Design and Marketing Solutions for Business Websites.'' Berkeley, CA: Friends of Ed, 2007. ISBN 978-1590598399
 
*Potts, Kevin. ''Web Design and Marketing Solutions for Business Websites.'' Berkeley, CA: Friends of Ed, 2007. ISBN 978-1590598399
*Pringle, Glen, Lloyd Allison, and David L. Dowe. [http://www.csse.monash.edu.au/~lloyd/tilde/InterNet/Search/1998_WWW7.html What is a tall poppy among web pages?] Proc. 7th Int. World Wide Web Conference, Brisbane, April 1998. Retrieved June 29, 2018.
 
 
*Siskind, Gregory H., Deborah McMurray, and Richard P. Klau. ''The Lawyer's Guide to Marketing on the Internet.'' Chicago, IL: American Bar Association, 2007. ISBN 978-1590318768
 
*Siskind, Gregory H., Deborah McMurray, and Richard P. Klau. ''The Lawyer's Guide to Marketing on the Internet.'' Chicago, IL: American Bar Association, 2007. ISBN 978-1590318768
*Thompson, Bill. [http://news.bbc.co.uk/1/hi/technology/3334531.stm Is Google good for you?] ''BBC News'', December 19, 2003. Retrieved June 29, 2018.
 
*Whalen, Jill. [http://www.searchengineguide.com/whalen/2004/1116_jw1.html Black Hat/White Hat Search Engine Optimization] ''Search Engine Guide'', November 16, 2004. Retrieved June 29, 2018.
 
  
 
==External links==
 
==External links==
All links retrieved June 29, 2018.
+
All links retrieved January 25, 2023.  
 
*[http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=35769 Google Webmaster Guidelines]  
 
*[http://www.google.com/support/webmasters/bin/answer.py?hl=en&answer=35769 Google Webmaster Guidelines]  
 
*[http://help.yahoo.com/l/us/yahoo/search/basics/basics-18.html Yahoo! Webmaster Guidelines]
 
*[http://help.yahoo.com/l/us/yahoo/search/basics/basics-18.html Yahoo! Webmaster Guidelines]
 
*[https://seotribunal.com/blog/stats-to-understand-seo/ 72 Stats To Understand SEO In 2018 (Infographic)]
 
*[https://seotribunal.com/blog/stats-to-understand-seo/ 72 Stats To Understand SEO In 2018 (Infographic)]
 +
*[https://logomaker.org/seo-statistics/ 36 Vital SEO Statistics and Facts to Keep in Mind for 2020]
  
 
[[Category:Library and information science]]
 
[[Category:Library and information science]]
  
 
{{credits|Search_engine_optimization|251848614}}
 
{{credits|Search_engine_optimization|251848614}}

Latest revision as of 02:44, 21 April 2023

PageRank, a link analysis algorithm by Google

Search engine optimization (SEO) is the process of improving the volume and quality of traffic to a web site from search engines via "natural" ("organic" or "algorithmic") search results. Usually, the earlier a site is presented in the search results, or the higher it "ranks," the more searchers will visit that site. SEO can also target different kinds of search, including image search, local search, and industry-specific vertical search engines.

As an Internet marketing strategy, SEO considers how search engines work and what people search for. Optimizing a website primarily involves editing its content and HTML coding to both increase its relevance to specific keywords and to remove barriers to the indexing activities of search engines.

The acronym "SEO" can also refer to "search engine optimizers," a term adopted by an industry of consultants who carry out optimization projects on behalf of clients and by employees who perform SEO services in-house. Search engine optimizers may offer SEO as a stand-alone service or as a part of a broader marketing campaign. Because effective SEO may require changes to the HTML source code of a site, SEO tactics may be incorporated into web site development and design. The term "search engine friendly" may be used to describe web site designs, menus, content management systems and shopping carts that are easy to optimize.

Another class of techniques, known as black hat SEO or Spamdexing, use methods such as link farms and keyword stuffing that degrade both the relevance of search results and the user-experience of search engines. Search engines look for sites that use these techniques in order to remove them from their indices.

History

Webmasters and content providers began optimizing sites for search engines in the mid-1990s, as the first search engines were cataloging the early Web. Initially, all a webmaster needed to do was submit a page, or URL, to the various engines which would send a spider to "crawl" that page, extract links to other pages from it, and return information found on the page to be indexed. The process involves a search engine spider downloading a page and storing it on the search engine's own server, where a second program, known as an indexer, extracts various information about the page, such as the words it contains and where these are located, as well as any weight for specific words, as well as any and all links the page contains, which are then placed into a scheduler for crawling at a later date.

Site owners started to recognize the value of having their sites highly ranked and visible in search engine results, creating an opportunity for both white hat and black hat SEO practitioners. According to industry analyst Danny Sullivan, the earliest known use of the phrase search engine optimization was in 1997.[1]

Early versions of search algorithms relied on webmaster-provided information such as the keyword meta tag, or index files in engines like ALIWEB. Meta tags provided a guide to each page's content. But using meta data to index pages was found to be less than reliable because the webmaster's account of keywords in the meta tag were not truly relevant to the site's actual keywords. Inaccurate, incomplete, and inconsistent data in meta tags caused pages to rank for irrelevant searches. Web content providers also manipulated a number of attributes within the HTML source of a page in an attempt to rank well in search engines.[2]

By relying so much on factors exclusively within a webmaster's control, early search engines suffered from abuse and ranking manipulation. To provide better results to their users, search engines had to adapt to ensure their results pages showed the most relevant search results, rather than unrelated pages stuffed with numerous keywords by unscrupulous webmasters. Since the success and popularity of a search engine is determined by its ability to produce the most relevant results to any given search allowing those results to be false would turn users to find other search sources. Search engines responded by developing more complex ranking algorithms, taking into account additional factors that were more difficult for webmasters to manipulate.

While graduate students at Stanford University, Larry Page and Sergey Brin developed "backrub," a search engine that relied on a mathematical algorithm to rate the prominence of web pages. The number calculated by the algorithm, PageRank, is a function of the quantity and strength of inbound links.[3] PageRank estimates the likelihood that a given page will be reached by a web user who randomly surfs the web, and follows links from one page to another. In effect, this means that some links are stronger than others, as a higher PageRank page is more likely to be reached by the random surfer.

Google headquarters

Page and Brin founded Google in 1998. Google attracted a loyal following among the growing number of Internet users, who liked its simple design.[4] Off-page factors (such as PageRank and hyperlink analysis) were considered as well as on-page factors (such as keyword frequency, meta tags, headings, links and site structure) to enable Google to avoid the kind of manipulation seen in search engines that only considered on-page factors for their rankings. Although PageRank was more difficult to game, webmasters had already developed link building tools and schemes to influence the Inktomi search engine, and these methods proved similarly applicable to gaining PageRank. Many sites focused on exchanging, buying, and selling links, often on a massive scale. Some of these schemes, or link farms, involved the creation of thousands of sites for the sole purpose of link spamming.[5] In recent years major search engines have begun to rely more heavily on off-web factors such as the age, sex, location, and search history of people conducting searches in order to further refine results.

By 2007, search engines had incorporated a wide range of undisclosed factors in their ranking algorithms to reduce the impact of link manipulation. Google says it ranks sites using more than 200 different signals.[6] The three leading search engines, Google, Yahoo and Microsoft's Live Search, do not disclose the algorithms they use to rank pages. Notable SEOs, such as Rand Fishkin, Barry Schwartz, Aaron Wall and Jill Whalen, have studied different approaches to search engine optimization, and have published their opinions in online forums and blogs.[7]

Webmasters and search engines

By 1997 search engines recognized that webmasters were making efforts to rank well in their search engines, and that some webmasters were even manipulating their rankings in search results by stuffing pages with excessive or irrelevant keywords. Early search engines, such as Infoseek, adjusted their algorithms in an effort to prevent webmasters from manipulating rankings.[8]

Due to the high marketing value of targeted search results, there is potential for an adversarial relationship between search engines and SEOs. In 2005, an annual conference, AIRWeb, Adversarial Information Retrieval on the Web,[9] was created to discuss and minimize the damaging effects of aggressive web content providers.

SEO companies that employ overly aggressive techniques can get their client websites banned from the search results. In 2005, the Wall Street Journal reported on a company, Traffic Power, which allegedly used high-risk techniques and failed to disclose those risks to its clients.[10] Google's Matt Cutts later confirmed that Google did in fact ban Traffic Power and some of its clients.[11]

Some search engines have also reached out to the SEO industry, and are frequent sponsors and guests at SEO conferences, chats, and seminars. In fact, with the advent of paid inclusion, some search engines now have a vested interest in the health of the optimization community. Major search engines provide information and guidelines to help with site optimization.[12][13]

Getting indexed

The leading search engines, Google, Yahoo! and Microsoft, use crawlers to find pages for their algorithmic search results. Pages that are linked from other search engine indexed pages do not need to be submitted because they are found automatically.

Two major directories, the Yahoo Directory and the Open Directory Project both require manual submission and human editorial review.[14] Google offers Google Webmaster Tools, for which an XML Sitemap feed can be created and submitted for free to ensure that all pages are found, especially pages that aren't discoverable by automatically following links.[15]

Search engine crawlers may look at a number of different factors when crawling a site. Not every page is indexed by the search engines. Distance of pages from the root directory of a site may also be a factor in whether or not pages get crawled.[16]

Preventing indexing

To avoid undesirable content in the search indexes, webmasters can instruct spiders not to crawl certain files or directories through the standard robots.txt file in the root directory of the domain. Additionally, a page can be explicitly excluded from a search engine's database by using a meta tag specific to robots. When a search engine visits a site, the robots.txt located in the root directory is the first file crawled. The robots.txt file is then parsed, and will instruct the robot as to which pages are not to be crawled. As a search engine crawler may keep a cached copy of this file, it may on occasion crawl pages a webmaster does not wish crawled. Pages typically prevented from being crawled include login specific pages such as shopping carts and user-specific content such as search results from internal searches. In March 2007, Google warned webmasters that they should prevent indexing of internal search results because those pages are considered search spam.[17]

White hat versus black hat

SEO techniques can be classified into two broad categories: techniques that search engines recommend as part of good design, and those techniques of which search engines do not approve. The search engines attempt to minimize the effect of the latter, among them spamdexing. Industry commentators have classified these methods, and the practitioners who employ them, as either white hat SEO, or black hat SEO. White hats tend to produce results that last a long time, whereas black hats anticipate that their sites may eventually be banned either temporarily or permanently once the search engines discover what they are doing.[18]

An SEO technique is considered white hat if it conforms to the search engines' guidelines and involves no deception. As the search engine guidelines[19][12][13] are not written as a series of rules or commandments, this is an important distinction to note. White hat SEO is not just about following guidelines, but is about ensuring that the content a search engine indexes and subsequently ranks is the same content a user will see. White hat advice is generally summed up as creating content for users, not for search engines, and then making that content easily accessible to the spiders, rather than attempting to trick the algorithm from its intended purpose. White hat SEO is in many ways similar to web development that promotes accessibility,[20] although the two are not identical.

Black hat SEO attempts to improve rankings in ways that are disapproved of by the search engines, or involve deception. One black hat technique uses text that is hidden, either as text colored similar to the background, in an invisible div, or positioned off screen. Another method gives a different page depending on whether the page is being requested by a human visitor or a search engine, a technique known as cloaking.

Search engines may penalize sites they discover using black hat methods, either by reducing their rankings or eliminating their listings from their databases altogether. Such penalties can be applied either automatically by the search engines' algorithms, or by a manual site review. One infamous example was the February 2006 Google removal of both BMW Germany and Ricoh Germany for use of deceptive practices.[21] Both companies, however, quickly apologized, fixed the offending pages, and were restored to Google's list.[22]

As a marketing strategy

Placement at or near the top of the rankings increases the number of searchers who will visit a site. However, more search engine referrals does not guarantee more sales. SEO is not necessarily an appropriate strategy for every website, and other Internet marketing strategies can be much more effective, depending on the site operator's goals. A successful Internet marketing campaign may drive organic traffic to web pages, but it also may involve the use of paid advertising on search engines and other pages, building high quality web pages to engage and persuade, addressing technical issues that may keep search engines from crawling and indexing those sites, setting up analytics programs to enable site owners to measure their successes, and improving a site's conversion rate.[23]

SEO may generate a return on investment. However, search engines are not paid for organic search traffic, their algorithms change, and there are no guarantees of continued referrals. Due to this lack of guarantees and certainty, a business that relies heavily on search engine traffic can suffer major losses if the search engines stop sending visitors.[24] It is considered wise business practice for website operators to liberate themselves from dependence on search engine traffic.[25] A top-ranked SEO blog reported, "Search marketers, in a twist of irony, receive a very small share of their traffic from search engines."[26] Instead, their main sources of traffic are links from other websites.

International Markets

The search engines' market shares vary from market to market, as does competition. In 2003, Danny Sullivan stated that Google represented about 75 percent of all searches.[27] In markets outside the United States, Google's share is often larger, as much as 90 percent.[28]

Successful search optimization for international markets may require professional translation of web pages, registration of a domain name with a top level domain in the target market, and web hosting that provides a local IP address. Otherwise, the fundamental elements of search optimization are essentially the same, regardless of language.

Legal precedents

On October 17, 2002, SearchKing filed suit in the United States District Court, Western District of Oklahoma, against the search engine Google. SearchKing's claim was that Google's tactics to prevent spamdexing constituted a tortious interference with contractual relations. On January 13, 2003, the court granted Google's motion to dismiss the complaint because Google's Page Ranks are entitled to First Amendment protection and further that SearchKing "failed to show that Google's actions caused it irreparable injury, as the damages arising from its reduced ranking were too speculative."[29]

In March 2006, KinderStart filed a lawsuit against Google over search engine rankings. Kinderstart's web site was removed from Google's index prior to the lawsuit and the amount of traffic to the site dropped by 70 percent. On March 16, 2007 the United States District Court for the Northern District of California (San Jose Division) dismissed KinderStart's complaint without leave to amend, and partially granted Google's motion for Rule 11 sanctions against KinderStart's attorney, requiring him to pay part of Google's legal expenses.[30]

Notes

  1. Danny Sullivan, Who Invented the Term "Search Engine Optimization"?. Search Engine Watch, June 14, 2004. See Google groups thread. Retrieved March 27, 2020.
  2. Glen Pringle, Lloyd Allison, and David L. Dowe, What is a tall poppy among web pages?. Proc. 7th Int. World Wide Web Conference, Brisbane, April 1998. Retrieved March 27, 2020.
  3. Sergey Brin and Larry Page, The Anatomy of a Large-Scale Hypertextual Web Search Engine Proceedings of the seventh international conference on World Wide Web, 1998. Retrieved March 27, 2020.
  4. Bill Thompson, Is Google good for you? BBC News, December 19, 2003. Retrieved March 27, 2020.
  5. Zoltan Gyongyi and Hector Garcia-Molina, Link Spam Alliances. Proceedings of the 31st VLDB Conference, Trondheim, Norway. 2005. Retrieved March 27, 2020.
  6. Saul Hansell, Google Keeps Tweaking Its Search Engine. The New York Times, June 3, 2007. Retrieved March 27, 2020.
  7. Search Engine Ranking Factors 2015 Moz. Retrieved March 27, 2020.
  8. Laurie J. Flynn, Desperately Seeking Surfers. The New York Times, November 11, 1996. Retrieved March 27, 2020.
  9. AIRWeb. Adversarial Information Retrieval on the Web. Retrieved March 27, 2020.
  10. David Kesmodel, Sites Get Dropped by Search Engines After Trying to 'Optimize' Rankings. Wall Street Journal, September 22, 2005. Retrieved March 27, 2020.
  11. Matt Cutts, Confirming a penalty. Gadgets, Google, and Seo, February 2, 2006. Retrieved March 27, 2020.
  12. 12.0 12.1 Google's Webmaster Guidelines. google.com. Retrieved March 27, 2020.
  13. 13.0 13.1 Yahoo! Content Quality Guidelines. help.yahoo.com. Retrieved March 27, 2020.
  14. Submitting To Directories: Yahoo & The Open Directory. Search Engine Watch, March 12, 2007. Retrieved March 27, 2020.
  15. Learn about sitemaps. google.com. Retrieved March 27, 2020.
  16. J. Cho, H. Garcia-Molina, and L. Page, Efficient crawling through URL ordering. Proceedings of the seventh conference on World Wide Web, Brisbane, Australia, April 14-18, 1998. Retrieved March 27, 2020.
  17. Danny Sullivan, Newspapers Amok! New York Times Spamming Google? LA Times Hijacking Cars.com? Search Engine Land, May 8, 2007. Retrieved March 27, 2020.
  18. Jill Whalen, Black Hat/White Hat Search Engine Optimization. Search Engine Guide, November 16, 2004.
  19. Do you need an SEO?. google.com. Retrieved March 27, 2020.
  20. Andy Hagans, High Accessibility Is Effective Search Engine Optimization. A List Apart, November 8, 2005. Retrieved March 27, 2020.
  21. Matt Cutts, Ramping up on international webspam. Gadgets, Google, and SEO, February 4, 2006. Retrieved March 27, 2020.
  22. Matt Cutts, Recent reinclusions. Gadgets, Google, and SEO, February 7, 2006. Retrieved March 27, 2020.
  23. Melissa Burdon, The Battle Between Search Engine Optimization and Conversion: Who Wins?. H2 Desk, March 31, 2007 . Retrieved March 27, 2020.
  24. Andy Greenberg, Condemned To Google Hell. Forbes, April 30, 2007. Retrieved March 27, 2020.
  25. Jakob Nielsen, Search Engines as Leeches on the Web. Nielsen Norman Group, January 9, 2006. Retrieved March 27, 2020.
  26. SEOmoz: Best SEO Blog of 2006. Search Engine Journal, January 3, 2007. Retrieved March 27, 2020.
  27. Jefferson Graham, The search engine that could. USA Today, August 26, 2003. Retrieved March 27, 2020.
  28. Search Engine Market Share Worldwide Stat Counter, Global Stats. Retrieved March 27, 2020.
  29. Martin Samson, Search King, Inc. v. Google Technology, Inc. Internet Library of Law and Court Decisions. Retrieved March 27, 2020.
  30. Eric Goldman, KinderStart v. Google Dismissed–With Sanctions Against KinderStart’s Counsel Technology & Marketing Law Blog, March 20, 2007. Retrieved March 27, 2020.

References
ISBN links support NWE through referral fees

  • Grappone, Jennifer, and Gradiva Couzin. Search Engine Optimization: An Hour a Day. San Francisco, CA: Sybex, 2006. ISBN 978-0471787532
  • Konia, Brad S. Search Engine Optimization with WebPosition Gold 2. Wordware web programming/development library. Plano, TX: Wordware Pub, 2002. ISBN 978-0585428475
  • Ledford, Jerri L. SEO: Search Engine Optimization Bible. Hoboken, NJ: Wiley, 2008. ISBN 978-0470175002
  • Pfanner, Eric. New to Russia, Google Struggles to Find Its Footing The New York Times, December 18, 2006. Retrieved March 27, 2020.
  • Potts, Kevin. Web Design and Marketing Solutions for Business Websites. Berkeley, CA: Friends of Ed, 2007. ISBN 978-1590598399
  • Siskind, Gregory H., Deborah McMurray, and Richard P. Klau. The Lawyer's Guide to Marketing on the Internet. Chicago, IL: American Bar Association, 2007. ISBN 978-1590318768

External links

All links retrieved January 25, 2023.

Credits

New World Encyclopedia writers and editors rewrote and completed the Wikipedia article in accordance with New World Encyclopedia standards. This article abides by terms of the Creative Commons CC-by-sa 3.0 License (CC-by-sa), which may be used and disseminated with proper attribution. Credit is due under the terms of this license that can reference both the New World Encyclopedia contributors and the selfless volunteer contributors of the Wikimedia Foundation. To cite this article click here for a list of acceptable citing formats.The history of earlier contributions by wikipedians is accessible to researchers here:

The history of this article since it was imported to New World Encyclopedia:

Note: Some restrictions may apply to use of individual images which are separately licensed.