Difference between revisions of "Citation index" - New World Encyclopedia

From New World Encyclopedia
 
(16 intermediate revisions by 4 users not shown)
Line 1: Line 1:
A '''citation index''' is an index of [[citation]]s between publications, allowing the user to easily establish which later documents cite which earlier documents.  
+
{{Submitted}} [[category:image wanted]]{{approved}}{{copyedited}}
 +
A '''citation index''' is an index of [[citation]]s between publications, allowing the user to easily discern which later documents cite which earlier documents.  
  
The first citation indices were legal [[citator]]s such as [[Shepard's Citations]] (1873). In 1960, [[Eugene Garfield]]'s [[Institute for Scientific Information]] (ISI) introduced the first citation index for papers published in [[academic journal]]s, starting with the ''[[Science Citation Index]]'' (SCI), and later expanding to produce the ''[[Social Sciences Citation Index]]'' (SSCI) and the ''[[Arts and Humanities Citation Index]]'' (AHCI). As of 2006, there are other sources of such data, such as [[Google Scholar]].
+
The first citation indices were legal [[citator]]s such as ''[[Shepard's Citations]]'' (1873). In 1960, [[Eugene Garfield]]'s [[Institute for Scientific Information]] (ISI) introduced the first citation index for papers published in [[academic journal]]s, starting with the ''[[Science Citation Index]]'' (SCI), and later expanding to produce the ''[[Social Sciences Citation Index]]'' (SSCI) and the ''[[Arts and Humanities Citation Index]]'' (AHCI). As of 2006, there are other sources of such data, such as [[Google Scholar]].
 +
{{toc}}
 +
Citations are used as a measure of importance or relative value of the information source, such as an individual [[journal]] article, [[book]], and others. For example, if an article is frequently cited by other journal articles and books in the discipline, it may indicate the relative importance of a work. Citation impact analysis is called bibliometrics in [[library and information science]] and has a wide range of applications.
 +
 
 +
==Citation analysis==
 +
A '''citation''' is the act of acknowledging or citing the author, year, title, and locus of publication (journal, book, or other) of a source used in a published work. Such [[citations]] can be counted as measures of the [[usage]] and [[impact]] of the cited work. This is called [[citation analysis]] or [[bibliometrics]] (see below). Among the measures that have emerged from citation analysis are the citation count for
 +
* An individual article (how often it was cited)
 +
* An author (total citations, or average citation count per article)
 +
* For a journal ([[journal]] [[impact factor]], or the average citation count for the articles in the journal)
 +
 
 +
Citation counts are [http://www.ariadne.ac.uk/issue35/harnad/ correlated]<ref>Stevan Harnad, Les Carr, Tim Brody, and Charles Oppenheim, [http://www.ariadne.ac.uk/issue35/harnad/ Mandated online RAE CVs linked to University eprint archives: Enhancing UK research impact and assessment,] Ariadne Issue 35. April 30 2003. Retrieved November 17, 2008.</ref> with other measures of scholarly/scientific performance and impact and can in some cases be enhanced by making a work [[open access]] by [[self-archiving]] the complete article on the web, publishing it in an [[open access journal]], or publishing it as an Open access article in one of the [[Hybrid open access journal]]s.
 +
 
 +
There also exists an [[H-index]] measure of an individual scientist's impact and citation record.
  
 
==Major current citation indexing services==
 
==Major current citation indexing services==
 
There are two publishers of general-purpose academic citation indexes, available to libraries by subscription:
 
There are two publishers of general-purpose academic citation indexes, available to libraries by subscription:
*ISI is now part of [[Thomson Scientific & Healthcare|Thomson Scientific]]. Though the ISI citation indexes are still published in print and [[Compact Disc|compact disc]], they are now generally accessed through the Web under the name '' [[Web of Science]]'', which is in turn part of the group of databases in ''[[Web of Knowledge|WoK]].''   
+
*ISI is now part of [[Thomson Scientific & Healthcare|Thomson Scientific]]. Though the ISI citation indexes are still published in print and [[Compact Disc|compact disc]], they are now generally accessed through the Web under the name ''[[Web of Science]],'' which is in turn part of the group of databases in ''[[Web of Knowledge|WoK]].''   
 
*[[Elsevier]] publishes [[Scopus]], available online only, which similarly combines subject searching with citation browsing and tracking in the sciences and [[social sciences]].  
 
*[[Elsevier]] publishes [[Scopus]], available online only, which similarly combines subject searching with citation browsing and tracking in the sciences and [[social sciences]].  
  
 
There are a number of other indexes, more readily available. Some of the currently notable ones are:
 
There are a number of other indexes, more readily available. Some of the currently notable ones are:
*The [[CiteSeer]] system provides citation and other searching of scientific literature, primarily in the fields of computer and information science<ref>[http://citeseerx.ist.psu.edu/about/site About CiteSeerX] on [[CiteSeerX]] website. Retrieved on November 15, 2008.</ref>
+
*The [[CiteSeer]] system provides citation and other searching of scientific literature, primarily in the fields of computer and information science.<ref>CiteSeerX, [http://citeseerx.ist.psu.edu/about/site About CiteSeerX.] Retrieved on November 15, 2008.</ref>
 
*[[RePec]] provides this in economics, and other discipline-specific indexes have also begun to include it in their indexes. Even journal publishers often supply the facility to link to late citations, at least from the journals they publish.  
 
*[[RePec]] provides this in economics, and other discipline-specific indexes have also begun to include it in their indexes. Even journal publishers often supply the facility to link to late citations, at least from the journals they publish.  
*[[Google Scholar]] (GS) has citation functionality, limited to the recent articles that are included. There is already discussion about the possibility that GS may in the future have sufficient capabilities to make the commercial products unnecessary.
+
*[[Google Scholar]] (GS) has citation functionality, limited to the recent articles that are included. There is already discussion about the possibility that GS may in the future have sufficient capabilities to make the commercial products unnecessary.
  
 
Each of these products offer an index of citations between publications and a mechanism to establish which documents cite which other documents. The different products offer different ways to access the citation list and also display their citation index differently.  
 
Each of these products offer an index of citations between publications and a mechanism to establish which documents cite which other documents. The different products offer different ways to access the citation list and also display their citation index differently.  
Line 18: Line 31:
 
==Citation analysis==
 
==Citation analysis==
 
While citation indexes were originally designed for [[information retrieval]] purposes, they are increasingly used for [[bibliometrics]] and other studies involving research evaluation. Citation data is also the basis of the popular [[journal impact factor]].
 
While citation indexes were originally designed for [[information retrieval]] purposes, they are increasingly used for [[bibliometrics]] and other studies involving research evaluation. Citation data is also the basis of the popular [[journal impact factor]].
There is large body of literature on citation analysis, sometimes called [[scientometrics]], a term invented by [[Vasily Nalimov]], or more specifically [[bibliometrics]]. The field blossomed with the advent of the [[Science Citation Index]], which now covers source literature from 1900 on. The leading journals of the field are ''Scientometrics'' and the ''[[Journal of the American Society of Information Science and Technology]]''. [[ASIST]] also hosts an [[electronic mailing list]] called SIGMETRICS at  ASIST<ref>{{cite web | title=The American Society for Information Science & Technology | work=The Information Society for the Information Age | url=http://www.asis.org| =2006-05-21}} Retrieved November 17, 2008. </ref>. This method is undergoing a resurgence based on the wide dissemination of the Web of Science and Scopus subscription databases in many universities, and the universally-available free citation tools such as  [[CiteBase]], [[CiteSeer]], [[Google Scholar]], and [[Windows Live Academic]].
+
There is large body of literature on citation analysis, sometimes called [[scientometrics]], a term invented by [[Vasily Nalimov]], or more specifically [[bibliometrics]]. The field blossomed with the advent of the [[Science Citation Index]], which now covers source literature from 1900 on. The leading journals of the field are ''Scientometrics'' and the ''[[Journal of the American Society of Information Science and Technology]]''. [[ASIST]] also hosts an [[electronic mailing list]] called SIGMETRICS at  ASIST.<ref>The American Society for Information Science & Technology, [http://www.asis.org The Information Society for the Information Age.] Retrieved November 17, 2008. </ref>. This method is undergoing a resurgence based on the wide dissemination of the Web of Science and Scopus subscription databases in many universities, and the universally-available free citation tools such as  [[CiteBase]], [[CiteSeer]], [[Google Scholar]], and [[Windows Live Academic]].
  
 
===Bibliometrics===
 
===Bibliometrics===
'''Bibliometrics''' is a set of methods used to study or measure texts and information. [[Citation analysis]] and [[content analysis]] are commonly used bibliometric methods. While bibliometric methods are most often used in the field of [[library and information science]], bibliometrics have wide applications in other areas. In fact, many research fields use bibliometric methods to explore the impact of their field, the impact of a set of researchers, or the impact of a particular paper.
+
'''Bibliometrics''' is a set of methods used to study or measure texts and information. [[Citation analysis]] and [[content analysis]] are commonly used bibliometric methods. While bibliometric methods are most often used in the field of [[library and information science]], bibliometrics have wide applications in other areas. In fact, many research fields use bibliometric methods to explore the impact of their field, a set of researchers, or a particular paper.
  
 
Historically bibliometric methods have been used to trace relationships amongst academic journal citations. Citation analysis, which involves examining an item's referring documents, is used in searching for materials and analyzing their merit. [[Citation index|Citation indices]], such as [[Institute for Scientific Information]]'s [[Web of Science]], allow users to search forward in time from a known article to more recent publications which cite the known item.
 
Historically bibliometric methods have been used to trace relationships amongst academic journal citations. Citation analysis, which involves examining an item's referring documents, is used in searching for materials and analyzing their merit. [[Citation index|Citation indices]], such as [[Institute for Scientific Information]]'s [[Web of Science]], allow users to search forward in time from a known article to more recent publications which cite the known item.
  
Data from citation indexes can be analyzed to determine the popularity and impact of specific articles, authors, and publications. Using citation analysis to gauge the importance of one's work, for example, is a significant part of the [[tenure]] review process. Information scientists also use citation analysis to quantitatively assess the core journal titles and watershed publications in particular disciplines; interrelationships between authors from different institutions and schools of thought; and related data about the sociology of academia. Some more pragmatic applications of this information includes the planning of retrospective bibliographies, "giving some indication both of the age of material used in a discipline, and of the extent to which more recent publications supersede the older ones;" indicating through high frequency of citation which documents should be archived; comparing the coverage of secondary services which can help publishers gauge their achievements and competition, and can aid librarians in evaluating "the effectiveness of their stock"<ref>Nicholas, David and Maureen Ritchie. ''Literature and Bibliometrics'' London: Clive Bingley: 1978. (12-28).</ref> There are also some limitations to the value of citation data. They are often incomplete or biased; data has been largely collected by hand (which is expensive), though citation indexes can also be used; incorrect citing of sources occurs continually; thus, further investigation is required to truly understand the rationale behind citing to allow it to be confidently applied<ref>Nicholas, David and Maureen Ritchie. ''Literature and Bibliometrics'' London: Clive Bingley: 1978. (28-29).</ref>
+
Data from citation indexes can be analyzed to determine the popularity and impact of specific articles, authors, and publications. Using citation analysis to gauge the importance of one's work, for example, is a significant part of the [[tenure]] review process. Information scientists also use citation analysis to quantitatively assess the core journal titles and watershed publications in particular disciplines; interrelationships between authors from different institutions and schools of thought; and related data about the sociology of academia. Some more pragmatic applications of this information includes the planning of retrospective bibliographies, "giving some indication both of the age of material used in a discipline, and of the extent to which more recent publications supersede the older ones;" indicating through high frequency of citation which documents should be archived; comparing the coverage of secondary services which can help publishers gauge their achievements and competition, and can aid librarians in evaluating "the effectiveness of their stock."<ref>David Nicholas and Maureen Ritchie, ''Literature and Bibliometrics'' (London: Clive Bingley, 1978), 12-28.</ref> There are also some limitations to the value of citation data. They are often incomplete or biased; data has been largely collected by hand (which is expensive), though citation indexes can also be used; incorrect citing of sources occurs continually; thus, further investigation is required to truly understand the rationale behind citing to allow it to be confidently applied.<ref>Nicholas, David and Maureen Ritchie. ''Literature and Bibliometrics'' London: Clive Bingley: 1978. (28-29).</ref>
  
 
Although citation analysis is nothing new (the [[Science Citation Index]] began publication in 1961), it was all done manually and thus really couldn't scale. Automated algorithms are making it much more useful, versatile, and widespread. Google's [[PageRank]] is based on the principle of  citation analysis. Patent citation maps are also based upon citation analysis (in this case, the citation of one patent by another).
 
Although citation analysis is nothing new (the [[Science Citation Index]] began publication in 1961), it was all done manually and thus really couldn't scale. Automated algorithms are making it much more useful, versatile, and widespread. Google's [[PageRank]] is based on the principle of  citation analysis. Patent citation maps are also based upon citation analysis (in this case, the citation of one patent by another).
Line 31: Line 44:
 
Other bibliometrics applications include: creating thesauri; measuring term frequencies; exploring [[grammar|grammatical]] and [[syntax|syntactical]] structures of texts; measuring usage by readers.
 
Other bibliometrics applications include: creating thesauri; measuring term frequencies; exploring [[grammar|grammatical]] and [[syntax|syntactical]] structures of texts; measuring usage by readers.
  
In 2003 [[Charles Murray (author)|Charles Murray]] published the results of a vast bibliometric study supposed to reveal the 'significant figures' in the arts and sciences. Some 4002 people are ranked in his lists compiled for 12 domains (8 scientific disciplines, literature, philosophy, arts).
+
In 2003 [[Charles Murray (author)|Charles Murray]] published the results of a vast bibliometric study supposed to reveal the "significant figures" in the arts and sciences. Some 4002 people are ranked in his lists compiled for 12 domains (8 scientific disciplines, literature, philosophy, arts).
  
 
==History==
 
==History==
In a classic 1965 paper, [[Derek J. de Solla Price]] described the inherent linking characteristic of the SCI as "Networks of Scientific Papers" <ref>{{cite journal | author=Derek J. de Solla Price | title=Networks of Scientific Papers | journal=[[Science (journal)|SCIENCE]] | date=July 30, 1965 | volume=149 | issue=3683| pages=510&ndash;515 | url=http://garfield.library.upenn.edu/papers/pricenetworks1965.pdf | pmid=14325149 | doi=10.1126/science.149.3683.510|format=PDF}} Retrieved November 17, 2008. </ref>. The links between citing and cited papers became dynamic when the SCI began to be published online. The [[Social Sciences Citation Index]] became one of the first databases to be mounted on the [[Dialog]] system <ref>[http://www.dialog.com/about/ Dialog and the invention of online information services],  Dialog. Retrieved November 17, 2008.</ref> in 1972. With the advent of the [[CD-ROM]] edition, linking became even easier and enabled the use of [[bibliographic coupling]] (M. M. Kessler) for finding related records. In 1973 [[Henry Small]] published his classic work on [[Co-Citation analysis]] which became a [[Self-organization|self-organizing]] classification system that led to [[document clustering]] experiments and eventually an [[Atlas of Science]] later called [[Research Reviews]].
+
In a classic 1965 paper, [[Derek J. de Solla Price]] described the inherent linking characteristic of the SCI as "Networks of Scientific Papers."<ref>Derek J. de Solla Price, Networks of Scientific Papers, ''SCIENCE'' 149 (3683): 510&ndash;515.</ref> The links between citing and cited papers became dynamic when the SCI began to be published online. The [[Social Sciences Citation Index]] became one of the first databases to be mounted on the [[Dialog]] system<ref>Dialog, [http://www.dialog.com/about/ Dialog and the invention of online information services.] Retrieved November 17, 2008.</ref> in 1972. With the advent of the [[CD-ROM]] edition, linking became even easier and enabled the use of [[bibliographic coupling]] (M. M. Kessler) for finding related records. In 1973, [[Henry Small]] published his classic work on [[Co-Citation analysis]] which became a [[Self-organization|self-organizing]] classification system that led to [[document clustering]] experiments and eventually an [[Atlas of Science]] later called [[Research Reviews]].
  
 
ISI also published [[Current Contents]], a paper publication reproducing journal title pages, widely used at the time for keeping up with the current literature, a technique known as [[selective dissemination of information]] (SDI), periodic updates of literature searches based on [[user profile]]s. The combination with SCI permitted the first use in 1965 of earlier cited references as a factor in the selection, in a product called Automatic Subject Citation Alert. This continues in electronic form as the ISI Personal Alert;  this feature  is now almost universally available in any bibliometric database and for most [[electronic journal]]s. In the case of SCI/SSCI profiles contained not only traditional [[natural language]] search terms, but also terms for cited references and cited authors, though this too is now a part of most such systems. Thus, a user can be alerted to any new works which cited the author, paper or book in question. Using journal names in a similar way, customized contents pages could also be provided.
 
ISI also published [[Current Contents]], a paper publication reproducing journal title pages, widely used at the time for keeping up with the current literature, a technique known as [[selective dissemination of information]] (SDI), periodic updates of literature searches based on [[user profile]]s. The combination with SCI permitted the first use in 1965 of earlier cited references as a factor in the selection, in a product called Automatic Subject Citation Alert. This continues in electronic form as the ISI Personal Alert;  this feature  is now almost universally available in any bibliometric database and for most [[electronic journal]]s. In the case of SCI/SSCI profiles contained not only traditional [[natural language]] search terms, but also terms for cited references and cited authors, though this too is now a part of most such systems. Thus, a user can be alerted to any new works which cited the author, paper or book in question. Using journal names in a similar way, customized contents pages could also be provided.
The inherent topological and graphical nature of the worldwide citation network which is an inherent property of the [[scientific literature]] was described by [[Ralph Garner]] at [[Drexel University]] in 1965.<ref>Garner, Ralph, Lois Lunin, Lois Baker. [http://www.garfield.library.upenn.edu/rgarner.pdf THREE DREXEL INFORMATION SCIENCE RESEARCH STUDIES]. Drexel Press, 1967. Retrieved November 17, 2008.</ref> The use of [[citation count]]s to rank journals was a technique used in the early part of the nineteenth century but the systematic ongoing measurement of ths counts for scientific journals was initiated by Eugene Garfield at the Institute for Scientific Information who also pioneered the use of these counts  to rank [[journal author|authors]] and [[academic paper|papers]]. In a landmark paper of 1965 he and [[Irving Sher]] showed the correlation between citation frequency and eminence in demonstrating that [[Nobel Prize]] winners published five times the average number of papers while their work was cited 30 to 50 times the average. In a long series of essays on the Nobel and other prizes Garfield reported this phenomenon. The usual summary measure is known as [[impact factor]], the number of citations to a journal for the previous two years, divided by the number of articles published in those years. It is widely used, both for appropriate and inappropriate purposes—in particular, the use of this measure alone for ranking authors and papers is therefore [[Impact factor#|quite controversial.]]
+
The inherent topological and graphical nature of the worldwide citation network which is an inherent property of the [[scientific literature]] was described by [[Ralph Garner]] at [[Drexel University]] in 1965.<ref>Ralph Garner, Lois Lunin, Lois Baker, [http://www.garfield.library.upenn.edu/rgarner.pdf THREE DREXEL INFORMATION SCIENCE RESEARCH STUDIES,] Drexel Press, 1967. Retrieved November 17, 2008.</ref> The use of [[citation count]]s to rank journals was a technique used in the early part of the nineteenth century but the systematic ongoing measurement of ths counts for scientific journals was initiated by Eugene Garfield at the Institute for Scientific Information who also pioneered the use of these counts  to rank [[journal author|authors]] and [[academic paper|papers]]. In a landmark paper of 1965, he and [[Irving Sher]] showed the correlation between citation frequency and eminence in demonstrating that [[Nobel Prize]] winners published five times the average number of papers while their work was cited 30 to 50 times the average. In a long series of essays on the Nobel and other prizes Garfield reported this phenomenon. The usual summary measure is known as [[impact factor]], the number of citations to a journal for the previous two years, divided by the number of articles published in those years. It is widely used, both for appropriate and inappropriate purposes—in particular, the use of this measure alone for ranking authors and papers is [[Impact factor#|quite controversial.]]
  
In an early study in 1964 of the use of Citation Analysis in writing the history of [[DNA]], Garfield and Sher demonstrated the potential for generating [[historiograph]]s, [[topological map]]s of the most important steps in the history of scientific topics. This work was later automated by E. Garfield, [[A. I. Pudovkin]] of the [[Institute of Marine Biology]], [[Russian Academy of Sciences]] and [[V. S. Istomin]] of [[Center for Teaching, Learning, and Technology]], [[Washington State University]] and led to the creation of the HistCite<ref>Garfield, Eugene, A. I. Pudovkin, V. S. Istomin. 2002. [http://www.garfield.library.upenn.edu/papers/asis2002/asis2002presentation.html Algorithmic Citation-Linked Historiography—Mapping the Literature of Science]. Presented the ASIS&T 2002: Information, Connections and Community. 65th Annual Meeting of ASIST in Philadelphia, PA. November 18-21, 2002. Retrieved November 17, 2008.</ref> software around 2002.
+
In an early study in 1964 of the use of Citation Analysis in writing the history of [[DNA]], Garfield and Sher demonstrated the potential for generating [[historiograph]]s, [[topological map]]s of the most important steps in the history of scientific topics. This work was later automated by E. Garfield, [[A.I. Pudovkin]] of the [[Institute of Marine Biology]], [[Russian Academy of Sciences]] and [[V. S. Istomin]] of [[Center for Teaching, Learning, and Technology]], [[Washington State University]] and led to the creation of the HistCite<ref>Eugene Garfield, A. I. Pudovkin, V. S. Istomin, [http://www.garfield.library.upenn.edu/papers/asis2002/asis2002presentation.html Algorithmic Citation-Linked Historiography—Mapping the Literature of Science,] Presented the ASIS&T 2002: Information, Connections and Community. 65th Annual Meeting of ASIST in Philadelphia, PA. November 18-21, 2002. Retrieved November 17, 2008.</ref> software around 2002.
Autonomous citation indexing was introduced in 1998 by Giles, Lawrence and Bollacker and enabled automatic algorithmic extraction and grouping of citations for any digital academic and scientific document. Where previous citation extraction was a manual process, citation measures could now be computed for any scholarly and scientific field and document venue, not just those selected by organizations such as ISI. This led to the creation of new systems for public and automated citation indexing, the first being [[CiteSeer]], soon followed by Cora (recently reborn as [http://rexa.info Rexa]), which focused primarily on the field of computer and [[information science]]. These were later followed by large scale academic domain citation systems such as the Google Scholar and more recently Microsoft Academic. Such autonomous citation indexing is not yet perfect in citation extraction or citation clustering with an error rate estimated by some at 10%. This has resulted in such authors as [[Ann Arbor, Michigan|Ann Arbor]], [[Milton Keynes]], and [[Walton Hall, Milton Keynes|Walton Hall]] being credited with extensive academic output.<ref>Postellon DC. 2008. "Hall and Keynes Join Arbor in the Citation Indexes". ''Nature''. 452, no. 7185.</ref> It should be noted that SCI has also been prepared through purely programmatic methods, and the older record particularly have a similar magnitude of error.
+
Autonomous citation indexing was introduced in 1998 by Giles, Lawrence and Bollacker and enabled automatic algorithmic extraction and grouping of citations for any digital academic and scientific document. Where previous citation extraction was a manual process, citation measures could now be computed for any scholarly and scientific field and document venue, not just those selected by organizations such as ISI. This led to the creation of new systems for public and automated citation indexing, the first being [[CiteSeer]], soon followed by Cora (recently reborn as [http://rexa.info Rexa]), which focused primarily on the field of computer and [[information science]]. These were later followed by large scale academic domain citation systems such as the Google Scholar and more recently Microsoft Academic. Such autonomous citation indexing is not yet perfect in citation extraction or citation clustering with an error rate estimated by some at 10%. This has resulted in such authors as [[Ann Arbor, Michigan|Ann Arbor]], [[Milton Keynes]], and [[Walton Hall, Milton Keynes|Walton Hall]] being credited with extensive academic output.<ref>DC Postellon, "Hall and Keynes Join Arbor in the Citation Indexes," ''Nature'' 452 (7185).</ref> It should be noted that SCI has also been prepared through purely programmatic methods, and the older record particularly have a similar magnitude of error.
  
 
==See also==
 
==See also==
* [[Index (publishing)]]
+
*[[Index (publishing)]]
* [[Citation]]
+
*[[Impact factor]]
 +
*[[Citation]]
  
 
==Notes==
 
==Notes==
{{Reflist}}
+
<references/>
 +
 
 
==References==
 
==References==
 
+
*The American Society for Information Science & Technology. [http://www.asis.org/ The Information Society for the Information Age.] Retrieved November 17, 2008.
*[http://citeseerx.ist.psu.edu/about/site CiteSeerX], CiteSeerX website. Retrieved on November 15, 2008.  
+
*CiteSeerX, [http://citeseerx.ist.psu.edu/about/site CiteSeerX.] Retrieved on November 15, 2008.  
*Derek J. de Solla Price (July 30, 1965). [http://garfield.library.upenn.edu/papers/pricenetworks1965.pdf Networks of Scientific Papers]. SCIENCE 149 (3683): 510–515. Retrieved November 17, 2008.
+
*Derek J. de Solla Price. [http://garfield.library.upenn.edu/papers/pricenetworks1965.pdf Networks of Scientific Papers.] SCIENCE 149 (3683): 510–515. Retrieved November 17, 2008.
*[http://www.dialog.com/about/ Dialog and the invention of online information services],  Dialog. Retrieved November 17, 2008.
+
*Dialog. [http://www.dialog.com/about/ Dialog and the invention of online information services.] Retrieved November 17, 2008.
* Garfield, Eugene, A. I. Pudovkin, V. S. Istomin. 2002. [http://www.garfield.library.upenn.edu/papers/asis2002/asis2002presentation.html Algorithmic Citation-Linked Historiography—Mapping the Literature of Science]. Presented the ASIS&T 2002: Information, Connections and Community. 65th Annual Meeting of ASIST in Philadelphia, PA. November 18-21, 2002. Retrieved November 17, 2008.
+
*Garfield, Eugene, A. I. Pudovkin, V. S. Istomin. 2002. [http://www.garfield.library.upenn.edu/papers/asis2002/asis2002presentation.html Algorithmic Citation-Linked Historiography—Mapping the Literature of Science.] Presented the ASIS&T 2002: Information, Connections and Community. 65th Annual Meeting of ASIST in Philadelphia, PA. November 18-21, 2002. Retrieved November 17, 2008.
*Garner, Ralph, Lois Lunin, Lois Baker. [http://www.garfield.library.upenn.edu/rgarner.pdf THREE DREXEL INFORMATION SCIENCE RESEARCH STUDIES]. Drexel Press, 1967. Retrieved November 17, 2008.
+
*Garner, Ralph, Lois Lunin, Lois Baker. [http://www.garfield.library.upenn.edu/rgarner.pdf THREE DREXEL INFORMATION SCIENCE RESEARCH STUDIES.] Drexel Press, 1967. Retrieved November 17, 2008.
*Murray, Charles A. ''Human Accomplishment: The Pursuit of Excellence in the Arts and Sciences, 800 B.C.E. to 1950''. New York: HarperCollins, 2003. ISBN 9780060192471
+
*Harnad, Stevan, Les Carr, Tim Brody, and Charles Oppenheim. [http://www.ariadne.ac.uk/issue35/harnad/ Mandated online RAE CVs linked to University eprint archives: Enhancing UK research impact and assessment.] ''Ariadne'' Issue 35 . April 30 2003. Retrieved November 17, 2008.
 +
*Murray, Charles A. ''Human Accomplishment: The Pursuit of Excellence in the Arts and Sciences, 800 B.C.E..E. to 1950''. New York: HarperCollins, 2003. ISBN 9780060192471.
 
*Nicholas, David and Maureen Ritchie. ''Literature and Bibliometrics'' London: Clive Bingley: 1978.  
 
*Nicholas, David and Maureen Ritchie. ''Literature and Bibliometrics'' London: Clive Bingley: 1978.  
*Postellon DC. 2008. "Hall and Keynes Join Arbor in the Citation Indexes". ''Nature''. 452, no. 7185.
+
*Postellon DC. 2008. "Hall and Keynes Join Arbor in the Citation Indexes." ''Nature'' 452 (7185).  
*The American Society for Information Science & Technology. [http://www.asis.org/ The Information Society for the Information Age]. Retrieved November 17, 2008.
 
  
 
==External links ==
 
==External links ==
All links retrieved November 17, 2008.
+
All links retrieved December 10, 2023.
* [http://www.isinet.com ISI website].
+
 
* [http://scholar.google.com Google Scholar] - an academic document [[search engine]] from Google
 
* [http://academic.live.com Windows Live Academic] - an academic document search engine from Microsoft
 
* [http://www.scopus.com Scopus] - an academic document [[bibliographic database]] from Elsevier.
 
* [http://citeseer.ist.psu.edu CiteSeer] - autonomous citation indexing for computer and information science.
 
* [http://www.librijournal.org/2005-4toc.html Google Scholar: The New Generation of Citation Indexes]
 
* [http://www.atlasofscience.net/ Atlas of Science: Mapping Science by means of citation relations]
 
 
* [http://www.dlib.org/dlib/september05/bauer/09bauer.html An Examination of Citation Counts in a New Scholarly Communication Environment]
 
* [http://www.dlib.org/dlib/september05/bauer/09bauer.html An Examination of Citation Counts in a New Scholarly Communication Environment]
* [http://www.harzing.com/pop.htm Publish or Perish] calculates various statistics, including the h-index and the [[g-index]] using [[Google Scholar]] data
+
* [http://www.harzing.com/pop.htm Publish or Perish] calculates various statistics, including the h-index and the g-index using Google Scholar data
  
 
[[Category:Politics and social sciences]]
 
[[Category:Politics and social sciences]]
 
[[Category:library and information science]]
 
[[Category:library and information science]]
  
{{credits|Citation_index|250430633|Bibliometrics|246113016}}
+
{{credits|Citation_index|250430633|Bibliometrics|246113016|Citation_impact|237029791}}

Latest revision as of 22:25, 10 December 2023

A citation index is an index of citations between publications, allowing the user to easily discern which later documents cite which earlier documents.

The first citation indices were legal citators such as Shepard's Citations (1873). In 1960, Eugene Garfield's Institute for Scientific Information (ISI) introduced the first citation index for papers published in academic journals, starting with the Science Citation Index (SCI), and later expanding to produce the Social Sciences Citation Index (SSCI) and the Arts and Humanities Citation Index (AHCI). As of 2006, there are other sources of such data, such as Google Scholar.

Citations are used as a measure of importance or relative value of the information source, such as an individual journal article, book, and others. For example, if an article is frequently cited by other journal articles and books in the discipline, it may indicate the relative importance of a work. Citation impact analysis is called bibliometrics in library and information science and has a wide range of applications.

Citation analysis

A citation is the act of acknowledging or citing the author, year, title, and locus of publication (journal, book, or other) of a source used in a published work. Such citations can be counted as measures of the usage and impact of the cited work. This is called citation analysis or bibliometrics (see below). Among the measures that have emerged from citation analysis are the citation count for

  • An individual article (how often it was cited)
  • An author (total citations, or average citation count per article)
  • For a journal (journal impact factor, or the average citation count for the articles in the journal)

Citation counts are correlated[1] with other measures of scholarly/scientific performance and impact and can in some cases be enhanced by making a work open access by self-archiving the complete article on the web, publishing it in an open access journal, or publishing it as an Open access article in one of the Hybrid open access journals.

There also exists an H-index measure of an individual scientist's impact and citation record.

Major current citation indexing services

There are two publishers of general-purpose academic citation indexes, available to libraries by subscription:

  • ISI is now part of Thomson Scientific. Though the ISI citation indexes are still published in print and compact disc, they are now generally accessed through the Web under the name Web of Science, which is in turn part of the group of databases in WoK.
  • Elsevier publishes Scopus, available online only, which similarly combines subject searching with citation browsing and tracking in the sciences and social sciences.

There are a number of other indexes, more readily available. Some of the currently notable ones are:

  • The CiteSeer system provides citation and other searching of scientific literature, primarily in the fields of computer and information science.[2]
  • RePec provides this in economics, and other discipline-specific indexes have also begun to include it in their indexes. Even journal publishers often supply the facility to link to late citations, at least from the journals they publish.
  • Google Scholar (GS) has citation functionality, limited to the recent articles that are included. There is already discussion about the possibility that GS may in the future have sufficient capabilities to make the commercial products unnecessary.

Each of these products offer an index of citations between publications and a mechanism to establish which documents cite which other documents. The different products offer different ways to access the citation list and also display their citation index differently. They differ widely in cost: WOK and Scopus are among the highest-cost subscription databases; the others mentioned are free.

Citation analysis

While citation indexes were originally designed for information retrieval purposes, they are increasingly used for bibliometrics and other studies involving research evaluation. Citation data is also the basis of the popular journal impact factor. There is large body of literature on citation analysis, sometimes called scientometrics, a term invented by Vasily Nalimov, or more specifically bibliometrics. The field blossomed with the advent of the Science Citation Index, which now covers source literature from 1900 on. The leading journals of the field are Scientometrics and the Journal of the American Society of Information Science and Technology. ASIST also hosts an electronic mailing list called SIGMETRICS at ASIST.[3]. This method is undergoing a resurgence based on the wide dissemination of the Web of Science and Scopus subscription databases in many universities, and the universally-available free citation tools such as CiteBase, CiteSeer, Google Scholar, and Windows Live Academic.

Bibliometrics

Bibliometrics is a set of methods used to study or measure texts and information. Citation analysis and content analysis are commonly used bibliometric methods. While bibliometric methods are most often used in the field of library and information science, bibliometrics have wide applications in other areas. In fact, many research fields use bibliometric methods to explore the impact of their field, a set of researchers, or a particular paper.

Historically bibliometric methods have been used to trace relationships amongst academic journal citations. Citation analysis, which involves examining an item's referring documents, is used in searching for materials and analyzing their merit. Citation indices, such as Institute for Scientific Information's Web of Science, allow users to search forward in time from a known article to more recent publications which cite the known item.

Data from citation indexes can be analyzed to determine the popularity and impact of specific articles, authors, and publications. Using citation analysis to gauge the importance of one's work, for example, is a significant part of the tenure review process. Information scientists also use citation analysis to quantitatively assess the core journal titles and watershed publications in particular disciplines; interrelationships between authors from different institutions and schools of thought; and related data about the sociology of academia. Some more pragmatic applications of this information includes the planning of retrospective bibliographies, "giving some indication both of the age of material used in a discipline, and of the extent to which more recent publications supersede the older ones;" indicating through high frequency of citation which documents should be archived; comparing the coverage of secondary services which can help publishers gauge their achievements and competition, and can aid librarians in evaluating "the effectiveness of their stock."[4] There are also some limitations to the value of citation data. They are often incomplete or biased; data has been largely collected by hand (which is expensive), though citation indexes can also be used; incorrect citing of sources occurs continually; thus, further investigation is required to truly understand the rationale behind citing to allow it to be confidently applied.[5]

Although citation analysis is nothing new (the Science Citation Index began publication in 1961), it was all done manually and thus really couldn't scale. Automated algorithms are making it much more useful, versatile, and widespread. Google's PageRank is based on the principle of citation analysis. Patent citation maps are also based upon citation analysis (in this case, the citation of one patent by another).

Other bibliometrics applications include: creating thesauri; measuring term frequencies; exploring grammatical and syntactical structures of texts; measuring usage by readers.

In 2003 Charles Murray published the results of a vast bibliometric study supposed to reveal the "significant figures" in the arts and sciences. Some 4002 people are ranked in his lists compiled for 12 domains (8 scientific disciplines, literature, philosophy, arts).

History

In a classic 1965 paper, Derek J. de Solla Price described the inherent linking characteristic of the SCI as "Networks of Scientific Papers."[6] The links between citing and cited papers became dynamic when the SCI began to be published online. The Social Sciences Citation Index became one of the first databases to be mounted on the Dialog system[7] in 1972. With the advent of the CD-ROM edition, linking became even easier and enabled the use of bibliographic coupling (M. M. Kessler) for finding related records. In 1973, Henry Small published his classic work on Co-Citation analysis which became a self-organizing classification system that led to document clustering experiments and eventually an Atlas of Science later called Research Reviews.

ISI also published Current Contents, a paper publication reproducing journal title pages, widely used at the time for keeping up with the current literature, a technique known as selective dissemination of information (SDI), periodic updates of literature searches based on user profiles. The combination with SCI permitted the first use in 1965 of earlier cited references as a factor in the selection, in a product called Automatic Subject Citation Alert. This continues in electronic form as the ISI Personal Alert; this feature is now almost universally available in any bibliometric database and for most electronic journals. In the case of SCI/SSCI profiles contained not only traditional natural language search terms, but also terms for cited references and cited authors, though this too is now a part of most such systems. Thus, a user can be alerted to any new works which cited the author, paper or book in question. Using journal names in a similar way, customized contents pages could also be provided. The inherent topological and graphical nature of the worldwide citation network which is an inherent property of the scientific literature was described by Ralph Garner at Drexel University in 1965.[8] The use of citation counts to rank journals was a technique used in the early part of the nineteenth century but the systematic ongoing measurement of ths counts for scientific journals was initiated by Eugene Garfield at the Institute for Scientific Information who also pioneered the use of these counts to rank authors and papers. In a landmark paper of 1965, he and Irving Sher showed the correlation between citation frequency and eminence in demonstrating that Nobel Prize winners published five times the average number of papers while their work was cited 30 to 50 times the average. In a long series of essays on the Nobel and other prizes Garfield reported this phenomenon. The usual summary measure is known as impact factor, the number of citations to a journal for the previous two years, divided by the number of articles published in those years. It is widely used, both for appropriate and inappropriate purposes—in particular, the use of this measure alone for ranking authors and papers is quite controversial.

In an early study in 1964 of the use of Citation Analysis in writing the history of DNA, Garfield and Sher demonstrated the potential for generating historiographs, topological maps of the most important steps in the history of scientific topics. This work was later automated by E. Garfield, A.I. Pudovkin of the Institute of Marine Biology, Russian Academy of Sciences and V. S. Istomin of Center for Teaching, Learning, and Technology, Washington State University and led to the creation of the HistCite[9] software around 2002. Autonomous citation indexing was introduced in 1998 by Giles, Lawrence and Bollacker and enabled automatic algorithmic extraction and grouping of citations for any digital academic and scientific document. Where previous citation extraction was a manual process, citation measures could now be computed for any scholarly and scientific field and document venue, not just those selected by organizations such as ISI. This led to the creation of new systems for public and automated citation indexing, the first being CiteSeer, soon followed by Cora (recently reborn as Rexa), which focused primarily on the field of computer and information science. These were later followed by large scale academic domain citation systems such as the Google Scholar and more recently Microsoft Academic. Such autonomous citation indexing is not yet perfect in citation extraction or citation clustering with an error rate estimated by some at 10%. This has resulted in such authors as Ann Arbor, Milton Keynes, and Walton Hall being credited with extensive academic output.[10] It should be noted that SCI has also been prepared through purely programmatic methods, and the older record particularly have a similar magnitude of error.

See also

Notes

  1. Stevan Harnad, Les Carr, Tim Brody, and Charles Oppenheim, Mandated online RAE CVs linked to University eprint archives: Enhancing UK research impact and assessment, Ariadne Issue 35. April 30 2003. Retrieved November 17, 2008.
  2. CiteSeerX, About CiteSeerX. Retrieved on November 15, 2008.
  3. The American Society for Information Science & Technology, The Information Society for the Information Age. Retrieved November 17, 2008.
  4. David Nicholas and Maureen Ritchie, Literature and Bibliometrics (London: Clive Bingley, 1978), 12-28.
  5. Nicholas, David and Maureen Ritchie. Literature and Bibliometrics London: Clive Bingley: 1978. (28-29).
  6. Derek J. de Solla Price, Networks of Scientific Papers, SCIENCE 149 (3683): 510–515.
  7. Dialog, Dialog and the invention of online information services. Retrieved November 17, 2008.
  8. Ralph Garner, Lois Lunin, Lois Baker, THREE DREXEL INFORMATION SCIENCE RESEARCH STUDIES, Drexel Press, 1967. Retrieved November 17, 2008.
  9. Eugene Garfield, A. I. Pudovkin, V. S. Istomin, Algorithmic Citation-Linked Historiography—Mapping the Literature of Science, Presented the ASIS&T 2002: Information, Connections and Community. 65th Annual Meeting of ASIST in Philadelphia, PA. November 18-21, 2002. Retrieved November 17, 2008.
  10. DC Postellon, "Hall and Keynes Join Arbor in the Citation Indexes," Nature 452 (7185).

References
ISBN links support NWE through referral fees

External links

All links retrieved December 10, 2023.

Credits

New World Encyclopedia writers and editors rewrote and completed the Wikipedia article in accordance with New World Encyclopedia standards. This article abides by terms of the Creative Commons CC-by-sa 3.0 License (CC-by-sa), which may be used and disseminated with proper attribution. Credit is due under the terms of this license that can reference both the New World Encyclopedia contributors and the selfless volunteer contributors of the Wikimedia Foundation. To cite this article click here for a list of acceptable citing formats.The history of earlier contributions by wikipedians is accessible to researchers here:

The history of this article since it was imported to New World Encyclopedia:

Note: Some restrictions may apply to use of individual images which are separately licensed.