| Title: | Analysis of Scientific Publication Data with R |
|---|---|
| Description: | Tools for quantitative research in scientometrics and bibliometrics. This package provides routines for importing bibliographic data from Clarivate Web of Science (<https://www.webofscience.com/wos/>) and performing bibliometric analysis. |
| Authors: | Ruben Fernandez-Casal [aut, cre] (ORCID: <https://orcid.org/0000-0002-5785-3739>), Borja Lafuente-Rego [aut] (ORCID: <https://orcid.org/0000-0002-2443-3084>), Maria Jose Lombardia [aut] (ORCID: <https://orcid.org/0000-0001-9452-9818>), Javier Tarrio-Saavedra [aut] (ORCID: <https://orcid.org/0000-0002-9584-127X>), Julian Costa-Bouzas [aut] (ORCID: <https://orcid.org/0000-0001-9760-9581>), Yesica Fernandez-Ramos [ctb], Guillermo Lopez-Taboada [ctb] |
| Maintainer: | Ruben Fernandez-Casal <[email protected]> |
| License: | GPL (>= 2) |
| Version: | 1.2.0 |
| Built: | 2026-05-16 07:01:32 UTC |
| Source: | https://github.com/rubenfcasal/scimetr |
Tools for quantitative research in scientometrics and bibliometrics. This package provides routines for importing bibliographic data from Clarivate Web of Science (https://www.webofscience.com/wos/) and performing bibliometric analysis. For more information visit https://rubenfcasal.github.io/scimetr/articles/scimetr.html.
Maintainer: Ruben Fernandez-Casal [email protected] (ORCID)
Authors:
Borja Lafuente-Rego [email protected] (ORCID)
Maria Jose Lombardia [email protected] (ORCID)
Javier Tarrio-Saavedra [email protected] (ORCID)
Julian Costa-Bouzas [email protected] (ORCID)
Other contributors:
Yesica Fernandez-Ramos [email protected] [contributor]
Guillermo Lopez-Taboada [email protected] [contributor]
Useful links:
Report bugs at https://github.com/rubenfcasal/scimetr/issues/
Extends the bibliographic database by adding JCR metrics to sources, per year and WoS category.
add_jcr(db, jcrdb) ## S3 method for class 'wos.jcr' plot( x, filter, plot = TRUE, all = FALSE, ask = plot && all && interactive(), ... )add_jcr(db, jcrdb) ## S3 method for class 'wos.jcr' plot( x, filter, plot = TRUE, all = FALSE, ask = plot && all && interactive(), ... )
db |
a bibliographic database (a wos.db object; typically the
output of the function |
jcrdb |
a JCR database (a jcr.db object; typically the
output of the function |
x |
a wos.jcr object. |
filter |
vector of document identifiers (usually a result of |
plot |
logical; if |
all |
logical; if |
ask |
logical; if |
... |
further arguments passed to or from other methods. |
An S3 object of class wos.jcr.
A wos.db object with additional components JCRSour and JCRCatSour.
db_bib.wos.data() converts a data.frame with WoS variables
(as returned by import_wos; see wosdf) into
a relational database (a list of data.frames).
db_bib(data, ...) ## S3 method for class 'wos.data' db_bib(data, label = "", progress = interactive(), verbose = FALSE, ...) ## S3 method for class 'wos.db' print(x, ...) ## S3 method for class 'wos.db' plot( x, filter, which = 1:3, plot = TRUE, warning = FALSE, ask = plot && (length(which) > 1) && interactive(), ... )db_bib(data, ...) ## S3 method for class 'wos.data' db_bib(data, label = "", progress = interactive(), verbose = FALSE, ...) ## S3 method for class 'wos.db' print(x, ...) ## S3 method for class 'wos.db' plot( x, filter, which = 1:3, plot = TRUE, warning = FALSE, ask = plot && (length(which) > 1) && interactive(), ... )
data |
a |
... |
further arguments passed to or from other methods. |
label |
character string describing the data. |
progress |
logical; if |
verbose |
logical; if |
x |
an object used to select a method. |
filter |
vector of document identifiers (usually a result of |
which |
if a subset of the plots is required,
specify a subset of the numbers |
plot |
logical; if |
warning |
logical; if |
ask |
logical; if |
An S3 object of class wos.db.
A list with the following components:
Docs: document-level records (year, type, citations, identifiers).
Authors and AutDoc: author dictionary and document-author links.
OI and OIDoc: ORCID identifiers and their linkage to documents.
RI and RIDoc: ResearcherID identifiers and their linkage to documents.
Affiliations and AffDoc: affiliation dictionary and document links.
Addresses and AddAutDoc: address and document-author-address links.
Sources: document sources (journals, books, proceedings, ...).
Categories, Areas, CatSour and AreaSour: thematic classifications
and their linkage to sources.
WSIndex and SourWSI: Web of Science indexes and their linkage to
sources.
label and date: metadata that helps tracking the dataset
identity over time.
This object has specialized print and plot methods.
plot.summary.wos.db, plot.summary.year.wos
db <- db_bib(wosdf) print(db) summary(db)db <- db_bib(wosdf) print(db) summary(db)
Reads JCR data from excel files downloaded from WoS and generates a relational database (a list of data.frames). It is assumed that the file name format is JCR_[WE]_[PY], where [WE] is the WoS index (SCIE, SSCI, ...) and [PY] the JCR year.
db_jcr(path = ".", files = dir(path, pattern = "*.xlsx"), verbose = TRUE)db_jcr(path = ".", files = dir(path, pattern = "*.xlsx"), verbose = TRUE)
path |
character; path to the directory containing the files. Defaults to the working directory. |
files |
character vector with the file names. Defaults to filenames in
directory |
verbose |
logical; indicating whether the name of the file being processed
is printed. Defaults to |
A subscription to Web of Science is required to download JCR data.
For successful execution, Excel files must retain the original header fields
from Clarivate exports:
Title20, ISO_ABBREV, TITLE, ISSN, EISSN, ISSUES/YEAR, COUNTRY,
LANGUAGE, 1ST_YR_PUB, categories, TOT_CITES, CITES_JCR_YR, CITES_JCR_YR1,
CITES_JCR_YR2, IF_NUMERATOR, CITES_JCR_YR3, CITES_JCR_YR4, CITES_JCR_YR5,
5YR_IF_NUMERATOR, ITEMS_JCR_YR, ITEMS_JCR_YR1, ITEMS_JCR_YR2, IF_DENOMINATOR,
ITEMS_JCR_YR3, ITEMS_JCR_YR4, ITEMS_JCR_YR5, 5YR_IF_DENOMINATOR, IMPACT_FACTOR,
IMMEDIACY_INDEX, CITED_HALF_LIFE, 5YR_IMPACT_FACTOR, EIGENFACTOR,
NORM_EIGENFACTOR, ARTL_INFLUENCE, FREQUENCY, PUBCODE, PUBLISHER_NAME,
CATEGORY_CODE, CATEGORY_DESCRIPTION, CATEGORY_RANKING, QUARTILE_RANK, JIF_PERCENTILE.
An S3 object of class jcr.db. A list with components:
Sources, Categories, JCRSour and JCRCatSour.
Bibliographic database with JCR metrics (a wos.jcr S3 object)
corresponding to a WoS search by the Affiliation field of Universidade da Coruña (UDC)
in the research area "Mathematics" during the years 2018–2023
(generated fom wosdf data set, using the functions db_bib, db_jcr
and add_jcr).
dbjcrdbjcr
An object of class wos.jcr (inherits from wos.db) of length 22.
Retrieve internal identifiers (entity keys) for the corresponding relational
table (authors, sources, categories, areas, ...). Any field in the target
table can be used as a condition, and multiple conditions are combined with &
(only IDs for which all conditions evaluate to TRUE are returned).
Tidy evaluation can be used to construct the logical expressions,
see dplyr::filter().
get_id_docs( db, ..., id_sources, id_authors, id_addresses, id_oi, id_ri, id_affiliations ) get_id_authors(db, ...) get_id_addresses(db, ...) get_id_areas(db, ...) get_id_categories(db, ...) get_id_wsi(db, ...) get_id_sources(db, ..., id_categories, id_areas, id_wsi) get_id_oi(db, ...) get_id_ri(db, ...) get_id_affiliations(db, ...)get_id_docs( db, ..., id_sources, id_authors, id_addresses, id_oi, id_ri, id_affiliations ) get_id_authors(db, ...) get_id_addresses(db, ...) get_id_areas(db, ...) get_id_categories(db, ...) get_id_wsi(db, ...) get_id_sources(db, ..., id_categories, id_areas, id_wsi) get_id_oi(db, ...) get_id_ri(db, ...) get_id_affiliations(db, ...)
db |
|
... |
Logical predicates. Multiple conditions are combined with |
id_sources |
optional; sources identifiers. |
id_authors |
optional; author identifiers (values of |
id_addresses |
optional; addresses identifiers. |
id_oi |
optional; ORCID identifiers or codes (values of |
id_ri |
optional; RI identifiers or codes (values of |
id_affiliations |
optional; affiliations identifiers or names (values of |
id_categories |
optional; categories identifiers or names (values of |
id_areas |
optional; research area identifiers or names (values of |
id_wsi |
optional; WoS Index identifiers or names (values of |
An integer vector of identifiers.
For instance, get_id_docs returns the document identifiers (values of
the db$Docs$idd variable) corresponding to identifiers of authors,
categories, areas, addresses and/or sources.
Logical expressions defined in terms of the variables in db$Docs can
also be used as arguments.
get_jcr() combines document indexes with their source JCR metrics per year.
get_jcr_cat() combines document indexes with their source JCR metrics per year
and WoS category.
get_jcr(db, filter) get_jcr_cat(db, filter, best = TRUE)get_jcr(db, filter) get_jcr_cat(db, filter, best = TRUE)
db |
a bibliographic database with JCR information (a wos.jcr object;
typically the output of the function |
filter |
vector of document identifiers (usually a result of |
best |
logical; if |
A data.frame with document indexes and their source JCR metrics.
add_jcr(), db_jcr(), db_bib().
head(get_jcr(dbjcr)) head(get_jcr_cat(dbjcr, best = TRUE))head(get_jcr(dbjcr)) head(get_jcr_cat(dbjcr, best = TRUE))
Reads bibliography entries from UTF-8 encoded Tab-delimited files containing "Full Record and Cited References" (see wosdf and vignette Downloading data from the Web of Science).
import_wos(path = ".", pattern = "*.txt", all = TRUE, progress = NULL)import_wos(path = ".", pattern = "*.txt", all = TRUE, progress = NULL)
path |
character; path to the directory containing the files. |
pattern |
regular expression; only matching files will be loaded.
Defaults to |
all |
logical; indicating whether sources without ISSN are included
( |
progress |
logical; indicating whether a progress bar should be displayed.
Defaults to |
A subscription to Web of Science is required to download bibliometric data.
A data.frame with rows corresponding to sources and columns to
WoS variables.
The summary_year methods return summaries by year of a bibliometric database.
summary_year(object, ...) ## S3 method for class 'wos.db' summary_year(object, filter, ...) ## S3 method for class 'summary.year.wos' print(x, ...) ## S3 method for class 'summary.year.wos' plot( x, which = 1:3, boxplot = FALSE, plot = TRUE, ask = plot && length(which) > 1 && interactive(), ... ) ## S3 method for class 'wos.jcr' summary_year(object, filter, all = FALSE, ...) ## S3 method for class 'summary.year.jcr' print(x, digits = 3, all = TRUE, ...) ## S3 method for class 'summary.year.jcr' plot(x, plot = TRUE, all = FALSE, ask = plot && interactive(), ...)summary_year(object, ...) ## S3 method for class 'wos.db' summary_year(object, filter, ...) ## S3 method for class 'summary.year.wos' print(x, ...) ## S3 method for class 'summary.year.wos' plot( x, which = 1:3, boxplot = FALSE, plot = TRUE, ask = plot && length(which) > 1 && interactive(), ... ) ## S3 method for class 'wos.jcr' summary_year(object, filter, all = FALSE, ...) ## S3 method for class 'summary.year.jcr' print(x, digits = 3, all = TRUE, ...) ## S3 method for class 'summary.year.jcr' plot(x, plot = TRUE, all = FALSE, ask = plot && interactive(), ...)
object |
an object for which a summary is desired. |
... |
further arguments passed to or from other methods. |
filter |
vector of document identifiers (optional, usually a result of get_id_docs). |
x |
an object used to select a method. |
which |
if a subset of the plots is required,
specify a subset of the numbers |
boxplot |
logical; if |
plot |
logical; if |
ask |
logical; if |
all |
logical; if |
digits |
minimal number of significant digits, see print.default. |
A list of summary statistics (an object of class summary.year.wos
and/or summary.year.jcr), which has specialized print() and plot() methods.
# Bibliographic database db <- db_bib(wosdf) summary_year(db) # Bibliographic database with JCR metrics summary_year(dbjcr)# Bibliographic database db <- db_bib(wosdf) summary_year(db) # Bibliographic database with JCR metrics summary_year(dbjcr)
summary() methods for bibliometric databases.
## S3 method for class 'wos.db' summary(object, filter, index = c("H", "G"), nmax = 10, ...) ## S3 method for class 'summary.wos.db' print(x, ...) author_metrics(db, id_authors) ## S3 method for class 'summary.wos.db' plot( x, which = 1:5, pie = FALSE, others = !pie, accuracy = 0.1, plot = TRUE, ask = length(which) > 1 && interactive(), ... ) ## S3 method for class 'wos.jcr' summary(object, filter, all = FALSE, ...) ## S3 method for class 'summary.wos.jcr' print(x, digits = 2, all = TRUE, ...) ## S3 method for class 'summary.wos.jcr' plot(x, plot = TRUE, all = FALSE, ask = plot && all && interactive(), ...)## S3 method for class 'wos.db' summary(object, filter, index = c("H", "G"), nmax = 10, ...) ## S3 method for class 'summary.wos.db' print(x, ...) author_metrics(db, id_authors) ## S3 method for class 'summary.wos.db' plot( x, which = 1:5, pie = FALSE, others = !pie, accuracy = 0.1, plot = TRUE, ask = length(which) > 1 && interactive(), ... ) ## S3 method for class 'wos.jcr' summary(object, filter, all = FALSE, ...) ## S3 method for class 'summary.wos.jcr' print(x, digits = 2, all = TRUE, ...) ## S3 method for class 'summary.wos.jcr' plot(x, plot = TRUE, all = FALSE, ask = plot && all && interactive(), ...)
object |
an object for which a summary is desired. |
filter |
vector of document identifiers
(optional, usually a result of |
index |
citation indexes. |
nmax |
number of top levels. |
... |
further arguments passed to or from other methods. |
x |
an object used to select a method. |
db |
|
id_authors |
optional; author identifiers |
which |
if a subset of the plots is required,
specify a subset of the numbers |
pie |
logical; if |
others |
logical; if |
accuracy |
numerical scalar; see |
plot |
logical; if |
ask |
logical; if |
all |
logical; if |
digits |
minimal number of significant digits, see print.default. |
A list of summary statistics (an object of class summary.wos.db
and/or summary.wos.jcr), which has specialized print() and plot() methods.
# Bibliographic database db <- db_bib(wosdf) summary(db) # Bibliographic database with JCR metrics summary(dbjcr)# Bibliographic database db <- db_bib(wosdf) summary(db) # Bibliographic database with JCR metrics summary(dbjcr)
The data set consists of 293 publications corresponding to a WoS search by
the Affiliation field of Universidade da Coruña (UDC) (Affiliation:
OG = Universidade da Coruna) in the research area "Mathematics" (SC = Mathematics),
indexed in the Web of Science Core Collection:
Science Citation Index Expanded (SCI-EXPANDED).
Social Sciences Citation Index (SSCI).
Arts & Humanities Citation Index (A&HCI).
in the years 2018-2023 (generated using the import_wos function).
wosdfwosdf
A data frame with 293 rows and 48 columns:
Publication Type
Author
Author Full Name
Article Title
Source Title
Book Series Title
Book Series Subtitle
Language
Document Type
Address
Affiliation
Researcher Ids
ORCID
Cited Reference Count
Times Cited, WoS Core
Times Cited, All Databases
180 Day Usage Count
Since 2013 Usage Count
Publisher
Publisher City
Publisher Address
ISSN
eISSN
ISBN
Journal Abbreviation
Journal ISO Abbreviation
Publication Date
Publication Year
Volume
Issue
Part Number
Supplement
Special Issue
Meeting Abstract
Start Page
End Page
Article Number
DOI
Book DOI
Early Access Date
Number of Pages
WoS Category
Web of Science Index
Research Area
Highly Cited Status
Hot Paper Status
Date of Export
Unique WOS ID
Clarivate Web of Science: https://www.webofscience.com/wos/.