This readme.txt file was generated on 2022-12-07 by the authors and updated on 2023-05-23 GENERAL INFORMATION ------------------- 1. Title of Dataset: Dataset of the academic impact of the Camino de Santiago 2. Authorship: Name: Silvia Díaz de la Fuente Institution: Departamento de Ingeniería de Organización Universidad de Burgos Email: sddelafuente@ubu.es ORCID: https://orcid.org/0000-0002-5961-3368 Name: Virginia Ahedo Institution: Departamento de Ingeniería de Organización Universidad de Burgos Email: vahedo@ubu.es ORCID: https://orcid.org/0000-0002-9812-388X Name: María Pilar Alonso Abad Institution: Departamento de Historia, Geografía y Comunicación Universidad de Burgos Email: mpaabad@ubu.es ORCID: https://orcid.org/0000-0002-6268-9443 Name: José Manuel Galán Institution: Departamento de Ingeniería de Organización Universidad de Burgos Email: jmgalan@ubu.es ORCID: https://orcid.org/0000-0003-3360-7602 DESCRIPTION ----------- 1. Dataset language: English and Spanish 2. Abstract: The present dataset contains two complementary databases to analyse the scientific impact of the Camino de Santiago in the academic literature. On the one hand, we have included a database extracted from Scopus with indexed manuscripts directly related to the Camino de Santiago. On the other hand, we also present a database of international doctoral theses also directly related to the Camino de Santiago. The dataset includes not only the final processed database but also the raw file from the Scopus search. A process of harmonisation of keywords, keywords+ and institutions has been performed on the original query. The files used to achieve the harmonisation are also included in the dataset. 3. Keywords: Camino de Santiago; scientific mapping; bibliometric analysis; academic impact; pilgrimage; doctoral research. 4. Date of data collection: 2022 (2023-second version) 5. Date of dataset publication: 2022-12-07 6. Funding: The authors are grateful for the support and funding from the Ministry of Science and Innovation through its networks of excellence HAR2017-90883-REDC and RED2018-102518-T) and the project PID2020118906GB-I00, and from the Junta de Castilla y León - Consejería de Educación (BDNS 425389) and the FWO-WOG (W001220N). Also, this work has been partially funded by the European Social Fund, through the grant of Silvia Díaz de la Fuente's predoctoral contract through the Consejería de Educación de la Junta de Castilla y León. ACCESS INFORMATION ------------------ 1. Dataset Creative Commons License: CC BY-NC 2. Dataset DOI: 10.36443/10259/7166 3. Related publication: The related article is currently under review METHODOLOGICAL INFORMATION -------------------------- The search was performed with the following filter: TITLE-ABS-KEY ( "Camino de Santiago" OR "Caminos de Santiago" OR "Camiño de Santiago" OR "Jacobeo" OR "Xacobeo" OR "Saint James Path" OR "Way to Santiago" OR "Way of Saint James" OR "Saint-Jacques-de-Compostelle" OR "Saint Jacques de Compostelle" OR "Way of St. James" OR "Camino to Santiago" OR "route to Santiago" OR "pilgrimage to Santiago" OR "pilgrimage Santiago" OR “Camino Lebaniego” OR “way of santiago” OR "Santiago de Compostela pilgrimage") OR TITLE-ABS-KEY (pilgrim AND Camino) OR KEY ( pilgrimage AND santiago AND de AND compostela ) OR KEY ( camino AND pilgrimage ) OR TITLE-ABS-KEY ( santiago AND due AND composted AND sacred AND places ) OR TITLE ( pilgrimage AND way ) OR TITLE ( pilgrimage AND Camino ) OR KEY ( Camino de Santiago de Compostella ) OR KEY (Camino de Santiago) OR ABS (Santiago de Compostela Camino) OR TITLE-ABS-KEY ( "Santiago Ways" ) AND ( EXCLUDE ( PUBYEAR , 2023 ) ) for both databases. In the case of the PhD database, the information sources used were: -TESEO https://www.educacion.gob.es/teseo -OATD https://oatd.org/ -DART-Europe E-theses Portal https://www.dart-europe.org/ -NDLTD https://ndltd.org/ And the timespan includes the records from 1979:2021 In the case of the indexed manuscripts, the information source was Scopus with the information retrieved on 2022-10-27 (2023-05-11 in the second version) Both databases have been processed to eliminate duplicates and false positives not directly related to the Camino de Santiago In the case of Scopus, an additional filter was used: NOT TITLE ( "Arba'een" OR "Jiuhua" OR “Mecca” OR “Walsingham” OR “John Muir Trail” OR “Adomnán” OR “Imvros” OR “Shikoku” OR {Pilgrimage tourism to Palestine} OR {Landscapes and destinations} OR {Faculty of Pharmacy} OR {José de Anchieta} OR {Camino Real de Tierra Adentro} OR {High hurdles} OR {Giovenale of Orvieto} OR {Social remarks on the history of Spanish} OR {William of Aquitaine} OR {international symposia on the history of anaesthesia} OR {The way to Monte Carmine} OR {El camino de los ayes} OR {Pilgrims for progress. El Camino Hospital} OR {A new approach towards town-country relations in Galicia}) In the case of the PhD dissertations, the process was conducted by manually analysing the abstracts of each thesis Harmonisation and clustering. When not included in the retrieved record, the information about the UNESCO codes of PhD thesis was obtained by contacting the doctoral theses' authors or supervisors directly to get the most detailed and complete information possible. In the case of inconsistencies in the keywords, keyword+, authors and institutions, we have processed the records to harmonise the information better. We have used different similarity-based string clustering algorithms: the Jaro-Winkler algorithm, the Damerau-Levenshtein algorithm and the Optimal String Alignment algorithm to help to identify similar terms. The identified clusters to harmonise affiliations, K+ (keyword+) and KA (keyword authors) are also provided in this dataset in the following files: Affiliation clusters.ods K+ clusters.ods KA clusters.ods FILE OVERVIEW -------------- 1. Readme.txt This text file with the general information about the dataset 2. Dataset Manuscripts Raw Data- Camino Santiago.bib No processed database obtained directly from Scopus through the queries 3. Affiliation clusters.ods File including the affiliation in the original database and the assigned harmonised (cluster) institution 4. K+ clusters.ods File including the keyword+ in the initial database and the assigned harmonised (cluster) keyword+ 5. KA clusters.ods File including the author keyword in the original database and the assigned harmonised (cluster) author keyword 6. Dataset Manuscripts- Camino Santiago.ods File with the processed records of the indexed manuscripts retrieved from Scopus through the queries 7. Dataset Tesis - Camino Santiago.ods File with the records of the PhD dissertations about the Camino retrieved from the different thesis repositories through the queries DATA-SPECIFIC INFORMATION ------------------------- Dataset Manuscripts- Camino Santiago.ods Each row corresponds to an indexed manuscript; the contents of the columns are specified below: AU: Author(s) DE: Keywords or terms describing the paper's subject matter ID: Keywords associated by ISI or SCOPUS database (keyword+) C1: Corresponding author address CR: Cited References JI: ISO Source Abbreviation AB: Abstract or summary of the paper's content PA: Physical address of the publisher or journal AR: Area or field of study relevant to the paper chemicals_cas: Chemical compounds mentioned in the document, identified by their CAS (Chemical Abstracts Service) numbers coden: Coden (a shorthand code identifying a specific scientific journal) RP: Reprint Address DT: Document type (e.g. "article," "review," etc.) DI: Digital object identifier (DOI) for the paper BE: Beginning Page FU: Funding Agency and Grant Number BN: International Standard Book Number (ISBN) for the journal or publication in which the paper appeared SN: International Standard Serial Number (ISSN) for the journal or publication in which the paper appeared SO: Publication Name (or Source) LA: Language in which the paper was written manufacturers: Companies or organisations mentioned in the paper as manufacturers of products or equipment used in the research TC: Number of times the paper has been cited in other publications PN: Part Number page_count: Number of pages in the paper PP: Name of the publisher PU: Place of publication PM: PubMedID DB: Database(s) sponsors: Companies or organisations that provided funding or support TI: Title of the paper tradenames: Trade names of products or equipment mentioned in the paper url: Web address (URL) for the paper or journal in which it appeared VL: Volume of the journal in which the paper appeared PY: Year in which the paper was published FX: Funding Text AU_UN: Author's Affiliations (disambiguated) AU1_UN: Corresponding Author's Affiliation (disambiguated) AU_UN_NR: Not Recognised Affiliations SR_FULL: Short Full-Reference SR: Short Reference Dataset Tesis - Camino Santiago.ods Each row corresponds to a PhD dissertation; the contents of the columns are specified below: ID: Identifier for the dissertation THESIS: Title of the dissertation UNIVERSITY: University at which the dissertation was completed LOCATION: Location of the university CITY: City in which the university is located PROVINCE: Province or state in which the university is located REGION/STATE: The region or state in which the university is located COUNTRY: Country in which the university is located YEAR: Year in which the dissertation was completed KEYWORDS: Keywords or terms describing the dissertation's subject matter PhD Supervisor: Name(s) of the dissertation supervisor(s) Committee: Names of the members of the dissertation evaluation committee UNESCO DESCRIPTOR SPANISH: Descriptor (e.g. subject area) for the dissertation according to the UNESCO (United Nations Educational, Scientific and Cultural Organization) classification system in the Spanish language UNESCO CODE: Code for the dissertation according to the UNESCO classification system (6-digit nomenclature) UNESCO CODE DISCIPLINE: Code for the discipline or field of study of the dissertation according to the UNESCO classification system (4-digit nomenclature) UNESCO CODE ENGLISH (Discipline): Code for the discipline or field of study of the dissertation in English, according to the UNESCO classification system UNESCO DESCRIPTOR ENGLISH: Descriptor for the dissertation in English, according to the UNESCO classification system (6-digit nomenclature) Link: Web address (URL) for the dissertation or other relevant information Summary: Summary or abstract of the dissertation's content.