Eye movement research involves eye movement analysis and eye tracking techniques [1
]. Eye movement analysis refers to the analysis of gaze data and it is considered an outward manifestation of visual/cognitive processing [2
], while eye tracking techniques refer to the methods of gaze capturing. Eye movement research emerged almost a century ago and has contributed much to reading psychology, education psychology, consumer psychology, sports psychology, and traffic psychology [3
], as well as neuroscience, industrial engineering, human factors, and computer science [8
]. Since the 1970s, cartography research has used eye movement strategies. For example, users’ map-reading behaviors have been explored to improve map design and map legibility [10
], and differences in users’ performance can represent differences in spatial cognition ability [12
]. Spatial cognition has been attracting cartographers’ attention for a long time. Robinson’s 1952 publication, The look of maps: An examination of cartographic design
, is considered seminal in cognitive map research [14
]. The author called for cognitive cartographers to systematically observe, collect, and explore data on how people look at and interpret maps, thus leading to the development of empirical approaches. One of the most important empirical approaches is eye movement research. With the development of eye trackers and eye tracking techniques, eye movement research has been widely used in research on animated maps [15
], map interaction [17
], web mapping [19
], and way-finding with mobile eye-trackers [21
The scientific literature of one research field reflects the dynamic development of that field. However, it is difficult for researchers to quickly establish the understanding of the evolution and trends of their research field if the amount of scientific publications is substantial. Scientometrics provides effective bibliometric analysis methods to analyze scientific literature and can help researchers efficiently explore their specialty knowledge domain, which have already been applied in many research fields, for example, regenerative medicine [23
], schizophrenia research [24
], recommendation system research [25
], bioenergy research [26
], and geographic information systems (GIS) [27
]. However, there has been no relevant research about cartography, especially about some emerging trends like eye movement research in cartography. On the other hand, the visual representations of the results directly produced by bibliometric analysis tools are not intuitively understandable. Therefore, the purpose of this paper is to analyze and visualize the intellectual structure of eye movement research in cartography with bibliometric analysis methods and multiple visual metaphors. The term “intellectual structure” used here includes classic literature, research theme clusters, research hotspots, and collaboration patterns indicating authorities in this research field. In addition to bibliometric analysis methods, we also used the geovisualization method for scientific collaboration analysis, which can efficiently represent the spatial distribution of scientific power. Although the results may not fully describe the whole knowledge domain, it can help researchers who are new to eye movement research in cartography to quickly explore the achievements and new trends in this field.
The structure of this paper is organized as follows: Section 2
describes the current bibliometric analysis methods and tools, Section 3
presents the data and workflow, Section 4
illustrates the analysis results, and Section 5
and Section 6
give the discussion and conclusions of our work.
2. Bibliometric Analysis Methods and Tools
The current widely used bibliometric analysis methods include co-citation analysis, bibliographic coupling analysis, and co-occurrence analysis (e.g., co-citation analysis to explore the structure and evolution of a research field [28
], bibliographic coupling analysis for patent grouping [29
], co-occurrence analysis of authors to detect research groups and author productivity [30
], and co-occurrence analysis of keywords for research hot spots [31
]). Details of these methods are described as below.
1. Co-Citation Analysis
Co-citation, introduced by Small and Griffith [32
], is defined as the frequency with which two documents are cited together. If two scientific documents are cited by another document, there is a co-citation relationship between the two documents. The more frequently the two documents are cited together, the closer the relationship between them. Co-citation can be used not only for literature analysis (called “document co-citation”), but also for author co-citation or journal co-citation [33
] has conceptualized a specialty as a time-variant duality between two fundamental concepts: research fronts and intellectual bases. Research fronts are defined as emergent and transient groupings of concepts and underlying research issues; the publications cited by research fronts comprise the intellectual bases. Document co-citation analysis has been used to study intellectual bases by many researchers, which allows the identification of key works [35
]. It is worth emphasizing that, because document co-citation is dependent on the citing literature, its patterns can change over time.
2. Bibliographic Coupling Analysis
] found that the more similar two papers’ research interests are, the more co-citations these papers receive, and the relationship between citing papers was defined as bibliographic coupling relationship. If two papers cite the same paper, these two papers are coupled papers. Coupling strength is the number of shared cited papers; higher coupling strength indicates a greater similarity in research theme. Furthermore, we can cluster the bibliographic coupling network to visualize the theme communities of the network. Generally, bibliographic coupling analysis is used to identify sets of recent papers [38
]. It differs from co-citation analysis because a paper’s citations cannot be modified after it is published; therefore, the bibliographic coupling relationship is fixed and permanent. In addition to bibliographic coupling analysis, author coupling and journal coupling are also effective ways to explore the similarity of author interests or journal themes.
3. Co-Occurrence Analysis
Co-occurrence analysis provides a quantitative method to obtain concurrence information from any information carriers [39
]. Concurrence is a linguistics term; co-occurrence analysis can either detect concurrence or the above-chance frequent occurrence of two terms from a text corpus. Based on co-occurrence analysis, co-words analysis is a content analysis method that analyzes the co-occurrence of paired items (i.e., keywords or noun phrases) in a text corpus to detect the relationships between ideas within the subject areas presented in these texts [40
]. Co-word analysis seeks to extract the themes and explore the linkages among them within the scientific literature; as a result, it can be used to reflect both research topics and evolving frontiers [41
Co-occurrence analysis can be broadened to co-author analysis, or co-institution analysis and co-country/territory analysis, which can reveal scientific collaboration patterns. Generated co-occurrence networks provide graphic visualization of relationships between terms, authors, institutions, or other objects.
Many tools have been developed to facilitate interpretation of bibliometric analysis results, including CiteSpace, Bibexcel, Science of Science (Sci2
) Tool, and VOSViewer [42
]. Among them, CiteSpace is an out-of-box, user-friendly and powerful software. It is a freeware, Java-based application developed by Chen for mapping scientific knowledge, and it has been continuously updated [34
]; the version used in this paper is 4.0. CiteSpace can read various kinds of bibliographic source formats, such as Web of Science (WOS), PubMed, Scopus, ADS, arXiv, NSF, and some Chinese database formats (e.g., Chinese National Knowledge Infrastructure [CNKI] and Chinese Social Sciences Citation Index [CSSCI]). It can generate and visualize networks comprising many nodes and edges, and can prune networks using a minimum spanning tree algorithm or pathfinder algorithm. It provides three views to display the network: cluster view, timeline view, and time zone view. For the cluster view, either the static form or the time slices form can be chosen; the latter splits the network by time interval. The timeline or time zone views show the nodes and edges as a time series form, which can explore the evolution of scientific literature.
Another useful functionality of CiteSpace is using cluster detection algorithm to divide a network into subgroup [34
]. After clustering, CiteSpace can label each cluster with terms extracted from document titles, keywords, or abstracts. The terms which are usually noun phrases can be ranked by three algorithms which are tf*idf (term frequency-inverse document frequency), LLR (log-likelihood ratio) test, and MI (mutual information) provided by CiteSpace [43
]. Tf*idf multiples two quantities tf and idf and is a metric to reflect how important a word is to a corpus [44
]; the LLR test is a statistical test to compare two models’ goodness of fit based on likelihood ratio [45
]; MI indicates a reduction in uncertainty measures of how much one random variable tells us about another [46
]. Terms selected by tf*idf tend to reflect a cluster’s most salient aspect, while the other two algorithms give a unique aspect of a cluster [43
Although CiteSpace is powerful for bibliometric analysis, the visualization output is not very satisfactory and the software lacks geovisualization functionality. Therefore, we have used other visualization tools in addition to CiteSpace to achieve better representations which will be described in the next section.
3. Data and Methodology
The data used in this paper were obtained from WOS, which is considered one of the most comprehensive and high-quality online bibliographic sources. The WOS core collection citation indexes include Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Arts and Humanities Citation Index (A&HCI), Conference Proceedings Citation Index-Science (CPCI-S), Conference Proceedings Citation Index-Social Science and Humanities (CPCI-SSH), and Emerging Source Citation Index (ESCI).
It is important to note that the searching strategy directly affects the results; the searching terms “eye tracking” or “eye movement” generate 40,601 records. As eye movement research in cartography is just one application of eye tracking technology, a narrower search scope is needed. Therefore, we further refined the results with searching terms such as “cartographic”, “cartography”, “map design”, “map symbol”, “map reading”, “map display”, “map usability”, “map perception”, “spatial cognition”, “geovisualization”, “spatial visualization”, “web map”, and “GIS”, and refined the document type as article, book chapter, and proceeding papers. Our purpose is to obtain the research achievements about cartography problems solving based on eye movement analysis, rather than eye tracking technique itself. Finally, we obtained 209 bibliographic records with 7355 citations from the publication years 1984–2015. These publications are mainly published in Experimental Brain Research, Cartographic Journal, Cartography and Geographic Information Science, International Journal of Geographic Information Science, Journal of Eye Movement Research, and GeoConference on Informatics, Geoinformatics and Remote Sensing, etc. The data were retrieved on 10 January 2016 and updated on 2 August 2016. Since eye movement research in cartography is a burgeoning new area, we believe that sample bibliographic records are adequate.
The reason to choose WOS is because its authority and high quality. However, WOS does not index all scientific publications, especially for some workshops (e.g., International Workshop on Eye Tracking for Spatial Research), so we manually captured that workshop’s papers and wrote the corresponding information into WOS format. Although the refining terms may not cover all aspects of cartography research, the method used in this paper is generally applicable [27
]; researchers can restrict or expand the search scope according to their research interest.
The workflow of our analysis process is shown in Figure 1
. At the beginning, we extracted bibliographic records from WOS using the proper searching terms, and stored the data into text format. Then we conducted the following analysis: firstly, we used co-citation analysis to explore classic literature; next, we performed bibliographic coupling analysis to detect research theme clusters; then we employed co-occurrence analysis to identify research hotspots and generated collaboration networks at author-level and institution-level, as well as a geo-collaboration network based on the geovisualization method. Some skills should be needed to manipulate the network for better interpretation that will be discussed in Section 5
In the process, geovisualization is a key step to explore the spatial distribution and connection of scientific power. Geovisualization comprises the theory, methods, and tools for visual exploration, analysis, synthesis, and presentation of geospatial data. It draws on and integrates approaches from visualization in scientific computing, information visualization, cartography, image analysis, exploratory data analysis, and GIS [47
]. Although the original bibliographic records are not geospatial data, we can extract the location-based information from text-format bibliographic records. For example, for data captured from WOS, the location-based information is stored in the C1 tag as the address string (e.g., BOSTON UNIV, CTR ADAPT SYST, 111 CUMMINGTON ST, BOSTON, MA 02215
). If we want to construct the city collaboration network, we can extract the city name (BOSTON
) from the address string. As a paper may have multiple authors, there may be several addresses; therefore, the duplicated city names should be removed. Then, city names can be parsed by a geocoding service to obtain the longitude and latitude to construct points. If there are two cities associated with one paper, a line connecting the two points will be generated to represent the collaboration between the two cities. The relationship can be mapped with a graph Gc = (Vc, Ec), in which Vc are the city nodes and Ec are edges representing the collaboration of the cities. The process of constructing a geo-collaboration network is shown in Figure 2
and the pseudocode is presented in Appendix
3.3. Visualization Tools
Gephi is an open-source software for data analysts and scientists keen to explore and visualize graphs and networks [48
]. It can produce a better representation output than CiteSpace and provide many network layout styles; for example, force atlas, fruchterman reingold, yifan hu, and geolayout. Since graphs created by CiteSpace may overlap (and, therefore, are sometimes hard to understand), we prefer to use Gephi to display networks by reading the exchange file with CiteSpace output.
In addition to its powerful graphical representation, Gephi is useful for exploratory data analysis. It can also detect community, and calculate the shortest path, degree centrality, betweenness centrality, clustering coefficients, and other information in the network.
CartoDB is an easy-to-use online geovisualization tool that allows the creation of beautiful visualizations of geographic data [49
]. CartoDB can read user’s data files or connect with Google Drive, Dropbox, or Twitter. It can create maps in seconds, and there is no need for users to install any additional software or have map-making experience. CartoDB APIs can provide more data processing and spatial analysis functionalities for developers.
5.1. Discussion Related to Analysis Methods
By introducing the use of bibliometric analysis methods in eye movement research in cartography, this paper demonstrates an effective and efficient way to visualize the intellectual structure of this knowledge domain, helping researchers quickly discover the main structure of this burgeoning research field. In addition, we did a lot of work that would improve current methods to facilitate interpretation from a professional perspective, greatly contributing to better understanding of the results. Four facets of the work warrant discussion.
First, the searching strategy is key to the results, so the proper searching strategy should be discussed with experts on this field, and may be modified several times based on results evaluation. On the other hand, due to the fact that WOS does not index all scientific publications, it is difficult to fully encompass the research scope of this field. Therefore, we manually added some workshop papers to ensure the effectiveness of the results, but it is time consuming. The automatic mechanism to translate the user-defined bibliographic database format into WOS format is needed, but it is a challenging work due to different standards of different databases.
Second, the selection criteria for network construction is key to controlling the scope of the network model. For example, there are several methods to select criteria in the network, such as Top N, Top N%, and threshold levels of c, cc, ccv (i.e., citation threshold, co-citations threshold, co-citation coefficients threshold) [34
] for each time slice. Considering the quantity of publications and the year span in our bibliographic records, we chose the Top N method and set it as 10, with five years as the time interval for co-citation analysis, as well as Top 10 for bibliographic coupling analysis, Top 50 for co-occurrence of keywords, authors, and institutions of each year. Large data sets permit larger N values.
Third, the network should be manipulated after generation to achieve better interpretation. Take the co-occurrence of keywords as an example. If the largest nodes “eye movement” and “eye tracking”, which are definitely prominent, are not excluded from the network, other nodes may appear very small and be difficult to identify. In some cases, nodes should be merged because of different spellings of the same object, such as institution or author names.
Finally, since the visualization outputs generated by bibliometric analysis tools are not very satisfactory, it is a better choice to present the analysis results by other visualization tools, such as Gephi, based on exchange files. Additionally, by constructing a geo-collaboration network, the distribution of scientific power was represented at a macro level. This allowed us to extract location-based information from bibliographic records and display it spatially and intuitively. Furthermore, more GIS functionalities, such as spatial clustering analysis, can be performed.
5.2. Discussion Related to Current Trends
In addition to the WOS literature, some workshops have contributed much to the development of eye movement research in cartography, for instance, the pre-conference workshop on eye tracking sponsored by ICA (International Cartographic Association) in 2013, and the 1st and 2nd International Workshop on Eye Tracking for Spatial Research held in 2013 and 2014. During the ICA eye tracking workshop, in addition to applications of eye movement research, new measurements and GIS tools were introduced to analyze eye movement data (e.g., a method to automatically identify user’s different activities on maps [72
], using a space-time cube to display and analyze eye movement recordings [73
]). ICA Commission on Cognitive Issues in Geographic Information Visualization also listed some tools for eye movement data analysis on its official website. EyeMMV (Eye Movements Metrics and Visualizations) and Saliency toolbox are the representative ones among them. EyeMMV is an open source MATLAB toolbox designed for post experimental eye movement analysis, which supports all eye tracking metrics and visualization techniques [75
]. The project of saliency-based visual attention was started in the laboratory of Prof. Christof Koch at Caltech. The saliency toolbox is also a MATLAB toolbox that used for computing the saliency map of an image [56
]. Additionally, other useful tools include iMap4 [78
] and DynAOI [79
]. The International Workshop on Eye Tracking for Spatial Research launched a wide range of discussion about eye movement research that is not limited to cartography. For example, with mobile eye tracker, both indoor and outdoor way-finding have been further discussed [80
Recent technological developments in the area of eye movement have opened up new perspectives for cartographers in spatial cognition research. Cartographers have made many progresses on navigation behaviors with eye tracking techniques. By comparing 2D maps with photorealistic 3D representations for pedestrian navigation, Dong and Liao [84
] found that the advantages and disadvantages of 3D representations are task dependent: 3D representations performed less effectively and efficiently in the process of spatial knowledge acquisition, but more efficiently in self-positioning and orientation. Similar experimentation was conducted by Lei et al. [86
], using 2D and 3D electronic maps for way-finding. The results showed that people carried out a wider ranging search and shorter viewing time with the 2D electronic map, while the 3D electronic map provided more information about the environment. Additionally, mobile eye trackers have been adopted to evaluate landmark identification and recall on maps [87
]. On the aspect of map reading and map perception, some special users (e.g., users with color vision deficiency) have been investigated [89
]. Furthermore, compared with 2D static maps, dynamic map symbols [90
], dynamic interactive applications [92
], and panoramic maps [95
] have attracted much more attention in cartographers. In the future, eye tracking techniques might make great contributions to cartography in the usability research of VR (virtual reality) [96
], AR (augmented reality) [97
], emotional recognition [98
This paper investigated and visualized the classic literature, research theme clusters, research hotspots, and collaboration patterns of eye movement research in cartography using multiple visual metaphors. In addition, geovisualization method was used to represent the spatial distribution of scientific power. As a result, we discovered some interesting characteristics of this knowledge domain.
Co-citation analysis revealed the classic literature that would be most helpful for novice researchers. The result showed that eye movement research in cartography is an interdisciplinary field that encompasses areas such as psychology, cognitive science, usability engineering, and computer science. Particularly at the early stage of its development, the most cited literature is from the psychology research field. Since the 1970s, some cartographers have explored relationships between map design and map reading using eye tracking experiments, and there has been much research since the 1980s, especially in the last two decades. The co-words analysis results showed that cartographers have focused on attention and spatial cognition, and bibliographic coupling analysis identified some trends of usability research. In addition to focusing on the classical problems in traditional cartography, such as the map labels placement method or map legend layout, eye movement research in cartography about usability has embraced several emerging techniques, such as web mapping, mobile mapping, animated mapping, and VGI.
This paper also explored scientific collaboration from a micro level to a macro level; this helped to reveal the authorities and scientific power distribution of this research field. We noted that most of the authors had only one publication; that the most productive authors are mainly from Palacky University, Zurich University, and Ghent University; and that highly productive authors always have more collaboration relationships. In addition, the geo-collaboration network showed that Europe and the USA form two clusters of eye movement research in cartography, and that Europe is the international collaboration center.
A picture is worth a thousand words and the method proposed in this paper may help the investigation of knowledge domains. We hope that the method will not only assist researchers in quickly grasping the evolution and trends of their research field, but will also become a novel method by which to merge geovisualization with knowledge visualization.