Reconstructing sessions from data discovery and access logs to build a semantic knowledge base for improving data discovery

18Citations
Citations of this article
29Readers
Mendeley users who have this article in their library.

Abstract

Big geospatial data are archived and made available through online web discovery and access. However, finding the right data for scientific research and application development is still a challenge. This paper aims to improve the data discovery by mining the user knowledge from log files. Specifically, user web session reconstruction is focused upon in this paper as a critical step for extracting usage patterns. However, reconstructing user sessions from raw web logs has always been difficult, as a session identifier tends to be missing in most data portals. To address this problem, we propose two session identification methods, including time-clustering-based and time-referrer-based methods. We also present the workflow of session reconstruction and discuss the approach of selecting appropriate thresholds for relevant steps in the workflow. The proposed session identification methods and workflow are proven to be able to extract data access patterns for further pattern analyses of user behavior and improvement of data discovery for more relevancy data ranking, suggestion, and navigation.

References Powered by Scopus

Data Preparation for Mining World Wide Web Browsing Patterns

0
1119Citations
N/AReaders
Get full text

Mixtools: An R package for analyzing finite mixture models

987Citations
N/AReaders
Get full text

Data mining in course management systems: Moodle case study and tutorial

750Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Big Data and cloud computing: innovation opportunities and challenges

617Citations
N/AReaders
Get full text

Utilizing Cloud Computing to address big geospatial data challenges

152Citations
N/AReaders
Get full text

Towards intelligent geospatial data discovery: a machine learning framework for search ranking

21Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Jiang, Y., Li, Y., Yang, C., Armstrong, E. M., Huang, T., & Moroni, D. (2016). Reconstructing sessions from data discovery and access logs to build a semantic knowledge base for improving data discovery. ISPRS International Journal of Geo-Information, 5(5). https://doi.org/10.3390/ijgi5050054

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 21

88%

Researcher 2

8%

Lecturer / Post doc 1

4%

Readers' Discipline

Tooltip

Computer Science 12

63%

Earth and Planetary Sciences 5

26%

Neuroscience 1

5%

Social Sciences 1

5%

Save time finding and organizing research with Mendeley

Sign up for free