| | | | | |
Select: All | None | Toggle Preview: Open All | Close All |
2007
|
| |  | Groth, Dennis P. | Tracking and Organizing Visual Exploration Activities across Systems and Tools read moreAbstract: Modern knowledge discovery activities occur in highly dynamic environments. Specific activities may involve multiple tools, techniques, systems, individuals, and locations. In addition to these complexities, the span of time involved with discovery may vary from short to long, as well as being contiguous or disjoint. This paper presents a framework for tracking the history, or provenance, of the discovery process across applications, systems, and users. The resulting capabilities provide fine-grained provenance information relative to the discovered information. Along with the provenance framework, a prototype system is used to demonstrate the main concepts of the proposed approach. | 2007 |
2006
|
| |  | Mothe, Josiane | Combining mining and visualization tools to discover the geographic structure of a domain read moreAbstract: Science monitoring is a core issue in the new world of business and research. Companies and institutes need to monitor the activities of their competitors, get information on the market, changing technologies or government policies. This paper presents the Tétralogie platform that is aimed at allowing a user to interactively discover trends in scientific research and communities from large textual collections that include information about geographical location. Tétralogie consists of several agents that communicate with each other on users’ demands in order to deliver results to them. Metadata and document content are extracted before being mined. Results are displayed in the form of histograms, networks and geographical maps; these complementary types of presentations increase the possibilities of analysis compared to the use of these tools separately. We illustrate the overall process through a case study of scientific literature analysis and show how the different agents can be combined to discover the structure of a domain. The system correctly predicts the country contribution to a field in future years and allows exploration of the relationships between countries. | 2006 |
| |  | Robinson, A. C. | Re-Visualization: Interactive Visualization of the Process of Visual Analysis read moreAbstract: Sorry no abstract available for this article | 2006 |
2005
|
| |  | Amar, Robert m. | Low-Level Components of Analytic Activity in Information Visualization read moreAbstract: Existing system-level taxonomies of visualization tasks are geared more towards the design of particular representations than the facilitation of user analytic activity. We present a set of ten low-level analysis tasks that largely capture people��?s activities while employing information visualization tools for understanding data. To help develop these tasks, we collected nearly 200 sample questions from students about how they would analyze five particular data sets from different domains. The questions, while not being totally comprehensive, illustrated the sheer variety of analytic questions typically posed by users when employing information visualization systems. We hope that the presented set of tasks is useful for information visualization system designers as a kind of common substrate to discuss the relative analytic capabilities of the systems. Further, the tasks may provide a form of checklist for system designers | 2005 |
| |  | Amar, Robert m. | Low-Level Components of Analytic Activity in Information Visualization read moreAbstract: Existing system-level taxonomies of visualization tasks are geared more towards the design of particular representations than the facilitation of user analytic activity. We present a set of ten low-level analysis tasks that largely capture people��?s activities while employing information visualization tools for understanding data. To help develop these tasks, we collected nearly 200 sample questions from students about how they would analyze five particular data sets from different domains. The questions, while not being totally comprehensive, illustrated the sheer variety of analytic questions typically posed by users when employing information visualization systems. We hope that the presented set of tasks is useful for information visualization system designers as a kind of common substrate to discuss the relative analytic capabilities of the systems. Further, the tasks may provide a form of checklist for system designers | 2005 |
| |  | Gahegan, Mark | Beyond Tools: Visual Support for the Entire Process of GIScience read moreAbstract: Sorry no abstract available for this article | 2005 |
| |  | Butler, A. R. | Three dogmas of metadata and undiscovered knowledge read moreAbstract: The prevalence of metadata and technologies supporting metadata have failed to achieve their anticipated objectives (shared data, loose coupling, productivity, intelligence) because they represent an unrealistic, or at least incomplete, picture of the concept "metadata". Three emergent assumptions typically adopted by designers using metadata are identified, assumptions that contain the seeds of the inevitable failure of such designs. We also suggest a number of methodologies to extricate the architect (and more critically, partner architects) from such dogma. | 2005 |
| |  | Harry,, Beth | Mapping the Process: An Exemplar of Process and Challenge in Grounded Theory Analysis read moreAbstract: This article responds to recent calls for greater clarity and transparency regarding methods in qualitative research. On the basis of a 3-year ethnographic study of the overrepresentation of minorities in special education, the authors address key tenets of grounded theory and attempt to reconcile some of the methodological challenges inherent in naturalistic inquiry. They discuss theoretical considerations and use a visual model to illustrate how they applied grounded theory to this complex and sensitive topic. Emphasizing the social nature of decision making in special education, the authors point to the appropriateness of qualitative methods to the investigation of such issues. | 2005 |
| |  | Groth, D. P. | Tracking Personal Histories for Knowledge Discovery Tasks read moreAbstract: Interactive visualizations provide an ideal setting for
exploring the use and exploitation of personal histories.
Even though visualizations leverage innate human
capabilities for recognizing interesting aspects of data, it is
unlikely that two users will follow the exact process for
discovery. This results in an inability to effectively
recreate the exact conditions of the discovery process,
which we call the knowledge rediscovery problem.
Because we cannot expect a user to fully document each of
their interactions, there is a need for visualization systems
to maintain user trace data in a way that enhances a user's
ability to communicate what they found to be interesting,
as well as how they found it. This project presents a model
for representing user interactions that articulates with a
corresponding set of annotations, or observations that are
made during the exploration. This problem is only made
more challenging when pervasive computing and
corresponding interactions across devices is factored in. | 2005 |
2002
|
| |  | Gahegan, Mark | Introducing GeoVISTA Studio: an integrated suite of visualization and computational methods for exploration and knowledge construction in geography read moreAbstract: One barrier to the uptake of geocomputation is that, unlike GIS, it has no system or toolbox that provides easy access to useful functionality. This paper describes an experimental environment, GeoVISTA Studio, that attempts to address this shortcoming. Studio is a Java-based, visual programming environment that allows for the rapid development of complex data exploration and knowledge construction applications to support geographic analysis. It achieves this by leveraging advances in geocomputation, software engineering, visualization and machine learning. At the time of writing, Studio contains full 3D rendering capability and has the following functionality: interactive parallel coordinate plots, scatterplot, visual classifier, 2D map and image viewer, sophisticated colour selection (including Munsell colour-space), spreadsheet, statistics package, and supervised and unsupervised neural networks. Through examples of Studio at work, this paper demonstrates the roles that geocomputation and visualization can play throughout the scientific cycle of knowledge creation, emphasising their supportive and mutually beneficial relationship. A brief overview of different types of inference used in such knowledge creation activities is given, and related to the exploratory analysis tools described. By way of results, a detailed account of the use of these tools is presented, and various findings and insights generated are pointed out. The domain of application is the process of uncovering useful categories by which a taxonomy of landuse/landcover can be created. The proposed categories are then evaluated using a combination of neural and visual methods, to ensure their viability.
| 2002 |
2001
|
| |  | Roddick, J. F. | Paradigms for spatial and spatio-temporal data mining read moreAbstract: With some significant exceptions, current applications for data mining are either in those areas for which
there is little accepted discovery methodology or are being used within a knowledge discovery process
that does not expect authoritative results but finds the discovered rules useful none-the-less. This is in
contrast to its application in the fields applicable to spatial or spatio-temporal discovery which possess a
rich history of methodological discovery and result evaluation. | 2001 |
2000
|
| |  | Buttenfield, Barbara P. | Geospatial data mining and knowledge discovery read moreAbstract: The advent of remote sensing and survey technologies over the last decade has
dramatically enhanced our capabilities to collect terabytes of geographic data on a daily
basis. However, the wealth of geographic data cannot be fully realized when information
implicit in data is difficult to discern. This confronts GIScientists with an urgent need for
new methods and tools that can intelligently and automatically transform geographic data
into information and, furthermore, synthesize geographic knowledge. It calls for new
approaches in geographic representation, query processing, spatial analysis, and data
visualization (Yuan 1998, Miller and Han 2000; Gahegan, 2000). Information scientists
face the same challenge as a result of the digital revolution that expedites the production
of terabytes of data from credit card transactions, medical examinations, telephone calls,
stock values, and other numerous human activities. Collaborative efforts in artificial
intelligence, statistics, and databases communities have been the underpinning
technologies of knowledge discovery in databases to extract useful information from
massive amounts of data in support of decision-making (Gardner 1996, Bhandari et al.
1997, Hedberg 1996). | 2000 |
| |  | Buttenfield, Barbara P. | Geospatial data mining and knowledge discovery read moreAbstract: The advent of remote sensing and survey technologies over the last decade has
dramatically enhanced our capabilities to collect terabytes of geographic data on a daily
basis. However, the wealth of geographic data cannot be fully realized when information
implicit in data is difficult to discern. This confronts GIScientists with an urgent need for
new methods and tools that can intelligently and automatically transform geographic data
into information and, furthermore, synthesize geographic knowledge. It calls for new
approaches in geographic representation, query processing, spatial analysis, and data
visualization (Yuan 1998, Miller and Han 2000; Gahegan, 2000). Information scientists
face the same challenge as a result of the digital revolution that expedites the production
of terabytes of data from credit card transactions, medical examinations, telephone calls,
stock values, and other numerous human activities. Collaborative efforts in artificial
intelligence, statistics, and databases communities have been the underpinning
technologies of knowledge discovery in databases to extract useful information from
massive amounts of data in support of decision-making (Gardner 1996, Bhandari et al.
1997, Hedberg 1996). | 2000 |
| |  | Bollacker, K. D. | Discovering relevant scientific literature on the Web read moreAbstract: Scientific literature on the Web makes up a massive, noisy, disorganized database. Unlike large, single-source databases such as a corporate customer database, the Web database draws from many sources, each with its own organization. Also, owing to its diversity, most records in this database are irrelevant to an individual researcher. Furthermore, the database is constantly growing in content and changing in organization. All these characteristics make the Web a difficult domain for knowledge discovery. To quickly and easily gather useful knowledge from such a database, users need the help of an information filtering system that automatically extracts only relevant records as they appear in a stream of incoming records. To this end, we have developed the CiteSeer. CiteSeer is an automatic generator of digital libraries of scientific literature. It uses sophisticated acquisition, parsing, and presentation methods to eliminate most of the manual effort of finding useful publications on the Web | 2000 |
1998
|
| |  | Montebello, M. | Information overload-an IR problem? read moreAbstract: Information overload on the World Wide Web (WWW) is a well recognised problem. Research to subdue this problem and extract maximum benefit from the Internet is still in its infancy. With huge amounts of information connected to the Internet, efficient and effective discovery of resources and knowledge has become an imminent research issue. A vast array of network services is growing up around the Internet and a massive amount of information is added everyday. Despite the potential benefits of existing indexing, retrieving and searching techniques in assisting users in the browsing process, little has been done to ensure that the information presented is of a high recall and precision standard. Therefore, search for specific information on this massive and exploding information resource base becomes highly critical. The author discusses the issues involved in resolving the information overload over the WWW and argues that this is solely an information retrieval problem. As a contribution to the field he proposes a general architecture to subdue information overload and describes how this architecture has been instantiated in a functional system he developed | 1998 |
1997
|
| |  | | Using MineSet for knowledge discovery read moreAbstract: Sorry no abstract available for this article | 1997 |
1992
|
| |  | Frawley, W. J. | Knowledge Discovery in Databases: an Overview read moreAbstract: This article presents an overview of the state
of the art in research on knowledge discovery
in databases. We analyze Knowledge Discovery
and define it as the nontrivial extraction of
implicit, previously unknown, and potentially
useful information from data. We then
compare and contrast database, machine
learning, and other approaches to discovery
in data. We present a framework for knowledge
discovery and examine problems in
dealing with large, noisy databases, the use
of domain knowledge, the role of the user in
the discovery process, discovery methods, and
the form and uses of discovered knowledge.
We also discuss application issues, including
the variety of existing applications and
propriety of discovery in social databases. We
present criteria for selecting an application
in a corporate environment. In conclusion,
we argue that discovery in databases is both
feasible and practical and outline directions
for future research, which include better use
of domain knowledge, efficient and incremental
algorithms, interactive systems, and
integration on multiple levels. | 1992 |