Wider research context
In many domains, such as biology, chemistry, medicine, and the humanities, large amounts of data exist. Visual exploratory analysis of these data is often not practicable due to their size and their unstructured nature. Traditional machine learning (ML) requires large-scale labeled training data and a clear target definition, which is typically not available when exploring unknown data. For such large-scale, unstructured, open-ended, and domain-specific problems, we need an interactive approach combining the strengths of ML and human analytical skills into a unified process that helps users to "detect the expected and discover the unexpected".
Hypotheses
We hypothesize that humans and machines can learn jointly from the data and from each other during exploratory data analysis. We further hypothesize that this joint learning enables a new visual analytics approach that reveals how users' incrementally growing insights fit the data, which will foster questioning and reframing.
Approach
We integrate interactive ML and interactive visualization to learn about data and from data in a joint fashion. To this end, we propose a data-agnostic joint human-machine data exploration (JDE) framework that supports users in the exploratory analysis and the discovery of meaningful structures in the data. In contrast to existing approaches, we investigate data exploration from a new perspective that focuses on the discovery and definition of complex structural information from the data rather than primarily on the model (as in ML) or on the data itself (as in visualization).
Innovation
First, the conceptual framework of JDE introduces a novel knowledge modeling approach for visual analytics based on interactive ML that incrementally captures potentially complex, yet interpretable concepts that users expect or have learned from the data. Second, it proposes an intelligent agent that elicits information fitting the users' expectations and discovers what may be unexpected for the users. Third, it relies on a new visualization approach focusing on how the large-scale data fits the users' knowledge and expectations, rather than solely the data. Fourth, this leads to novel exploratory data analysis techniques -- an interactive interplay between knowledge externalization, machine-guided data inspection, questioning, and reframing.
Primary researchers involved
The project is a joint collaboration between researchers from TU Wien (Manuela Waldner) and the University of Applied Sciences St. Pölten (Matthias Zeppelzauer), Austria, who contribute and join their complementary expertise on information visualization, visual analytics, and interactive ML.
FWF Stand-alone project P 36453
DOI: 10.55776/P36453
News
Our poster WebGPU for Scalable Client-Side Aggregate Visualization has won the Best Poster Award at EuroVis 2023!
Funding
- FWF Fonds zur Förderung der wissenschaftlichen Forschung (FWF)
Project Partner
Team
News
- posted on
Research Areas
- In this research area, our focus lies on novel visual encodings and interaction techniques to explore a large amount of abstract data, often in combination with analytical reasoning.
Publications
Image | Bib Reference | Publication Type |
---|---|---|
2024 | ||
Dominik Wolf Joint Human-Machine Data Exploration Sandbox [report] |
Student Project | |
Matthias Matt, Matthias Zeppelzauer, Manuela Waldner cVIL: Class-Centric Visual Interactive Labeling In Eurographics Proceedings. May 2024. [paper] |
Conference Paper | |
2023 | ||
Judith Louis-Alexandre Dit Petit-Frere, Manuela Waldner Visual Exploration of Indirect Bias in Language Models In EuroVis 2023 - Short Papers. June 2023. [paper] [video] [online demo] |
Conference Paper | |
Gerald Kimmersdorfer, Dominik Wolf, Manuela Waldner WebGPU for Scalable Client-Side Aggregate Visualization Poster shown at 25th EG Conference on Visualization (EuroVis 2023) (12. June 2023-16. June 2023) In EuroVis 2023 - Posters , pages 105-107. [extended abstract] [poster] [Climate Change Explorer] |
Poster |