Skip to content

InTaVia/milestone-3

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 

Repository files navigation

InTaVia Milestone-3

InTaVia Milestone 3 Documentation - Basic Operational System

Individual components of the system operational with connectivity partially established and ready for evaluation by WP7.

Backend (WP2 & 3)

Datamodel InTaVia IDM-RDF

The repository https://github.com/InTaVia/idm-rdf contains the IDM-RDF at its current state (including ongoing discussions in the issues).

Ingestion workflows / source datasets

The repository https://github.com/InTaVia/source-dataset-conversion contains the conversion scripts and the datasets in the IDM data model at their current state for the following prosopographical source datasets:

JSON API

The repository https://github.com/InTaVia/grlc contains the grlc software, that is used for providing the JSON API, at its current state.

The repository https://github.com/InTaVia/grlc_sparql contains the JSON API definitions at their current state, namely the following APIs:

  • Generic entity search
  • Person history

The test version of the API is available under https://grlc.acdh-dev.oeaw.ac.at/api/InTaVia/grlc_sparql#/

Researchspace for internal usage

In addition to the resources mentioned above we set up a researchspace instance that is meant for internal work on and exploration of the knowledge graph. This service is currently for internal use only.

Prefect workflow component

We deployed prefect within the ACDH-CH kubernetes cluster to run conversion, enrichment and ingestion scripts on the cluster. Given that the open-source version of this software solution doesnt come with authentication built in, this component is currently only reachable from within ACDH-CH subnet. Given some delay in our original planning the scripts are currently still executed locally instead of running within prefect.

WP4

NLP Abbreviations

We include the code to train an abbreviation identification classifier and to run an abbreviation expansion generator, both based on pre-trained transformer language models. The code can be found here. This is the preliminary version, where we test the concept on a small gold-standard slovene dataset (for now, a separate request for the dataset is required). More thorough evaluation and expansion of the experiments to Dutch and German coming soon.

NLP Visualization

The milestone 3 version of our interactive text mining environment can be found here. It consists of two connected components, Performancer and AnnoXplorer, providing an overview and detail view on the data respectively; brushing over texts in the former displays them in the latter.

Frontend (WP5 & WP6)

The Milestone 3 Prototype (v0.1.0) of the InTaVia web client (frontend) is available as a permanent release: https://github.com/InTaVia/web/releases/tag/v0.1.0.

The current prototype is available online: https://intavia.acdh-dev.oeaw.ac.at/.

The data shown in the application are automatically generated mock data including person and place entities. As soon as the connection between the backend and the frontend is established, the mock data will be replaced. Note: if below linked visualizations do not show results, please first do a search on the search page and then switch back to the visualization.

The prototype implements aspects of the three top-level components (Data Curation Lab, DC lab; Visual Analytics Studio, VA studio; Visual Storytelling Suite, ST suite) in a single application on separate pages. The components implemented are:

DC lab

VA studio

Data views:

Coordinated views:

  • Multiple views integrated on a single page are shown here: https://intavia.acdh-dev.oeaw.ac.at/coordination
  • The example coordinates an entity list view, a timeline and a map showing persons and their life events via mouseover highlighting (red colour).
  • Currently, applies only to the map, a dropdown menu allows to filter the depicted events. If more than one event type is selected, the localized events are connected with lines in chronological order.

Visual querying:

ST suite = ST creator + ST viewer

  • The ST creator implements an interactive user interface allowing to generate and import slide-based stories. The prototype can depict persons’ life-events on a map and provides annotation capabilities (i.e., images and text): https://intavia.acdh-dev.oeaw.ac.at/storycreator

    • Stories Overview (create and delete stories)
    • Story Flow (create, layout and delete slides per drag and drop)
    • Slide Editor (create, edit, layout and delete map, images and text per drag and drop)
    • Text Mode (edit or upload the whole story via textarea) (accessible through the clipboard icon in the top right corner)
  • The single event slides for the "Life of Paolo Vergerio" can be imported through the text editor using the content of json this file.

  • A clickable Adobe XD prototype of the ST viewer is available online: ST Viewer Prototype

    • The prototype tells the story "Life of Paolo Vergerio" primarily based on a map/timeline visualization featuring annotations, images, quiz and timebased visualizations.
    • All screens including a selection of desktop views are available as PNG: Screens ST viewer [.zip]

About

InTaVia Milestone 3 Documentation

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published