Data visualization of genetic networks

Data visualization of genetic networks for the project "Gustave Roud, Œuvres complètes" (Université de Lausanne) in collaboration with Density Design Lab (Politecnico di Milano).

Documentation

"Gustave Roud, Œuvres complètes": project page. The project includes the development of a web application to access the texts and the archive of Gustave Roud. The visualizations of the genetic networks will be included in the web application.
Christen, A., & Spadini, E. (2019). "Modeling genetic networks. Gustave Roud’s œuvre, from diary to poetry collections". Umanistica Digitale, 3(7). https://doi.org/10.6092/issn.2532-8816/9063. This article presents the data model used in the project to organize the genetic relationships between the manuscripts and the publications. Some aspects of the data model have evolved since the publication of the article (e.g., we do not take into account anymore the order of the avant-textes inside a genetic dossier), but the main concepts remain the same.
Web application development: https://github.com/gustaveroudproject/roud-oeuvres-app. Documentation: https://github.com/gustaveroudproject/roud-oeuvres-app/blob/master/DOCUMENTATION.md.
ARKs are persistent identifier in the form of URL pointing to our edition or to a generic interface.
Slides of a presentation about the collections (recueils) in the work of Gustave Roud: https://gustaveroudproject.github.io/talks/20180524_cuso_recueil.html#/ (hover to see details in some of the slides).

Model

Classes and properties

The modelling of the genetic networks of Gustave Roud's works are complexe, especially because each document or publication might be composed of multiple parts or sections.

A simplified model of the genetic networks, without taking into accounts parts, is available in a human-readable rendering or a Turtle file. This is the suggested starting point to read the model.

The complete model, including parts, is available in a human-readable rendering or a Turtle file.

Visual rendering

Summary of all possible genetic relationships
Visual rendering of the simplified specification, without parts (through WebVOWL; for a better visualisation, scroll for 'Max label width' in the 'Options' menu). This link is also given in the specification itself.
Visual rendering of the complete specification (through WebVOWL; for a better visualisation, scroll for 'Max label width' in the 'Options' menu). This link is also given in the specification itself.

Some info

The project ontology is build on top of the knora-base ontology, that is a general purpose schema for data in the humanities and social sciences, compliant with the framework Knora. Our ontology aims to model archival objects, bibliographical records, persons, places, events and other information related to Roud's works. Here we removed the knora-base layer and list only the classes (nodes) and properties (edges) of the ontology that are relevant for the modelling of the genetic relationships. For instance, properties for titles and dates are not listed in the specification, but present in the data.

In our work, we created a generic model for genetic data, GENO. Our implementation in the project uses most of GENO's classes and properties, but not all of them; and adds classes and properties that do not exist in GENO, namely decomposing publications, manuscripts and dossiers in parts.

Technical note: classes and properties can be understood as nodes and edges. The domain of a property indicates the category of the starting point of an edge and the range indicates the category of its ending point. The properties are predicates in a triple of subject-predicate-object. Both classes and properties can have sub-classes (which inherit the properties) and sub-properties (which share the object of the property). The value of an object-type property is a node in the graph, while the value of a data-type property is a string, or literal. One can use both the human-readable label of a class or property (e.g. part of a manuscript), and its IRI (the last part of the IRI, e.g. MsPart) to identify it. Unfortunately, in this specification there are a lot of inconsistencies in the naming of classes and properties.

About the visualization

What to show, make evident or consider in the visualisation for each node:

type
if type is :Publication and subtype :Book => title and date
if type is :Publication and subtype :PeriodicalArticle => title, date and periodical
if type is :Publication => if it has photo or not (default is not)
if type is :Manuscript => title, archive, shelfmark (id of the manuscript), editorial set (for Diary only), genetic stage (not for diary), date
if type is :MsPart or :PubPart => title and number indicating the order of the parts
if type is :GeneticDossier or :GeneticDossierPart, no need to visualize properties

Data

Data for each work by Gustave Roud can be found in the folder data. The works are:

Adieu, Au Verseau, 1927
Feuillets, Mermod, 1929
Petit traité de la marche en plaine suivi de Lettres, dialogues et morceaux, Mermod, 1932
Essai pour un paradis, Mermod, 1933
Pour un moissonneur, Mermod, 1941
Air de la solitude, Mermod, 1945
Haut-Jorat, Éditions des Terreaux, 1949
Le Repos du cavalier, Bibliothèque des Arts, 1958
Requiem, Payot, 1967
Campagne perdue, Bibliothèque des Arts, 1972

Name		Name	Last commit message	Last commit date
Latest commit History 43 Commits
data		data
doc		doc
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data visualization of genetic networks

Documentation

Model

Classes and properties

Visual rendering

Some info

About the visualization

Data

About

Releases

Packages

vpallacci1/geneticNetworksDataViz

Folders and files

Latest commit

History

Repository files navigation

Data visualization of genetic networks

Documentation

Model

Classes and properties

Visual rendering

Some info

About the visualization

Data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages