-
Notifications
You must be signed in to change notification settings - Fork 0
Expand file tree
/
Copy pathindex.html
More file actions
160 lines (141 loc) · 14.8 KB
/
index.html
File metadata and controls
160 lines (141 loc) · 14.8 KB
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
<!DOCTYPE html>
<html>
<head>
<title>Introduction to automatic collation</title>
<link rel="stylesheet" href="css/index.css" />
<meta charset="UTF-8" />
<meta name="description" content="Introduction to automatic collation: workshop materials" />
<meta name="keywords" content="collation, digital philology" />
<meta name="author" content="Elena Spadini, Elisa Nury, Helena Bermúdez Sabel" />
<meta name="viewport" content="width=device-width, initial-scale=1.0" />
</head>
<body>
<h1>Introduction to automatic collation</h1>
<img src="img/header.png" alt="color gradiant" class="header" />
<p>This is the home page of the online workshop <em>Introduction to automatic collation</em>
that will take place on the 24th and 25th of September. This course is offered by the
<a href="https://www.unil.ch/doc-digitalstudies/home.html">Programme Doctoral en Études Numériques</a> of the University of Lausanne (UNIL).</p>
<p> From this home
page you can access:</p>
<ul>
<li>Information about the workshop <a href="#logistics">logistics</a>.</li>
<li>The <a href="#program">program of the workshop</a> with links to the slides for each
session.</li>
<li>A <a
href="https://bimestriel.framapad.org/p/automaticcollationlausanne2020-9j0d?lang=en"
target="_blank">discussion environment</a>: a Framapad document where you can
post questions or resources, and chat.</li>
<li>The <a href="https://github.com/automaticCollationLausanne2020/Materials"
target="_blank">Github repository</a> that contains all the materials of the
workshop.</li>
<li>The installation instructions of the software we will use. There is NO need to install software to follow the course, because we will be using a virtual environment and explain how to use it during the course. If, nevertheless, you want to have your own installation, please follow <a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/installation.md" target="_blank">these instructions</a>; should you have any problem with the installation, please contact the instructors <u>before</u> the course.</li>
<li>Basic information about the <a href="#tech">technological framework</a> we will be using during the workshop: Github, Binder and Jupyter Notebooks. <u>Optionally, you can read this section before the workshop to familiarize yourself with the terms used and follow the Markdown tutorial indicated below (max 30 mins)</u>. Do not worry if it is a lot of new information! We will go through everything together during the workshop and you will find out that the practice is easier than the theory ;)</li>
<li><a href="#credits">Credits and license</a>.</li>
</ul>
<p>Please forward any doubts, problems or suggestions to the course instructors: <a
href="mailto:elena.spadini@unil.ch">Elena Spadini</a> (UNIL) - <a
href="mailto:elisa.nury@unige.ch">Elisa Nury</a> (UNIGE/UNIL) - <a
href="mailto:helena.bermudezsabel@unil.ch">Helena Bermúdez Sabel</a> (UNIL)
</p>
<div>
<h2 id="logistics">Logistics</h2>
<p>The course is held online on the Zoom platform hosted at UNIL. Participants will receive the room ID and password by email shortly before the beginning of the course. There is no need to have a Zoom account to join the room. You can access the room via the browser or the Zoom desktop app.</p>
</div>
<div>
<h2 id="program">Program outline</h2>
<h3>Day 1</h3>
<ul>
<li><span>9h15</span> Welcome / Accueil. Setting up the working environment / Préparer l'environnement de travail
<ul>
<li>Follow the instructions at the bottom of <a href="https://github.com/automaticCollationLausanne2020/Materials" target="_blank">this page</a> (Github repo)</li>
</ul></li>
<li><span>9h45</span> Explore your computer <ul>
<li>
<span>9h45-10h30</span>
<a href="session1a.html">Session 1a</a>: how to work on the command line</li>
<li>
<span>10h35-11h</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session1/session1b_JupyterNotebooks.ipynb">Session 1b</a>: what is a Jupyter Notebook [open in Binder]</li>
<li><span>11h-11h30</span> Break / pause</li>
<li>
<span>11h30-12h30</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session2/Session02_python_introduction.ipynb">Session 2</a>: gentle introduction to the Python programming language [open in Binder]</li>
</ul>
</li>
<li><span>14h</span> Theory of automatic collation and reflection about the use of
digital tools for textual criticism <ul>
<li><span>14h-14h15</span>
<a href="https://github.com/enury/collation-viz">Session 3a</a> (showcase): the Calpurnius app</li>
<li><span>14h15-15h15</span>
<a href="session3b.html">Session 3b</a>: manual and automatic collation</li>
<li><span>15h15-15h45</span> Break / Pause</li>
<li><span>15h45-16h45</span>
<a href="session4a.html">Session 4a</a>: the Gothenburg model of automatic collation</li>
<li><span>16h45-17h</span>
<a href="https://hal.archives-ouvertes.fr/hal-02268348" target="_blank">Session 4b</a> (showcase): the Falcon package</li>
</ul></li>
<li><span>17h15-18h30</span> Keynote: <a href="https://pure.knaw.nl/portal/en/persons/elli-bleeker">Elli Bleeker</a>, <em>What has been left out: The perks
and pitfalls of XML-aware collation</em> [<a href="https://docs.google.com/presentation/d/12-rY3DnMbGMiW5B847xJ32urKvdfQGBDySawmi_jW2Q/edit#slide=id.p">slides</a>]</li>
</ul>
<h3>Day 2</h3>
<ul>
<li><span>9h30</span> Automatic collation practise: environment configuration, making a
simple collation (part I)<ul>
<li><span>9h30-10h15</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session5/Session05_PlainTextCollation.ipynb">Session 5a</a>: collate with CollateX, plain texts [open in Binder]</li>
<li><span>10h15-11h</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session5/Session05b_collateFiles.ipynb">Session 5b</a>: collate with CollateX, reading from files [open in Binder]</li>
<li><span>11h-11h30</span> Break / pause</li>
<li><span>11h30-12h10</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session6/collation-outputs.ipynb">Session 6a</a>: collate with CollateX, output options [open in Binder]</li>
<li><span>12h10-12h30</span> Session 6b: guest speaker <a href="https://unil.ch/fra/fra/rudolfmahrer">Rudolf
Mahrer</a>, <i><a href="http://variance.ch/">Variance</a>. Éditer la genèse post-éditoriale des œuvres</i> [<a href="img/Variance_automatic_collation.pptx">slides</a>]</li>
</ul></li>
<li><span>14h</span> Automatic collation practise: environment configuration, making a
not so simple collation (part II)<ul>
<li><span>14h-14h35</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session7/Session07_CollateXInputFormats.ipynb">Session 7a</a>: collate with CollateX, input options [open in Binder]</li>
<li><span>14h35-15h15</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session7/Session07b_Normalization.ipynb">Session 7b</a>: collate with CollateX, normalization options [open in Binder]</li>
<li><span>15h15-15h45</span> Break / Pause</li>
<li><span>15h45-17h</span>
<a href="https://github.com/automaticCollationLausanne2020/Materials/blob/master/session8/Session08_Practice.ipynb">Session 8</a>: time to discuss your project and to exercise</li>
</ul>
</li>
</ul>
</div>
<div>
<h2 id="tech">Basic info about ...</h2>
<h3>... Github</h3>
<p>
<b><a href="https://github.com/">Github</a></b> is a platform to host, share and review code. It is used by developers and researchers to collaborate on the development of software for any kind of project, open-source or not. A Github repository, or repo, is the place where all the files of a project are hosted; a Github repo is "physically" a directory (a folder) that can be cloned or downloaded to your computer. Github uses the <a href="https://git-scm.com/">git</a> distributed version control software, that is a way to save changes over time without overwriting previous versions of a file: this is very useful for collaboration.
<br/>Examples of a Github repo? The <a href="https://dig-ed-cat.acdh.oeaw.ac.at/">Catalogue of Digital Editions</a>' data <a href="https://github.com/gfranzini/digEds_cat">repo</a>; the <a href="https://github.com/automaticCollationLausanne2020/Materials" target="_blank">repo</a> that contains all the materials of this workshop; the repo of <a href="https://github.com/automaticCollationLausanne2020/automaticCollationLausanne2020.github.io">this website</a>.</p>
<p>How will we use Github during the workshop? As a place to host the workshop materials. We will download them together at the beginning of the workshop, you don't have to do anything before. They will remain available in Github after the workshop as well.
</p>
<h3>... Binder</h3>
<p>
<b>Binder</b> makes possible to create a computing environment that can be shared and used by many remote users: each user access a virtual computer (or virtual machine) ready to be used, where all the required programs have been installed and data are available.
In our case, we use <a href="https://mybinder.org/">mybinder.org</a>, a service which works with Github repositories: we just give to it the URL of our Github repo and Binder creates a virtual computer for each participant, where s/he can access and execute all the files provided in the workshop materials. We should be aware that all the changes we make are lost when we close the virtual computer created by Binder, for example when we turn off our computer at the end of the day. But we can export everything that we've done!</p>
<p>How we will use Binder during the workshop? We will launch Binder together at the beginning of the workshop, by clicking the <img src="https://mybinder.org/static/images/badge_logo.svg?v=117793ab76524046ef44e2d2d5af220c"></img> button in our <a href="https://github.com/automaticCollationLausanne2020/Materials">Github repo</a>. You don't have to do anything before.</p>
<h3>... Jupyter Notebooks</h3>
<p>
The workshop materials are mainly Jupyter Notebooks. A <a>Jupyter Notebook</a> is first of all a file, like a normal file on your computer, a <code>.pdf</code> or a <code>.docx</code> file: its extension is <code>.ipynb</code> (if you don't see file extensions in your computer, this is the right moment to fix it! Just search the web for "how to make extension visible" in your operating system –Windows, MacOS or Linux– and you will find plenty of very easy instructions). To open a Jupyter Notebook you will need the right environment, just as to open a <code>.docx</code> file you need Microsoft Word or another program capable of doing it. The good news is that the right environment to open a Jupyter Notebook is available in the virtual computer created by Binder, as we've seen above (and should be available on your computer if you chose to follow the optional installation instructions). A Jupyter Notebook is a special sort of file, because it can contain text (to be read) and code (to be run, or executed): the text is written in the <a href="https://www.markdownguide.org/">Markdown</a> markup language and the code in the <a href="https://www.python.org/">Python</a> programming language.</p>
<p>How we will use Jupyter Notebooks during the workshop? We will use them for all the practical parts of the workshop. We will see together how a Jupyter Notebook works in session 1 and introduce the Python language in session 2. <u>If you've never used the Markdown language before, you might have a look at <a href="https://www.markdownguide.org/cheat-sheet">this cheat sheet</a> and follow <a href="https://www.markdowntutorial.com/">this tutorial</a> before the workshop</u> (max 30 mins; we suggest to use the English version, because the others contain translation errors); but again, this is not compulsory.</p>
</div>
<div>
<h2 id="credits">Credits</h2>
<div>Some of the materials collected here have been prepared by the instructors specially for this course. Others reuse tutorials from
<ul>
<li>the <i><a href="https://github.com/elespdn/CollateX_tutorial/blob/master/workshopLausanne201904/INTRO.ipynb">Formation Textes et éditions numériques</a></i> held at the University of Lausanne in April 2019,</li>
<li>the <i>DiXiT Workshop Code and Collation: Training Textual Scholars</i> held in Amsterdam in 2016 (<a href="https://sites.google.com/site/dixitcodingcollation/">website</a> | <a href="https://github.com/DiXiT-eu/collatex-tutorial">Github repo</a>),</li>
<li>the <i><a href="http://collatex.obdurodon.org/">CollateX Workshop</a></i> held in Sydney during DH 2016.</li>
</ul>
Many persons over the years have contributed to these materials, among which Ronald Haentjens Dekker (the main developer of CollateX) and, in alphabetical order, Tara Andrews, Helena Bermúdez Sabel, David Birnbaum, Elli Bleeker, Elisa Nury, Leif-Jöran Olsson, Elena Spadini, Catherine Smith, Joris Van Zundert.
</div>
</div>
<div>
<h2 id="license">License</h2>
<p>The materials are released under a <a href="https://www.gnu.org/licenses/gpl-3.0.en.html">GPL 3.0</a> license.</p>
</div>
</body>
</html>