-
Notifications
You must be signed in to change notification settings - Fork 2
/
handout.tex
255 lines (206 loc) · 12.2 KB
/
handout.tex
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
\documentclass[a3, ruledsections, 8pt]{sciposter}
\usepackage{lipsum}
\usepackage{epsfig}
\usepackage{amsmath}
\usepackage{amssymb}
\usepackage{multicol}
\usepackage{graphicx,url}
\usepackage[german, english]{babel}
\usepackage[utf8]{inputenc}
\usepackage{pgfpages}
\pgfpagesuselayout{resize to}[a4paper,border shrink=5mm]
\renewcommand{\titlesize}{\LARGE}
\renewcommand{\authorsize}{\small}
\renewcommand{\instsize}{\tiny}
%\usepackage{fancybullets}
\newtheorem{Def}{Definition}
\definecolor{SectionCol}{rgb}{1.0,0.6,0.0} %orange
\title{W4 CAA Scripting Languages Hackathon I \\ \Huge\textcolor{SectionCol}{Can you code this?}}
%Título do projeto
\author{Clemens Schmid$^{1}$, Martin Hinz$^{2}$, Carolin Tietze$^{3}$}
%nome dos autores
\institute
{$^{1}$ Römisch-Germanisches Zentralmuseum Leibniz-Forschungsinstitut für Archäologie: Mainz, Rheinland-Pfalz; $^{2}$ Institut für Archäologisches Wissenschaften, Universität Bern; $^{3}$ Institut für Klassische Altertumskunde, Christian-Albrechts-Universität zu Kiel
}
%Nome e endereço da Instituição
% Onde você coloca os emails dos integrantes
%\date is unused by the current \maketitle
\rightlogo[1]{images/front_20180624.png}
\leftlogo[1]{images/49449374.png}
% Exibe os logos (direita e esquerda)
% Procure usar arquivos png ou jpg, e de preferencia mantenha na mesma pasta do .tex
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
%%% Begin of Document
\begin{document}
%define conference poster is presented at (appears as footer)
\conference{{\bf CAA 2019} Check Object Integrity - 47th Computer Applications and Quantitative Methods in Archaeology - 23-27 April 2019}
%\LEFTSIDEfootlogo
% Uncomment to put footer logo on left side, and
% conference name on right side of footer
% Some examples of caption control (remove % to check result)
%\renewcommand{\algorithmname}{Algoritme} % for Dutch
%\renewcommand{\mastercapstartstyle}[1]{\textit{\textbf{#1}}}
%\renewcommand{\algcapstartstyle}[1]{\textsc{\textbf{#1}}}
%\renewcommand{\algcapbodystyle}{\bfseries}
%\renewcommand{\thealgorithm}{\Roman{algorithm}}
\renewcommand{\maketitle}{%
\ifthenelse{\boolean{useleftlogo}}{%
\begin{minipage}[c]{\logowidth}
\printleftlogo
\end{minipage}
}{}
\begin{minipage}[c]{\titlewidth}
\begin{center}
\renewcommand{\baselinestretch}{2.3}\normalsize
{\titlesize \bf \@title}\par
\renewcommand{\baselinestretch}{1.0}\normalsize
\vspace{0.4\titleskip}
{\authorsize {\bf\@author} \par}
{\instsize
\vspace{0.2\titleskip}
\theinstitute \par
\ifthenelse{\equal{\printemail}{}}{%nothing
}{%
\vspace{0.2\titleskip}
\texttt{\printemail}
}
}
\end{center}
\end{minipage}%}}
\ifthenelse{\boolean{userightlogo}}{%
\begin{minipage}[c]{\logowidth}
\printrightlogo
\end{minipage}
}{}
\vspace{\titleskip}
}
\maketitle
%%% Begin of Multicols-Enviroment
%\begin{multicols}{1}
%%% Abstract
%\begin{abstract}
%No modelo que a Danielle postou não tinha resumo, então confirmem para saber se precisa ou não.
%Morphological pattern spectra computed from granulometries are frequently used
%to classify the size classes of details in textures and images. An extension
%of this technique, which retains information on the spatial
%distribution of the details in each size class is developed. Algorithms for
%computation of these spatial pattern spectra for a large number of
%granulometries on binary images are presented.
%\end{abstract}
%%% Introduction
\section{General Information}
\begin{itemize}
\item A \textbf{repository} with all information and data for this workshop is available at \newline \url{https://github.com/sslarch/caa2019_hackathon}
\item 2-4 \textbf {groups} are formed on site according to framework preference and skill levels (\textit{Unconference style}).
\item We have until \textbf{4pm} to work together. Breaks can be taken as you wish, but a joint lunch break at 12:30pm might be a good idea.
\item It is not necessary to complete all possible tasks. Work as far as you can go. Most tasks are independent of each other, so you can skip boring ones. This is \textbf{NOT} an exam.
\item All results must be submitted in one \textbf{reproducible report} with all code and plots. This can be rendered from IPython Notebook, Rmarkdown, Latex, etc. or compiled manually. Ideally the report is submitted as a Pull Request to the hackathon repository on github. The file(s) should be added to the \verb|reports| directory.
\item The organizers of this workshop are available for questions and advice. They are able to assist you with problems as far as they are familiar with your toolset.
\end{itemize}
\section{Dataset}
The data for this excercise --- \verb|Michelsberg| --- are taken from the R package \verb|archdata| (Carlson/Roth 2018). It is a features by types table of abundance data on vessel types in archaeological features of the Younger Neolithic Michelsberg Culture from Belgium, France and Germany by Birgit Höhn (2002). The 109 observations/lines represent individual features (pits, ditches, etc.) from sites. For each feature we have information about 42 variables/columns. These include identifiers, classified pottery type counts, phase attribution and site coordinates.
\bigskip
\textit{From} \verb|?archdata::Michelsberg:|
\begin{multicols}{3}
\begin{small}
\begin{itemize}
\item id: Unique identifier (categorical, integer)
\item site\_name: Name of site (categorical, character)
\item catalogue\_nr: Number in catalogue of Höhn (2002) (categorical, integer)
\item feature\_nr: Number of the archaeological feature (categorical, numeric)
\item to3: Pot/vessel type 3 count
\item f4: Bottle type 4 count
\item b2: Beaker type 2 count
\item to2: Pot/vessel type 2 count
\item b3: Beaker type 3 count
\item b7: Beaker type 7 count
\item kw5: Carinated bowl type 5 count
\item vg1: Storage vessel type 1 count
\item vg2: Storage vessel type 2 count
\item t4a: Tulip beaker type 4a count
\item kw2: Carinated bowl type 2 count
\item kw4: Carinated bowl type 4 count
\item b5: Beaker type 5 count
\item t3b: Tulip beaker type 3b count
\item f3: Bottle type 3 count
\item kw3: Carinated bowl type 3 count
\item kw1: Carinated bowl type 1 count
\item b6: Beaker type 6 count
\item to1: Pot/vessel type 1 count
\item b1: Beaker type 1 count
\item t3a: Tulip beaker type 3a count
\item vg4: Storage vessel type 4 count
\item ks2: Conical bowl type 2 count
\item ks1: Conical bowl type 1 count
\item t2b: Tulip beaker type 2b count
\item f2: Bottle type 2 count
\item bs3: Globular bowl type 3 count
\item t2a: Tulip beaker type 2a count
\item bs2: Globular bowl type 2 count
\item b4: Beaker type 4 count
\item bs1: Globular bowl type 1 count
\item f1: Bottle type 1 count
\item t1b: Tulip beaker type 1b count
\item vg3: Storage vessel type 3 count
\item t1a: Tulip beaker type 1a count
\item mbk\_phase: MBK phase according to Lüning (1967) as an ordered factor with levels $<$ I/II $<$ II $<$ II/III $<$ III $<$ III-V $<$ III/IV $<$ IV $<$ IV/V $<$ Munz $<$ V
\item x\_utm32n: x coordinate in m; projection UTM WGS 84, zone 32 nord
\item y\_utm32n: y coordinate in m; projection UTM WGS 84, zone 32 nord
\end{itemize}
\end{small}
\end{multicols}
\medskip
Höhn (2002) recorded pottery vessel shapes from 108 archaeological features (pits, ditches etc.) from 69 sites of the Central European Younger Neolithic Michelsberg Culture (MBK; 4350-3500 BC) following Lüning's (1967) typology. Her correspondence analysis of the abundance data (columns 5 to 39) exhibits a pronounced Guttman effect or arch, suggesting the data set is structured by a time gradient. Recently Mischka et al. (2015) projected an 109th Michelsberg assemblage, Flintbek LA48, a pit with Michelsberg pottery from a North German site of the Funnel Beaker Culture (TRB), as a supplementary row into the existing chronology thereby connecting the relative chronologies of TRB and MBK. The data frame contains as attributes the references for the data, a typological key and the map projection. Note that ambiguous fragments of conical bowls (ks1 and ks2) are assigned as 0.5 to each of the two types resulting also in positive entries suitable to analysis by CA.
\medskip
Höhn, B. 2002. Die Michelsberger Kultur in der Wetterau. Universitätsforschungen zur prähistorischen Archäologie 87. Bonn: Habelt.
Mischka, D., Roth, G. and K. Struckmeyer 2015. Michelsberg and Oxie in contact next to the Baltic Sea. In: Neolithic Diversities. Perspectives from a conference in Lund, Sweden. Acta Archaeologica Lundensia Ser. 8, No. 65, edited by Kr. Brink et al., pp 241–250.
Lüning, J. 1967. Die Michelsberger Kultur: Ihre Funde in zeitlicher und räumlicher Gliederung. Berichte der Römisch-Germanischen Kommission 48, 1-350.
%\pagebreak
\section{Tasks}
All these tasks focus on the technical implementation of data analysis, not the interpretation of the results in the context of archaeological research questions. But feel free to observe and document interesting patterns.
\begin{enumerate}
\item Create a table with information about the sites (\verb|site_name|) and the amount of features (\verb|feature_name|) documented per sites. Based on this table: Who many sites are represented in the table by more than one feature?
\item Create a table with information about the total material occurrence: How many individual objects of \verb|to3|, \verb|f4|, ..., \verb|t1a| do we have in total across all sites? The table should not just contain the shortend type ids (e.g. \verb|to3|), but also a human readable type name (eg. \verb|Pot/vessel type 3|). Plot an ordered bar chart based on this table.
\item How much artefacts are documented per MBK phase (\verb|mbk_phases|) in total? Visualise the grouped counts with a time series plot. \verb|mbk_phases| is an ordinally scaled time variable. Reduce your data selection to the 6 sites with the most features (cf. Task 1) and draw the same time series for each of these variables (ideally in a plot matrix).
\item Plot a spatial map of sites based on the coordinate information. The map can be interactive. Add a meaningful background layer. The size and colour of the site markers should reflect the occurence of \verb|Carinated bowl type| (\verb|kw3|) at the site.
\item Run a Correspondence Analysis (CA) of features and pottery type variables (\verb|to3|, \verb|f4|, ..., \verb|t1a|). Remove very rare types and features with almost no finds beforehand. Prepare two result scatter plots -- 2D: Dimension1~Dimension2 and point colour according to the MBK phase (\verb|mbk_phases|). 3D: Dimension1~Dimension2~Dimension3.
\item Run a Correspondence Analysis (CA) of sites (!) and pottery type variables. Use only binary presence-absence information for the CA, not object count. Plot the resulting rank order along Dimension1 or along a fitted principal curve on a spatial map.
\item Calculate cultural distance between features (Euclidian distance, Chi-square distance, ...) based on the pottery type variables and visualize the distance network.
\item \textbf{Make your own task!} What else comes to mind with this kind of data?
\end{enumerate}
%%% References
%% Note: use of BibTeX als works!!
%\bibliographystyle{plain}
%\begin{thebibliography}{1}
%
%\bibitem{Flusser:Suk:93}
%J.~Flusser and T.~Suk.
%\newblock Pattern recognition by affine moment invariants.
%\newblock {\em Pattern Recognition}, 26:167--174, 1993.
%
%\bibitem{Hu:62}
%M.~K. Hu.
%\newblock Visual pattern recognition by moment invariants.
%\newblock {\em IRE Transactions on Information Theory}, IT-8:179--187, 1962.
%
%\bibitem{maragos89:_patter}
%P.~Maragos.
%\newblock Pattern spectrum and multiscale shape representation.
%\newblock {\em IEEE Trans. Patt. Anal. Mach. Intell.}, 11:701--715, 1989.
%
%\bibitem{Meijster:Wilkinson:PAMI}
%A.~Meijster and M.~H.~F. Wilkinson.
%\newblock A comparison of algorithms for connected set openings and closings.
%\newblock {\em IEEE Trans. Patt. Anal. Mach. Intell.}, 24(4):484--494, 2002.
%
%\bibitem{Nacken:thesis}
%P.~F.~M. Nacken.
%\newblock {\em Image Analysis Methods Based on Hierarchies of Graphs and
% Multi-Scale Mathematical Morphology}.
%\newblock PhD thesis, University of Amsterdam, Amsterdam, The Netherlands,
% 1994.
%
%\end{thebibliography}
%\end{multicols}
\end{document}