diff --git a/Sources/04RoboCoopsource.tex b/Sources/04RoboCoopsource.tex
new file mode 100644
index 0000000..53cc1a7
--- /dev/null
+++ b/Sources/04RoboCoopsource.tex
@@ -0,0 +1,415 @@
+\documentclass[letterpaper]{article}
+\usepackage[utf8]{inputenc}
+\usepackage[T1]{fontenc}
+\usepackage{natbib,alifeconf}  %% The order is important
+
+\usepackage{graphicx}
+\usepackage{hyperref}
+\usepackage{amsmath}
+
+
+% *****************
+%  Requirements:
+% *****************
+%
+% - All pages sized consistently at 8.5 x 11 inches (US letter size).
+% - PDF length <= 8 pages for full papers, <=2 pages for extended
+%    abstracts (not including citations).
+% - Abstract length <= 250 words.
+% - No visible crop marks.
+% - Images at no greater than 300 dpi, scaled at 100%.
+% - Embedded open type fonts only.
+% - All layers flattened.
+% - No attachments.
+% - All desired links active in the files.
+
+% Note that the PDF file must not exceed 5 MB if it is to be indexed
+% by Google Scholar. Additional information about Google Scholar
+% can be found here:
+% http://www.google.com/intl/en/scholar/inclusion.html.
+
+
+% If your system does not generate letter format documents by default,
+% you can use the following workflow:
+% latex example
+% bibtex example
+% latex example ; latex example
+% dvips -o example.ps -t letterSize example.dvi
+% ps2pdf example.ps example.pdf
+
+
+% For pdflatex users:
+% The alifeconf style file loads the "graphicx" package, and
+% this may lead some users of pdflatex to experience problems.
+% These can be fixed by editing the alifeconf.sty file to specify:
+% \usepackage[pdftex]{graphicx}
+%   instead of
+% \usepackage{graphicx}.
+% The PDF output generated by pdflatex should match the required
+% specifications and obviously the dvips and ps2pdf steps become
+% unnecessary.
+
+
+% Note:  Some laser printers have a serious problem printing TeX
+% output. The use of ps type I fonts should avoid this problem.
+
+
+\title{The challenges of cooperation for a swarm of heterogeneous robots}
+\author{Paul Ecoffet$^1$, Jean-Baptiste André$^2$, Nicolas Bredeche$^1$ \\
+\mbox{}\\
+$^1$Institut des Systèmes Intelligents et Robotique, Sorbonne Université, Paris \\
+$^2$Institut Jean-Nicod, École Normale Supérieure, Paris \\
+\href{mailto:ecoffet@sorbonne-universite.fr}{ecoffet@sorbonne-universite.fr}} % email of corresponding author
+
+% For several authors from the same institution use the same number to
+% refer to one address.
+%
+% If the names do not fit well on one line use
+%         Author 1, Author 2 ... \\ {\Large\bf Author n} ...\\ ...
+%
+% If the title and author information do not fit in the area
+% allocated, place \setlength\titlebox{<new height>} after the
+% \documentclass line where <new height> is 2.25in
+
+
+
+\begin{document}
+\maketitle
+
+\begin{abstract}
+% Abstract length should not exceed 250 words
+  
+\end{abstract}
+
+\section{Introduction}
+
+- Cooperation avec clonal Floreano
+- Cooperation swarm partner choice (no evo) \cite{Aktipis2011}
+
+
+In collective robotics, the accomplishment of a task for a population is often not aligned with the individual objective of each robot. Thus, a population of robots maximizing their individual gains may interfere with the execution of the collective task. How can we then align the individual goals of the agents to this collective success? This problem has been extensively studied in game theory and evolutionary biology \citep{Axelrod1981}. Several mechanisms have been identified that allow this alignment to take place \citep{West2007a}. Among these mechanisms, partner choice is an efficient mechanism. Each individual seeks to maximize his own gain and must interact with another partner. If individuals have the ability to select their partner, then it is in their interest to find the best possible partner as quickly as possible. Thus, individuals, in order to be chosen as a partner, have an interest in being more cooperative than the individually optimal action is. There is pressure to be cooperative in order to be chosen as a partner. Theoretical results have shown that for partner selection to be effective, the time spent searching for a partner compared to the time spent interacting with partners should be as short as possible \citep{Debove2015b}. Let $\beta$ be the meeting probability per time step for an individual, and $\tau$ be the cessation probability per time step of two partners after an interaction, so $\beta / \tau$ must be large for partner selection to be effective. Our goal is to identify robotic environments where partner choice is effective. To do so, we have built a pseudo-realistic environment where robots meet on patches and can interact together. We modulate our environment according to different parameters and study the emergence of partner choice behavior and the appearance of cooperation behavior under these different conditions. We have shown that for partner choice to be possible, constraints are very strong. The robot population must be very dense in order to have a very high $\beta$ encounter probability. Moreover, the interactions between two robots must be very long (very low $\tau$) in order for the search time to be small enough compared to the interaction time.
+
+\section{Methods}
+
+\subsection{Environment}
+
+We define a collective forage task where $N$ robots move and consume resources in pairs in a circular arena. The resources are spread randomly throughout the arena (see Fig.~\ref{fig:env}). Resources can be seen by robots and are surrounded by patches. Robots must move on the patches to consume the resource and increase their scores. A robot alone cannot exploit a resource. When two robots are on the same patch, they can collaborate to exploit a resource. Each robot receives a payoff based on its own investment and that of its partner. Depending on its investment, a robot can act either by cheating (it invests to maximize its own gain) or by cooperating (it invests to maximize the gain of the pair). 
+
+\begin{figure}
+    \begin{center}
+        \includegraphics[width=2.5in]{media/wander_env.png}
+        \vskip 0.25cm
+        \caption{The environment. Each blue dot is a robot. Each green dot is a resource and the light green circle around it is the patch. Robots can see the resources, and when two robots walk on a patch, they can interact together.
+        }
+    \label{fig:env}
+    \end{center}
+\end{figure}
+
+When two robots are on the same patch, they can choose to interact together and exploit the resource. First, each robot accesses the action that its partner intends to do, then it decides whether or not to accept the interaction. If one of the robots choose not to interact, then the resource disappears and the robots continue their course. If both robots accept, the resource disappears and they play the announced investments to get their payoffs. The robots switch then to a wandering behaviour for a certain period of time. It represents the amount of time the robots interact with each other, or a digestion period. Each robot has a probability $\tau$ of returning to the game at each iteration. The expected duration of an interaction for an agent is therefore $1/\tau$. Two robots that have interacted together may not come back to partner seeking behaviour at the same iteration. When a resource disappears, a new resource appears in the arena at the next iteration.
+
+\subsection{Cooperation Market} \label{sec:market}
+According to the theoretical results on partner choice \citep{Debove2015b}, the efficiency of this strategy depends on the meeting probability of an agent ($\beta$) and the split probability of an interaction ($\tau$). If the meeting probability is big compared to the split probability, that is $\beta/\tau$ is large, then partner choice is a viable strategy and can emerge. Indeed, for partner choice to be effective, when an agent refuses to interact with a partner, it must do so because its expectation of gain in finding a better partner outweighs the gain missed by rejecting the interaction with the wrong partner and the implied cost paid by looking for a new partner. Thus, if search time is short compared to interaction time, it is profitable to spend more time searching for a good partner than interacting with more uncooperative partners.
+
+The $\beta$ parameter is determined by the ability of the robots to meet on a patch and varies as the robots evolve, but also depending on the density of robots in the arena, and especially the robots that are also seeking for partner. In our model, the split probability $\tau$ parameter is chosen experimentally.
+
+\subsection{Objective function}
+
+When two robots interact with each other, they earn a gain determined by the investment of the two agents. The gain of an agent $a_i$ investing $x_i$ with its partner $a_j$ investing $x_j$ is determined by the function $P(x_i, x_j)$ described in the equation~\ref{eq:payoff}.
+
+\begin{align}
+PG(x_i, x_j) &= \frac{a}{2} (x_i + x_j) \\
+PD(x_j) &= \frac{b}{2} (x_j) \\
+C(x_i) &= \frac{1}{2} x_i^2 \\
+P(x_i, x_j)& = PG(x_i, x_j) + PD(x_j) - C(x_i) \label{eq:payoff}
+\end{align}
+
+This function is a mixture of a public good ($PG$, modulated by $a$) and a prisoner's dilemma ($PD$, modulated by $b$) and a quadratic cost $C$. For $a_i$ to maximize its individual gain ($P(x_i, x_j)$), the optimal investment is $x_d = \frac{a}{2}$, which correspond to the defective behaviour. For the group to maximize their total gain, both agents must invest $\hat{x} = a + \frac{b}{2}$, which correspond to the cooperative behaviour. The Figure~\ref{fig:payoff} is a plot of the payoff function with different partner's investment values.
+
+
+
+\begin{figure}[htpb]
+    \centering
+    \includegraphics[width=\columnwidth]{media/payoff.pdf}
+    \caption{Payoff function with different partner's investment value. The individually optimal investment is $x_d = \frac{a}{2}$ whatever the constant value the partner invests, which correspond to a defective behaviour. If both robots invest the same value, then the socially optimal investment is $\hat{x} = a + \frac{b}{2}$, which correspond to a cooperative behaviour.}
+    \label{fig:payoff}
+\end{figure}
+
+\subsection{Controller}
+
+The robot control system is composed of the investment value ($x \in [0, 10]$) during interaction and two decision modules: The movement module and the partner choice module. The robot always invests the same value and the modules remain fixed throughout the task. 
+
+The movement module is an artificial neural network with 1 hidden layer of 10 neurons. All the nodes have a $\tanh$ activation function. The input of the network is the detailed information from the 8 sensors of the robot. The network gives as output the speed of translation and rotation between $]-1, 1[$. These values are then resized to match the maximum translation and rotation speeds of the robot.
+
+The partner choice module is also a artificial neural network. It is activated only when an agent is with another agent on the same patch. This network receives as inputs the investment level of the robot as well as the investment level of its partner. It is composed of 1 hidden layer of 3 neurons and has a $\tanh$ activation function. It gives as output a value ($a \in ]-1, 1[$), which correspond to the response to the partner. If the output is greater than 0, then the robot accepts the interaction, otherwise it refuses it and the interaction does not take place.  The details of the inputs of each network are given in the Table~\ref{tab:ann_params}.
+
+All neural network weights are bounded in the range $]-10, 10[$. In total, the two neural networks consist of 368 weights.
+
+\begin{table}
+    \centering
+    \begin{tabular}{cc}
+        \hline
+        \textbf{Input} & \textbf{Value}  \\
+        \hline
+        \textbf{Movement module} & \\
+        \textit{Per sensor ($\times 8$)}& \\
+        Distance to Robot &  $]0, 1[$ if in range else 1 \\
+        Distance to Wall & $]0, 1[$ if in range else 1  \\
+        Distance to Resource & $]0, 1[$ if in range else 1  \\
+        Robot on the patch & 0 or 1 \\
+        \hline
+        \textbf{Partner choice module} & \\
+        Partner's investment & $]0, 10[$ \\
+        Robot's own investment & $]0, 10[$ \\
+        \hline
+    \end{tabular}
+    \caption{Neural Networks inputs}
+    \label{tab:ann_params}
+\end{table}
+
+\subsection{Phenotypic variability} \label{sec:phenovar}
+
+\citet{McNamara2010c} reviews different works that have shown the importance of variability in the level of investment in the population to allow agents' selectivity and thus enable the appearance of partner choice. Indeed, for selectivity to be a useful skill, the variability of investments between agents must be big enough so that the payoff variation between two different partners is sufficiently beneficial. In this case, selective robots have the upper hand against undiscriminating robots.
+
+This variability can be present by itself or enforced either with a very high mutation strength for the gene encoding the investment level for each agent, or by adding a noise to the genetically encoded investment level for each agent that will remain the same throughout the task. 
+
+\subsection{Learning}
+
+The weights of the neural networks and the investment value of a robot constitute its genome. In total, a robot has $369$ genes, the $g_x$ gene to encode the investment level and the 368 $g_{w_i}\,\forall i \in 0..368$ genes to encode the weights of the two neural networks. The value of $g_x$ is in $]0, 1[$, the investment level $x$ of the robot is defined by $x = 10 \times g_x$. The values of $g_{w_i}$ are in $]-10, 10[$.
+
+At the beginning of learning, the $g_{w_i}$ genes are randomly initialized in the range $]-1, 1[$ and the $g_x$ gene is randomly initialized in $]0, 1[$.
+
+We use the fitness-proportionate evolutionary algorithm described below for the learning of our robots. After each generation, the total payoffs of the agents represent their fitnesses. Thus, the fitness $F_i$ of the robot $i$  which had accepted $n$ interactions is described by (Eq.~\ref{eq:totalpayoff})
+
+
+\begin{equation}
+    F_i = \frac{1}{\tau} \sum_{j=0}^{n} P(x_i, x_j) \label{eq:totalpayoff}
+\end{equation}  
+
+with $x_j$ being the investment value of the robot's partner at the $j^{th}$ interaction. Each payoff is weighted by $\tau$ to normalize the total payoff gains by robots between conditions where $\tau$ varies.
+
+A new generation of robots is generated by randomly drawing the agents' genomes in proportion to their fitnesses. Then a mutation operation is applied to each agent of the new generation. Each $g_i$ gene of a robot has a probability $\mu = 0.01$ to mutate. If the gene is selected, then it has a probability of $0.1$ to mutate according to a uniform distribution $\mathcal{U}(]-10, 10[)$ and a probability of $0.9$ to mutate according to a normal distribution $\mathcal{N}(g_i, \sigma)$ with $\sigma = \sigma_w = 0.1$ for the weight genes and $\sigma = \sigma_x = 0.1$ for the investment gene. The new generation then performs the task and the process is repeated for $G = 200$ generations (see Table~\ref{tab:env_params} for a list of all the parameters). 
+
+\begin{table}
+    \centering
+    \begin{tabular}{clc}
+        \hline
+        \textbf{Param} & \textbf{Description}  & \textbf{Value} \\
+        \hline
+        \multicolumn{3}{l}{\textbf{Payoff}} \\
+        $a$ & Public good weight & 5 \\
+        $b$ & Prisoner's dilemma weight & 3 \\
+        %\hline
+        \multicolumn{3}{l}{\textbf{Environment}} \\
+        $T$ & Number of iterations per generation & $100\,000$ \\
+        $G$ & Number of generations per run & $200$ \\
+        & Arena diameter & 400px \\
+        & Robot size & 4px \\
+        & Robot max speed & 2px/iteration \\
+        $\omega$ & Number of patches & 30 \\
+        $\tau$ & End of interaction probability & \\
+        %\hline
+        \multicolumn{3}{l}{\textbf{Evolution hyper-parameters}} \\
+        $\mu$ & mutation probability & 0.01 \\
+        $\sigma_w$ & mutation strength of weight genes & 0.1 \\
+        $\sigma_x$ & mutation strength of investment gene& 0.1 \\
+        \hline
+    \end{tabular}
+    \caption{Experiment parameters}
+    \label{tab:env_params}
+\end{table}
+
+\section{Results}
+
+\subsection{Experimental setup}
+
+The environment is a circular arena with a diameter of 400px. The robots are 4px diameter disks. The robots have 8 equally distributed sensors with a range of 96px giving them information about their surroundings, such as the presence of other robots, of a resource or of a wall. The robots move through the environment at a maximum translation speed of 2px/iteration and a rotational speed of $30^\circ$/iteration. $N$ robots are spread randomly in the environment and 30 resources are randomly scattered throughout the arena. Each generation lasts $T = 100\,000$ iterations. The environment is represented in Figure~\ref{fig:env}.
+
+The results presented below are obtained by the behavioral study of the $200^{th}$ generation.  We ran 24 simulations per condition in all experiments.
+
+We have studied the influence of several factors that may facilitate the emergence of partner choice and cooperation behaviours: (i) the effect of population size (ii) the effect of the duration of interactions by changing the split probability ($\tau$), and (iii) the strength of the investment gene mutation ($\sigma_x$). 
+
+\subsection{Effect of the population size}
+
+We first wanted to test the impact of the population size on the emergence of partner choice. Does a bigger population size positively impact the emergence of cooperative behavior? %question
+To test the emergence of the cooperation behavior by partner choice, we set the parameters to be the most favourable for its emergence. We set $\tau = 0$ and the evaluation duration $T = 100\,000$ in order to grant a long search time for the robots and a very engaging commitment if they accept the interaction. %expe
+At $N = 50$, robots plays the defective strategy. the average investment level is very close to the social optimum for $N = 1\,000$ (see Figure~\ref{fig:do_coop}). % results 
+The robots evolve a cooperative behavior for $N$ sufficiently large. The denser the population, the higher the probability of encounters $\beta$ is. Thus, with 50 robots in the arena, the robots are unable to meet and sample enough partners to be selective before the end of the generation. Moreover, the robots are racing to find a partner quickly. Indeed, with $\tau = 0$, the more the task advances in time, the fewer agents are available in the arena and thus the more $\beta$ decreases throughout the evaluation. % interpretation
+
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3in]{media/wander_do_coop.pdf}
+        \vskip 0.25cm
+        \caption{The larger the population, the higher the agents' level of investment.
+        Mean investment of the population for 24 simulations per condition with a split probability $\tau = 0$ and a mutation strength for investment $\sigma_x = 0.1$. When the population is large, agents can easily find a partner and can be more selective. The pressure to invest a lot is then greater due to the effect of the partner choice.
+        }
+        \label{fig:do_coop}
+    \end{center}
+\end{figure}
+
+
+To show the importance of partner choice in the evolution of this cooperative behavior, %question
+we built a control condition where we deactivate the agents' ability to know their partner's investment in order to accept or not accept an interaction. % expe
+In this condition, whatever the number of agents in the environment, the average investment level is always $x_d$, that is a defective behaviour (see Fig.~\ref{fig:control}). % results
+In this situation, agents have no way to be selective and cannot choose a cooperative robot over a non-cooperative one. Thus, cooperative robots are not preferentially selected as partners and there is no incentive to invest more than the individual optimum. There is no selection pressure in favor of the most cooperative agents. % interpretation
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3in]{media/wander_control.pdf}
+        \vskip 0.25cm
+        \caption{Robots never cooperate in control condition. Mean investment of the population for 24 simulations per condition with a split probability $\tau = 0$ and a mutation strength of investment $\sigma_x = 0.1$. Robots never cooperate whatever the number of robots $N$ in the environment. Without access to their potential partner's investment level, agents cannot be selective and partner selection is impossible. Agents are under no pressure to invest a lot to be chosen. Therefore, they all play at the individually optimal investment level.
+        }
+        \label{fig:control}
+    \end{center}
+\end{figure}
+
+
+\subsection{Effect of the interaction length}
+
+%question
+%expe
+%results
+%interpretation
+
+According to the theoretical results, the longer the interaction, the greater the influence of the choice of partner (see section~\nameref{sec:market}). We test the reality of this prediction in our experimental setup. % question
+To do this, we vary the split probability $\tau$.  % expe
+The larger $\tau$ is, the shorter the interaction. When the split probability $\tau$ is null or low and the population size $N$ is large, the robots invest in a collectively optimal way and have a cooperative behavior (see Fig.~\ref{fig:corr_tau_comp}. % results
+The larger the $\tau$ becomes, the less cooperative the robots are even for a high population size. The robots plays systematically a defective investment with $\tau \geq 1\times 10^{-3}$ Thus, increasing the duration of interactions has a positive effect on the appearance of cooperative behavior by partner choice. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_corr_tau_coop_pop1000.pdf}
+        \vskip 0.25cm
+        \caption{The smaller the split probability $\tau$ is, the more cooperative robots get. The robots invest cooperatively for $\tau \leq 2\times 10 ^{-5}$, and have a defective behaviour for $\tau > 2 \times 10^{-5}$. The higher $\tau$ is, the less long are the interaction and the more profitable it is to interact with a lot of bad partner compared to looking for a good partner and interact with it.
+        }
+        \label{fig:corr_tau_comp}
+    \end{center}
+\end{figure}
+
+
+\subsection{Effect of the mutation strength}
+
+As explained in the section \nameref{sec:phenovar}, different works have shown the importance of variability in the level of investment in the population to allow agents' selectivity and thus enable the appearance of partner choice \citep{McNamara2010c}.
+We test the influence of higher phenotypic variability in our task. % Question
+To do so, we (i) modified the strength $\sigma_x$ of the Gaussian mutation on the gene encoding the robot investment level and (ii) applied a constant noise on the robot investment level during a generation. % expe
+We observe very minor differences in the average investment level between the different simulations (see Fig~\ref{fig:varmut}). However, we note the presence of less variability between simulations when the mutation level is high. This can be explained by a more rapid convergence towards the optimal investment level. % resultats
+The fact that the variability of investment in the environment plays very little role in our task may be explained by the fact that all possible levels of investment are present in the first generation. The ability to be selective in the choice of partner may therefore emerge before the population is completely homogeneous and thus selectivity becomes an unnecessary skill. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=2.4in]{media/wander_varmut.pdf}
+        \vskip 0.25cm
+        \caption{A higher mutation strength has no impact on average cooperation but reduces variance in investment between simulations.
+        Average investment in the population for 24 simulations per condition with $\tau = 0$. The addition of phenotypic variability facilitates the appearance of agent selectivity at low investment mutation strength \citep{McNamara2010c}. Here, variations in mutation strength for investment $\sigma_x$ %or the addition of phenotypic variability
+        have only a small impact on the final investment level of the agents. This may be due to the fact that all possible investment levels are represented at the Initialization of the simulation.
+        }
+    \label{fig:varmut}
+    \end{center}
+\end{figure}
+
+
+
+\subsection{Control: population size vs number of generations}
+
+The difference in population size between low (50 robots) and high (1000 robots) population conditions could be explained by the lower number of evaluations that the 50 robot conditions have to evolve cooperative behaviour. Indeed, with the number of generations being constant ($G = 200$), the number of evaluations for the condition with 50 robots is $50 \times 200 = $10,000 and for the conditions with 1000 robots is $1,000 \times 200 = 200,000$. This difference in the number of evaluations could explain why cooperative behaviour has evolved in the conditions where $N$ is large and not in those where $N$ is small. Has the evolution converged in the small $N$ conditions? % Question
+To test the impact of this number of evaluations, we run a new control condition of 24 simulations with $G = 4,000$ for a population of $N=50$ robots, offering $200\,000$ evaluations. % Method
+The difference between the condition $N=50, G=200$ and $N=50, G=4000$ is marginal, but the difference between these conditions at the condition $N=1\,000, G=200$ is very large (Fig.~\ref{fig:gencomp}). Adding more generations does not improve the level of cooperation achieved for conditions with a small population. % results
+It is therefore the too low encounter probability $\beta$ that blocks the emergence of cooperative behavior under these conditions, not the fewer evaluations. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_comp_genpop.pdf}
+        \vskip 0.25cm
+        \caption{More generations with small population does not lead to cooperative behaviour. The differences in robot investment between conditions $N=50$ and $N=1000$ cannot be attributed to fewer evaluations for small populations.
+        }
+        \label{fig:gencomp}
+    \end{center}
+\end{figure}
+
+
+\subsection{Control: Wandering vs Teleportation}
+
+
+Finally, we do a final control to test the influence of the wandering behaviour. Does this facilitate or not the emergence of cooperation by partner choice? % Question
+To test this, we compare the task with digestion time by wandering with a task with digestion time outside the arena. In this second condition, after one robot has interacted with another, it is placed outside the arena and has a $\tau$ probability of returning to the arena at each time step. When a robot is placed back into the arena, it is randomly placed back into the arena. This second condition is closer to the numerical simulations present in \citet{Debove2015c} than the wandering condition. We compare the results with the wandering condition and the condition outside the arena for several values of split probability $\tau$. % 2) Expe
+We find that in the wandering condition as well as in the off-arena condition, when the probability of split is low ($\tau < 1\times 10^{-5}$), the robots invest cooperatively. We also find that in the off-arena condition, the robots remain cooperative for higher values of split probability. For even higher split probabilities, the robots no longer have cooperative behaviors whatever the condition. % results
+The off-arena condition is more robust than the wander condition. Indeed, for higher split probability values, the agents still behave cooperatively. This can be explained by the fact that the arena is less crowded than in the wander condition. Indeed, a robot necessarily crosses potential partners in the off-arena condition, and is not blocked by agents in their digestion phase, as would be the case in the wander condition. Thus, the $\beta$ encounter probability is greater in the off-arena condition than in the wander condition. % 4) interpretation
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_corr_tau_coop_pop1000_tp.pdf}
+        \vskip 0.25cm
+        \caption{Robots act cooperatively in both the wander and off-arena conditions for low split probability $\tau$. The off-arena condition is more robust to middle range values of $\tau$ than the wander condition.
+        }
+        \label{fig:comp_tau_wander_tp}
+    \end{center}
+\end{figure}
+
+
+
+\section{Conclusion}
+
+
+
+\section{Acknowledgements}
+
+
+
+\footnotesize
+\bibliographystyle{apalike}
+\bibliography{references}
+
+\clearpage
+
+\section{Supplementary}
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_all_fit.pdf}
+        \vskip 0.25cm
+        \caption{Fitness is super noisy
+        }
+        \label{fig:allfit}
+    \end{center}
+\end{figure}
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_vartau.pdf}
+        \vskip 0.25cm
+        \caption{Average investment over 24 simulations per condition with $\sigma = 0.1$. As a function of the duration of interactions between agents ($\tau$), the average level of investment for large populations varies greatly. \textbf{a. b.} If the interactions are very long (small $\tau$), then the search time is really very small compared to the interaction time, and the search for a partner is de facto very cheap. Agents would rather spend a lot of time finding a good partner than interact with the first partner they meet. There is therefore a lot of pressure to invest a lot so as to be chosen as a partner and agents invest at the socially optimal level. \textbf{c. d.} If the interactions are very short (high $\tau$), then the search time becomes very important compared to the interaction time, and the search for a partner becomes de facto very expensive. Agents would rather have many interactions with uncooperative partners than spend a lot of time searching for an efficient partner. There is therefore no pressure to invest a lot to be chosen as a partner and agents invest at the individually optimal level.
+        }
+        \label{fig:vartau}
+    \end{center}
+\end{figure}
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander_vartau.pdf}
+        \vskip 0.25cm
+        \caption{TP VERSION, ARENA 120px, NOT STRICTLY THE SAME
+        }
+        \label{fig:tp_vartau}
+    \end{center}
+\end{figure}
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{media/wander.pdf}
+        \vskip 0.25cm
+        \caption{Average investment over 24 simulations per condition with $\sigma = 0.1$. xx TODO
+        }
+        \label{fig:wander}
+    \end{center}
+\end{figure}
+
+
+
+
+\end{document}
diff --git a/Sources/Lions.tex b/Sources/Lions.tex
new file mode 100644
index 0000000..a9e8c54
--- /dev/null
+++ b/Sources/Lions.tex
@@ -0,0 +1,302 @@
+\documentclass[twocolumn]{article}
+\usepackage[utf8]{inputenc}
+
+\usepackage{geometry}
+\usepackage{authblk}
+
+ \geometry{
+ a4paper,
+ total={170mm,257mm},
+ left=20mm,
+ top=20mm
+ }
+
+\title{Nothing better to do? Environment quality and the evolution of cooperation by partner choice}
+
+
+\author[1]{Paul Ecoffet}
+\author[1,*]{Nicolas Bredeche}
+\author[2,*]{Jean-Baptiste André}
+\affil[1]{Institut des Systèmes Intelligents et Robotique, Sorbonne Université, Paris}
+\affil[2]{Institut Jean-Nicod, École Normale Supérieure, Paris}
+\affil[*]{\small{Equal contribution}}
+
+
+\date{\today}
+
+
+
+\usepackage{hyperref}
+\usepackage[english]{babel}
+\usepackage[numbers]{natbib}
+\usepackage{graphicx}
+\usepackage{amsfonts}
+\usepackage{amsmath}
+\usepackage{float}
+\usepackage{amssymb}
+\usepackage{stmaryrd} % for integer ranges
+\usepackage{hyperref}
+
+
+
+\begin{document}
+
+\maketitle
+
+\begin{abstract}
+    The evolution of cooperation
+\end{abstract}
+
+\section{Introduction}
+
+xx Intro trop brutale?
+
+Several mechanisms have been identified to explain the evolution of cooperation among non-kin \citep{Trivers1971, MaynardSmith1974, Axelrod1981}, including positive reciprocity \cite{Trivers1971, Axelrod1981, Andre2007}, punishment \cite{Bshary2005, Raihani2012} or partner choice \cite{Eshel1982, Bull1991, West2007, Schino2017}. Among these mechanisms, partner choice has been considered over the last twenty years as having probably played a particularly important role \cite{Baumard2013a, +ref}. When individuals can choose among several different partners, which they can compare and compete against each other as in an economic market, this generates a selection pressure to cooperate more, to appear as a good partner and attract others' cooperation \cite{Noe1994}.
+
+The effects of partner choice have been well documented in a large number of biological systems. For example, in the interaction between cleaner fishes and their clients the law of supply and demand determines the way in which the added value of the interaction is shared, in accordance with market principles \cite{Bshary2006}. When cleaners are rare, clients tolerate cheating on their part, while they become more picky when cleaners are numerous. The effects of partner choice have also been documented in primate grooming behavior in two meta-analyses, showing that female primates groom preferentially those that groom them most and that a positive relation exists between grooming and agonistic support \citep{Schino2007, Schino2008}. In vervet monkeys, individuals groom others in exchange for access to food and they do so for longer periods when fewer partners are available \cite{Fruteau2009}. Beyond cooperation, partner choice also plays a decisive role in mating, leading to the evolution of secondary sexual caracteristics and nuptial gifts, and/or to assortative matching (refs xxxTODOxxx \cite{Zahavi1975, xxTerrain}. Lastly, the effects of partner choice have also been documented in humans where it has been shown that the need to attract social partners is a major driver of cooperation \citep{Barclay2007a, Barclay2015, Barclay2016, Debove2015b,  Andre2011, Baumard2013a}.
+
+% xx \cite{Clutton-brock2009} qui discute plein de cas de réciprocité qui pourrait n'être qu'en fait manipulations et mutualismes
+
+
+There are, however, a number of biological situations in which one would typically expect partner choice to play an important role, but where no such effect has ever been demonstrated. These include most intraspecific collective actions in non-human animals. This is particularly salient in collective hunts such as collobus hunting in chimpanzees, or pack hunting in carnivores. No empirical evidence in these species suggests that individuals cooperate for reasons related to partner choice, either to attract partners or to be accepted by them in their hunts. On the contrary, the majority of available data are consistent with the more parcimonious explanation that individuals are simply doing what is in their immediate best interest at any given time \cite{Packer1986,Packer1988a, Melis2008, Melis2011}. In particular, if cooperation in collective hunts was driven in part by the need to appear as a good partner, individuals would be expected to  willingly share the product of their hunts in a way that depends on everyone's actual engagement, to encourage participation in other hunts in the future. However, such voluntary and conditional sharing has never been documented in animal collective hunts \cite{Melis2011}. In evolutionary terms, therefore, collective hunting in these species is most likely an instance of \textit{byproduct} cooperation, rather than an instance of reciprocal cooperation based on partner choice.
+
+Yet several models on the evolution of cooperation by  partner choice suggest that cooperation should evolve in these situations \cite{McNamara2008, Aktipis2011, Barclay2011, Campenni2014}. And, in humans, behaviours in collective actions are driven by the need to appear as a good partner, especially when it comes to sharing the benefits of cooperation (refs Alvard xx \cite{Baumard2013a}). One may therefore wonder why the same effects did not produce the same consequences in other species.
+
+Such a lack of observation could always be the consequence of methodological difficulty in empirically proving the existence of partner choice. However, we would like to suggest an alternative here, namely that there is in fact a strong constraint impeding partner choice in a large number of situations in animals.
+
+Partner choice requires that individuals can compare and choose among several opportunities for cooperation. In some cases, \textit{partners} themselves constitute opportunities for cooperation and partner choice then only requires that partners are many and accessible. This is the case, for instance, in mating markets, or in most instances of interspecific mutualism. In other cases, however, finding an opportunity for cooperation requires more than just finding a partner. This is what happens when cooperation consists of several individuals working together to exploit environmental resources. In this case, a cooperation opportunity requires both a partner(s) and a resource, which imposes an additional constraint limiting the scope of partner choice. When resources are scarce, there are always few options to compare, and partner choice cannot operate. This could explain the lack of cooperation, beyond by-product cooperation, in many instances of collective actions in the wild despite the availability of potential partners.
+
+In this article, we aim to test this idea using agent-based simulations. To do this, we simulate the evolution of agents placed in an environment containing resources that can be exploited collectively. We show that, in a low-resource environment, and even if there are plenty of partners, partner choice is not able to drive the evolution of cooperation as individuals cannot pit the few cooperation opportunities against each other. What is more, we also show that the number of potential partners actually has a negative effect on the evolution of cooperation when patches are scarce. When there are too many potential partners relative to the amount of patches available, there are always too many individuals on any given resource as individuals have nothing else to do anyway. Hence, there is no point in trying to attract partners but on the contrary there are benefits in trying to limit their number. We therefore show that partner choice is only effective when the number of available partners lies within a precise range of values, all the narrower as the availability of patches is low.
+
+We believe that this constraint plays a central role in explaining that, in many species, although individuals do participate in collective actions, sometimes finely coordinating their behaviour with that of others, individuals do not actually seek to cooperate beyond what is in their immediate personal interest. On the contrary, thanks to its cognitive capacities, the human species is able to extract resources from a greater variety of situations. As a result, we actually live in an environment that is much richer in resources than other species. Hence we can compare and compete a greater diversity of opportunities for cooperation against one another, and we are thus forced to cooperate more intensively to attract partners.
+
+
+\section{Methods}
+
+We consider a population of $N_T$ individuals living in an environment consisting of $\omega$ different patches on which resources are located. Every generation of the simulations is constituted of $T$ time steps during which individuals gather payoff units. At the end of these $T$ time steps, individuals reproduce in proportion to their total payoff, and die. During a time step, every individual is considered one by one in a random order. When her turn comes, an individual evaluates each of the $\omega$ patches of the environment, including the patch where she is currently located, assigns each a score, and then moves toward the patch with the highest score, or stays on her current patch if that's the one with the highest score. Once every individual has taken this decision, individuals express their cooperation strategy on their local patch, and they collect a payoff that depends on their own and their partners' cooperation strategy. Patches can disappear every time step, with a probability $d$, and are then immediately replaced by an empty patch.
+
+xx La taille de la population totale $N$ est toujours constante quelque soit le nombre d'individus présents dans l'environnement $N_T$ afin d'avoir le même nombre d'évaluations d'individus dans toutes les conditions. Pour $N_T < N$, $N_E = \lceil N_T / N \rceil$ environnements sont créés. Les individus sont répartis aléatoirement dans ces environnements afin que chaque environnement comporte $N_T$ individus. Pour le dernier environnement à compléter, s'il n'y a pas $N_T$ individus encore disponibles, alors des individus tirés d'autres environnements sont inclus dans l'environnement pour le compléter. Les gains obtenus par ces individus dans cet environnement ne sont pas considérés pour le calcul de leurs fitnesses.
+
+\subsection{The decision-making mechanisms}
+
+The individuals' strategy in this environment consists of two separate decisions.
+
+On the one hand, the individual must evaluate the different patches available and assign a score to each. This decision is made by an artificial neural network, called the "patch ranking" network. For each patch, this neural network has the following input information: (i) the number of other individuals already present on the patch, (ii) the average level of cooperation expressed by these individuals in the last time step, (iii) the level of cooperation that the focal individual would express should he join this patch, and (iv) a binary that indicates whether or not the individual would have to move in space in order to join this patch (i.e. this binary distinguishes the patch where the individual is currently located from all other patches).
+
+(xx En supplementaries plutôt ? C'est vraiment du détail d'implem…
+
+Pour (i), (ii) and (iii), leurs valeurs sont séparées en décimales et unités et envoyées dans des entrées différentes pour permettre au contrôleur de distinguer facilement de faible variations.
+)
+
+On the other hand, the individual must decide on a level of cooperation once she is on a patch. This decision is made by another artificial neuron network called the ``cooperation'' network (plus some phenotypic variability, see below). As an input, this neural network only has the number of other individuals present on the same patch as the focal. This entails that we assume that the agent cannot modulate her cooperation level in function of others' cooperation level. This assumption is meant to exclude the possibility that partner control strategies may evolve, and allows us to focus only on the effect of partner choice.
+
+The connection weights of both networks constitute the genome of each agent. They evolve by natural selection as exposed in the section \ref{sec:evolutionaryalgo}.
+
+
+\subsubsection{Phenotypic variability of cooperation}\label{ssec:phenotypic_var}
+
+As is now well established in the litterature, selective pressures in favor of any form of conditional cooperation, and therefore in particular in favor of partner choice, stem from the presence of some variability in partners’ cooperative behavior (see \cite{McNamara2010c} for a review of this idea). In order to capture the effect of variability in the simplest possible way, here we consider the effect of phenotypic variance in the expression of individuals' genes. At each generation of our simulations, each individual is subject to the effect of a \emph{phenotypic noise} that modifies her cooperation level. If $x_i^g$ is the cooperation level chosen by the genes of an individual (i.e. decided by her cooperation network), then the actual cooperation level player by the individual is $x = x_i^g + \epsilon$, where $\epsilon$ is drawn ramdomly as follows. The interval $[-1, 1]$ is uniformly split in $N_T$ values, and every individual gets one value of $\epsilon$ chosen among these $N_T$ values without replacement.
+
+
+\subsection{The payoff function}
+
+Each individual $i$ present on a patch invests a given amount $x_i$ into cooperation --where $x_i$ is decided by the individual's cooperation network. Individuals present on the same patch play a modified version of the n-player prisoner's dilemma. Consider a focal individual $i$ playing $x_i$, in a patch on which there are $n-1$ other individuals whose average level of cooperation is $\bar{x}_{-i}$ . The payoff of individual $i$ is given by
+
+\begin{equation}
+P(x_i, \bar{x}_{-i}, n) = F(n)  \times  \left[ a x_{i} +  b  \bar{x}_{-i} - \frac{1}{2}  x_i^2\right]
+\end{equation}
+xx où $a$ représente le bénéfice propre de l'agent et $b$ represents the social benefit of others' cooperation, and the function $F(n)$ is meant to capture the fact that there is an optimal number of individuals exploiting a patch and is given by
+
+\begin{equation}
+F(n) = e^{ - \left( {n - \hat{n} } \right)^2  / (2\sigma^2) } \label{eq:friction}
+\end{equation}where $\hat{n}$ is the optimal number of individuals per patch and $\sigma$ measures the strength of the penalty that stem from being a submoptimal number of individuals on the same patch.
+
+This payoff function has been chosen in such a way that, in the absence of partner choice, the evolutionarily stable strategy is always to invest the individually optimal investment (i.e. $x_{ESS} = a$), whereas the ``socially optimal'' cooperation, that is the level of cooperation that would maximise the average payoff of individuals on the patch, is to invest $\hat{x} = a + b$.
+
+
+% \begin{figure}[htbp]
+%     \centering
+%     \includegraphics[width=\linewidth]{media/methods/payoff.pdf}
+%     \caption{Variation of the payoff for the focal player according to its partner investment strategy for $n=2$}
+%     \label{fig:payoff}
+% \end{figure}
+
+% Note: pourquoi social optimum quand P(x, x)? Nécessite une explication qui ne va pas de soi (?).
+
+
+
+\subsection{The evolutionary algorithm}\label{sec:evolutionaryalgo}
+
+Each individual has a genome composed of the weights of its two neural networks, which makes a total of 84 genes $g = (g_{1}, \ldots, g_{84})$ with $ g_{i} \in ]-10, 10[$. We consider a population of fixed size $N$. The first generation is composed of $N$ individuals with random genes for the neural network weights, drawn uniformly in $]-1, 1[$. We then use a fitness proportionate evolutionary algorithm to simulate  evolution.  After the $T$ time steps of a generation have taken place, individuals all reproduce and die. A new population of $N$ individuals is built out of the previous generation by sampling randomly among the $N$ parents in proportion to their cumulated payoff, according to a Wright-Fisher process.
+
+A mutation operator is applied on each offspring. Every gene of every offspring has a probability $\mu$ to mutate and a probability $1-\mu$ to stay unchanged. If a gene $g_i$, with value $v_i$, mutates, it has a probability $0.9$ to mutate according a normal distribution and thus reach a new value sampled in $\mathcal{N}(v_i, 0.1)$ and a probability $0.1$ to mutate according to a uniform distribution and thus reach a new value sampled in $\mathcal{U}(]b_{min}, b_{max}[)$.
+
+The evolutionary algorithm is run for $G$ generations.
+
+
+\begin{table}
+    \centering
+    \begin{tabular}{clc}
+        \hline
+        \textbf{Parameter} & \textbf{Description} & \textbf{Value}  \\
+        \hline
+        \textbf{Environment} & & \\
+        $N$ & Population size & $100$ \\
+        $d$ & Probability of disappearance of partches, per time step & $1/1\ 000$ \\
+        $T$ & Number of timesteps per generation & $1\ 000$ \\
+        $c_{m}$ & Cost of moving to another patch & $0$ \\
+        $N_T$ & Number of individuals in the local environment & var \\
+
+        \textbf{Payoff } & & \\
+        $a$ & Immediate personal benefit of cooperation & $5$ \\
+        $b$ & Social benefit of cooperation & $5$ \\
+        $\hat{n}$ & Optimal number of individuals per patch & var \\
+        $\sigma$ & Tolerance to variations in the number of individuals per patch & var \\
+        \textbf{Evolution} & & \\
+        $G$ & Number of generations & $1\ 500$ \\
+        $\mu$ & Probability of mutation per gene per generation & $0.01$ \\
+        \hline
+
+    \end{tabular}
+    \caption{Parameters of the simulation}
+    \label{tab:parameters}
+\end{table}
+
+
+\section{Results}
+
+\subsection{Cooperation cannot evolve when patches are scarce}
+
+We simulated the evolution of a population of $N_T=100$ individuals for $G=1500$ generations, for different values of the number of resource patches $\omega$, but always in a situation where the optimal number of individuals per patch was $\hat{n}=2$. Cooperation only evolved when patches were more abundant than a threshold (Fig.~\ref{fig:varyingopp}, a). This can be understood as follows. When resource patches are few, precisely when $\omega < \frac{N_T}{\hat{n}}$, individuals have little cooperation opportunities and there is therefore always more individuals per patch than what would be optimal (in this case, the optimal number of individuals per patch is $\hat{n}=2$). As a result, additional individuals joining a patch are more of a nuisance than a benefit, and there is therefore no benefit in trying to attract partners by appearing cooperative.
+
+\begin{figure}[tb]
+    \centering
+    \includegraphics[width=\columnwidth]{media/results/byprod/varopp_hatn_1.pdf}
+    \caption{Mean investment in simulation for different number of opportunities $\omega$ and a fixed population of $N_T=100$ individuals. Results after $1\,500$ generations. \textbf{a.}~When $\hat{n} = 2$ Cooperation evolves when $\omega \geq 50$. \textbf{b-c.}~For $\hat{n} \geq 3$, cooperative behaviours never evolve. \textbf{d.}~When $\sigma \to \infty$, there is no pressure for agent to attract partners and cooperative behaviours never evolve.}
+    %\par \small
+    %When there is not enough patches to host all the agents in the environment, agents have no outside options. Therefore, they have no better choice than staying with their current partners.
+    %If there is enough patches, agents can easily find available opportunities. Therefore, they have plenty of outside options. Partners must invest sufficiently enough to satisfy the agent.
+
+    \label{fig:varyingopp}
+\end{figure}
+
+
+We then simulated the evolution of cooperation in situations where the optimal number of individuals per patch, $\hat{n}$, was larger (Fig. \ref{fig:varyingopp}, b-c). Overall, the outcome was even less favorable to cooperation. This may seem paradoxical but can be understood as a consequence of the law of large numbers. When the number of individuals per patch is large, whether it is greater or less than $\hat{n}$, the effect of each individual on the average quality of her patch is very small anyway. There is therefore little value for an individual to invest in cooperation to try and attract partners.
+
+Finally, we performed the same simulations in the case where the number of individuals per patch is neutral ($\sigma \rightarrow \infty$, Fig. \ref{fig:varyingopp}, d). Cooperation did not evolve either and this can be understood also because there cannot be any benefit in attracting partners when the number of individuals per patch does not matter.
+
+Overall, the evolution of cooperation by partner choice can only take place in the restricted conditions where (i) there is an optimal number of individuals per resource patch, (ii) this optimal number is low, and (ii) the number of resource patches in the environment is large.
+
+
+\subsection{Cooperation cannot evolve when there are too many partners around}
+
+In a second step, we simulated again the evolution of a population of $N=100$ individuals for $G=1500$ generations in a situation where the optimal number of individuals per patch was $\hat{n}=2$, but this time we held the number of patches constant, $\omega = 20$, while varying the actual number of individuals, $N_T$, present together in the environment.
+
+In this case, cooperation only evolved when the number of individuals in the environment was intermediate. This can be understood as follows. When the number of individuals in the environment, $N_T$, is too close to the number of individuals, $\hat{n}$, that are needed to exploit at least one patch --or even more so when $N_T < \hat{n}$ , then the number of available partners is limiting. As a result, the actual number of cooperation opportunities from which individuals can choose is very low, partner choice is thus a weak force, and the benefit of investing into cooperation is low. On the other hand, when the number of individuals in the environment, $N_T$ is larger than the total number of individuals that can be accomodated on the available patches, that is when $N_T > \hat{n} \omega$, the number of available patches is limiting. In this case we find the result described above (Fig. \ref{fig:gridtol1}, a). The problem is rather that there are always too many individuals on each patch than too few and partner choice is also a weak force. There is, therefore, a range of intermediate population densities, neither too low nor too high, for which cooperation can evolve.
+
+We then performed the same simulations again, but with more patches available in the environment (i.e. for larger $\omega$, Fig. \ref{fig:gridtol1}, b, c). We observed that the range of population densities for which cooperation could evolve was then broader. This can again be understood in the above framework. On one hand, the lower boundary of population density, $N_T \approx \hat{n}$, below which the number of individuals is a limiting factor, is unaffected by the amount of patches available.  On the other hand, the upper boundary of population density, $N_T > \hat{n}\omega$, above which the number of patches is a limiting factor, increases with the amount of patches, $\omega$ . As a result, the width of the range of population densities where partner choice is effective increases.
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{media/results/byprod/grid_1.pdf}
+    \caption{Effect on the population size in the environment with 20, 40 or 80 patches and an optimal number ofagents $\hat{n} = 2, 3$ and $\sigma = 1$. Agents have a cooperative behaviour for $\hat{n} < N_T < \omega\times \hat{n}$ and for $\hat{n} = 2$.}
+    \label{fig:gridtol1}
+\end{figure*}
+
+We then performed the same simulations, but this time in situations where the optimal number of individuals per patch, $\hat{n}$, was larger. The outcome was even less favorable to cooperation (Fig.~\ref{fig:gridtol1}, e-p). This is again a consequence of the dilution of the benefit of being a cooperator to attract others, when cooperation takes place in too large groups.
+
+\section{Discussion}
+
+Partner choice can lead to the evolution of cooperation when individuals can compare several opportunities for social interaction and choose the most advantageous. In this article, we have shown that the conditions for this to happen are, however, quite restrictive. They entail  that individuals really have access to a range of social opportunities. Yet, in many cases, social opportunities are very rare because they necessitate the co-occurrence of two things at the same time: (i) at least one available partner, and (ii) an exploitable resource or, more generally, ``something to do'' with that partner. 
+
+Cooperation by partner choice can therefore evolve in two situations. First, it can evolve if a partner constitutes in itself a resource as there is, in this case,  no further requirement for a social opportunity than the need to find a partner. This occurs, for instance, in sexual markets, or in the many instances of interspecific mutualisms where the other individual alone constitutes an opportunity to cooperate. It is therefore logical that partner choice plays an important role in these two types of interactions (refs xx).
+
+Second, it can evolve if individuals are very efficient at extracting opportunities for cooperation from their environment. This is particularly the case in the human species. In the same environment, there are more opportunities for cooperation for human beings than for individuals of most other species. This is a consequence of our skill-intensive strategy that allows us to transform and thereby extract high-value resources from our environment (Kaplan xx). We can thus understand why our cooperation is  related to our cognitive abilities. Having skills that increase the number of opportunities to do useful things also brings with it the possibility of choosing between different opportunities. This puts greater pressure on individuals, who are competing to attract partners on their own opportunity, rather than on another. 
+
+xx revoir cette partie = TODO PAUL = faire la biblio sur les différentes hypothèses sur la relation entre intelligence et coopération (rapidement ; voir mon mail)
+There is a large number of hypotheses in the literature on the relationship between intelligence and cooperation and it is often difficult to  sort them out, both in terms of their empirical predictions and in terms of their theoretical plausibility. One particularly prominent theory, the social brain hypothesis, considers that our cognitive abilities are a secondary consequence of selective pressures steming from social life (refs xx). Cooperation is a complex problem to solve that requires the evolution of sophisticated cognitive devices and, so the theory goes, this led to the evolution of intelligence in other domains as well. The social brain hypothesis, however, is based on the premisse that selection to deal with specifically social problems leads to the evolution of intelligence in other domains as well, which is not plausible (refs xx). Another more recent hypothesis considers that causality goes both ways (refs xx west). Intelligence makes cooperation more efficient, which increases the range of situations in which cooperation can evolve, which in turn selects for more intelligence to better benefit from cooperation and so on. The present hypothesis is not in oposition to the latter. It constitutes an additional effect, which specifically concerns reciprocal cooperation --cooperation made to attract partners-- and not other forms of cooperation such as kin altruism or byproduct cooperation. According to our hypothesis, intelligence does not make cooperation more useful, it makes all actions in the world more efficient, and thus leads to greater competition between individuals to attract partners, thereby forcing them to cooperate more.
+
+On the other hand, partner choice cannot lead to the evolution of cooperation when individuals are not very effective in finding cooperation opportunities in their environment. This explains why, in many species, social interactions show no evidence of cooperation beyond immediate self-interest (refs xx). Even when individuals engage in collective actions, for example when they hunt collectively, others have so few outside options anyway that there is no need to seek to draw them into the collective actions. They will come anyway, for want of anything better to do. Even worse than that, as opportunities for cooperation are rare, not only are there always enough partners in each collective action without it being necessary to actively attract them, but in fact the opposite is true. There are always too \textit{many} indviduals participating in each cooperation endeavour. This has been documented for instance in pack hunting in Lions, where Packer showed that lionesses often hunt in groups that are too large compared to what would be optimal (refs xx). In such a case, the average gain per individual in a collective action is reduced and not increased by the participation of others, and there is therefore no selection to attract partners but rather a selection to push them away at the time of sharing.
+
+%The difficulty is that, even in these cases, during the collective action itself, individuals do behave in a coordinated manner for a common goal, as they all wish for the eventual success. Yet, this is no indication that they are cooperating for anything other than their immediate benefit. In many cases, individuals actually live in groups for an independent reason (e.g., to protect against  foreign males in the case of lionnesses) and the most prominent collective actions (e.g. group hunting) are merely unwanted by-products of  group life, occuring because individuals simply have nothing better to do.
+
+
+\bibliographystyle{vancouver}
+\footnotesize{
+\bibliography{references}}
+
+\clearpage
+
+\section*{Supplementary Materials}
+
+\begin{figure}[htbp]
+    \centering
+    \includegraphics[width=\columnwidth]{media/results/byprod/varopp_1.pdf}
+    \caption{Mean investment in simulation for different number of opportunities $\omega$ and a fixed population of $N_T=100$ individuals and an optimal number of $\hat{n} = 2$ agents per opportunity. Results after $1\,500$ generations and $\sigma = 1$. Cooperation evolves when $\omega \geq 50$}
+    %\par \small
+    %When there is not enough resources to host all the agents in the environment, agents have no outside options. Therefore, they have no better choice than staying with their current partners.
+    %If there is enough resources, agents can easily find available opportunities. Therefore, they have plenty of outside options. Partners must invest sufficiently enough to satisfy the agent.
+
+    \label{fig:varyingopptol1}
+\end{figure}
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{media/results/byprod/varNrowOpp_1.pdf}
+    \caption{Effect on the population size in the environment with 10, 20 or 40 patches and an optimal number of 2 agents and $\sigma = 1$}
+    \label{fig:varNrowOpptol1}
+\end{figure*}
+
+\begin{figure}
+    \centering
+    \includegraphics[width=\columnwidth]{media/results/byprod/varN_inf.pdf}
+    \caption{Effect on the population size in the environment with 40 patches and an optimal number of 2 agents, $\hat{n} = 2$ and $\sigma = \infty$.}
+    \label{fig:varNrowtolinf}
+\end{figure}
+
+\begin{figure}[htbp]
+    \centering
+    \includegraphics[width=\columnwidth]{media/results/byprod/varopp_3.pdf}
+    \caption{Mean investment in simulation for different numbers of opportunities $\omega$ and a fixed population of $N_T=100$ individuals and an optimal number of $\hat{n} = 2$ agents per opportunity. Results after $1\,500$ generations and $\sigma = 1$. Cooperation evolves when $\omega \geq N_T/\hat{n}$}
+    %\par \small
+    %When there is not enough patches to host all the agents in the environment, agents have no outside options. Therefore, they have no better choice than staying with their current partners.
+    %If there is enough patches, agents can easily find available opportunities. Therefore, they have plenty of outside options. Partners must invest sufficiently enough to satisfy the agent.
+
+    \label{fig:varyingopptol3}
+\end{figure}
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{media/results/byprod/varNrowOpp_3.pdf}
+    \caption{Effect on the population size in the environment with 10, 20 or 40 patches and an optimal number of $\hat{n} = 2$ agents and $\sigma = 3$. Cooperative behaviour evolve for $\hat{n} < N_T < \omega\times \hat{n}$}
+    \label{fig:varNrowOpptol3}
+\end{figure*}
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{media/results/byprod/grid_3.pdf}
+    \caption{Effect on the population size in the environment with 10, 20 or 40 patches and an optimal number of agent $\hat{n} = 2, 3, 4, 20$ and $\sigma = 3$. Cooperative behaviour evolve for $\hat{n} < N_T < \omega\times \hat{n}$ and $\hat{n} \leq 3$. The mean investment value is further from the socially optimal investment than with $\sigma = 1$.}
+    \label{fig:gridtol3}
+\end{figure*}
+
+
+\begin{figure}[htbp]
+    \centering
+    \includegraphics[width=\columnwidth]{media/results/byprod/varopp_inf.pdf}
+    \caption{Mean investment in simulation for different number of opportunities $\omega$ and a fixed population of $N_T=100$ individuals and an optimal number of $\hat{n} = 2$ agents per opportunity. Results after $1\,500$ generations and $\sigma = \infty$. Cooperation never evolves.}
+    %\par \small
+    %When there is not enough patches to host all the agents in the environment, agents have no outside options. Therefore, they have no better choice than staying with their current partners.
+    %If there is enough patches, agents can easily find available opportunities. Therefore, they have plenty of outside options. Partners must invest sufficiently enough to satisfy the agent.
+
+    \label{fig:varyingopptolinf}
+\end{figure}
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{media/results/byprod/varNrowOpp_inf.pdf}
+    \caption{Effect on the population size in the environment with 10, 20 or 40 patches and an optimal number of 2 agents and $\sigma = \infty$}
+    \label{fig:varNrowOpptolinf}
+\end{figure*}
+
+
+\end{document}
diff --git a/chapters/01introduction.tex b/chapters/01introduction.tex
new file mode 100644
index 0000000..e69de29
diff --git a/chapters/02stateoftheart.tex b/chapters/02stateoftheart.tex
new file mode 100644
index 0000000..8e8b7c0
--- /dev/null
+++ b/chapters/02stateoftheart.tex
@@ -0,0 +1,125 @@
+\section{L'évolution de la coopération}
+
+\subsection{L'évolution}
+\label{ssec:evolution}
+
+La coopération est un problème important en biologie évolutionnaire, qui étudie l'apparition et le maintien de traits morphologiques et comportementaux dans le vivant. D'après la théorie de la biologie évolutionnaire, chaque individu possède un certain nombres de caractéristiques qui peuvent être soit avantageuses soit désavantageuses pour l'individu dans son environnement comparé à ses pairs. Une caractéristique est dite avantageuse si elle permet à l'individu d'être plus adapté à son environnement que ses pairs, c'est-à-dire avoir une meilleure fitness. Cela signifie que l'individu a plus de capacité à survivre et à se reproduire s'il possède cette caractéristique comparé à ses pairs.
+
+Si cette caractéristique est transmissible par reproduction, puisque l'individu se reproduit plus que les autres membres de la population, la caractéristique plus adaptée est de plus en plus fréquente dans la population. La descendance de cet individu est elle aussi plus adaptée que la descendance des autres individus, elle a donc un plus grand succès reproducteur. La descendance de la descendance de l'individu adapté est elle aussi plus adaptée comparé à celle de la descendance de la descendance des autres individus et ainsi de suite. La fréquence de la nouvelle caractéristique dans la population augmente au fil des générations au point que la caractéristique soit virtuellement dans toute la population. La caractéristique s'est fixée dans la population. C'est le mécanisme de sélection naturelle.
+
+Ainsi, par sélection naturelle et reproduction, les caractéristiques les plus adaptées se propagent et se fixent dans les populations. Nous avons pris ici le cas simple d'une unique caractéristique, dans un environnement stationnaire. La fixation de caractéristiques peut être beaucoup plus complexe, parce que les caractéristiques qui définissent un individu peuvent avoir des interactions entre elles, ou bien l'environnement dans lequel évolue la population peut changer, soit de manière exogène au processus d'adaptation des individus, ou bien parce que le comportement des individus suite à la fixation de nouvelles caractéristiques viennent modifier celui-ci.
+
+Nous avons ici présenté le premier ingrédient des mécanismes de l'évolution, la sélection. Le second élément est la variation. Nous avons précédemment postulé qu'un individu avait une caractéristique différente des autres membres de sa population. Comment peut-on avoir des variations de caractéristiques ? Précédemment, nous avons fait l'hypothèse implicite d'une transmission parfaite des caractéristiques du parent à la descendance. Faisons maintenant l'hypothèse explicite contraire : Il est possible que lors de la reproduction, il y ait une probabilité que la caractéristique transmise soit légèrement altérée. Cette altération peut avoir des conséquences marginales (la caractéristique est plus ou moins fortement exprimée), ou elle peut même avoir des conséquences importantes, comme changer complètement la nature de la caractéristique. Cette nouvelle version de la caractéristique peut être elle aussi adaptée ou non, et donc subir le même processus de sélection que décrit précédemment. 
+
+Ainsi, le mécanisme de variation introduit de nouvelles caractéristiques dans la population, tandis que le processus de sélection vient conserver les caractéristiques les plus adaptées qui vont se fixer dans la population au détriment des moins adaptées qui vont elles s'éteindre. Ce processus, l'évolution, conduit un processus d'optimisation. À chaque génération, les caractéristiques qui maximisent la survie et la reproduction des individus se propagent.
+
+
+\begin{verbatim}
+ajouter exemples?
+\end{verbatim}
+
+\subsection{Le problème de la coopération}
+
+En biologie de l'évolution, la coopération peut être définie comme le fait d'agir pour apporter un bénéfice à un autre individu. C'est un comportement très déroutant pour la biologie évolutionnaire. En effet, comment un tel comportement peut-il être adapté ? Un individu qui coopère dépense son temps et son énergie afin d'aider un autre individu. Il dépense des ressources pour augmenter la fitness, le succès reproducteur d'un autre. S'il est aisé de comprendre qu'être le bénéficiaire d'une coopération soit adapté et que ce comportement puisse être conservé par sélection naturelle, si tant est que ce serait transmissible, il est déroutant que la caractéristique \emph{d'être} coopérateur puisse être adapté. Il est difficile d'imaginer que cette caractéristique puisse être conservée par sélection naturelle. 
+
+Pourtant, de nombreux cas de coopération sont observés dans le vivant. Tout d'abord, l'un des exemples les plus marquants pourrait être les insectes eusociaux, dont font partie les fourmis ou les abeilles. Pour ces espèces, les membres de colonies entières se coordonnent et travaillent ensemble afin de permettre le succès reproducteur de leurs reines. C'est un cas de coopération que nous développerons dans la section~\ref{ssec:altruism_kin}~\emph{\nameref{ssec:altruism_kin}}. Des comportements coopératifs sont aussi observés chez les gobemouches noirs, qui aident les autres gobemouches lorsqu'ils se font chassés par une chouette en attaquant collectivement le prédateur.  Ainsi, les gobemouches qui viennent aider la proie risquent leurs vies pour sauver celle d'un autre individu. Enfin, les humains sont de très grands coopérateurs. Ils possèdent des structures sociales complexes, produisent leurs ressources ensemble, les échangent, donnent à des organisations caritatives.
+
+
+C'est comportements coopératifs sont comme dit précédemment très déroutant. Par exemple, il est aisé d'imaginer qu'un gobemouche noir qui n'attaque pas le prédateur d'un de ses partenaires pourrait avoir une meilleure fitness en ne mettant pas en danger sa vie. Comment expliquer que ces comportements n'ait pas été contre-sélectionnés ? Cette question est très fortement étudiée en biologie de l'évolution. De nombreux travaux ont identifié différents mécanismes qui permettrait de rendre les comportements de coopération viables. Ces mécanismes sont détaillés dans les prochaines sections.
+
+
+
+\subsection{L'altruisme et l'apparentement}
+\label{ssec:altruism_kin}
+
+Une première explication des comportements coopératifs est la sélection de parentèle \citep{Hamilton1964}. Les explications de l'évolution donnés dans la section~\ref{ssec:evolution}~\emph{\nameref{ssec:evolution}} étaient centrées sur l'individu. Cependant, la propagation de caractéristiques est plus complexe que cela. L'individu n'est qu'un véhicule transportant la caractéristique, et celle-ci . Le support de cette caractéristique dans le vivant est le gène. Le gène code cette caractéristique. Le support même. \todo{compléter} 
+
+Ainsi, il peut être rentable pour un individu d'aider un de ses apparentés, si le bénéfices $b$ pour le bénéficiaires pondérées par le degré d'apparentement $r$ surpasse les coûts engagés par l'acteur $c$, alors le comportement altruistique est stable. C'est la règle d'Hamilton définie dans l'équation~\ref{eq:hamiltonrule}.
+
+\begin{equation}
+r \times b > c \label{eq:hamiltonrule}
+\end{equation}
+
+
+Dans ce cas là, c'est aider des parties de soi plutôt que d'aider qqun d'autre. Cette implémentation de la coopération est mise en places chez les insectes eusociaux tel que les fourmis ou les abeilles, où le degré d'apparentement entre les individus d'une même colonie est de 75\%, ou bien entre les cellules d'organismes multicellulaires, où le niveau d'apparentement est ici de 100\%.
+
+Cependant, cela permet de comprendre qu'un sous-ensemble des comportements observés dans le vivant. Comment les comportements de coopération entre individus d'espèces différentes ont pu évoluer ? De même, comment les comportements entre individus d'une même espèce mais non-apparentés, comme on l'observe chez l'Humain ou chez les chauves-souris vampires par exemple peuvent se développer.
+
+\subsection{La réciprocité}
+
+Un gène ne peut être conservée au cours de l'évolution uniquement si ce gène déploie un phénotype plus adapté que le phénotype de ses concurrents pour sa reproduction. Comme vu dans la section~\ref{ssec:altruism_kin}~\emph{\nameref{ssec:altruism_kin}}, cela peut se faire de manière indirecte, ce qui permet d'expliquer les comportements de coopération entre apparentés. Dans le cas de coopération entre non-apparentés, c'est donc que le comportement de coopération exprimé par le gène d'un acteur apporte bien un bénéfice à cet acteur lui-même. En effet, sans sélection de parentèle, l'expression du gène ne possède aucun moyen de déterminer si le bénéficiaire de la coopération possède lui aussi le même exemplaire de ce gène. Quand bien même le bénéficiaire posséderait les mêmes expressions phénotypiques que l'acteur, cela ne garantit pas la propagation du gène responsable pour ce comportement. \todo{green beard \cite{Dawkins1976}} Ainsi, un gène qui code pour un comportement coopératif, c'est-à-dire qui code un comportement aillant un coût pour l'acteur et un bénéfice pour le bénéficiaire, doit forcément apporter un bénéfice comparé à ses concurrents.
+
+C'est le cas si le comportement coopératif de l'acteur entraîne un changement de comportement sur le bénéficiaire, comme le propose \citet{Trivers1971}. Ainsi, il est intéressant d'agir de manière coopérative avec un individu qui le sera lui aussi avec nous en retour. C'est la coopération conditionnelle ou la réciprocité. Cette réciprocité peut être soit positive, c'est à dire que le bénéficiaire coopère avec l'acteur en réponse à la coopération ; soit négative, le bénéficiaire punit l'acteur si celui-ci ne coopère pas. Dans les deux cas, il est dans l'intérêt de l'acteur de coopérer, puisque cela maximise son gain en vu de la réaction du bénéficiaire. Notons cependant qu'agir de manière coopérative n'a de sens ici uniquement si cela a un effet sur le comportement du bénéficiaire. Si le bénéficiaire aurait de toute façon aider l'acteur après coût, ou bien si le bénéficiaire lui aurait de toute façon causer dommage, alors l'acteur n'a aucun intérêt à coopérer en faveur du bénéficiaire. C'est en cela qu'il y a réciprocité. Les comportements de réciprocité peuvent être implémenté très facilement et sont particulièrement robuste, comme l'a montré \citet{Axelrod1981}. Ainsi, dans le cadre d'une situation de coopération modélisable comme un dilemme du prisonnier, avec interactions répétées, le comportement de tit-for-tat est une stratégie de réciprocité extrêmement robuste. De même, les comportements de punition chez les poissons nettoyeurs avec leurs clients \citep{bsharyxx}. Les comportements tit-for-tat se retrouvent aussi avec les gobemouches noirs \citep{Krams2008}, qui ne viennent défendre que les gobemouches noirs qui les ont déjà défendu auparavant, mais jamais ceux qui ne sont jamais venu les aider alors qu'ils en avaient besoin.
+
+Il y a ainsi deux types de réciprocité directe: Le Partner Fidelity Feedback et le Partner Choice \citep{Sachs2004}.
+
+\subsection{Le choix du partenaire et les marchés biologiques}
+
+\xx{Parler d'opportunités de coopération !!!}
+
+Parmi les implémentations de la coopération par réciprocité, le choix du partenaire semble être apparu de nombreuses fois et être un mécanisme particulièrement efficace. Dans l'espèce Humaine, il joue un rôle prépondérant dans le maintien des comportements de coopération. Le choix du partenaire permet l'apparition et le maintient de la coopération. Pour le comprendre, ne nous centrons plus sur l'acteur de la coopération, mais le bénéficiaire. Dans une tâche collective, il est toujours pertinent pour le bénéficiaire de la tâche collective de se mettre avec le meilleur acteur possible, c'est-à-dire l'acteur qui lui permettra d'obtenir le plus gros gain. Puisque pour réaliser la tâche, qui est \emph{collective}, l'acteur a aussi besoin du bénéficiaire, il est alors dans l'intérêt des acteurs d'être le plus coopératif possible afin d'être choisi par un bénéficiaire. Ainsi, il y a une pression a être coopératif afin d'être choisi par un bénéficiaire. Cette pression est d'autant plus forte si le nombre d'acteurs est particulièrement grand par rapport au nombre de bénéficiaire. Il y a un effet de marché \citep{Noe1994}.
+
+Imaginons maintenant une population d'individus qui cherche à être avec le meilleur partenaire possible, et que tous les individus présents dans cette population sont égoïstes. Dans cette population apparaît un mutant qui coopère plus que les autres. Ce mutant va être particulièrement recherché par les autres individus de la population. Il va donc interagir beaucoup et obtenir de nombreux gains. De la même manière, puisque de nombreux individus vont vouloir coopérer avec lui, il va pouvoir être sélectif. Il va pouvoir refuser les interactions avec les moins efficaces pour choisir les interactions avec les plus performants. Il y a donc un assortative matching qui se met en place. Les individus les plus performants qui interagissent ensemble vont recevoir de très nombreux bénéfices de leurs interactions. Ces bénéfices ont un impact positif sur leur fitness et ils vont se propager dans la population. Le fait d'être coopératif va se fixer dans la population et tous les individus seront des coopérateurs.
+
+Le choix du partenaire
+
+Ces mécanismes de choix du partenaire se retrouvent dans la nature. C'est ainsi eux qu'on observe dans les mutualismes inter-spécifique chez les cleaners fishes et leurs clients \citep{Bshary2002b} \todo{détailler} ou bien chez les mutualismes legumes-rhizobium \todo{source + détail}.
+
+
+partner switching
+
+\cite{Aktipis2011}
+
+
+\begin{verbatim}
+Choisir un partenaire pour le gain qu'il nous apporte, non pas pour l'aider lui. crée une boucle
+- grooming
+- vampire bats
+- cleaner/client
+\end{verbatim}
+
+\subsection{Pourquoi la coopération n'est pas partout ?}
+
+Bien que nous nous demandions en début de ce chapitre comment la coopération pourrait évoluer, après l'étude des différents mécanismes qui puissent l'implémenter, c'est maintenant la question contraire qui apparaît. Comment se fait-il que la coopération entre non-apparentés soit en fait si rare dans la nature ? En effet \xx{exemple de pas coopération même si c'est tendu}, et pourtant le choix du partenaire est un mécanisme particulièrement puissant chez l'Homme \todo{cite}. 
+
+Quels sont les paramètres qui pourrait empêcher l'apparition de réciprocité ?
+
+Tout d'abord, il faut souligner le problème du bootstrapping. S'il est aisé de comprendre comment les mécanismes de réciprocité peuvent maintenir les comportements de coopération, il est en fait plus compliqué d'expliquer comment ce mécanisme peut apparaître de lui-même. En effet, la réciprocité -- tant la Partner Fidelity Feedback et le Partner Choice -- requiert.
+
+bootstrapping
+Dans ce cas, pourquoi la coop n'est pas partout? Qu'est-ce qui la bloque ?
+Objectif de la thèse : capturer le pourquoi la coop c'est difficile
+
+
+%%%%%%%%%%%%%%%
+%%%%%%%%%%%%%%%
+
+\section{Modèles et méthodes}
+
+\subsection{Les modèles analytiques}
+
+Théorie des jeux évolutionnaire
+
+\subsection{Les modèles agents centrés}
+
+McNamara, Aktipis(?)
+
+\subsection{La robotique évolutionniste}
+
+importance des problèmes de coordination et mécanistes.
+Intéressant d'avoir des robots capables de coop
+
+\subsection{Le mapping génotypique-phénotypique}
+
+%%%%%%%%%%%%%%
+%%%%%%%%%%%%%%
+
+\section{Objectif de la thèse}
+
+\begin{verbatim}
+    
+comprendre pourquoi la coopération bloque, pourquoi c'est difficile à obtenir
+- Utilisation de modèles agents centrés et robotiques pour capturer les difficultés présentes et que ne capturent pas les autres modèles
+-> Contraintes de coordinations, d'accès au ressources, de densité, de navigation
+\end{verbatim}
+
diff --git a/chapters/03Lions.tex b/chapters/03Lions.tex
new file mode 100644
index 0000000..194838e
--- /dev/null
+++ b/chapters/03Lions.tex
@@ -0,0 +1,168 @@
+Several mechanisms have been identified to explain the evolution of cooperation among non-kin \citep{Trivers1971, MaynardSmith1974, Axelrod1981}, including positive reciprocity \cite{Trivers1971, Axelrod1981, Andre2007}, punishment \cite{Bshary2005, Raihani2012} or partner choice \cite{Eshel1982, Bull1991, West2007, Schino2017}. Among these mechanisms, partner choice has been considered over the last twenty years as having probably played a particularly important role \cite{Baumard2013a, +ref}. When individuals can choose among several different partners, which they can compare and compete against each other as in an economic market, this generates a selection pressure to cooperate more, to appear as a good partner and attract others' cooperation \cite{Noe1994}.
+
+The effects of partner choice have been well documented in a large number of biological systems. For example, in the interaction between cleaner fishes and their clients the law of supply and demand determines the way in which the added value of the interaction is shared, in accordance with market principles \cite{Bshary2006}. When cleaners are rare, clients tolerate cheating on their part, while they become more picky when cleaners are numerous. The effects of partner choice have also been documented in primate grooming behavior in two meta-analyses, showing that female primates groom preferentially those that groom them most and that a positive relation exists between grooming and agonistic support \citep{Schino2007, Schino2008}. In vervet monkeys, individuals groom others in exchange for access to food and they do so for longer periods when fewer partners are available \cite{Fruteau2009}. Beyond cooperation, partner choice also plays a decisive role in mating, leading to the evolution of secondary sexual caracteristics and nuptial gifts, and/or to assortative matching (refs xxxTODOxxx \cite{Zahavi1975, xxTerrain}. Lastly, the effects of partner choice have also been documented in humans where it has been shown that the need to attract social partners is a major driver of cooperation \citep{Barclay2007a, Barclay2015, Barclay2016, Debove2015b,  Andre2011, Baumard2013a}.
+
+% xx \cite{Clutton-brock2009} qui discute plein de cas de réciprocité qui pourrait n'être qu'en fait manipulations et mutualismes
+
+
+There are, however, a number of biological situations in which one would typically expect partner choice to play an important role, but where no such effect has ever been demonstrated. These include most intraspecific collective actions in non-human animals. This is particularly salient in collective hunts such as collobus hunting in chimpanzees, or pack hunting in carnivores. No empirical evidence in these species suggests that individuals cooperate for reasons related to partner choice, either to attract partners or to be accepted by them in their hunts. On the contrary, the majority of available data are consistent with the more parcimonious explanation that individuals are simply doing what is in their immediate best interest at any given time \cite{Packer1986,Packer1988a, Melis2008, Melis2011}. In particular, if cooperation in collective hunts was driven in part by the need to appear as a good partner, individuals would be expected to  willingly share the product of their hunts in a way that depends on everyone's actual engagement, to encourage participation in other hunts in the future. However, such voluntary and conditional sharing has never been documented in animal collective hunts \cite{Melis2011}. In evolutionary terms, therefore, collective hunting in these species is most likely an instance of \textit{byproduct} cooperation, rather than an instance of reciprocal cooperation based on partner choice.
+
+Yet several models on the evolution of cooperation by  partner choice suggest that cooperation should evolve in these situations \cite{McNamara2008, Aktipis2011, Barclay2011, Campenni2014}. And, in humans, behaviours in collective actions are driven by the need to appear as a good partner, especially when it comes to sharing the benefits of cooperation (refs Alvard xx \cite{Baumard2013a}). One may therefore wonder why the same effects did not produce the same consequences in other species.
+
+Such a lack of observation could always be the consequence of methodological difficulty in empirically proving the existence of partner choice. However, we would like to suggest an alternative here, namely that there is in fact a strong constraint impeding partner choice in a large number of situations in animals.
+
+Partner choice requires that individuals can compare and choose among several opportunities for cooperation. In some cases, \textit{partners} themselves constitute opportunities for cooperation and partner choice then only requires that partners are many and accessible. This is the case, for instance, in mating markets, or in most instances of interspecific mutualism. In other cases, however, finding an opportunity for cooperation requires more than just finding a partner. This is what happens when cooperation consists of several individuals working together to exploit environmental resources. In this case, a cooperation opportunity requires both a partner(s) and a resource, which imposes an additional constraint limiting the scope of partner choice. When resources are scarce, there are always few options to compare, and partner choice cannot operate. This could explain the lack of cooperation, beyond by-product cooperation, in many instances of collective actions in the wild despite the availability of potential partners.
+
+In this article, we aim to test this idea using agent-based simulations. To do this, we simulate the evolution of agents placed in an environment containing resources that can be exploited collectively. We show that, in a low-resource environment, and even if there are plenty of partners, partner choice is not able to drive the evolution of cooperation as individuals cannot pit the few cooperation opportunities against each other. What is more, we also show that the number of potential partners actually has a negative effect on the evolution of cooperation when patches are scarce. When there are too many potential partners relative to the amount of patches available, there are always too many individuals on any given resource as individuals have nothing else to do anyway. Hence, there is no point in trying to attract partners but on the contrary there are benefits in trying to limit their number. We therefore show that partner choice is only effective when the number of available partners lies within a precise range of values, all the narrower as the availability of patches is low.
+
+We believe that this constraint plays a central role in explaining that, in many species, although individuals do participate in collective actions, sometimes finely coordinating their behaviour with that of others, individuals do not actually seek to cooperate beyond what is in their immediate personal interest. On the contrary, thanks to its cognitive capacities, the human species is able to extract resources from a greater variety of situations. As a result, we actually live in an environment that is much richer in resources than other species. Hence we can compare and compete a greater diversity of opportunities for cooperation against one another, and we are thus forced to cooperate more intensively to attract partners.
+
+
+\section{Methods}
+
+We consider a population of $N_T$ individuals living in an environment consisting of $\omega$ different patches on which resources are located. Every generation of the simulations is constituted of $T$ time steps during which individuals gather payoff units. At the end of these $T$ time steps, individuals reproduce in proportion to their total payoff, and die. During a time step, every individual is considered one by one in a random order. When her turn comes, an individual evaluates each of the $\omega$ patches of the environment, including the patch where she is currently located, assigns each a score, and then moves toward the patch with the highest score, or stays on her current patch if that's the one with the highest score. Once every individual has taken this decision, individuals express their cooperation strategy on their local patch, and they collect a payoff that depends on their own and their partners' cooperation strategy. Patches can disappear every time step, with a probability $d$, and are then immediately replaced by an empty patch.
+
+xx La taille de la population totale $N$ est toujours constante quelque soit le nombre d'individus présents dans l'environnement $N_T$ afin d'avoir le même nombre d'évaluations d'individus dans toutes les conditions. Pour $N_T < N$, $N_E = \lceil N_T / N \rceil$ environnements sont créés. Les individus sont répartis aléatoirement dans ces environnements afin que chaque environnement comporte $N_T$ individus. Pour le dernier environnement à compléter, s'il n'y a pas $N_T$ individus encore disponibles, alors des individus tirés d'autres environnements sont inclus dans l'environnement pour le compléter. Les gains obtenus par ces individus dans cet environnement ne sont pas considérés pour le calcul de leurs fitnesses.
+
+\subsection{The decision-making mechanisms}
+
+The individuals' strategy in this environment consists of two separate decisions.
+
+On the one hand, the individual must evaluate the different patches available and assign a score to each. This decision is made by an artificial neural network, called the "patch ranking" network. For each patch, this neural network has the following input information: (i) the number of other individuals already present on the patch, (ii) the average level of cooperation expressed by these individuals in the last time step, (iii) the level of cooperation that the focal individual would express should he join this patch, and (iv) a binary that indicates whether or not the individual would have to move in space in order to join this patch (i.e. this binary distinguishes the patch where the individual is currently located from all other patches).
+
+(xx En supplementaries plutôt ? C'est vraiment du détail d'implem…
+
+Pour (i), (ii) and (iii), leurs valeurs sont séparées en décimales et unités et envoyées dans des entrées différentes pour permettre au contrôleur de distinguer facilement de faible variations.
+)
+
+On the other hand, the individual must decide on a level of cooperation once she is on a patch. This decision is made by another artificial neuron network called the ``cooperation'' network (plus some phenotypic variability, see below). As an input, this neural network only has the number of other individuals present on the same patch as the focal. This entails that we assume that the agent cannot modulate her cooperation level in function of others' cooperation level. This assumption is meant to exclude the possibility that partner control strategies may evolve, and allows us to focus only on the effect of partner choice.
+
+The connection weights of both networks constitute the genome of each agent. They evolve by natural selection as exposed in the section \ref{sec:evolutionaryalgo}.
+
+
+\subsubsection{Phenotypic variability of cooperation}\label{ssec:phenotypic_var}
+
+As is now well established in the litterature, selective pressures in favor of any form of conditional cooperation, and therefore in particular in favor of partner choice, stem from the presence of some variability in partners’ cooperative behavior (see \cite{McNamara2010c} for a review of this idea). In order to capture the effect of variability in the simplest possible way, here we consider the effect of phenotypic variance in the expression of individuals' genes. At each generation of our simulations, each individual is subject to the effect of a \emph{phenotypic noise} that modifies her cooperation level. If $x_i^g$ is the cooperation level chosen by the genes of an individual (i.e. decided by her cooperation network), then the actual cooperation level player by the individual is $x = x_i^g + \epsilon$, where $\epsilon$ is drawn ramdomly as follows. The interval $[-1, 1]$ is uniformly split in $N_T$ values, and every individual gets one value of $\epsilon$ chosen among these $N_T$ values without replacement.
+
+
+\subsection{The payoff function}
+
+Each individual $i$ present on a patch invests a given amount $x_i$ into cooperation --where $x_i$ is decided by the individual's cooperation network. Individuals present on the same patch play a modified version of the n-player prisoner's dilemma. Consider a focal individual $i$ playing $x_i$, in a patch on which there are $n-1$ other individuals whose average level of cooperation is $\bar{x}_{-i}$ . The payoff of individual $i$ is given by
+
+\begin{equation}
+P(x_i, \bar{x}_{-i}, n) = F(n)  \times  \left[ a x_{i} +  b  \bar{x}_{-i} - \frac{1}{2}  x_i^2\right]
+\end{equation}
+xx où $a$ représente le bénéfice propre de l'agent et $b$ represents the social benefit of others' cooperation, and the function $F(n)$ is meant to capture the fact that there is an optimal number of individuals exploiting a patch and is given by
+
+\begin{equation}
+F(n) = e^{ - \left( {n - \hat{n} } \right)^2  / (2\sigma^2) } \label{eq:friction}
+\end{equation}where $\hat{n}$ is the optimal number of individuals per patch and $\sigma$ measures the strength of the penalty that stem from being a submoptimal number of individuals on the same patch.
+
+This payoff function has been chosen in such a way that, in the absence of partner choice, the evolutionarily stable strategy is always to invest the individually optimal investment (i.e. $x_{ESS} = a$), whereas the ``socially optimal'' cooperation, that is the level of cooperation that would maximise the average payoff of individuals on the patch, is to invest $\hat{x} = a + b$.
+
+
+% \begin{figure}[htbp]
+%     \centering
+%     \includegraphics[width=\linewidth]{lions/methods/payoff.pdf}
+%     \caption{Variation of the payoff for the focal player according to its partner investment strategy for $n=2$}
+%     \label{fig:payoff}
+% \end{figure}
+
+% Note: pourquoi social optimum quand P(x, x)? Nécessite une explication qui ne va pas de soi (?).
+
+
+
+\subsection{The evolutionary algorithm}\label{sec:evolutionaryalgo}
+
+Each individual has a genome composed of the weights of its two neural networks, which makes a total of 84 genes $g = (g_{1}, \ldots, g_{84})$ with $ g_{i} \in ]-10, 10[$. We consider a population of fixed size $N$. The first generation is composed of $N$ individuals with random genes for the neural network weights, drawn uniformly in $]-1, 1[$. We then use a fitness proportionate evolutionary algorithm to simulate  evolution.  After the $T$ time steps of a generation have taken place, individuals all reproduce and die. A new population of $N$ individuals is built out of the previous generation by sampling randomly among the $N$ parents in proportion to their cumulated payoff, according to a Wright-Fisher process.
+
+A mutation operator is applied on each offspring. Every gene of every offspring has a probability $\mu$ to mutate and a probability $1-\mu$ to stay unchanged. If a gene $g_i$, with value $v_i$, mutates, it has a probability $0.9$ to mutate according a normal distribution and thus reach a new value sampled in $\mathcal{N}(v_i, 0.1)$ and a probability $0.1$ to mutate according to a uniform distribution and thus reach a new value sampled in $\mathcal{U}(]b_{min}, b_{max}[)$.
+
+The evolutionary algorithm is run for $G$ generations.
+
+
+\begin{table}
+    \centering
+    \begin{tabular}{clc}
+        \hline
+        \textbf{Parameter} & \textbf{Description} & \textbf{Value}  \\
+        \hline
+        \textbf{Environment} & & \\
+        $N$ & Population size & $100$ \\
+        $d$ & Probability of disappearance of partches, per time step & $1/1\ 000$ \\
+        $T$ & Number of timesteps per generation & $1\ 000$ \\
+        $c_{m}$ & Cost of moving to another patch & $0$ \\
+        $N_T$ & Number of individuals in the local environment & var \\
+
+        \textbf{Payoff } & & \\
+        $a$ & Immediate personal benefit of cooperation & $5$ \\
+        $b$ & Social benefit of cooperation & $5$ \\
+        $\hat{n}$ & Optimal number of individuals per patch & var \\
+        $\sigma$ & Tolerance to variations in the number of individuals per patch & var \\
+        \textbf{Evolution} & & \\
+        $G$ & Number of generations & $1\ 500$ \\
+        $\mu$ & Probability of mutation per gene per generation & $0.01$ \\
+        \hline
+
+    \end{tabular}
+    \caption{Parameters of the simulation}
+    \label{tab:parameters}
+\end{table}
+
+
+\section{Results}
+
+\subsection{Cooperation cannot evolve when patches are scarce}
+
+We simulated the evolution of a population of $N_T=100$ individuals for $G=1500$ generations, for different values of the number of resource patches $\omega$, but always in a situation where the optimal number of individuals per patch was $\hat{n}=2$. Cooperation only evolved when patches were more abundant than a threshold (Fig.~\ref{fig:varyingopp}, a). This can be understood as follows. When resource patches are few, precisely when $\omega < \frac{N_T}{\hat{n}}$, individuals have little cooperation opportunities and there is therefore always more individuals per patch than what would be optimal (in this case, the optimal number of individuals per patch is $\hat{n}=2$). As a result, additional individuals joining a patch are more of a nuisance than a benefit, and there is therefore no benefit in trying to attract partners by appearing cooperative.
+
+\begin{figure}[tb]
+    \centering
+    \includegraphics[width=\columnwidth]{lions/results/byprod/varopp_hatn_1.pdf}
+    \caption{Mean investment in simulation for different number of opportunities $\omega$ and a fixed population of $N_T=100$ individuals. Results after $1\,500$ generations. \textbf{a.}~When $\hat{n} = 2$ Cooperation evolves when $\omega \geq 50$. \textbf{b-c.}~For $\hat{n} \geq 3$, cooperative behaviours never evolve. \textbf{d.}~When $\sigma \to \infty$, there is no pressure for agent to attract partners and cooperative behaviours never evolve.}
+    %\par \small
+    %When there is not enough patches to host all the agents in the environment, agents have no outside options. Therefore, they have no better choice than staying with their current partners.
+    %If there is enough patches, agents can easily find available opportunities. Therefore, they have plenty of outside options. Partners must invest sufficiently enough to satisfy the agent.
+
+    \label{fig:varyingopp}
+\end{figure}
+
+
+We then simulated the evolution of cooperation in situations where the optimal number of individuals per patch, $\hat{n}$, was larger (Fig. \ref{fig:varyingopp}, b-c). Overall, the outcome was even less favorable to cooperation. This may seem paradoxical but can be understood as a consequence of the law of large numbers. When the number of individuals per patch is large, whether it is greater or less than $\hat{n}$, the effect of each individual on the average quality of her patch is very small anyway. There is therefore little value for an individual to invest in cooperation to try and attract partners.
+
+Finally, we performed the same simulations in the case where the number of individuals per patch is neutral ($\sigma \rightarrow \infty$, Fig. \ref{fig:varyingopp}, d). Cooperation did not evolve either and this can be understood also because there cannot be any benefit in attracting partners when the number of individuals per patch does not matter.
+
+Overall, the evolution of cooperation by partner choice can only take place in the restricted conditions where (i) there is an optimal number of individuals per resource patch, (ii) this optimal number is low, and (ii) the number of resource patches in the environment is large.
+
+
+\subsection{Cooperation cannot evolve when there are too many partners around}
+
+In a second step, we simulated again the evolution of a population of $N=100$ individuals for $G=1500$ generations in a situation where the optimal number of individuals per patch was $\hat{n}=2$, but this time we held the number of patches constant, $\omega = 20$, while varying the actual number of individuals, $N_T$, present together in the environment.
+
+In this case, cooperation only evolved when the number of individuals in the environment was intermediate. This can be understood as follows. When the number of individuals in the environment, $N_T$, is too close to the number of individuals, $\hat{n}$, that are needed to exploit at least one patch --or even more so when $N_T < \hat{n}$ , then the number of available partners is limiting. As a result, the actual number of cooperation opportunities from which individuals can choose is very low, partner choice is thus a weak force, and the benefit of investing into cooperation is low. On the other hand, when the number of individuals in the environment, $N_T$ is larger than the total number of individuals that can be accomodated on the available patches, that is when $N_T > \hat{n} \omega$, the number of available patches is limiting. In this case we find the result described above (Fig. \ref{fig:gridtol1}, a). The problem is rather that there are always too many individuals on each patch than too few and partner choice is also a weak force. There is, therefore, a range of intermediate population densities, neither too low nor too high, for which cooperation can evolve.
+
+We then performed the same simulations again, but with more patches available in the environment (i.e. for larger $\omega$, Fig. \ref{fig:gridtol1}, b, c). We observed that the range of population densities for which cooperation could evolve was then broader. This can again be understood in the above framework. On one hand, the lower boundary of population density, $N_T \approx \hat{n}$, below which the number of individuals is a limiting factor, is unaffected by the amount of patches available.  On the other hand, the upper boundary of population density, $N_T > \hat{n}\omega$, above which the number of patches is a limiting factor, increases with the amount of patches, $\omega$ . As a result, the width of the range of population densities where partner choice is effective increases.
+
+\begin{figure*}
+    \centering
+    \includegraphics[width=\textwidth]{lions/results/byprod/grid_1.pdf}
+    \caption{Effect on the population size in the environment with 20, 40 or 80 patches and an optimal number ofagents $\hat{n} = 2, 3$ and $\sigma = 1$. Agents have a cooperative behaviour for $\hat{n} < N_T < \omega\times \hat{n}$ and for $\hat{n} = 2$.}
+    \label{fig:gridtol1}
+\end{figure*}
+
+We then performed the same simulations, but this time in situations where the optimal number of individuals per patch, $\hat{n}$, was larger. The outcome was even less favorable to cooperation (Fig.~\ref{fig:gridtol1}, e-p). This is again a consequence of the dilution of the benefit of being a cooperator to attract others, when cooperation takes place in too large groups.
+
+\section{Discussion}
+
+Partner choice can lead to the evolution of cooperation when individuals can compare several opportunities for social interaction and choose the most advantageous. In this article, we have shown that the conditions for this to happen are, however, quite restrictive. They entail  that individuals really have access to a range of social opportunities. Yet, in many cases, social opportunities are very rare because they necessitate the co-occurrence of two things at the same time: (i) at least one available partner, and (ii) an exploitable resource or, more generally, ``something to do'' with that partner. 
+
+Cooperation by partner choice can therefore evolve in two situations. First, it can evolve if a partner constitutes in itself a resource as there is, in this case,  no further requirement for a social opportunity than the need to find a partner. This occurs, for instance, in sexual markets, or in the many instances of interspecific mutualisms where the other individual alone constitutes an opportunity to cooperate. It is therefore logical that partner choice plays an important role in these two types of interactions (refs xx).
+
+Second, it can evolve if individuals are very efficient at extracting opportunities for cooperation from their environment. This is particularly the case in the human species. In the same environment, there are more opportunities for cooperation for human beings than for individuals of most other species. This is a consequence of our skill-intensive strategy that allows us to transform and thereby extract high-value resources from our environment (Kaplan xx). We can thus understand why our cooperation is  related to our cognitive abilities. Having skills that increase the number of opportunities to do useful things also brings with it the possibility of choosing between different opportunities. This puts greater pressure on individuals, who are competing to attract partners on their own opportunity, rather than on another. 
+
+xx revoir cette partie = TODO PAUL = faire la biblio sur les différentes hypothèses sur la relation entre intelligence et coopération (rapidement ; voir mon mail)
+There is a large number of hypotheses in the literature on the relationship between intelligence and cooperation and it is often difficult to  sort them out, both in terms of their empirical predictions and in terms of their theoretical plausibility. One particularly prominent theory, the social brain hypothesis, considers that our cognitive abilities are a secondary consequence of selective pressures steming from social life (refs xx). Cooperation is a complex problem to solve that requires the evolution of sophisticated cognitive devices and, so the theory goes, this led to the evolution of intelligence in other domains as well. The social brain hypothesis, however, is based on the premisse that selection to deal with specifically social problems leads to the evolution of intelligence in other domains as well, which is not plausible (refs xx). Another more recent hypothesis considers that causality goes both ways (refs xx west). Intelligence makes cooperation more efficient, which increases the range of situations in which cooperation can evolve, which in turn selects for more intelligence to better benefit from cooperation and so on. The present hypothesis is not in oposition to the latter. It constitutes an additional effect, which specifically concerns reciprocal cooperation --cooperation made to attract partners-- and not other forms of cooperation such as kin altruism or byproduct cooperation. According to our hypothesis, intelligence does not make cooperation more useful, it makes all actions in the world more efficient, and thus leads to greater competition between individuals to attract partners, thereby forcing them to cooperate more.
+
+On the other hand, partner choice cannot lead to the evolution of cooperation when individuals are not very effective in finding cooperation opportunities in their environment. This explains why, in many species, social interactions show no evidence of cooperation beyond immediate self-interest (refs xx). Even when individuals engage in collective actions, for example when they hunt collectively, others have so few outside options anyway that there is no need to seek to draw them into the collective actions. They will come anyway, for want of anything better to do. Even worse than that, as opportunities for cooperation are rare, not only are there always enough partners in each collective action without it being necessary to actively attract them, but in fact the opposite is true. There are always too \textit{many} indviduals participating in each cooperation endeavour. This has been documented for instance in pack hunting in Lions, where Packer showed that lionesses often hunt in groups that are too large compared to what would be optimal (refs xx). In such a case, the average gain per individual in a collective action is reduced and not increased by the participation of others, and there is therefore no selection to attract partners but rather a selection to push them away at the time of sharing.
+
+%The difficulty is that, even in these cases, during the collective action itself, individuals do behave in a coordinated manner for a common goal, as they all wish for the eventual success. Yet, this is no indication that they are cooperating for anything other than their immediate benefit. In many cases, individuals actually live in groups for an independent reason (e.g., to protect against  foreign males in the case of lionnesses) and the most prominent collective actions (e.g. group hunting) are merely unwanted by-products of  group life, occuring because individuals simply have nothing better to do.
diff --git a/chapters/04RoboCoop.tex b/chapters/04RoboCoop.tex
new file mode 100644
index 0000000..39b8dad
--- /dev/null
+++ b/chapters/04RoboCoop.tex
@@ -0,0 +1,270 @@
+
+
+\section{Introduction}
+
+- Cooperation avec clonal Floreano
+- Cooperation swarm partner choice (no evo) \cite{Aktipis2011}
+
+
+In collective robotics, the accomplishment of a task for a population is often not aligned with the individual objective of each robot. Thus, a population of robots maximizing their individual gains may interfere with the execution of the collective task. How can we then align the individual goals of the agents to this collective success? This problem has been extensively studied in game theory and evolutionary biology \citep{Axelrod1981}. Several mechanisms have been identified that allow this alignment to take place \citep{West2007a}. Among these mechanisms, partner choice is an efficient mechanism. Each individual seeks to maximize his own gain and must interact with another partner. If individuals have the ability to select their partner, then it is in their interest to find the best possible partner as quickly as possible. Thus, individuals, in order to be chosen as a partner, have an interest in being more cooperative than the individually optimal action is. There is pressure to be cooperative in order to be chosen as a partner. Theoretical results have shown that for partner selection to be effective, the time spent searching for a partner compared to the time spent interacting with partners should be as short as possible \citep{Debove2015b}. Let $\beta$ be the meeting probability per time step for an individual, and $\tau$ be the cessation probability per time step of two partners after an interaction, so $\beta / \tau$ must be large for partner selection to be effective. Our goal is to identify robotic environments where partner choice is effective. To do so, we have built a pseudo-realistic environment where robots meet on patches and can interact together. We modulate our environment according to different parameters and study the emergence of partner choice behavior and the appearance of cooperation behavior under these different conditions. We have shown that for partner choice to be possible, constraints are very strong. The robot population must be very dense in order to have a very high $\beta$ encounter probability. Moreover, the interactions between two robots must be very long (very low $\tau$) in order for the search time to be small enough compared to the interaction time.
+
+\section{Methods}
+
+\subsection{Environment}
+
+We define a collective forage task where $N$ robots move and consume resources in pairs in a circular arena. The resources are spread randomly throughout the arena (see Fig.~\ref{fig:env}). Resources can be seen by robots and are surrounded by patches. Robots must move on the patches to consume the resource and increase their scores. A robot alone cannot exploit a resource. When two robots are on the same patch, they can collaborate to exploit a resource. Each robot receives a payoff based on its own investment and that of its partner. Depending on its investment, a robot can act either by cheating (it invests to maximize its own gain) or by cooperating (it invests to maximize the gain of the pair). 
+
+\begin{figure}
+    \begin{center}
+        \includegraphics[width=2.5in]{robocoop/wander_env.png}
+        \vskip 0.25cm
+        \caption{The environment. Each blue dot is a robot. Each green dot is a resource and the light green circle around it is the patch. Robots can see the resources, and when two robots walk on a patch, they can interact together.
+        }
+    \label{fig:env}
+    \end{center}
+\end{figure}
+
+When two robots are on the same patch, they can choose to interact together and exploit the resource. First, each robot accesses the action that its partner intends to do, then it decides whether or not to accept the interaction. If one of the robots choose not to interact, then the resource disappears and the robots continue their course. If both robots accept, the resource disappears and they play the announced investments to get their payoffs. The robots switch then to a wandering behaviour for a certain period of time. It represents the amount of time the robots interact with each other, or a digestion period. Each robot has a probability $\tau$ of returning to the game at each iteration. The expected duration of an interaction for an agent is therefore $1/\tau$. Two robots that have interacted together may not come back to partner seeking behaviour at the same iteration. When a resource disappears, a new resource appears in the arena at the next iteration.
+
+\subsection{Cooperation Market} \label{sec:market}
+According to the theoretical results on partner choice \citep{Debove2015b}, the efficiency of this strategy depends on the meeting probability of an agent ($\beta$) and the split probability of an interaction ($\tau$). If the meeting probability is big compared to the split probability, that is $\beta/\tau$ is large, then partner choice is a viable strategy and can emerge. Indeed, for partner choice to be effective, when an agent refuses to interact with a partner, it must do so because its expectation of gain in finding a better partner outweighs the gain missed by rejecting the interaction with the wrong partner and the implied cost paid by looking for a new partner. Thus, if search time is short compared to interaction time, it is profitable to spend more time searching for a good partner than interacting with more uncooperative partners.
+
+The $\beta$ parameter is determined by the ability of the robots to meet on a patch and varies as the robots evolve, but also depending on the density of robots in the arena, and especially the robots that are also seeking for partner. In our model, the split probability $\tau$ parameter is chosen experimentally.
+
+\subsection{Objective function}
+
+When two robots interact with each other, they earn a gain determined by the investment of the two agents. The gain of an agent $a_i$ investing $x_i$ with its partner $a_j$ investing $x_j$ is determined by the function $P(x_i, x_j)$ described in the equation~\ref{eq:payoff}.
+
+\begin{align}
+PG(x_i, x_j) &= \frac{a}{2} (x_i + x_j) \\
+PD(x_j) &= \frac{b}{2} (x_j) \\
+C(x_i) &= \frac{1}{2} x_i^2 \\
+P(x_i, x_j)& = PG(x_i, x_j) + PD(x_j) - C(x_i) \label{eq:payoff}
+\end{align}
+
+This function is a mixture of a public good ($PG$, modulated by $a$) and a prisoner's dilemma ($PD$, modulated by $b$) and a quadratic cost $C$. For $a_i$ to maximize its individual gain ($P(x_i, x_j)$), the optimal investment is $x_d = \frac{a}{2}$, which correspond to the defective behaviour. For the group to maximize their total gain, both agents must invest $\hat{x} = a + \frac{b}{2}$, which correspond to the cooperative behaviour. The Figure~\ref{fig:payoff} is a plot of the payoff function with different partner's investment values.
+
+
+
+\begin{figure}[htpb]
+    \centering
+    \includegraphics[width=\columnwidth]{robocoop/payoff.pdf}
+    \caption{Payoff function with different partner's investment value. The individually optimal investment is $x_d = \frac{a}{2}$ whatever the constant value the partner invests, which correspond to a defective behaviour. If both robots invest the same value, then the socially optimal investment is $\hat{x} = a + \frac{b}{2}$, which correspond to a cooperative behaviour.}
+    \label{fig:payoff}
+\end{figure}
+
+\subsection{Controller}
+
+The robot control system is composed of the investment value ($x \in [0, 10]$) during interaction and two decision modules: The movement module and the partner choice module. The robot always invests the same value and the modules remain fixed throughout the task. 
+
+The movement module is an artificial neural network with 1 hidden layer of 10 neurons. All the nodes have a $\tanh$ activation function. The input of the network is the detailed information from the 8 sensors of the robot. The network gives as output the speed of translation and rotation between $]-1, 1[$. These values are then resized to match the maximum translation and rotation speeds of the robot.
+
+The partner choice module is also a artificial neural network. It is activated only when an agent is with another agent on the same patch. This network receives as inputs the investment level of the robot as well as the investment level of its partner. It is composed of 1 hidden layer of 3 neurons and has a $\tanh$ activation function. It gives as output a value ($a \in ]-1, 1[$), which correspond to the response to the partner. If the output is greater than 0, then the robot accepts the interaction, otherwise it refuses it and the interaction does not take place.  The details of the inputs of each network are given in the Table~\ref{tab:ann_params}.
+
+All neural network weights are bounded in the range $]-10, 10[$. In total, the two neural networks consist of 368 weights.
+
+\begin{table}
+    \centering
+    \begin{tabular}{cc}
+        \hline
+        \textbf{Input} & \textbf{Value}  \\
+        \hline
+        \textbf{Movement module} & \\
+        \textit{Per sensor ($\times 8$)}& \\
+        Distance to Robot &  $]0, 1[$ if in range else 1 \\
+        Distance to Wall & $]0, 1[$ if in range else 1  \\
+        Distance to Resource & $]0, 1[$ if in range else 1  \\
+        Robot on the patch & 0 or 1 \\
+        \hline
+        \textbf{Partner choice module} & \\
+        Partner's investment & $]0, 10[$ \\
+        Robot's own investment & $]0, 10[$ \\
+        \hline
+    \end{tabular}
+    \caption{Neural Networks inputs}
+    \label{tab:ann_params}
+\end{table}
+
+\subsection{Phenotypic variability} \label{sec:phenovar}
+
+\citet{McNamara2010c} reviews different works that have shown the importance of variability in the level of investment in the population to allow agents' selectivity and thus enable the appearance of partner choice. Indeed, for selectivity to be a useful skill, the variability of investments between agents must be big enough so that the payoff variation between two different partners is sufficiently beneficial. In this case, selective robots have the upper hand against undiscriminating robots.
+
+This variability can be present by itself or enforced either with a very high mutation strength for the gene encoding the investment level for each agent, or by adding a noise to the genetically encoded investment level for each agent that will remain the same throughout the task. 
+
+\subsection{Learning}
+
+The weights of the neural networks and the investment value of a robot constitute its genome. In total, a robot has $369$ genes, the $g_x$ gene to encode the investment level and the 368 $g_{w_i}\,\forall i \in 0..368$ genes to encode the weights of the two neural networks. The value of $g_x$ is in $]0, 1[$, the investment level $x$ of the robot is defined by $x = 10 \times g_x$. The values of $g_{w_i}$ are in $]-10, 10[$.
+
+At the beginning of learning, the $g_{w_i}$ genes are randomly initialized in the range $]-1, 1[$ and the $g_x$ gene is randomly initialized in $]0, 1[$.
+
+We use the fitness-proportionate evolutionary algorithm described below for the learning of our robots. After each generation, the total payoffs of the agents represent their fitnesses. Thus, the fitness $F_i$ of the robot $i$  which had accepted $n$ interactions is described by (Eq.~\ref{eq:totalpayoff})
+
+
+\begin{equation}
+    F_i = \frac{1}{\tau} \sum_{j=0}^{n} P(x_i, x_j) \label{eq:totalpayoff}
+\end{equation}  
+
+with $x_j$ being the investment value of the robot's partner at the $j^{th}$ interaction. Each payoff is weighted by $\tau$ to normalize the total payoff gains by robots between conditions where $\tau$ varies.
+
+A new generation of robots is generated by randomly drawing the agents' genomes in proportion to their fitnesses. Then a mutation operation is applied to each agent of the new generation. Each $g_i$ gene of a robot has a probability $\mu = 0.01$ to mutate. If the gene is selected, then it has a probability of $0.1$ to mutate according to a uniform distribution $\mathcal{U}(]-10, 10[)$ and a probability of $0.9$ to mutate according to a normal distribution $\mathcal{N}(g_i, \sigma)$ with $\sigma = \sigma_w = 0.1$ for the weight genes and $\sigma = \sigma_x = 0.1$ for the investment gene. The new generation then performs the task and the process is repeated for $G = 200$ generations (see Table~\ref{tab:env_params} for a list of all the parameters). 
+
+\begin{table}
+    \centering
+    \begin{tabular}{clc}
+        \hline
+        \textbf{Param} & \textbf{Description}  & \textbf{Value} \\
+        \hline
+        \multicolumn{3}{l}{\textbf{Payoff}} \\
+        $a$ & Public good weight & 5 \\
+        $b$ & Prisoner's dilemma weight & 3 \\
+        %\hline
+        \multicolumn{3}{l}{\textbf{Environment}} \\
+        $T$ & Number of iterations per generation & $100\,000$ \\
+        $G$ & Number of generations per run & $200$ \\
+        & Arena diameter & 400px \\
+        & Robot size & 4px \\
+        & Robot max speed & 2px/iteration \\
+        $\omega$ & Number of patches & 30 \\
+        $\tau$ & End of interaction probability & \\
+        %\hline
+        \multicolumn{3}{l}{\textbf{Evolution hyper-parameters}} \\
+        $\mu$ & mutation probability & 0.01 \\
+        $\sigma_w$ & mutation strength of weight genes & 0.1 \\
+        $\sigma_x$ & mutation strength of investment gene& 0.1 \\
+        \hline
+    \end{tabular}
+    \caption{Experiment parameters}
+    \label{tab:env_params}
+\end{table}
+
+\section{Results}
+
+\subsection{Experimental setup}
+
+The environment is a circular arena with a diameter of 400px. The robots are 4px diameter disks. The robots have 8 equally distributed sensors with a range of 96px giving them information about their surroundings, such as the presence of other robots, of a resource or of a wall. The robots move through the environment at a maximum translation speed of 2px/iteration and a rotational speed of $30^\circ$/iteration. $N$ robots are spread randomly in the environment and 30 resources are randomly scattered throughout the arena. Each generation lasts $T = 100\,000$ iterations. The environment is represented in Figure~\ref{fig:env}.
+
+The results presented below are obtained by the behavioral study of the $200^{th}$ generation.  We ran 24 simulations per condition in all experiments.
+
+We have studied the influence of several factors that may facilitate the emergence of partner choice and cooperation behaviours: (i) the effect of population size (ii) the effect of the duration of interactions by changing the split probability ($\tau$), and (iii) the strength of the investment gene mutation ($\sigma_x$). 
+
+\subsection{Effect of the population size}
+
+We first wanted to test the impact of the population size on the emergence of partner choice. Does a bigger population size positively impact the emergence of cooperative behavior? %question
+To test the emergence of the cooperation behavior by partner choice, we set the parameters to be the most favourable for its emergence. We set $\tau = 0$ and the evaluation duration $T = 100\,000$ in order to grant a long search time for the robots and a very engaging commitment if they accept the interaction. %expe
+At $N = 50$, robots plays the defective strategy. the average investment level is very close to the social optimum for $N = 1\,000$ (see Figure~\ref{fig:do_coop}). % results 
+The robots evolve a cooperative behavior for $N$ sufficiently large. The denser the population, the higher the probability of encounters $\beta$ is. Thus, with 50 robots in the arena, the robots are unable to meet and sample enough partners to be selective before the end of the generation. Moreover, the robots are racing to find a partner quickly. Indeed, with $\tau = 0$, the more the task advances in time, the fewer agents are available in the arena and thus the more $\beta$ decreases throughout the evaluation. % interpretation
+
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3in]{robocoop/wander_do_coop.pdf}
+        \vskip 0.25cm
+        \caption{The larger the population, the higher the agents' level of investment.
+        Mean investment of the population for 24 simulations per condition with a split probability $\tau = 0$ and a mutation strength for investment $\sigma_x = 0.1$. When the population is large, agents can easily find a partner and can be more selective. The pressure to invest a lot is then greater due to the effect of the partner choice.
+        }
+        \label{fig:do_coop}
+    \end{center}
+\end{figure}
+
+
+To show the importance of partner choice in the evolution of this cooperative behavior, %question
+we built a control condition where we deactivate the agents' ability to know their partner's investment in order to accept or not accept an interaction. % expe
+In this condition, whatever the number of agents in the environment, the average investment level is always $x_d$, that is a defective behaviour (see Fig.~\ref{fig:control}). % results
+In this situation, agents have no way to be selective and cannot choose a cooperative robot over a non-cooperative one. Thus, cooperative robots are not preferentially selected as partners and there is no incentive to invest more than the individual optimum. There is no selection pressure in favor of the most cooperative agents. % interpretation
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3in]{robocoop/wander_control.pdf}
+        \vskip 0.25cm
+        \caption{Robots never cooperate in control condition. Mean investment of the population for 24 simulations per condition with a split probability $\tau = 0$ and a mutation strength of investment $\sigma_x = 0.1$. Robots never cooperate whatever the number of robots $N$ in the environment. Without access to their potential partner's investment level, agents cannot be selective and partner selection is impossible. Agents are under no pressure to invest a lot to be chosen. Therefore, they all play at the individually optimal investment level.
+        }
+        \label{fig:control}
+    \end{center}
+\end{figure}
+
+
+\subsection{Effect of the interaction length}
+
+%question
+%expe
+%results
+%interpretation
+
+According to the theoretical results, the longer the interaction, the greater the influence of the choice of partner (see section~\nameref{sec:market}). We test the reality of this prediction in our experimental setup. % question
+To do this, we vary the split probability $\tau$.  % expe
+The larger $\tau$ is, the shorter the interaction. When the split probability $\tau$ is null or low and the population size $N$ is large, the robots invest in a collectively optimal way and have a cooperative behavior (see Fig.~\ref{fig:corr_tau_comp}. % results
+The larger the $\tau$ becomes, the less cooperative the robots are even for a high population size. The robots plays systematically a defective investment with $\tau \geq 1\times 10^{-3}$ Thus, increasing the duration of interactions has a positive effect on the appearance of cooperative behavior by partner choice. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{robocoop/wander_corr_tau_coop_pop1000.pdf}
+        \vskip 0.25cm
+        \caption{The smaller the split probability $\tau$ is, the more cooperative robots get. The robots invest cooperatively for $\tau \leq 2\times 10 ^{-5}$, and have a defective behaviour for $\tau > 2 \times 10^{-5}$. The higher $\tau$ is, the less long are the interaction and the more profitable it is to interact with a lot of bad partner compared to looking for a good partner and interact with it.
+        }
+        \label{fig:corr_tau_comp}
+    \end{center}
+\end{figure}
+
+
+\subsection{Effect of the mutation strength}
+
+As explained in the section \nameref{sec:phenovar}, different works have shown the importance of variability in the level of investment in the population to allow agents' selectivity and thus enable the appearance of partner choice \citep{McNamara2010c}.
+We test the influence of higher phenotypic variability in our task. % Question
+To do so, we (i) modified the strength $\sigma_x$ of the Gaussian mutation on the gene encoding the robot investment level and (ii) applied a constant noise on the robot investment level during a generation. % expe
+We observe very minor differences in the average investment level between the different simulations (see Fig~\ref{fig:varmut}). However, we note the presence of less variability between simulations when the mutation level is high. This can be explained by a more rapid convergence towards the optimal investment level. % resultats
+The fact that the variability of investment in the environment plays very little role in our task may be explained by the fact that all possible levels of investment are present in the first generation. The ability to be selective in the choice of partner may therefore emerge before the population is completely homogeneous and thus selectivity becomes an unnecessary skill. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=2.4in]{robocoop/wander_varmut.pdf}
+        \vskip 0.25cm
+        \caption{A higher mutation strength has no impact on average cooperation but reduces variance in investment between simulations.
+        Average investment in the population for 24 simulations per condition with $\tau = 0$. The addition of phenotypic variability facilitates the appearance of agent selectivity at low investment mutation strength \citep{McNamara2010c}. Here, variations in mutation strength for investment $\sigma_x$ %or the addition of phenotypic variability
+        have only a small impact on the final investment level of the agents. This may be due to the fact that all possible investment levels are represented at the Initialization of the simulation.
+        }
+    \label{fig:varmut}
+    \end{center}
+\end{figure}
+
+
+
+\subsection{Control: population size vs number of generations}
+
+The difference in population size between low (50 robots) and high (1000 robots) population conditions could be explained by the lower number of evaluations that the 50 robot conditions have to evolve cooperative behaviour. Indeed, with the number of generations being constant ($G = 200$), the number of evaluations for the condition with 50 robots is $50 \times 200 = $10,000 and for the conditions with 1000 robots is $1,000 \times 200 = 200,000$. This difference in the number of evaluations could explain why cooperative behaviour has evolved in the conditions where $N$ is large and not in those where $N$ is small. Has the evolution converged in the small $N$ conditions? % Question
+To test the impact of this number of evaluations, we run a new control condition of 24 simulations with $G = 4,000$ for a population of $N=50$ robots, offering $200\,000$ evaluations. % Method
+The difference between the condition $N=50, G=200$ and $N=50, G=4000$ is marginal, but the difference between these conditions at the condition $N=1\,000, G=200$ is very large (Fig.~\ref{fig:gencomp}). Adding more generations does not improve the level of cooperation achieved for conditions with a small population. % results
+It is therefore the too low encounter probability $\beta$ that blocks the emergence of cooperative behavior under these conditions, not the fewer evaluations. % interpretation
+
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{robocoop/wander_comp_genpop.pdf}
+        \vskip 0.25cm
+        \caption{More generations with small population does not lead to cooperative behaviour. The differences in robot investment between conditions $N=50$ and $N=1000$ cannot be attributed to fewer evaluations for small populations.
+        }
+        \label{fig:gencomp}
+    \end{center}
+\end{figure}
+
+
+\subsection{Control: Wandering vs Teleportation}
+
+
+Finally, we do a final control to test the influence of the wandering behaviour. Does this facilitate or not the emergence of cooperation by partner choice? % Question
+To test this, we compare the task with digestion time by wandering with a task with digestion time outside the arena. In this second condition, after one robot has interacted with another, it is placed outside the arena and has a $\tau$ probability of returning to the arena at each time step. When a robot is placed back into the arena, it is randomly placed back into the arena. This second condition is closer to the numerical simulations present in \citet{Debove2015c} than the wandering condition. We compare the results with the wandering condition and the condition outside the arena for several values of split probability $\tau$. % 2) Expe
+We find that in the wandering condition as well as in the off-arena condition, when the probability of split is low ($\tau < 1\times 10^{-5}$), the robots invest cooperatively. We also find that in the off-arena condition, the robots remain cooperative for higher values of split probability. For even higher split probabilities, the robots no longer have cooperative behaviors whatever the condition. % results
+The off-arena condition is more robust than the wander condition. Indeed, for higher split probability values, the agents still behave cooperatively. This can be explained by the fact that the arena is less crowded than in the wander condition. Indeed, a robot necessarily crosses potential partners in the off-arena condition, and is not blocked by agents in their digestion phase, as would be the case in the wander condition. Thus, the $\beta$ encounter probability is greater in the off-arena condition than in the wander condition. % 4) interpretation
+
+\begin{figure}[tbhp]
+    \begin{center}
+        \includegraphics[width=3.3in]{robocoop/wander_corr_tau_coop_pop1000_tp.pdf}
+        \vskip 0.25cm
+        \caption{Robots act cooperatively in both the wander and off-arena conditions for low split probability $\tau$. The off-arena condition is more robust to middle range values of $\tau$ than the wander condition.
+        }
+        \label{fig:comp_tau_wander_tp}
+    \end{center}
+\end{figure}
+
diff --git a/chapters/05Discussion.tex b/chapters/05Discussion.tex
new file mode 100644
index 0000000..e69de29
diff --git a/main.tex b/main.tex
new file mode 100644
index 0000000..573ecfd
--- /dev/null
+++ b/main.tex
@@ -0,0 +1,75 @@
+\documentclass[12pt,a4paper]{report}
+\usepackage[utf8]{inputenc}
+\usepackage[T1]{fontenc}
+\usepackage{lmodern}
+\usepackage{graphicx}
+\usepackage{amsmath}
+\usepackage{hyperref}
+\usepackage[french]{babel}
+\graphicspath{ {media/} }
+\usepackage{csquotes}
+
+%% Create the xx commande
+\usepackage{soulutf8}  % Add the \hl command from soulutf8
+\usepackage[dvipsnames]{xcolor}  % Add the color Apricot
+\newcommand{\xx}[1]{%
+  \begingroup
+  \sethlcolor{Apricot}%
+  \hl{xx #1 xx}%
+  \endgroup
+}
+
+% To remove avec todo
+\setlength {\marginparwidth }{2cm} % ajoute un peu de marges pour les todo
+\usepackage{todonotes}
+% To remove avec todo
+
+\usepackage[natbib=true,backend=biber,style=apa]{biblatex}
+\addbibresource{references.bib}
+
+
+\title{
+{Evolution of Cooperation in Collective Adaptative Systems}\\
+{\large Sorbonne Université}\\
+{\includegraphics[width=\textwidth]{university.png}}
+}
+\author{Paul Ecoffet}
+\date{\today}
+
+\begin{document}
+
+\maketitle
+
+\chapter*{Abstract}
+Abstract goes here
+
+\chapter*{Dedication}
+To mum and dad
+
+\chapter*{Declaration}
+I declare that..
+
+\chapter*{Acknowledgements}
+I want to thank...
+
+\tableofcontents
+
+\chapter{Introduction}
+\input{chapters/01introduction}
+
+\chapter{State of the Art}
+\input{chapters/02stateoftheart}
+
+\chapter{Nothing better to do? Environment quality and the evolution of cooperation by partner choice}
+\input{chapters/03Lions}
+
+\chapter{Cooperation in robotic swarms}
+\input{chapters/04RoboCoop}
+
+\chapter{Discussion}
+\input{chapters/05Discussion}
+
+\printbibliography
+
+
+\end{document}
\ No newline at end of file
diff --git a/media/lions/results/byprod/grid_1.pdf b/media/lions/results/byprod/grid_1.pdf
new file mode 100644
index 0000000..2078048
Binary files /dev/null and b/media/lions/results/byprod/grid_1.pdf differ
diff --git a/media/lions/results/byprod/grid_3.pdf b/media/lions/results/byprod/grid_3.pdf
new file mode 100644
index 0000000..7e9d547
Binary files /dev/null and b/media/lions/results/byprod/grid_3.pdf differ
diff --git a/media/lions/results/byprod/grid_inf.pdf b/media/lions/results/byprod/grid_inf.pdf
new file mode 100644
index 0000000..d6b01c7
Binary files /dev/null and b/media/lions/results/byprod/grid_inf.pdf differ
diff --git a/media/lions/results/byprod/varN_inf.pdf b/media/lions/results/byprod/varN_inf.pdf
new file mode 100644
index 0000000..623aa0e
Binary files /dev/null and b/media/lions/results/byprod/varN_inf.pdf differ
diff --git a/media/lions/results/byprod/varNrowOpp_1.pdf b/media/lions/results/byprod/varNrowOpp_1.pdf
new file mode 100644
index 0000000..282aa4b
Binary files /dev/null and b/media/lions/results/byprod/varNrowOpp_1.pdf differ
diff --git a/media/lions/results/byprod/varNrowOpp_3.pdf b/media/lions/results/byprod/varNrowOpp_3.pdf
new file mode 100644
index 0000000..3a25dd6
Binary files /dev/null and b/media/lions/results/byprod/varNrowOpp_3.pdf differ
diff --git a/media/lions/results/byprod/varNrowOpp_inf.pdf b/media/lions/results/byprod/varNrowOpp_inf.pdf
new file mode 100644
index 0000000..97f4f84
Binary files /dev/null and b/media/lions/results/byprod/varNrowOpp_inf.pdf differ
diff --git a/media/lions/results/byprod/varopp_1.pdf b/media/lions/results/byprod/varopp_1.pdf
new file mode 100644
index 0000000..cb3a448
Binary files /dev/null and b/media/lions/results/byprod/varopp_1.pdf differ
diff --git a/media/lions/results/byprod/varopp_3.pdf b/media/lions/results/byprod/varopp_3.pdf
new file mode 100644
index 0000000..ca1a93d
Binary files /dev/null and b/media/lions/results/byprod/varopp_3.pdf differ
diff --git a/media/lions/results/byprod/varopp_hatn_1.pdf b/media/lions/results/byprod/varopp_hatn_1.pdf
new file mode 100644
index 0000000..9fb2e58
Binary files /dev/null and b/media/lions/results/byprod/varopp_hatn_1.pdf differ
diff --git a/media/lions/results/byprod/varopp_inf.pdf b/media/lions/results/byprod/varopp_inf.pdf
new file mode 100644
index 0000000..206878a
Binary files /dev/null and b/media/lions/results/byprod/varopp_inf.pdf differ
diff --git a/media/robocoop/2fit.pdf b/media/robocoop/2fit.pdf
new file mode 100644
index 0000000..3d7d638
Binary files /dev/null and b/media/robocoop/2fit.pdf differ
diff --git a/media/robocoop/all_fit.pdf b/media/robocoop/all_fit.pdf
new file mode 100644
index 0000000..40365f2
Binary files /dev/null and b/media/robocoop/all_fit.pdf differ
diff --git a/media/robocoop/comp_genpop.pdf b/media/robocoop/comp_genpop.pdf
new file mode 100644
index 0000000..babf63c
Binary files /dev/null and b/media/robocoop/comp_genpop.pdf differ
diff --git a/media/robocoop/comp_wander_genpop.pdf b/media/robocoop/comp_wander_genpop.pdf
new file mode 100644
index 0000000..db5db13
Binary files /dev/null and b/media/robocoop/comp_wander_genpop.pdf differ
diff --git a/media/robocoop/control.pdf b/media/robocoop/control.pdf
new file mode 100644
index 0000000..00c2c84
Binary files /dev/null and b/media/robocoop/control.pdf differ
diff --git a/media/robocoop/control.png b/media/robocoop/control.png
new file mode 100644
index 0000000..52f51ac
Binary files /dev/null and b/media/robocoop/control.png differ
diff --git a/media/robocoop/corr_tau_coop_pop1000.pdf b/media/robocoop/corr_tau_coop_pop1000.pdf
new file mode 100644
index 0000000..e484635
Binary files /dev/null and b/media/robocoop/corr_tau_coop_pop1000.pdf differ
diff --git a/media/robocoop/do_coop.pdf b/media/robocoop/do_coop.pdf
new file mode 100644
index 0000000..a9b8f12
Binary files /dev/null and b/media/robocoop/do_coop.pdf differ
diff --git a/media/robocoop/env.png b/media/robocoop/env.png
new file mode 100644
index 0000000..57992cb
Binary files /dev/null and b/media/robocoop/env.png differ
diff --git a/media/robocoop/long.pdf b/media/robocoop/long.pdf
new file mode 100644
index 0000000..cf0b22a
Binary files /dev/null and b/media/robocoop/long.pdf differ
diff --git a/media/robocoop/payoff.pdf b/media/robocoop/payoff.pdf
new file mode 100644
index 0000000..bde8f56
Binary files /dev/null and b/media/robocoop/payoff.pdf differ
diff --git a/media/robocoop/varmut.pdf b/media/robocoop/varmut.pdf
new file mode 100644
index 0000000..2a59cfa
Binary files /dev/null and b/media/robocoop/varmut.pdf differ
diff --git a/media/robocoop/vartau.pdf b/media/robocoop/vartau.pdf
new file mode 100644
index 0000000..e89e0e0
Binary files /dev/null and b/media/robocoop/vartau.pdf differ
diff --git a/media/robocoop/wander.pdf b/media/robocoop/wander.pdf
new file mode 100644
index 0000000..2a05cec
Binary files /dev/null and b/media/robocoop/wander.pdf differ
diff --git a/media/robocoop/wander_all_fit.pdf b/media/robocoop/wander_all_fit.pdf
new file mode 100644
index 0000000..02ff6f3
Binary files /dev/null and b/media/robocoop/wander_all_fit.pdf differ
diff --git a/media/robocoop/wander_comp_genpop.pdf b/media/robocoop/wander_comp_genpop.pdf
new file mode 100644
index 0000000..2621b58
Binary files /dev/null and b/media/robocoop/wander_comp_genpop.pdf differ
diff --git a/media/robocoop/wander_control.pdf b/media/robocoop/wander_control.pdf
new file mode 100644
index 0000000..ac6ba2b
Binary files /dev/null and b/media/robocoop/wander_control.pdf differ
diff --git a/media/robocoop/wander_corr_tau_coop_pop1000.pdf b/media/robocoop/wander_corr_tau_coop_pop1000.pdf
new file mode 100644
index 0000000..a7a3f56
Binary files /dev/null and b/media/robocoop/wander_corr_tau_coop_pop1000.pdf differ
diff --git a/media/robocoop/wander_corr_tau_coop_pop1000_comp_tp.pdf b/media/robocoop/wander_corr_tau_coop_pop1000_comp_tp.pdf
new file mode 100644
index 0000000..96a1a34
Binary files /dev/null and b/media/robocoop/wander_corr_tau_coop_pop1000_comp_tp.pdf differ
diff --git a/media/robocoop/wander_corr_tau_coop_pop1000_tp.pdf b/media/robocoop/wander_corr_tau_coop_pop1000_tp.pdf
new file mode 100644
index 0000000..e7b9bce
Binary files /dev/null and b/media/robocoop/wander_corr_tau_coop_pop1000_tp.pdf differ
diff --git a/media/robocoop/wander_do_coop.pdf b/media/robocoop/wander_do_coop.pdf
new file mode 100644
index 0000000..8ada6e7
Binary files /dev/null and b/media/robocoop/wander_do_coop.pdf differ
diff --git a/media/robocoop/wander_env.png b/media/robocoop/wander_env.png
new file mode 100644
index 0000000..33c0f5a
Binary files /dev/null and b/media/robocoop/wander_env.png differ
diff --git a/media/robocoop/wander_varmut.pdf b/media/robocoop/wander_varmut.pdf
new file mode 100644
index 0000000..ea0d27f
Binary files /dev/null and b/media/robocoop/wander_varmut.pdf differ
diff --git a/media/robocoop/wander_vartau.pdf b/media/robocoop/wander_vartau.pdf
new file mode 100644
index 0000000..eee8b42
Binary files /dev/null and b/media/robocoop/wander_vartau.pdf differ
diff --git a/media/robocoop/wander_vartau_beta.pdf b/media/robocoop/wander_vartau_beta.pdf
new file mode 100644
index 0000000..b82148a
Binary files /dev/null and b/media/robocoop/wander_vartau_beta.pdf differ
diff --git a/media/robocoop/wander_vartau_betata.pdf b/media/robocoop/wander_vartau_betata.pdf
new file mode 100644
index 0000000..cea2964
Binary files /dev/null and b/media/robocoop/wander_vartau_betata.pdf differ
diff --git a/media/university.png b/media/university.png
new file mode 100644
index 0000000..d111821
Binary files /dev/null and b/media/university.png differ
diff --git a/references.bib b/references.bib
new file mode 100644
index 0000000..375c4cb
--- /dev/null
+++ b/references.bib
@@ -0,0 +1,4571 @@
+@article{Bergstrom2016,
+    title = {{(Se) correspondre en ligne}},
+    year = {2016},
+    journal = {Soci{\'{e}}t{\'{e}}s contemporaines},
+    author = {Bergstr{\"{o}}m, Marie},
+    number = {4},
+    pages = {13},
+    volume = {104},
+    url = {http://www.cairn.info/revue-societes-contemporaines-2016-4-page-13.htm},
+    isbn = {9782724634747},
+    doi = {10.3917/soco.104.0013},
+    issn = {1150-1944}
+}
+
+@incollection{Packer1986,
+    title = {{19. The Ecology of Sociality in Felids}},
+    year = {1986},
+    booktitle = {Ecological Aspects of Social Evolution},
+    author = {Packer, Craig},
+    editor = {Rubenstein, Daniel I. and Wrangham, Richard W.},
+    month = {12},
+    pages = {429--451},
+    publisher = {Princeton University Press},
+    url = {http://www.degruyter.com/view/books/9781400858149/9781400858149.429/9781400858149.429.xml},
+    address = {Princeton},
+    doi = {10.1515/9781400858149.429}
+}
+
+@article{Wyatt2014,
+    title = {{A biological market analysis of the plant-mycorrhizal symbiosis}},
+    year = {2014},
+    journal = {Evolution},
+    author = {Wyatt, Gregory A.K. and Toby Kiers, E. and Gardner, Andy and West, Stuart A.},
+    number = {9},
+    pages = {2603--2618},
+    volume = {68},
+    doi = {10.1111/evo.12466},
+    issn = {15585646},
+    keywords = {Bargaining power, Cournot competition, Darwinian agriculture, Mutualism, Partner choice, Ricardian economics}
+}
+
+@misc{Packer,
+    title = {{A comparative analysis of non-offspring nursing}},
+    author = {Packer, Craig and Lewis, Susan and Pusey, Anne}
+}
+
+@article{Scharff1991,
+    title = {{A comparative study of the behavioral deficits following lesions}},
+    year = {1991},
+    author = {Scharff, C and Nottebohm, F},
+    number = {September},
+    volume = {7}
+}
+
+@article{Pepper2002,
+    title = {{A Mechanism for the Evolution of Altruism among Nonkin: Positive Assortment through Environmental Feedback}},
+    year = {2002},
+    journal = {The American Naturalist},
+    author = {{Pepper} and {Smuts}},
+    number = {2},
+    pages = {205},
+    volume = {160},
+    doi = {10.2307/3079138},
+    issn = {00030147}
+}
+
+@article{Baumard2013a,
+    title = {{A mutualistic approach to morality: The evolution of fairness by partner choice}},
+    year = {2013},
+    journal = {Behavioral and Brain Sciences},
+    author = {Baumard, Nicolas and Andr{\'{e}}, Jean Baptiste and Sperber, Dan},
+    number = {1},
+    pages = {59--78},
+    volume = {36},
+    doi = {10.1017/S0140525X11002202},
+    issn = {14691825},
+    pmid = {23445574},
+    keywords = {cooperation, economic games, evolutionary psychology, fairness, morality, partner choice}
+}
+
+@article{Tchernichovski2000,
+    title = {{A procedure for an automated measurement of song similarity}},
+    year = {2000},
+    journal = {Animal Behaviour},
+    author = {Tchernichovski, Ofer and Nottebohm, Fernando and Ho, Ching Elizabeth and Pesaran, Bijan and Mitra, Partha Pratim},
+    number = {6},
+    pages = {1167--1176},
+    volume = {59},
+    isbn = {0003-3472},
+    doi = {10.1006/anbe.1999.1416},
+    issn = {00033472},
+    pmid = {10877896}
+}
+
+@article{Becker2016,
+    title = {{A Theory of Marriage : Part I Stable URL : http://www.jstor.org/stable/1831130 Linked references are available on JSTOR for this article :}},
+    year = {2016},
+    author = {Becker, Gary S},
+    number = {4},
+    pages = {813--846},
+    volume = {81}
+}
+
+@article{Baranes2013,
+    title = {{Active learning of inverse models with intrinsically motivated goal exploration in robots}},
+    year = {2013},
+    journal = {Robotics and Autonomous Systems},
+    author = {Baranes, Adrien and Oudeyer, Pierre Yves},
+    number = {1},
+    pages = {49--73},
+    volume = {61},
+    isbn = {0921889012000},
+    doi = {10.1016/j.robot.2012.05.008},
+    issn = {09218890},
+    pmid = {23416936},
+    arxivId = {1301.4862},
+    keywords = {Active learning, Autonomous motor learning, Competence based intrinsic motivation, Curiosity-driven task space exploration, Developmental robotics, Goal babbling, Inverse models, Motor development}
+}
+
+@article{Packer1983,
+    title = {{Adaptations of female lions to infanticide by incoming males ( Panthera leo).}},
+    year = {1983},
+    journal = {American Naturalist},
+    author = {Packer, C. and Pusey, A. E.},
+    number = {5},
+    pages = {716--728},
+    volume = {121},
+    doi = {10.1086/284097},
+    issn = {00030147}
+}
+
+@article{Barto1995,
+    title = {{Adaptive Critics and the Basal Ganglia}},
+    year = {1995},
+    author = {Barto, Andrew},
+    keywords = {earth simulator, esc, high performance computing, high resolution simulations and, hydrostatic atmospheric model of, non-hydrostatic agcm, preliminary, regional forecasting, results of weather forecasting, theme i, with the regional non-}
+}
+
+@article{Redish2004,
+    title = {{Addiction as a computational process gone awry (Supplementary Materials)}},
+    year = {2004},
+    journal = {Science},
+    author = {Redish, A David},
+    number = {5703},
+    pages = {1944--1947},
+    volume = {306},
+    isbn = {1095-9203 (Electronic) 1095-9203 (Linking)},
+    doi = {10.1126/science.1102384},
+    issn = {0036-8075},
+    pmid = {15591205}
+}
+
+@article{Kennedy2018,
+    title = {{Altruism in a volatile world}},
+    year = {2018},
+    journal = {Nature},
+    author = {Kennedy, Patrick and Higginson, Andrew D. and Radford, Andrew N. and Sumner, Seirian},
+    number = {7696},
+    pages = {359--362},
+    volume = {555},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature25965},
+    isbn = {2017070955},
+    doi = {10.1038/nature25965},
+    issn = {14764687},
+    pmid = {11507039},
+    arxivId = {NIHMS150003}
+}
+
+@article{Fehr2002,
+    title = {{Altruistic punishment in humans}},
+    year = {2002},
+    journal = {Nature},
+    author = {Fehr, Ernst and G{\"{a}}chter, Simon},
+    number = {6868},
+    pages = {137--140},
+    volume = {415},
+    isbn = {0028-0836},
+    doi = {10.1038/415137a},
+    issn = {00280836},
+    pmid = {11805825}
+}
+
+@article{Troyer2000,
+    title = {{An Associational Model of Birdsong Sensorimotor Learning I. Efference Copy and the Learning of Song Syllables}},
+    year = {2000},
+    journal = {Journal of Neurophysiology},
+    author = {Troyer, Todd W. and Doupe, Allison J.},
+    number = {3},
+    pages = {1204--1223},
+    volume = {84},
+    url = {http://www.physiology.org/doi/10.1152/jn.2000.84.3.1204},
+    isbn = {1137811390},
+    doi = {10.1152/jn.2000.84.3.1204},
+    issn = {0022-3077},
+    pmid = {10979996}
+}
+
+@article{Bell2007,
+    title = {{An Efference Copy Which is Modified by Reafferent Input Science is currently published by American Association for the Advancement of Science . http://www.jstor.org/about/terms.html . JSTOR ' s Terms and Conditions of Use provides , in part , that unless }},
+    year = {2007},
+    author = {Bell, Curtis C and Aaronson, A},
+    number = {4519},
+    pages = {450--453},
+    volume = {214}
+}
+
+@article{Koechlin2014,
+    title = {{An evolutionary computational theory of prefrontal executive function in decision-making}},
+    year = {2014},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Koechlin, Etienne},
+    number = {1655},
+    volume = {369},
+    isbn = {1471-2970 (Electronic){\textbackslash}r0962-8436 (Linking)},
+    doi = {10.1098/rstb.2013.0474},
+    issn = {14712970},
+    pmid = {25267817},
+    keywords = {Bayesian inference, Decision-making, Executive control, Prefrontal cortex, Reasoning, Reinforcement learning}
+}
+
+@book{Davies2012,
+    title = {{An introduction to behavioural ecology, 4th Edition}},
+    year = {2012},
+    booktitle = {Wiley-Blackwell},
+    author = {Davies, Nicholas B. and Krebs, John R. and West, Stuart A.},
+    pages = {337},
+    isbn = {9788578110796},
+    doi = {10.1017/CBO9781107415324.004},
+    issn = {1098-6596},
+    pmid = {25246403},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@book{Helekar2013,
+    title = {{Animal models of speech and language disorders}},
+    year = {2013},
+    booktitle = {Animal Models of Speech and Language Disorders},
+    author = {Helekar, Santosh A.},
+    pages = {1--295},
+    isbn = {9781461484004},
+    doi = {10.1007/978-1-4614-8400-4}
+}
+
+@article{Connor,
+    title = {{Are Dolphins Reciprocal Altruists}},
+    author = {Connor, Richard and Norris, Kenneth},
+    pages = {3--8}
+}
+
+@article{Marck2017,
+    title = {{Are we reaching the limits of Homo sapiens?}},
+    year = {2017},
+    journal = {Frontiers in Physiology},
+    author = {Marck, Adrien and Antero, Juliana and Berthelot, Geoffroy and Sauli{\`{e}}re, Guillaume and Jancovici, Jean Marc and Masson-Delmotte, Valérie and Boeuf, Gilles and Spedding, Michael and Le Bourg, Éric and Toussaint, Jean François},
+    number = {OCT},
+    volume = {8},
+    isbn = {1664-042X (Print)1664-042X (Linking)},
+    doi = {10.3389/fphys.2017.00812},
+    issn = {1664042X},
+    pmid = {29123486},
+    keywords = {Anthropocene, Biometry, Environment, Human upper limits, Life span, Longevity, Performance, Public health}
+}
+
+@article{Eshel1982,
+    title = {{Assortment of encounters and evolution of cooperativeness}},
+    year = {1982},
+    journal = {Proceedings of the National Academy of Sciences of the United States of America},
+    author = {Eshel, I. and Cavalli-Sforza, L. L.},
+    number = {4 I},
+    pages = {1331--1335},
+    volume = {79},
+    doi = {10.1073/pnas.79.4.1331},
+    issn = {00278424}
+}
+
+@article{Bshary2002a,
+    title = {{Asymmetric cheating opportunities and partner control in a cleaner fish mutualism}},
+    year = {2002},
+    journal = {Animal Behaviour},
+    author = {Bshary, Redouan and Grutter, Alexandra S.},
+    number = {3},
+    pages = {547--555},
+    volume = {63},
+    isbn = {0003-3472},
+    doi = {10.1006/anbe.2001.1937},
+    issn = {00033472},
+    pmid = {16791194}
+}
+
+@article{Chou2011,
+    title = {{Automatic birdsong recognition with MFCC based syllable feature extraction}},
+    year = {2011},
+    journal = {Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)},
+    author = {Chou, Chih Hsun and Ko, Hui Yu},
+    number = {707},
+    pages = {185--196},
+    volume = {6905 LNCS},
+    isbn = {9783642236402},
+    doi = {10.1007/978-3-642-23641-9{\_}17},
+    issn = {03029743},
+    keywords = {MFCC, linear discriminant analysis, syllable, transition matrix}
+}
+
+@article{Lebreton2015,
+    title = {{Automatic integration of confidence in the brain valuation signal}},
+    year = {2015},
+    journal = {Nature Neuroscience},
+    author = {Lebreton, Maël and Abitbol, Raphaëlle and Daunizeau, Jean and Pessiglione, Mathias},
+    number = {8},
+    pages = {1159--1167},
+    volume = {18},
+    isbn = {1546-1726 (Electronic){\textbackslash}r1097-6256 (Linking)},
+    doi = {10.1038/nn.4064},
+    issn = {15461726},
+    pmid = {26192748}
+}
+
+@article{Boari2015,
+    title = {{Automatic reconstruction of physiological gestures used in a model of birdsong production}},
+    year = {2015},
+    journal = {Journal of Neurophysiology},
+    author = {Boari, Santiago and Sanz Perl, Yonatan and Amador, Ana and Margoliash, Daniel and Mindlin, Gabriel B.},
+    pages = {jn.00385.2015},
+    url = {http://jn.physiology.org/lookup/doi/10.1152/jn.00385.2015},
+    isbn = {9788578110796},
+    doi = {10.1152/jn.00385.2015},
+    issn = {0022-3077},
+    pmid = {25246403},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@article{Wolpert2004,
+    title = {{Bayesian integration in sensorimotor learning}},
+    year = {2004},
+    journal = {Nature},
+    author = {Wolpert, Daniel M and K{\"{o}}rding, Konrad P},
+    number = {January},
+    pages = {244--247},
+    volume = {427},
+    isbn = {9781509017904},
+    doi = {10.1002/mrm.20321},
+    issn = {15394565},
+    pmid = {14724638}
+}
+
+@article{Jaynes2014,
+    title = {{Bayesian Programming Principles}},
+    year = {2014},
+    author = {Jaynes, Edwin T},
+    pages = {2014}
+}
+
+@article{Tervo2014,
+    title = {{Behavioral variability through stochastic choice and its gating by anterior cingulate cortex}},
+    year = {2014},
+    journal = {Cell},
+    author = {Tervo, Dougal G.R. and Proskurin, Mikhail and Manakov, Maxim and Kabra, Mayank and Vollmer, Alison and Branson, Kristin and Karpova, Alla Y.},
+    number = {1},
+    pages = {21--32},
+    volume = {159},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/j.cell.2014.08.037},
+    isbn = {1097-4172 (Electronic){\textbackslash}r0092-8674 (Linking)},
+    doi = {10.1016/j.cell.2014.08.037},
+    issn = {10974172},
+    pmid = {25259917}
+}
+
+@article{Bredeche2017,
+    title = {{Benefits of proportionate selection in embodied evolution: A case study with behavioural specialization}},
+    year = {2017},
+    journal = {GECCO 2017 - Proceedings of the Genetic and Evolutionary Computation Conference Companion},
+    author = {Bredeche, N. and Montanier, J.-M. and Carrignon, S.},
+    pages = {2016--2017},
+    isbn = {9781450349390},
+    doi = {10.1145/3067695.3082551},
+    keywords = {Embodied Evolution, Evolutionary Robotics, Swarm Robotics}
+}
+
+@article{Amador2008,
+    title = {{Beyond harmonic sounds in a simple model for birdsong production}},
+    year = {2008},
+    journal = {Chaos},
+    author = {Amador, Ana and Mindlin, Gabriel B.},
+    number = {4},
+    pages = {1--6},
+    volume = {18},
+    isbn = {1054-1500},
+    doi = {10.1063/1.3041023},
+    issn = {10541500},
+    pmid = {19123633}
+}
+
+@article{Barclay2016,
+    title = {{Biological markets and the effects of partner choice on cooperation and friendship}},
+    year = {2016},
+    journal = {Current Opinion in Psychology},
+    author = {Barclay, Pat},
+    month = {2},
+    pages = {33--38},
+    volume = {7},
+    doi = {10.1016/j.copsyc.2015.07.012},
+    issn = {2352250X}
+}
+
+@article{Bshary2003,
+    title = {{Biological Markets The Ubiquitous Influence of Partner Choice on the Dynamics of Cleaner}},
+    year = {2003},
+    journal = {Genetic and Cultural Evolution of Cooperation},
+    author = {Bshary, Redouan and Noe, Roland},
+    pages = {167--184},
+    isbn = {0262083264}
+}
+
+@article{Noe1994,
+    title = {{Biological markets: supply and demand determine the effect of partner choice in cooperation, mutualism and mating}},
+    year = {1994},
+    journal = {Behavioral Ecology and Sociobiology},
+    author = {No{\"{e}}, Ronald and Hammerstein, Peter},
+    number = {1},
+    month = {7},
+    pages = {1--11},
+    volume = {35},
+    publisher = {Springer-Verlag},
+    url = {https://doi.org/10.1007/BF00167053 http://link.springer.com/10.1007/BF00167053},
+    isbn = {0340-5443},
+    doi = {10.1007/BF00167053},
+    issn = {1432-0762},
+    pmid = {242},
+    arxivId = {arXiv:1011.1669v3},
+    keywords = {Cooperation Mutualism, ESS, Market genes, Sexual selection, ess - cooperation, market games, mutualism - sexual selection}
+}
+
+@article{Hammerstein2016,
+    title = {{Biological trade and markets}},
+    year = {2016},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Hammerstein, Peter and No{\"{e}}, Ronald},
+    number = {1687},
+    volume = {371},
+    isbn = {9780935015997},
+    doi = {10.1098/rstb.2015.0101},
+    issn = {14712970},
+    pmid = {26729940},
+    keywords = {Biological markets, Comparative advantage, Cooperation, Mutualism, Partner choice, Principal-agent problem}
+}
+
+@article{Hammerstein2016a,
+    title = {{Biological trade and markets}},
+    year = {2016},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Hammerstein, Peter and No{\"{e}}, Ronald},
+    number = {1687},
+    month = {2},
+    pages = {20150101},
+    volume = {371},
+    url = {http://rstb.royalsocietypublishing.org/lookup/doi/10.1098/rstb.2015.0101},
+    isbn = {9780935015997},
+    doi = {10.1098/rstb.2015.0101},
+    issn = {14712970},
+    pmid = {26729940},
+    keywords = {Biological markets, Comparative advantage, Cooperation, Mutualism, Partner choice, Principal-agent problem}
+}
+
+@article{Doupe1999,
+    title = {{BIRDSONG AND HUMAN SPEECH: Common Themes and Mechanisms}},
+    year = {1999},
+    journal = {Annual Review of Neuroscience},
+    author = {Doupe, Allison J. and Kuhl, Patricia K.},
+    number = {1},
+    pages = {567--631},
+    volume = {22},
+    url = {http://www.annualreviews.org/doi/10.1146/annurev.neuro.22.1.567},
+    isbn = {0147-006X},
+    doi = {10.1146/annurev.neuro.22.1.567},
+    issn = {0147-006X},
+    pmid = {10202549},
+    keywords = {auditory, critical period, innate, learning, perception, vocalization}
+}
+
+@article{Bshary2002,
+    title = {{Biting cleaner fish use altruism to deceive image-scoring client reef fish}},
+    year = {2002},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {Bshary, Redouan},
+    number = {1505},
+    pages = {2087--2093},
+    volume = {269},
+    isbn = {0962-8452},
+    doi = {10.1098/rspb.2002.2084},
+    issn = {14712970},
+    pmid = {12396482},
+    keywords = {Communication network, Indirect reciprocity, Labroides dimidiatus, Mutualism, Tactical deception}
+}
+
+@article{Efron2014,
+    title = {{Bootstrap confidence intervals Jeremy Orlo ff and Jonathan Bloom}},
+    year = {2014},
+    author = {Efron, Bradley}
+}
+
+@article{Maestre2015,
+    title = {{Bootstrapping interactions with objects from raw sensorimotor data: A novelty search based approach}},
+    year = {2015},
+    journal = {5th Joint International Conference on Development and Learning and Epigenetic Robotics, ICDL-EpiRob 2015},
+    author = {Maestre, Carlos and Cully, Antoine and Gonzales, Christophe and Doncieux, Stephane},
+    number = {2},
+    pages = {7--12},
+    isbn = {9781467393201},
+    doi = {10.1109/DEVLRN.2015.7346098},
+    issn = {1051-4651},
+    pmid = {22255825},
+    arxivId = {1204.3968}
+}
+
+@article{Zimmer2018,
+    title = {{Bootstrapping Q-Learning for Robotics from Neuro-Evolution Results}},
+    year = {2018},
+    journal = {IEEE Transactions on Cognitive and Developmental Systems},
+    author = {Zimmer, Matthieu and Doncieux, Stephane},
+    number = {1},
+    pages = {102--119},
+    volume = {10},
+    isbn = {0890-9369 (Print){\textbackslash}r0890-9369 (Linking)},
+    doi = {10.1109/TCDS.2016.2628817},
+    issn = {23798939},
+    pmid = {11390359},
+    keywords = {Generation of representation during development, robots with development and learning skills, transfer learning}
+}
+
+@article{Engelhardt2020,
+    title = {{Broad definitions of enforcement are unhelpful for understanding evolutionary mechanisms of cooperation}},
+    year = {2020},
+    journal = {Nature Ecology {\&} Evolution},
+    author = {Engelhardt, Sacha C and Taborsky, Michael},
+    pages = {41559},
+    publisher = {Springer US},
+    url = {http://dx.doi.org/10.1038/s41559-019-1088-7},
+    doi = {10.1038/s41559-019-1088-7},
+    issn = {2397-334X}
+}
+
+@article{LEIMAR2003,
+    title = {{By-product benefits, reciprocity, and pseudoreciprocity in mutualism.}},
+    year = {2003},
+    journal = {In: Genetic and Cultural Evolution of Cooperation. P.Hammerstein (ed.). p.203-222. The MIT Press. ISBN 0-262-08326-4. 2003.},
+    author = {LEIMAR, OLOF and RICHARD C CONNOR.},
+    pages = {20},
+    isbn = {0262083264}
+}
+
+@article{Gianetto2015,
+    title = {{Catalysts of cooperation in system of systems: The role of diversity and network structure}},
+    year = {2015},
+    journal = {IEEE Systems Journal},
+    author = {Gianetto, David A. and Heydari, Babak},
+    number = {1},
+    pages = {303--311},
+    volume = {9},
+    publisher = {IEEE},
+    isbn = {1932-8184},
+    doi = {10.1109/JSYST.2013.2284959},
+    issn = {19379234},
+    keywords = {Agent-based simulation, competition, complex networks, cooperation, diversity, prisoner's dilemma, system of systems (SoS)}
+}
+
+@book{Schino2009a,
+    title = {{Chapter 2 Reciprocal Altruism in Primates. Partner Choice, Cognition, and Emotions}},
+    year = {2009},
+    booktitle = {Advances in the Study of Behavior},
+    author = {Schino, Gabriele and Aureli, Filippo},
+    edition = {1},
+    number = {09},
+    pages = {45--69},
+    volume = {39},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/S0065-3454(09)39002-6},
+    isbn = {9780123744746},
+    doi = {10.1016/S0065-3454(09)39002-6},
+    issn = {00653454},
+    keywords = {Altruism, Emotions, Partner choice, Primates, Reciprocation}
+}
+
+@article{Jensen2007,
+    title = {{Chimpanzees are rational maximizers in an ultimatum game}},
+    year = {2007},
+    journal = {Science},
+    author = {Jensen, Keith and Call, Josep and Tomasello, Michael},
+    number = {5847},
+    pages = {107--109},
+    volume = {318},
+    doi = {10.1126/science.1145850},
+    issn = {00368075},
+    pmid = {17916736}
+}
+
+@article{Melis2006a,
+    title = {{Chimpanzees recruit the best collaborators}},
+    year = {2006},
+    journal = {Science},
+    author = {Melis, Alicia P. and Hare, Brian and Tomasello, Michael},
+    number = {5765},
+    pages = {1297--1300},
+    volume = {311},
+    doi = {10.1126/science.1123007},
+    issn = {00368075}
+}
+
+@article{Bullinger2011a,
+    title = {{Chimpanzees, Pan troglodytes, prefer individual over collaborative strategies towards goals}},
+    year = {2011},
+    journal = {Animal Behaviour},
+    author = {Bullinger, Anke F. and Melis, Alicia P. and Tomasello, Michael},
+    number = {5},
+    pages = {1135--1141},
+    volume = {82},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.anbehav.2011.08.008},
+    doi = {10.1016/j.anbehav.2011.08.008},
+    issn = {00033472},
+    keywords = {Chimpanzee, Collaboration, Competition, Motivation, Pan troglodytes}
+}
+
+@article{Melis2011,
+    title = {{Chimpanzees, Pan troglodytes, share food in the same way after collaborative and individual food acquisition}},
+    year = {2011},
+    journal = {Animal Behaviour},
+    author = {Melis, Alicia P. and Schneider, Anna Claire and Tomasello, Michael},
+    number = {3},
+    pages = {485--493},
+    volume = {82},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.anbehav.2011.05.024},
+    doi = {10.1016/j.anbehav.2011.05.024},
+    issn = {00033472},
+    keywords = {Chimpanzee, Collaboration, Competition, Dominance, Fairness, Food sharing, Pan troglodytes}
+}
+
+@article{Wubs2016a,
+    title = {{Coevolution between positive reciprocity, punishment, and partner switching in repeated interactions}},
+    year = {2016},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {Wubs, Matthias and Bshary, Redouan and Lehmann, Laurent},
+    number = {1832},
+    month = {6},
+    pages = {20160488},
+    volume = {283},
+    publisher = {The Royal Society},
+    url = {https://royalsocietypublishing.org/doi/10.1098/rspb.2016.0488},
+    isbn = {0891-2432},
+    doi = {10.1098/rspb.2016.0488},
+    issn = {14712954},
+    pmid = {27306050},
+    keywords = {Partner control mechanism, Partner switching, Positive reciprocity, Punishment, partner control mechanism, partner switching, positive reciprocity, punishment}
+}
+
+@article{Barrett2010,
+    title = {{Coevolution of cooperation, causal cognition and mindreading}},
+    year = {2010},
+    journal = {Communicative {\&} Integrative Biology},
+    author = {Barrett, H. Clark and Cosmides, Leda and Tooby, John},
+    number = {6},
+    pages = {522--524},
+    volume = {3},
+    doi = {10.4161/cib.3.6.12604},
+    issn = {1942-0889},
+    keywords = {cheater, cooperation, detection, evolution, free-riding, mindreading}
+}
+
+@article{Barrett2010a,
+    title = {{Coevolution of cooperation, causal cognition and mindreading}},
+    year = {2010},
+    journal = {Communicative {\&} Integrative Biology},
+    author = {Barrett, H. Clark and Cosmides, Leda and Tooby, John},
+    number = {6},
+    pages = {522--524},
+    volume = {3},
+    doi = {10.4161/cib.3.6.12604},
+    issn = {1942-0889},
+    keywords = {cheater, cooperation, detection, evolution, free-riding, mindreading}
+}
+
+@article{MacPherson2012,
+    title = {{Cognitive penetration of colour experience: Rethinking the issue in light of an indirect mechanism}},
+    year = {2012},
+    journal = {Philosophy and Phenomenological Research},
+    author = {MacPherson, Fiona},
+    number = {1},
+    pages = {24--62},
+    volume = {84},
+    isbn = {1933-1592},
+    doi = {10.1111/j.1933-1592.2010.00481.x},
+    issn = {00318205}
+}
+
+@article{Millikan1990,
+    title = {{Compare and contrast Dretske, Fodor, and Millikan on teleosemantics}},
+    year = {1990},
+    journal = {Philosophical Topics},
+    author = {Millikan, Ruth Garrett},
+    number = {2},
+    pages = {151--61},
+    volume = {18},
+    isbn = {7134611493},
+    doi = {10.1177/109442810034002},
+    issn = {02762080, 2154154X},
+    keywords = {mental content/causal theories/teleological approa}
+}
+
+@article{Raihani2015a,
+    title = {{Competitive helping in online giving}},
+    year = {2015},
+    journal = {Current Biology},
+    author = {Raihani, Nichola J. and Smith, Sarah},
+    number = {9},
+    pages = {1183--1186},
+    volume = {25},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.cub.2015.02.042},
+    isbn = {0960-9822},
+    doi = {10.1016/j.cub.2015.02.042},
+    issn = {09609822},
+    pmid = {25891407}
+}
+
+@article{Barclay2011,
+    title = {{Competitive helping increases with the size of biological markets and invades defection}},
+    year = {2011},
+    journal = {Journal of Theoretical Biology},
+    author = {Barclay, Pat},
+    number = {1},
+    pages = {47--55},
+    volume = {281},
+    publisher = {Elsevier},
+    url = {http://dx.doi.org/10.1016/j.jtbi.2011.04.023},
+    doi = {10.1016/j.jtbi.2011.04.023},
+    issn = {00225193},
+    keywords = {Biological markets, Competitive altruism, Cooperation, Helping, Partner choice}
+}
+
+@article{Packer1988,
+    title = {{Constraints on the evolution of reciprocity: Lessons from cooperative hunting}},
+    year = {1988},
+    journal = {Ethology and Sociobiology},
+    author = {Packer, Craig},
+    number = {2-4},
+    pages = {137--147},
+    volume = {9},
+    doi = {10.1016/0162-3095(88)90018-0},
+    issn = {01623095}
+}
+
+@article{Andre2015,
+    title = {{Contingency in the Evolutionary Emergence of Reciprocal Cooperation}},
+    year = {2015},
+    journal = {The American Naturalist},
+    author = {Andr{\'{e}}, Jean-Baptiste},
+    number = {3},
+    pages = {303--316},
+    volume = {185},
+    url = {http://www.journals.uchicago.edu/doi/10.1086/679625},
+    doi = {10.1086/679625},
+    issn = {0003-0147},
+    pmid = {25674686},
+    keywords = {bootstrapping, evolution of cooperation, mechanistic con-, reciprocity, straints}
+}
+
+@article{Crandall2018,
+    title = {{Cooperating with machines}},
+    year = {2018},
+    journal = {Nature Communications},
+    author = {Crandall, Jacob W. and Oudah, Mayada and {Tennom} and Ishowo-Oloko, Fatimah and Abdallah, Sherief and Bonnefon, Jean François and Cebrian, Manuel and Shariff, Azim and Goodrich, Michael A. and Rahwan, Iyad},
+    number = {1},
+    volume = {9},
+    doi = {10.1038/s41467-017-02597-8},
+    issn = {20411723},
+    pmid = {29339817},
+    arxivId = {1703.06207}
+}
+
+@article{Mitani2009,
+    title = {{Cooperation and competition in chimpanzees: Current understanding and future challenges}},
+    year = {2009},
+    journal = {Evolutionary Anthropology},
+    author = {Mitani, John C.},
+    number = {5},
+    pages = {215--227},
+    volume = {18},
+    doi = {10.1002/evan.20229},
+    issn = {10601538},
+    keywords = {Behavior, Chimpanzee, Pan troglodytes}
+}
+
+@article{Moll2007,
+    title = {{Cooperation and human cognition: The Vygotskian intelligence hypothesis}},
+    year = {2007},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Moll, Henrike and Tomasello, Michael},
+    number = {1480},
+    pages = {639--648},
+    volume = {362},
+    doi = {10.1098/rstb.2006.2000},
+    issn = {09628436},
+    keywords = {Communication, Cooperation, Human children, Primate cognitive evolution, Social intelligence, Vygotskian intelligence hypothesis}
+}
+
+@article{Clutton-brock2009,
+    title = {{Cooperation between non-kin in animal societies}},
+    year = {2009},
+    journal = {Nature},
+    author = {Clutton-Brock, Tim},
+    number = {7269},
+    pages = {51--57},
+    volume = {462},
+    publisher = {Nature Publishing Group},
+    isbn = {1476-4687 (Electronic){\textbackslash}r0028-0836 (Linking)},
+    doi = {10.1038/nature08366},
+    issn = {00280836},
+    pmid = {19890322}
+}
+
+@article{Hayes2014,
+    title = {{Cooperation came first: Evolution and human cognition}},
+    year = {2014},
+    journal = {Journal of the Experimental Analysis of Behavior},
+    author = {Hayes, Steven C. and Sanford, Brandon T.},
+    number = {1},
+    pages = {112--129},
+    volume = {101},
+    doi = {10.1002/jeab.64},
+    issn = {00225002},
+    keywords = {Cooperation, Eusociality, Evolution, Language, Relational frame theory, Symmetry}
+}
+
+@article{Leimar2010,
+    title = {{Cooperation for direct fitness benefits}},
+    year = {2010},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Leimar, Olof and Hammerstein, Peter},
+    number = {1553},
+    pages = {2619--2626},
+    volume = {365},
+    isbn = {1471-2970 (Electronic){\textbackslash}n0962-8436 (Linking)},
+    doi = {10.1098/rstb.2010.0116},
+    issn = {14712970},
+    pmid = {20679106},
+    keywords = {Biological markets, By-product benefits, Common interest, Mutualism, Pseudo-reciprocity}
+}
+
+@article{Powers,
+    title = {{Cooperation in large-scale human societies -- What, if anything, makes it unique, and how did it evolve?}},
+    author = {Powers, Simon and Schaik, Carel P. van and Lehmann, Laurent},
+    publisher = {OSF Preprints},
+    url = {https://osf.io/v47ap/},
+    doi = {10.31219/OSF.IO/V47AP},
+    keywords = {Biology, Life Sciences, Social and Behavioral Sciences, cooperation, cultural group selection, evolutionary psychology, human social evolution, institutions, large, punishment, scale societies}
+}
+
+@article{Boesch1994,
+    title = {{Cooperative hunting in wild chimpanzees}},
+    year = {1994},
+    journal = {Animal Behaviour},
+    author = {Boesch, Christophe},
+    number = {3},
+    pages = {653--667},
+    volume = {48},
+    doi = {10.1006/anbe.1994.1285},
+    issn = {00033472}
+}
+
+@article{Sylwester2010,
+    title = {{Cooperators benefit through reputation-based partner choice in economic games}},
+    year = {2010},
+    journal = {Biology Letters},
+    author = {Sylwester, Karolinaz and Roberts, Gilbert},
+    number = {5},
+    month = {10},
+    pages = {659--662},
+    volume = {6},
+    publisher = {The Royal Society},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/20410026 http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=PMC2936156},
+    doi = {10.1098/rsbl.2010.0209},
+    issn = {1744957X},
+    pmid = {20410026},
+    keywords = {Competitive altruism, Cooperation, Indirect reciprocity, Reputation}
+}
+
+@article{Sylwester2010a,
+    title = {{Cooperators benefit through reputation-based partner choice in economic games}},
+    year = {2010},
+    journal = {Biology Letters},
+    author = {Sylwester, Karolinaz and Roberts, Gilbert},
+    number = {5},
+    pages = {659--662},
+    volume = {6},
+    doi = {10.1098/rsbl.2010.0209},
+    issn = {1744957X},
+    keywords = {Competitive altruism, Cooperation, Indirect reciprocity, Reputation}
+}
+
+@article{Daw2006,
+    title = {{Cortical substrates for exploratory decisions in humans}},
+    year = {2006},
+    journal = {Nature},
+    author = {Daw, Nathaniel D. and O'Doherty, John P. and Dayan, Peter and Seymour, Ben and Dolan, Raymond J.},
+    number = {7095},
+    pages = {876--879},
+    volume = {441},
+    isbn = {0028-0836},
+    doi = {10.1038/nature04766},
+    issn = {14764687},
+    pmid = {16778890},
+    arxivId = {0803973233}
+}
+
+@article{Gintis2001,
+    title = {{Costly signaling and cooperation}},
+    year = {2001},
+    journal = {Journal of Theoretical Biology},
+    author = {Gintis, Herbert and Smith, Eric Alden and Bowles, Samuel},
+    number = {1},
+    pages = {103--119},
+    volume = {213},
+    doi = {10.1006/jtbi.2001.2406},
+    issn = {00225193}
+}
+
+@article{Pusey1978,
+    title = {{Devide we fall {\_}Packer{\_}Pusey}},
+    year = {1978},
+    author = {Pusey, Anne E},
+    number = {May 1997}
+}
+
+@article{Cunningham2014,
+    title = {{Dimensionality reduction for large-scale neural recordings}},
+    year = {2014},
+    journal = {Nature Neuroscience},
+    author = {Cunningham, John P. and Yu, Byron M.},
+    number = {11},
+    pages = {1500--1509},
+    volume = {17},
+    isbn = {1097-6256},
+    doi = {10.1038/nn.3776},
+    issn = {15461726},
+    pmid = {25151264},
+    arxivId = {15334406}
+}
+
+@article{VanVeelen2012,
+    title = {{Direct reciprocity in structured populations}},
+    year = {2012},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {van Veelen, M. and Garcia, J. and Rand, D. G. and Nowak, M. A.},
+    number = {25},
+    pages = {9929--9934},
+    volume = {109},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.1206694109},
+    isbn = {1091-6490 (Electronic){\textbackslash}r0027-8424 (Linking)},
+    doi = {10.1073/pnas.1206694109},
+    issn = {0027-8424},
+    pmid = {22665767}
+}
+
+@article{Katz2016,
+    title = {{Dissociated functional significance of decision-related activity in the primate dorsal stream}},
+    year = {2016},
+    journal = {Nature},
+    author = {Katz, Leor N. and Yates, Jacob L. and Pillow, Jonathan W. and Huk, Alexander C.},
+    number = {7611},
+    pages = {285--288},
+    volume = {535},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature18617},
+    isbn = {0028-0836},
+    doi = {10.1038/nature18617},
+    issn = {14764687},
+    pmid = {27376476},
+    arxivId = {15334406}
+}
+
+@article{Hanks2015,
+    title = {{Distinct relationships of parietal and prefrontal cortices to evidence accumulation}},
+    year = {2015},
+    journal = {Nature},
+    author = {Hanks, Timothy D. and Kopec, Charles D. and Brunton, Bingni W. and Duan, Chunyu A. and Erlich, Jeffrey C. and Brody, Carlos D.},
+    number = {7546},
+    pages = {220--223},
+    volume = {520},
+    isbn = {1476-4687},
+    doi = {10.1038/nature14066},
+    issn = {14764687},
+    pmid = {25600270},
+    arxivId = {NIHMS150003}
+}
+
+@article{Bshary2008b,
+    title = {{Distinguishing four fundamental approaches to the evolution of helping}},
+    year = {2008},
+    journal = {Journal of Evolutionary Biology},
+    author = {Bshary, R. and Bergm{\"{u}}ller, R.},
+    number = {2},
+    pages = {405--420},
+    volume = {21},
+    isbn = {1010-061X},
+    doi = {10.1111/j.1420-9101.2007.01482.x},
+    issn = {1010061X},
+    pmid = {18179515},
+    keywords = {Altruism, Cognition, Control mechanism, Cooperation, Ecology, Kin selection, Life histories, Strategies}
+}
+
+@article{Bull1991,
+    title = {{Distinguishing mechanisms for the evolution of co-operation}},
+    year = {1991},
+    journal = {Journal of Theoretical Biology},
+    author = {Bull, J. J. and Rice, W. R.},
+    number = {1},
+    pages = {63--74},
+    volume = {149},
+    doi = {10.1016/S0022-5193(05)80072-4},
+    issn = {10958541},
+    pmid = {1881147}
+}
+
+@article{Packer2009,
+    title = {{Divided We Fall: Cooperation among Lions}},
+    year = {2009},
+    journal = {Scientific American},
+    author = {Packer, Craig and Pusey, Anne E.},
+    number = {5},
+    pages = {52--59},
+    volume = {276},
+    doi = {10.1038/scientificamerican0597-52},
+    issn = {0036-8733}
+}
+
+@article{Melis2008,
+    title = {{Do chimpanzees reciprocate received favours?}},
+    year = {2008},
+    journal = {Animal Behaviour},
+    author = {Melis, Alicia P. and Hare, Brian and Tomasello, Michael},
+    number = {3},
+    pages = {951--962},
+    volume = {76},
+    doi = {10.1016/j.anbehav.2008.05.014},
+    issn = {00033472},
+    keywords = {Pan troglodytes, chimpanzee, cognitive constraint, reciprocal altruism, reciprocity, recruitment}
+}
+
+@article{Connor2007,
+    title = {{Dolphin social intelligence: Complex alliance relationships in bottlenose dolphins and a consideration of selective environments for extreme brain size evolution in mammals}},
+    year = {2007},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Connor, Richard C.},
+    number = {1480},
+    pages = {587--602},
+    volume = {362},
+    isbn = {0962-8436},
+    doi = {10.1098/rstb.2006.1997},
+    issn = {09628436},
+    pmid = {17296597},
+    keywords = {Alliances, Brain size, Dolphins, Social complexity}
+}
+
+@article{Hollerman1998,
+    title = {{Dopamine neurons report an error in the temporal prediction of reward during learning}},
+    year = {1998},
+    journal = {Nature Neuroscience},
+    author = {Hollerman, Jeffrey R. and Schultz, Wolfram},
+    number = {4},
+    pages = {304--309},
+    volume = {1},
+    url = {http://www.nature.com/articles/nn0898_304},
+    isbn = {1097-6256},
+    doi = {10.1038/1124},
+    issn = {1097-6256},
+    pmid = {10195164},
+    arxivId = {NIHMS150003}
+}
+
+@article{Sutton2002,
+    title = {{Dyna-Style Planning with Linear Function Approximation and Prioritized Sweeping Richard}},
+    year = {2002},
+    journal = {Journal of Geophysical Research},
+    author = {Sutton, Richard and Szepesvari, Csaba and Geramifard, Alborz and Bowling, Michael},
+    number = {D4},
+    pages = {4036},
+    volume = {107},
+    url = {http://doi.wiley.com/10.1029/2000JD000149},
+    arxivId = {1206.3285}
+}
+
+@article{Sitt2008,
+    title = {{Dynamical origin of spectrally rich vocalizations in birdsong}},
+    year = {2008},
+    journal = {Physical Review E - Statistical, Nonlinear, and Soft Matter Physics},
+    author = {Sitt, J. D. and Amador, A. and Goller, F. and Mindlin, G. B.},
+    number = {1},
+    pages = {1--6},
+    volume = {78},
+    isbn = {1539-3755 (Print)1539-3755 (Linking)},
+    doi = {10.1103/PhysRevE.78.011905},
+    issn = {15393755},
+    pmid = {18763980}
+}
+
+@article{Tchernichovski2001,
+    title = {{Dynamics of the vocal imitation process: How a zebra finch learns its song}},
+    year = {2001},
+    journal = {Science},
+    author = {Tchernichovski, O. and Mitra, P. P. and Lints, T. and Nottebohm, F.},
+    number = {5513},
+    pages = {2564--2569},
+    volume = {291},
+    isbn = {0036-8075},
+    doi = {10.1126/science.1058522},
+    issn = {00368075},
+    pmid = {11283361},
+    arxivId = {NIHMS150003}
+}
+
+@book{Korb2008,
+    title = {{Ecology of Social Evolution}},
+    year = {2008},
+    author = {Korb, Judith},
+    url = {http://link.springer.com/10.1007/978-3-540-75957-7},
+    isbn = {978-3-540-75956-0},
+    doi = {10.1007/978-3-540-75957-7},
+    issn = {1098-6596},
+    pmid = {25246403},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@book{Dittami2003,
+    title = {{Economics in Nature, Social Dilemmas, Mate Choice and Biological Markets}},
+    year = {2003},
+    booktitle = {Ethology},
+    author = {Dittami, John},
+    number = {7},
+    pages = {614--615},
+    volume = {109},
+    isbn = {9780521650144},
+    doi = {10.1046/j.1439-0310.2003.00891.x},
+    issn = {0179-1613}
+}
+
+@article{Robson1990,
+    title = {{Efficiency in evolutionary games: Darwin, nash and the secret handshake}},
+    year = {1990},
+    journal = {Journal of Theoretical Biology},
+    author = {Robson, Arthur J.},
+    number = {3},
+    pages = {379--396},
+    volume = {144},
+    doi = {10.1016/S0022-5193(05)80082-7},
+    issn = {10958541}
+}
+
+@article{Smith2006,
+    title = {{Efficient auditory coding}},
+    year = {2006},
+    journal = {Nature},
+    author = {Smith, Evan C. and Lewicki, Michael S.},
+    number = {7079},
+    pages = {978--982},
+    volume = {439},
+    isbn = {1476-4687 (Electronic){\textbackslash}r0028-0836 (Linking)},
+    doi = {10.1038/nature04485},
+    issn = {14764687},
+    pmid = {16495999},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@article{Packer2001a,
+    title = {{Egalitarianism in female lions}},
+    year = {2001},
+    journal = {Science},
+    author = {Packer, Craig and Pusey, Anne E and Eberly, Lynn E},
+    number = {November 1959},
+    pages = {690--693},
+    volume = {293}
+}
+
+@article{Connor1992,
+    title = {{Egg-trading in simultaneous hermaphrodites: an alternative to Tit-for-Tat}},
+    year = {1992},
+    journal = {Journal of Evolutionary Biology},
+    author = {Connor, Richard C.},
+    number = {3},
+    month = {5},
+    pages = {523--528},
+    volume = {5},
+    url = {http://doi.wiley.com/10.1046/j.1420-9101.1992.5030523.x},
+    doi = {10.1046/j.1420-9101.1992.5030523.x},
+    issn = {1010-061X}
+}
+
+@article{Connor1992a,
+    title = {{Egg‐trading in simultaneous hermaphrodites: an alternative to Tit‐for‐Tat}},
+    year = {1992},
+    journal = {Journal of Evolutionary Biology},
+    author = {Connor, Richard C.},
+    number = {3},
+    month = {5},
+    pages = {523--528},
+    volume = {5},
+    publisher = {John Wiley {\&} Sons, Ltd (10.1111)},
+    url = {http://doi.wiley.com/10.1046/j.1420-9101.1992.5030523.x},
+    doi = {10.1046/j.1420-9101.1992.5030523.x},
+    issn = {14209101},
+    keywords = {Reciprocity, cooperation, egg‐trading, reciprocal altruism, seabasses, simultaneous hermaphroditism, tit‐for‐tat}
+}
+
+@article{Amador2013,
+    title = {{Elemental gesture dynamics are encoded by song premotor cortical neurons}},
+    year = {2013},
+    journal = {Nature},
+    author = {Amador, Ana and Perl, Yonatan Sanz and Mindlin, Gabriel B. and Margoliash, Daniel},
+    number = {7439},
+    pages = {59--64},
+    volume = {495},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature11967},
+    isbn = {1476-4687 (Electronic){\textbackslash}r0028-0836 (Linking)},
+    doi = {10.1038/nature11967},
+    issn = {00280836},
+    pmid = {23446354},
+    arxivId = {NIHMS150003}
+}
+
+@article{Baker2019,
+    title = {{Emergent Tool Use From Multi-Agent Autocurricula}},
+    year = {2019},
+    author = {Baker, Bowen and Kanitscheider, Ingmar and Markov, Todor and Wu, Yi and Powell, Glenn and McGrew, Bob and Mordatch, Igor},
+    url = {http://arxiv.org/abs/1909.07528},
+    arxivId = {1909.07528}
+}
+
+@article{Agren2019,
+    title = {{Enforcement is central to the evolution of cooperation}},
+    year = {2019},
+    journal = {Nature Ecology and Evolution},
+    author = {{\AA}gren, J. Arvid and Davies, Nicholas G. and Foster, Kevin R.},
+    number = {July},
+    volume = {3},
+    publisher = {Springer US},
+    url = {http://dx.doi.org/10.1038/s41559-019-0907-1},
+    doi = {10.1038/s41559-019-0907-1},
+    issn = {2397334X}
+}
+
+@article{Melis2006,
+    title = {{Engineering cooperation in chimpanzees: tolerance constraints on cooperation}},
+    year = {2006},
+    journal = {Animal Behaviour},
+    author = {Melis, Alicia P. and Hare, Brian and Tomasello, Michael},
+    number = {2},
+    pages = {275--286},
+    volume = {72},
+    doi = {10.1016/j.anbehav.2005.09.018},
+    issn = {00033472}
+}
+
+@article{Jackson2015,
+    title = {{Epiphenomenal Qualia}},
+    year = {2015},
+    author = {Jackson, Frank},
+    number = {99},
+    pages = {126--137},
+    volume = {25}
+}
+
+@article{Margoliash2002,
+    title = {{Evaluating theories of bird song learning: Implications for future directions}},
+    year = {2002},
+    journal = {Journal of Comparative Physiology A: Neuroethology, Sensory, Neural, and Behavioral Physiology},
+    author = {Margoliash, D.},
+    number = {11-12},
+    pages = {851--866},
+    volume = {188},
+    isbn = {0340-7594},
+    doi = {10.1007/s00359-002-0351-5},
+    issn = {03407594},
+    pmid = {12471486},
+    keywords = {Error-driven and reinforcement learning, Instruction and selection, Sleep and replay, Template theory}
+}
+
+@book{MaynardSmith,
+    title = {{Evolution and the Theory of Games}},
+    year = {1982},
+    author = {Maynard Smith, John},
+    publisher = {Cambridge University Press},
+    url = {http://ebooks.cambridge.org/ref/id/CBO9780511806292},
+    address = {Cambridge},
+    isbn = {9780511806292},
+    doi = {10.1017/CBO9780511806292}
+}
+
+@article{Dunbar2007,
+    title = {{Evolution in the social brain}},
+    year = {2007},
+    journal = {Science},
+    author = {Dunbar, R. I.M. and Shultz, Susanne},
+    number = {5843},
+    pages = {1344--1347},
+    volume = {317},
+    doi = {10.1126/science.1145463},
+    issn = {00368075},
+    pmid = {17823343}
+}
+
+@article{Ranjbar-Sahraei2014,
+    title = {{Evolution of cooperation in arbitrary complex networks}},
+    year = {2014},
+    journal = {Proceedings of the 13th International Conference on Autonomous Agents and Multiagent Systems},
+    author = {Ranjbar-Sahraei, Bijan and Bou Ammar, Haitham and Bloembergen, Daan and Tuyls, Karl and Weiss, Gerhard},
+    pages = {677--684},
+    url = {http://dl.acm.org/citation.cfm?id=2615731.2615841},
+    isbn = {978-1-4503-2738-1},
+    keywords = {evolution of cooperation, repeated games on graphs}
+}
+
+@article{BRAUCHLI1999,
+    title = {{Evolution of cooperation in spatially structured populations}},
+    year = {1999},
+    journal = {Journal of Theoretical Biology},
+    author = {BRAUCHLI, Kurt and KILLINGBACK, Timothy and Doebeli, MICHAEL},
+    number = {4},
+    month = {10},
+    pages = {405--417},
+    volume = {200},
+    publisher = {Academic Press},
+    url = {https://www.sciencedirect.com/science/article/pii/S0022519399910007 https://pdfs.semanticscholar.org/83e5/34783c7c8746be0c47856541e2c15285368a.pdf http://linkinghub.elsevier.com/retrieve/pii/S0022519399910007},
+    doi = {10.1006/jtbi.1999.1000},
+    issn = {00225193}
+}
+
+@article{Hilbe2018,
+    title = {{Evolution of cooperation in stochastic games}},
+    year = {2018},
+    journal = {Nature},
+    author = {Hilbe, Christian and {\v{S}}imsa, Štěpán and Chatterjee, Krishnendu and Nowak, Martin A.},
+    number = {7713},
+    pages = {246--249},
+    volume = {559},
+    isbn = {1476-4687 (Electronic) 0028-0836 (Linking)},
+    doi = {10.1038/s41586-018-0277-x},
+    issn = {14764687},
+    pmid = {29973718}
+}
+
+@article{Chen2016,
+    title = {{Evolution of cooperation in the spatial public goods game with adaptive reputation assortment}},
+    year = {2016},
+    journal = {Physics Letters, Section A: General, Atomic and Solid State Physics},
+    author = {Chen, Mei Huan and Wang, Li and Sun, Shi Wen and Wang, Juan and Xia, Cheng Yi},
+    number = {1-2},
+    month = {1},
+    pages = {40--47},
+    volume = {380},
+    publisher = {North-Holland},
+    url = {https://www.sciencedirect.com/science/article/pii/S0375960115008506},
+    doi = {10.1016/j.physleta.2015.09.047},
+    issn = {03759601},
+    keywords = {Evolutionary game theory, Individual diversity, Promotion of cooperation, Public goods game, Reputation assortment}
+}
+
+@article{Ale2013,
+    title = {{Evolution of Cooperation: Combining Kin Selection and Reciprocal Altruism into Matrix Games with Social Dilemmas}},
+    year = {2013},
+    journal = {PLoS ONE},
+    author = {Ale, Som B. and Brown, Joel S. and Sullivan, Amy T.},
+    number = {5},
+    pages = {1--8},
+    volume = {8},
+    isbn = {1932-6203 (Electronic){\textbackslash}r1932-6203 (Linking)},
+    doi = {10.1371/journal.pone.0063761},
+    issn = {19326203},
+    pmid = {23717479}
+}
+
+@article{Delton2011,
+    title = {{Evolution of direct reciprocity under uncertainty can explain human generosity in one-shot encounters}},
+    year = {2011},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Delton, A. W. and Krasnow, M. M. and Cosmides, L. and Tooby, J.},
+    number = {32},
+    pages = {13335--13340},
+    volume = {108},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.1102131108},
+    isbn = {0027-8424},
+    doi = {10.1073/pnas.1102131108},
+    issn = {0027-8424},
+    pmid = {21788489},
+    arxivId = {2716}
+}
+
+@article{Debove2015c,
+    title = {{Evolution of equal division among unequal partners}},
+    year = {2015},
+    journal = {Evolution},
+    author = {Debove, Stéphane and Baumard, Nicolas and Andr{\'{e}}, Jean Baptiste},
+    number = {2},
+    pages = {561--569},
+    volume = {69},
+    isbn = {0014-3820},
+    doi = {10.1111/evo.12583},
+    issn = {15585646},
+    pmid = {25522195},
+    keywords = {Egalitarianism, Fairness, Game theory, Partner choice}
+}
+
+@misc{Nowak2005,
+    title = {{Evolution of indirect reciprocity}},
+    year = {2005},
+    booktitle = {Nature},
+    author = {Nowak, Martin A. and Sigmund, Karl},
+    number = {7063},
+    month = {10},
+    pages = {1291--1298},
+    volume = {437},
+    publisher = {Nature Publishing Group},
+    url = {http://www.nature.com/articles/nature04131},
+    doi = {10.1038/nature04131},
+    issn = {14764687},
+    keywords = {Humanities and Social Sciences, Science, multidisciplinary}
+}
+
+@article{Mohtashemi2003,
+    title = {{Evolution of indirect reciprocity by social information: The role of trust and reputation in evolution of altruism}},
+    year = {2003},
+    journal = {Journal of Theoretical Biology},
+    author = {Mohtashemi, Mojdeh and Mui, Lik},
+    number = {4},
+    pages = {523--531},
+    volume = {223},
+    doi = {10.1016/S0022-5193(03)00143-7},
+    issn = {00225193},
+    keywords = {Collective memory, Growth, Order, Reciprocity, Reputation, Social information, Trust}
+}
+
+@article{Ferrante2015,
+    title = {{Evolution of Self-Organized Task Specialization in Robot Swarms}},
+    year = {2015},
+    journal = {PLoS Computational Biology},
+    author = {Ferrante, Eliseo and Turgut, Ali Emre and Du{\'{e}}{\~{n}}ez-Guzm{\'{a}}n, Edgar and Dorigo, Marco and Wenseleers, Tom},
+    editor = {Sporns, Olaf},
+    number = {8},
+    month = {8},
+    pages = {e1004273},
+    volume = {11},
+    publisher = {Public Library of Science},
+    url = {http://dx.plos.org/10.1371/journal.pcbi.1004273},
+    doi = {10.1371/journal.pcbi.1004273},
+    issn = {15537358}
+}
+
+@article{Geritz1997,
+    title = {{Evolutionarily singular strategies and the adaptive growth and branching of the evolutionary tree}},
+    year = {1997},
+    journal = {Evolutionary Ecology},
+    author = {Geritz, S and Kisdi, E and Mesze NA, G},
+    url = {http://www.springerlink.com/index/J7073UU8U1701MG2.pdf%5Cnpapers2://publication/uuid/4CF42845-96B5-4AEC-9DC8-7FEA67DBC770},
+    keywords = {adaptive dynamics, evolutionarily singular strategy, evolutionary, evolutionary branching}
+}
+
+@article{Garcia2014,
+    title = {{Evolutionary determinants of sociality: the role of group formation}},
+    year = {2014},
+    author = {Garcia, Thomas and Pierre, L Université and Marie, E T},
+    url = {http://hal.upmc.fr/tel-01018209/}
+}
+
+@article{Ohtsuki2018,
+    title = {{Evolutionary Dynamics of Coordinated Cooperation}},
+    year = {2018},
+    journal = {Frontiers in Ecology and Evolution},
+    author = {Ohtsuki, Hisashi},
+    number = {May},
+    pages = {1--12},
+    volume = {6},
+    url = {https://www.frontiersin.org/article/10.3389/fevo.2018.00062/full},
+    doi = {10.3389/fevo.2018.00062},
+    issn = {2296-701X},
+    keywords = {conditional cooperation, conditional cooperation, evolutionary game theory,, evolutionary game theory, finite population, negotiation, replicator dynamics}
+}
+
+@article{West2007,
+    title = {{Evolutionary Explanations for Cooperation}},
+    year = {2007},
+    journal = {Current Biology},
+    author = {West, Stuart A. and Griffin, Ashleigh S. and Gardner, Andy},
+    number = {16},
+    pages = {661--672},
+    volume = {17},
+    isbn = {0960-9822 (Print){\textbackslash}n0960-9822 (Linking)},
+    doi = {10.1016/j.cub.2007.06.004},
+    issn = {09609822},
+    pmid = {17714660},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@article{Adami2016,
+    title = {{Evolutionary game theory using agent-based methods}},
+    year = {2016},
+    journal = {Physics of Life Reviews},
+    author = {Adami, Christoph and Schossau, Jory and Hintze, Arend},
+    pages = {1--26},
+    volume = {19},
+    publisher = {Elsevier B.V.},
+    url = {http://dx.doi.org/10.1016/j.plrev.2016.08.015},
+    isbn = {1873-1457 (Electronic){\textbackslash}r1571-0645 (Linking)},
+    doi = {10.1016/j.plrev.2016.08.015},
+    issn = {15710645},
+    pmid = {27617905},
+    arxivId = {1404.0994},
+    keywords = {Agent-based modeling, Evolutionary game theory}
+}
+
+@article{Nowak,
+    title = {{Evolutionary games and spatial chaos}},
+    author = {Nowak, Martin A. and May, Robert. M}
+}
+
+@article{Petrie2018,
+    title = {{Evolutionary Innovation}},
+    year = {2018},
+    journal = {Science},
+    author = {Petrie, Katherine L and Palmer, Nathan D and Johnson, Daniel T and Medina, Sarah J and Yan, Stephanie J and Li, Victor and Burmeister, Alita R and Meyer, Justin R},
+    number = {March},
+    pages = {1542--1545},
+    volume = {1545},
+    isbn = {0226586944 (ISBN); 0226586952 (ISBN)},
+    doi = {10.1126/science.aar1954},
+    issn = {0036-8075},
+    pmid = {29599247}
+}
+
+@article{Szabo2002,
+    title = {{Evolutionary prisoner’s dilemma games with voluntary participation}},
+    year = {2002},
+    journal = {Physical Review E - Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics},
+    author = {Szab{\'{o}}, György and Hauert, Christoph},
+    number = {6},
+    month = {12},
+    pages = {4},
+    volume = {66},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/12513331 https://link.aps.org/doi/10.1103/PhysRevE.66.062903},
+    doi = {10.1103/PhysRevE.66.062903},
+    issn = {1063651X},
+    pmid = {12513331}
+}
+
+@article{BussDavidM.1995,
+    title = {{Evolutionary Psychology: A New Paradigm for Psychological Science.}},
+    year = {1995},
+    journal = {Psychological Inquiry},
+    author = {{Buss,David M.}},
+    number = {1},
+    pages = {1--1},
+    volume = {6},
+    url = {http://search.ebscohost.com/login.aspx?direct=true&#38;db=aph&#38;AN=7396669&#38;site=ehost-live},
+    isbn = {1047840X}
+}
+
+@article{Andre2016,
+    title = {{Evolutionary robotics simulations help explain why reciprocity is rare in nature}},
+    year = {2016},
+    journal = {Scientific Reports},
+    author = {Andr{\'{e}}, Jean-Baptiste and Nolfi, Stefano},
+    number = {1},
+    month = {12},
+    pages = {32785},
+    volume = {6},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/srep32785 http://www.nature.com/articles/srep32785},
+    doi = {10.1038/srep32785},
+    issn = {2045-2322}
+}
+
+@article{Doncieux2015a,
+    title = {{Evolutionary Robotics: What, Why, and Where to}},
+    year = {2015},
+    journal = {Frontiers in Robotics and AI},
+    author = {Doncieux, Stephane and Bredeche, Nicolas and Mouret, Jean-Baptiste and Eiben, Agoston E. (Gusz)},
+    number = {March},
+    pages = {1--18},
+    volume = {2},
+    url = {http://www.frontiersin.org/Evolutionary_Robotics/10.3389/frobt.2015.00004/abstract},
+    isbn = {2296-9144},
+    doi = {10.3389/frobt.2015.00004},
+    issn = {2296-9144},
+    keywords = {embodied intelligence, evolutionary algorithms, evolutionary biology, evolutionary robotics, evolutionary robotics, embodied intelligence, evol, robotics}
+}
+
+@book{trianni2008evolutionary,
+    title = {{Evolutionary swarm robotics: evolving self-organising behaviours in groups of autonomous robots}},
+    year = {2008},
+    author = {Trianni, Vito},
+    volume = {108},
+    publisher = {Springer}
+}
+
+@article{Sutton2014,
+    title = {{Experience Replay and Planning}},
+    year = {2014},
+    author = {{Sutton}},
+    number = {1},
+    pages = {56 p.},
+    volume = {14},
+    url = {http://www.nasbe.org/wp-content/uploads/Standard_Mar2014_full_online.pdf},
+    keywords = {learning}
+}
+
+@article{Adam2012,
+    title = {{Experience replay for real-time reinforcement learning control}},
+    year = {2012},
+    journal = {IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews},
+    author = {Adam, Sander and Bu{\c{s}}oniu, Lucian and Babu{\v{s}}ka, Robert},
+    number = {2},
+    pages = {201--212},
+    volume = {42},
+    isbn = {1094-6977},
+    doi = {10.1109/TSMCC.2011.2106494},
+    issn = {10946977},
+    pmid = {12588315},
+    keywords = {Experience replay (ER), Q-learning, SARSA, real-time control, reinforcement learning (RL), robotics}
+}
+
+@article{Krams2008,
+    title = {{Experimental evidence of reciprocal altruism in the pied flycatcher}},
+    year = {2008},
+    journal = {Behavioral Ecology and Sociobiology},
+    author = {Krams, Indrikis and Krama, Tatjana and Igaune, Kristine and M{\"{a}}nd, Raivo},
+    number = {4},
+    pages = {599--605},
+    volume = {62},
+    isbn = {0340-5443},
+    doi = {10.1007/s00265-007-0484-1},
+    issn = {03405443},
+    keywords = {Anti-predator behaviour, Co-operation, Mobbing, Pied flycatcher, Reciprocal altruism}
+}
+
+@article{Bshary2002b,
+    title = {{Experimental evidence that partner choice is a driving force in the payoff distribution among cooperators or mutualists: The cleaner fish case}},
+    year = {2002},
+    journal = {Ecology Letters},
+    author = {Bshary, Redouan and Grutter, Alexandra S.},
+    number = {1},
+    pages = {130--136},
+    volume = {5},
+    isbn = {1461-023X},
+    doi = {10.1046/j.1461-0248.2002.00295.x},
+    issn = {1461023X},
+    keywords = {Biological markets, Labroides dimidiatus, Mutualism, Partner choice}
+}
+
+@article{Geoffroy2018,
+    title = {{Explaining fine-grained properties of human cooperation . Insights from evolutionary game theory}},
+    year = {2018},
+    author = {Geoffroy, Félix}
+}
+
+@article{Moulin-Frier2014,
+    title = {{Explauto: An open-source Python library to study autonomous exploration in developmental robotics}},
+    year = {2014},
+    journal = {IEEE ICDL-EPIROB 2014 - 4th Joint IEEE International Conference on Development and Learning and on Epigenetic Robotics},
+    author = {Moulin-Frier, Clement and Rouanet, Pierre and Oudeyer, Pierre Yves},
+    pages = {171--172},
+    isbn = {9781479975402},
+    doi = {10.1109/DEVLRN.2014.6982976}
+}
+
+@article{DeLavilleon2015,
+    title = {{Explicit memory creation during sleep demonstrates a causal role of place cells in navigation}},
+    year = {2015},
+    journal = {Nature Neuroscience},
+    author = {De Lavill{\'{e}}on, Gaetan and Lacroix, Marie Masako and Rondi-Reig, Laure and Benchenane, Karim},
+    number = {4},
+    pages = {493--495},
+    volume = {18},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nn.3970},
+    isbn = {1546-1726 (Electronic){\textbackslash}r1097-6256 (Linking)},
+    doi = {10.1038/nn.3970},
+    issn = {15461726},
+    pmid = {25751533}
+}
+
+@article{Schmid2017,
+    title = {{Feel good, do good? Disentangling reciprocity from unconditional prosociality}},
+    year = {2017},
+    journal = {Ethology},
+    author = {Schmid, Res and Schneeberger, Karin and Taborsky, Michael},
+    number = {9},
+    pages = {640--647},
+    volume = {123},
+    doi = {10.1111/eth.12636},
+    issn = {14390310},
+    keywords = {affective state, cooperation, decision-making, experience, mood, prosocial behaviour, reciprocal altruism}
+}
+
+@misc{Mller1988,
+    title = {{Female choice selects for male sexual tail ornaments in the monogamous swallow}},
+    year = {1988},
+    booktitle = {Nature},
+    author = {M{\o}ller, Anders Pape},
+    number = {6165},
+    pages = {640--642},
+    volume = {332},
+    doi = {10.1038/332640a0},
+    issn = {00280836}
+}
+
+@article{Nowak2006,
+    title = {{Five rules for the evolution of cooperation}},
+    year = {2006},
+    journal = {Science},
+    author = {Nowak, Martin A.},
+    number = {5805},
+    pages = {1560--1563},
+    volume = {314},
+    isbn = {doi:10.1126/science.1133755},
+    doi = {10.1126/science.1133755},
+    issn = {00368075},
+    pmid = {17158317},
+    arxivId = {arXiv:1208.2666v4}
+}
+
+@article{Fodor2012,
+    title = {{Fodor ' s Guide to Mental Representation : Auntie ' s Vade-Mecum The Intelligent}},
+    year = {2012},
+    journal = {Mind},
+    author = {Fodor, J.},
+    number = {373},
+    pages = {76--100},
+    volume = {94}
+}
+
+@article{Johnstone2002,
+    title = {{From parasitism to mutualism: Partner control in asymmetric interactions}},
+    year = {2002},
+    journal = {Ecology Letters},
+    author = {Johnstone, Rufus A. and Bshary, Redouan},
+    number = {5},
+    pages = {634--639},
+    volume = {5},
+    isbn = {1461-0248},
+    doi = {10.1046/j.1461-0248.2002.00358.x},
+    issn = {1461023X},
+    keywords = {Cleaner-fish, Cooperation, Mutualism, Partner control, Punishment, Reciprocal altruism}
+}
+
+@article{Bshary2004,
+    title = {{Game Structures in Mutualistic Interactions: What Can the Evidence Tell Us About the Kind of Models We Need?}},
+    year = {2004},
+    journal = {Advances in the Study of Behavior},
+    author = {Bshary, Redouan and Bronstein, Judith L.},
+    pages = {59--101},
+    volume = {34},
+    isbn = {0120045346},
+    doi = {10.1016/S0065-3454(04)34002-7},
+    issn = {00653454}
+}
+
+@book{Gibbons1992,
+    title = {{Game Theory For Applied Economists}},
+    year = {1992},
+    booktitle = {Princeton University Press},
+    author = {Gibbons, Robert},
+    number = {6},
+    pages = {1--142},
+    volume = {71},
+    url = {http://scholar.google.com/scholar?hl=en&btnG=Search&q=intitle:Game+Theory+For+Applied+Economists#0},
+    isbn = {0691003955},
+    doi = {10.1017/CBO9780511791307.017},
+    issn = {07450115},
+    pmid = {613706},
+    arxivId = {arXiv:gr-qc/9809069v1},
+    archivePrefix = {arXiv},
+    eprint = {9809069v1},
+    primaryClass = {gr-qc}
+}
+
+@book{Gibbons2014,
+    title = {{Game Theory for Applied Economists}},
+    year = {2014},
+    booktitle = {Journal of Environmental Studies and Sciences},
+    author = {Gibbons, Robert},
+    number = {4},
+    pages = {360--363},
+    volume = {4},
+    isbn = {0-691-04308-6},
+    keywords = {City, Habitat, Sustainability, Urban}
+}
+
+@article{Waibel2009,
+    title = {{Genetic Team Composition and Level of Selection in the Evolution of Cooperation}},
+    year = {2009},
+    journal = {IEEE Transactions on Evolutionary Computation},
+    author = {Waibel, Markus and Keller, Laurent and Floreano, Dario},
+    number = {3},
+    month = {6},
+    pages = {648--660},
+    volume = {13},
+    url = {http://ieeexplore.ieee.org/document/5089892/},
+    doi = {10.1109/TEVC.2008.2011741},
+    issn = {1089-778X},
+    keywords = {Altruism, Artificial evolution, Cooperation, Evolutionary robotics, Fitness allocation, Multiagent systems (MAS), Team composition}
+}
+
+@article{Schino2007,
+    title = {{Grooming and agonistic support: a meta-analysis of primate reciprocal altruism}},
+    year = {2007},
+    journal = {Behavioral Ecology},
+    author = {Schino, Gabriele},
+    number = {1},
+    month = {1},
+    pages = {115--120},
+    volume = {18},
+    url = {https://academic.oup.com/beheco/article-lookup/doi/10.1093/beheco/arl045},
+    doi = {10.1093/beheco/arl045},
+    issn = {1465-7279},
+    keywords = {Agonistic support, Grooming, Meta-analysis, Primates, Reciprocal altruism}
+}
+
+@article{Schino2008,
+    title = {{Grooming reciprocation among female primates: a meta-analysis}},
+    year = {2008},
+    journal = {Biology Letters},
+    author = {Schino, Gabriele and Aureli, Filippo},
+    number = {1},
+    month = {2},
+    pages = {9--11},
+    volume = {4},
+    url = {https://royalsocietypublishing.org/doi/10.1098/rsbl.2007.0506},
+    doi = {10.1098/rsbl.2007.0506},
+    issn = {1744-9561},
+    keywords = {altruism, grooming, reciprocation}
+}
+
+@article{Garcia2013,
+    title = {{Group formation and the evolution of sociality}},
+    year = {2013},
+    journal = {Evolution},
+    author = {Garcia, Thomas and De Monte, Silvia},
+    number = {1},
+    pages = {131--141},
+    volume = {67},
+    isbn = {1558-5646},
+    doi = {10.1111/j.1558-5646.2012.01739.x},
+    issn = {00143820},
+    pmid = {23289567},
+    keywords = {Assortative mechanisms, Evolution of altruism, Group size, Public goods games, Social dilemma}
+}
+
+@article{Scheel1991,
+    title = {{Group hunting behaviour of lions: a search for cooperation}},
+    year = {1991},
+    journal = {Animal Behaviour},
+    author = {Scheel, D. and Packer, C.},
+    number = {4},
+    pages = {697--709},
+    volume = {41},
+    doi = {10.1016/S0003-3472(05)80907-8},
+    issn = {00033472}
+}
+
+@article{Mosser2009,
+    title = {{Group territoriality and the benefits of sociality in the African lion, Panthera leo}},
+    year = {2009},
+    journal = {Animal Behaviour},
+    author = {Mosser, Anna and Packer, Craig},
+    number = {2},
+    pages = {359--370},
+    volume = {78},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.anbehav.2009.04.024},
+    doi = {10.1016/j.anbehav.2009.04.024},
+    issn = {00033472},
+    keywords = {Panthera leo, aggression, competition, evolution, fission-fusion, geographical information system, group territoriality, lion, sociality}
+}
+
+@article{Mosser2009a,
+    title = {{Group territoriality and the benefits of sociality in the African lion, Panthera leo}},
+    year = {2009},
+    journal = {Animal Behaviour},
+    author = {Mosser, Anna and Packer, Craig},
+    number = {2},
+    pages = {359--370},
+    volume = {78},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.anbehav.2009.04.024},
+    doi = {10.1016/j.anbehav.2009.04.024},
+    issn = {00033472},
+    keywords = {Panthera leo, aggression, competition, evolution, fission-fusion, geographical information system, group territoriality, lion, sociality}
+}
+
+@book{Basar2018,
+    title = {{Handbook of dynamic game theory}},
+    year = {2018},
+    booktitle = {Handbook of Dynamic Game Theory},
+    author = {Ba{\c{s}}ar, Tamer and Zaccour, Georges},
+    pages = {1--1285},
+    isbn = {9783319443744},
+    doi = {10.1007/978-3-319-44374-4}
+}
+
+@article{Gracia-Lazaro2012,
+    title = {{Heterogeneous networks do not promote cooperation when humans play a Prisoner's Dilemma}},
+    year = {2012},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Gracia-Lazaro, C. and Ferrer, Alfredo and Ruiz, Gonzalo and Tarancon, A. and Cuesta, J. A. and Sanchez, A. and Moreno, Y. and Gracia-l, Carlos and Ferrer, Alfredo and Ruiz, Gonzalo and Taranc, Alfonso},
+    number = {32},
+    month = {8},
+    pages = {12922--12926},
+    volume = {109},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.1206681109},
+    isbn = {1292212926},
+    doi = {10.1073/pnas.1206681109},
+    issn = {0027-8424}
+}
+
+@article{Lipkind2013a,
+    title = {{HHS Public Access}},
+    year = {2013},
+    author = {Lipkind, Dina and Marcus, Gary F and Bemis, Douglas and Sasahara, Kazutoshi and Jacoby, Nori and Takahashi, Miki and Suzuki, Kenta and Feher, Olga and Ravbar, Primoz and Okanoya, Kazuo and Tchernichovski, Ofer and Gan, Ramat},
+    number = {7452},
+    pages = {104--108},
+    volume = {498},
+    isbn = {9780954882501},
+    doi = {10.1038/nature12173.Stepwise}
+}
+
+@article{Lewis2014,
+    title = {{High mobility explains demand sharing and enforced cooperation in egalitarian hunter-gatherers}},
+    year = {2014},
+    journal = {Nature Communications},
+    author = {Lewis, Hannah M. and Vinicius, Lucio and Strods, Janis and MacE, Ruth and Migliano, Andrea Bamberg},
+    pages = {1--8},
+    volume = {5},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/ncomms6789},
+    isbn = {2041-1723 (Electronic){\textbackslash}r2041-1723 (Linking)},
+    doi = {10.1038/ncomms6789},
+    issn = {20411723},
+    pmid = {25511874}
+}
+
+@article{Gupta2010,
+    title = {{Hippocampal Replay Is Not a Simple Function of Experience}},
+    year = {2010},
+    journal = {Neuron},
+    author = {Gupta, Anoopum S. and van der Meer, Matthijs A.A. and Touretzky, David S. and Redish, A. David},
+    number = {5},
+    pages = {695--705},
+    volume = {65},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.neuron.2010.01.034},
+    isbn = {1097-4199 (Electronic) 0896-6273 (Linking)},
+    doi = {10.1016/j.neuron.2010.01.034},
+    issn = {08966273},
+    pmid = {20223204},
+    arxivId = {15334406},
+    keywords = {SYSNEURO}
+}
+
+@article{Davidson2009,
+    title = {{Hippocampal Replay of Extended Experience}},
+    year = {2009},
+    journal = {Neuron},
+    author = {Davidson, Thomas J. and Kloosterman, Fabian and Wilson, Matthew A.},
+    number = {4},
+    pages = {497--507},
+    volume = {63},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.neuron.2009.07.027},
+    isbn = {0896-6273},
+    doi = {10.1016/j.neuron.2009.07.027},
+    issn = {08966273},
+    pmid = {19709631},
+    arxivId = {quant-ph/0604011},
+    keywords = {SYSNEURO}
+}
+
+@misc{Watson2016,
+    title = {{How Can Evolution Learn?}},
+    year = {2016},
+    booktitle = {Trends in Ecology and Evolution},
+    author = {Watson, Richard A. and Szathm{\'{a}}ry, Eörs},
+    number = {2},
+    pages = {147--157},
+    volume = {31},
+    publisher = {Elsevier Ltd},
+    doi = {10.1016/j.tree.2015.11.009},
+    issn = {01695347}
+}
+
+@article{Suchak2016a,
+    title = {{How chimpanzees cooperate in a competitive world}},
+    year = {2016},
+    journal = {Proceedings of the National Academy of Sciences of the United States of America},
+    author = {Suchak, Malini and Eppleya, Timothy M. and Campbell, Matthew W. and Feldmana, Rebecca A. and Quarlesc, Luke F. and De Waal, Frans B.M.},
+    number = {36},
+    pages = {10215--10220},
+    volume = {113},
+    doi = {10.1073/pnas.1611826113},
+    issn = {10916490},
+    keywords = {Enforcement, Freeloading, Pan troglodytes, Partner choice, Punishment}
+}
+
+@article{SebastienDeregnaucourt2005,
+    title = {{How sleep affects the developmental learning of bird song}},
+    year = {2005},
+    journal = {Nature},
+    author = {{Sebastien Deregnaucourt} and {Partha P.Mitra} and {Olga Feher} and {Carolyn Pytte} and {Ofer Tchernichovski}},
+    number = {February},
+    volume = {433}
+}
+
+@article{Rand2013,
+    title = {{Human cooperation}},
+    year = {2013},
+    journal = {Trends in cognitive sciences},
+    author = {Rand, David G. and Nowak, Martin A.},
+    number = {8},
+    month = {8},
+    pages = {413--25},
+    volume = {17},
+    publisher = {Elsevier Current Trends},
+    url = {https://www.sciencedirect.com/science/article/pii/S1364661313001216?dgcid=api_sd_search-api-endpoint http://www.ncbi.nlm.nih.gov/pubmed/23856025},
+    doi = {10.1016/j.tics.2013.06.003},
+    issn = {1879-307X},
+    pmid = {23856025}
+}
+
+@article{Collins2014,
+    title = {{Human EEG Uncovers Latent Generalizable Rule Structure during Learning}},
+    year = {2014},
+    journal = {Journal of Neuroscience},
+    author = {Collins, A. G. E. and Cavanagh, J. F. and Frank, M. J.},
+    number = {13},
+    pages = {4677--4685},
+    volume = {34},
+    url = {http://www.jneurosci.org/cgi/doi/10.1523/JNEUROSCI.3900-13.2014},
+    isbn = {1529-2401 (Electronic){\textbackslash}n0270-6474 (Linking)},
+    doi = {10.1523/JNEUROSCI.3900-13.2014},
+    issn = {0270-6474},
+    pmid = {24672013},
+    keywords = {eeg, prefrontal cortex, reinforcement learning, rules, task-set}
+}
+
+@article{Ostendorf2010,
+    title = {{Human thalamus contributes to perceptual stability across eye movements}},
+    year = {2010},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Ostendorf, F. and Liebermann, D. and Ploner, C. J.},
+    number = {3},
+    month = {1},
+    pages = {1229--1234},
+    volume = {107},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.0910742107},
+    doi = {10.1073/pnas.0910742107},
+    issn = {0027-8424}
+}
+
+@article{Boesch1989,
+    title = {{Hunting behavior of wild chimpanzees in the Ta{\"{i}} National Park}},
+    year = {1989},
+    journal = {American Journal of Physical Anthropology},
+    author = {Boesch, Christophe and Boesch, Hedwige},
+    number = {4},
+    pages = {547--573},
+    volume = {78},
+    doi = {10.1002/ajpa.1330780410},
+    issn = {10968644},
+    keywords = {Cooperation, Sharing, Traditions}
+}
+
+@article{Crandall2007,
+    title = {{HVC Neural Sleep Activity Increases With Development and Parallels Nightly Changes in Song Behavior}},
+    year = {2007},
+    journal = {Journal of Neurophysiology},
+    author = {Crandall, S. R. and Adam, M. and Kinnischtzke, A. K. and Nick, T. A.},
+    number = {1},
+    pages = {232--240},
+    volume = {98},
+    url = {http://jn.physiology.org/cgi/doi/10.1152/jn.00128.2007},
+    isbn = {2156623929},
+    doi = {10.1152/jn.00128.2007},
+    issn = {0022-3077},
+    pmid = {17428907},
+    arxivId = {NIHMS150003}
+}
+
+@article{Kay2008,
+    title = {{Identifying natural images from human brain activity}},
+    year = {2008},
+    journal = {Nature},
+    author = {Kay, Kendrick N. and Naselaris, Thomas and Prenger, Ryan J. and Gallant, Jack L.},
+    number = {7185},
+    pages = {352--355},
+    volume = {452},
+    isbn = {1476-4687 (Electronic)},
+    doi = {10.1038/nature06713},
+    issn = {14764687},
+    pmid = {18322462},
+    arxivId = {NIHMS150003}
+}
+
+@article{Bshary2006,
+    title = {{Image scoring and cooperation in a cleaner fish mutualism}},
+    year = {2006},
+    journal = {Nature},
+    author = {Bshary, Redouan and Grutter, Alexandra S.},
+    number = {7096},
+    pages = {975--978},
+    volume = {441},
+    isbn = {1476-4687 (Electronic){\textbackslash}r0028-0836 (Linking)},
+    doi = {10.1038/nature04755},
+    issn = {14764687},
+    pmid = {16791194}
+}
+
+@article{Giacomini1965,
+    title = {{Imetodi attuali per la diagnosi della gravidanza protratta.}},
+    year = {1965},
+    journal = {Rivista di ostetricia e ginecologia},
+    author = {Giacomini, G. and Bonfirraro, G.},
+    number = {5},
+    pages = {361--374},
+    volume = {20},
+    isbn = {9788578110796},
+    doi = {10.1126/science.aad3023},
+    issn = {10959203},
+    pmid = {25246403},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@article{Shu2018,
+    title = {{Impacts of memory on a regular lattice for different population sizes with asynchronous update in spatial snowdrift game}},
+    year = {2018},
+    journal = {Physics Letters, Section A: General, Atomic and Solid State Physics},
+    author = {Shu, Feng and Liu, Xingwen and Li, Min},
+    number = {20},
+    pages = {1317--1323},
+    volume = {382},
+    publisher = {Elsevier B.V.},
+    url = {https://doi.org/10.1016/j.physleta.2018.03.033},
+    doi = {10.1016/j.physleta.2018.03.033},
+    issn = {03759601},
+    keywords = {Cooperation, Evolutionary game, Memory-based snowdrift game, Population sizes}
+}
+
+@article{Jones2012,
+    title = {{Inferred But Not Cached Values}},
+    year = {2012},
+    author = {{Jones} and Mirenzi, Aaron and Schoenbaum, Geoffrey},
+    number = {November},
+    pages = {953--956},
+    volume = {338},
+    isbn = {9781137362506},
+    doi = {10.1126/science.1227489.Orbitofrontal}
+}
+
+@article{Bergmuller2007,
+    title = {{Integrating cooperative breeding into theoretical concepts of cooperation}},
+    year = {2007},
+    journal = {Behavioural Processes},
+    author = {Bergm{\"{u}}ller, Ralph and Johnstone, Rufus A. and Russell, Andrew F. and Bshary, Redouan},
+    number = {2},
+    pages = {61--72},
+    volume = {76},
+    isbn = {0376-6357},
+    doi = {10.1016/j.beproc.2007.07.001},
+    issn = {03766357},
+    pmid = {17703898},
+    keywords = {Altruistic behaviour, Cooperative breeding, Evolution, Group augmentation, Pay-to-stay, Prestige, Reciprocity}
+}
+
+@book{Eiben2003,
+    title = {{Introduction to Evolutionary Computing}},
+    year = {2003},
+    booktitle = {The Manipulation of Literature:Strategies and Methods for Translating Theatre Texts.},
+    author = {Eiben, A. E. and Smith, J. E.},
+    pages = {87--102},
+    series = {Natural Computing Series},
+    publisher = {Springer Berlin Heidelberg},
+    url = {http://link.springer.com/10.1007/978-3-662-05094-1},
+    address = {Berlin, Heidelberg},
+    isbn = {978-3-642-07285-7},
+    doi = {10.1007/978-3-662-05094-1}
+}
+
+@article{VanBavel2017,
+    title = {{Introduction: Agent-Based Modelling as a Tool to Advance Evolutionary Population Theory}},
+    year = {2017},
+    author = {Van Bavel, Jan and Grow, André},
+    pages = {3--27},
+    volume = {41},
+    url = {http://link.springer.com/10.1007/978-3-319-32283-4_1},
+    isbn = {978-3-319-32281-0},
+    doi = {10.1007/978-3-319-32283-4{\_}1}
+}
+
+@article{Brette2017,
+    title = {{Is coding a relevant metaphor for the brain?}},
+    year = {2017},
+    author = {Brette, Romain},
+    url = {http://www.albayan.ae},
+    doi = {10.1101/168237},
+    keywords = {10, 1101, 168237, 2017, 27, a license to display, action, biorxiv preprint first posted, doi, dx, funder, http, information, is the author, neural coding, online jul, org, perception, sensorimotor, the copyright holder for, the preprint in perpetuity, this preprint, which was not peer-reviewed, who has granted biorxiv}
+}
+
+@article{Aktipis2011,
+    title = {{Is cooperation viable in mobile organisms? Simple Walk Away rule favors the evolution of cooperation in groups}},
+    year = {2011},
+    journal = {Evolution and Human Behavior},
+    author = {Aktipis, C Athena},
+    number = {4},
+    month = {7},
+    pages = {263--276},
+    volume = {32},
+    publisher = {NIH Public Access},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/21666771 http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=PMC3110732},
+    doi = {10.1016/j.evolhumbehav.2011.01.002},
+    issn = {10905138},
+    pmid = {21666771},
+    keywords = {Conditional movement, Contingent movement, Dispersal, Group selection, Motility, Multilevel selection, Social dilemma, Walk away}
+}
+
+@article{Stanley1993,
+    title = {{Iterated Prisoner's Dilemma with Choice and Refusal of Partners}},
+    year = {1993},
+    author = {Stanley, E Ann and Ashlock, Dan and Tesfatsion, Leigh}
+}
+
+@article{Liu2004,
+    title = {{Juvenile zebra finches can use multiple strategies to learn the same song}},
+    year = {2004},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Liu, W.-c. and Gardner, T. J. and Nottebohm, F.},
+    number = {52},
+    pages = {18177--18182},
+    volume = {101},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.0408065101},
+    isbn = {0027-8424},
+    doi = {10.1073/pnas.0408065101},
+    issn = {0027-8424},
+    pmid = {15608063}
+}
+
+@article{Aktipis2004,
+    title = {{Know when to walk away: Contingent movement and the evolution of cooperation}},
+    year = {2004},
+    journal = {Journal of Theoretical Biology},
+    author = {Aktipis, C. Athena},
+    number = {2},
+    pages = {249--260},
+    volume = {231},
+    isbn = {0022-5193 (Print){\textbackslash}n0022-5193 (Linking)},
+    doi = {10.1016/j.jtbi.2004.06.020},
+    issn = {00225193},
+    pmid = {15380389},
+    keywords = {Agent-based, Cooperation, Evolution, Exit, Movement, Tit-for-tat, Walk away}
+}
+
+@article{Ecarlat2003,
+    title = {{Learning a high diversity of object manipulations through an evolutionary-based babbling}},
+    year = {2003},
+    journal = {Review of Economic Studies},
+    author = {Ecarlat, Pierre and Cully, Antoine and Maestre, Carlos and Doncieux, Stephane},
+    number = {3},
+    pages = {649--665},
+    volume = {70},
+    doi = {10.1111/1467-937X.00260},
+    issn = {00346527},
+    arxivId = {1504.04909}
+}
+
+@article{Grohens2017,
+    title = {{Learning conditional cooperation in evolutionary swarm robotics}},
+    year = {2017},
+    author = {Grohens, Théotime and Bredeche, Nicolas and Intelligents, Systèmes and Robotique, De},
+    number = {July},
+    pages = {1--18}
+}
+
+@article{Behrens2007,
+    title = {{Learning the value of information in an uncertain world}},
+    year = {2007},
+    journal = {Nature Neuroscience},
+    author = {Behrens, Timothy E.J. and Woolrich, Mark W. and Walton, Mark E. and Rushworth, Matthew F.S.},
+    number = {9},
+    pages = {1214--1221},
+    volume = {10},
+    isbn = {1097-6256 (Print){\textbackslash}n1097-6256 (Linking)},
+    doi = {10.1038/nn1954},
+    issn = {10976256},
+    pmid = {17676057},
+    arxivId = {NIHMS150003}
+}
+
+@article{Boyan1999,
+    title = {{Least-Squares Temporal Difference Learning Justin}},
+    year = {1999},
+    author = {Boyan, Justin A},
+    pages = {1--8},
+    url = {papers://d471b97a-e92c-44c2-8562-4efc271c8c1b/Paper/p92}
+}
+
+@article{Glascher2012,
+    title = {{Lesion mapping of cognitive control and value-based decision making in the prefrontal cortex}},
+    year = {2012},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Glascher, J. and Adolphs, R. and Damasio, H. and Bechara, A. and Rudrauf, D. and Calamia, M. and Paul, L. K. and Tranel, D.},
+    number = {36},
+    pages = {14681--14686},
+    volume = {109},
+    url = {http://www.pnas.org/cgi/doi/10.1073/pnas.1206608109},
+    isbn = {1091-6490 (Electronic){\textbackslash}n0027-8424 (Linking)},
+    doi = {10.1073/pnas.1206608109},
+    issn = {0027-8424},
+    pmid = {22908286},
+    arxivId = {1408.1149}
+}
+
+@article{Wolf2007,
+    title = {{Life-history trade-offs favour the evolution of animal personalities}},
+    year = {2007},
+    journal = {Nature},
+    author = {Wolf, Max and Van Doorn, G. Sander and Leimar, Olof and Weissing, Franz J.},
+    number = {7144},
+    pages = {581--584},
+    volume = {447},
+    isbn = {0028-0836},
+    doi = {10.1038/nature05835},
+    issn = {14764687},
+    pmid = {17538618},
+    arxivId = {NIHMS150003}
+}
+
+@article{Noe2017,
+    title = {{Local mating markets in humans and non-human animals}},
+    year = {2017},
+    journal = {Behavioral Ecology and Sociobiology},
+    author = {No{\"{e}}, Ronald},
+    number = {10},
+    volume = {71},
+    publisher = {Behavioral Ecology and Sociobiology},
+    doi = {10.1007/s00265-017-2376-3},
+    issn = {03405443},
+    keywords = {Biological market, Marriage market, Matching, Mating, Operational sex ratio, Sexual selection}
+}
+
+@article{Amador2014,
+    title = {{Low dimensional dynamics in birdsong production}},
+    year = {2014},
+    journal = {European Physical Journal B},
+    author = {Amador, Ana and Mindlin, Gabriel B.},
+    number = {12},
+    volume = {87},
+    isbn = {1434-6028 1434-6036},
+    doi = {10.1140/epjb/e2014-50566-5},
+    issn = {14346036},
+    keywords = {Colloquium}
+}
+
+@article{Lerer2017,
+    title = {{Maintaining cooperation in complex social dilemmas using deep reinforcement learning}},
+    year = {2017},
+    author = {Lerer, Adam and Peysakhovich, Alexander},
+    url = {http://arxiv.org/abs/1707.01068},
+    doi = {10.1002/bit.20168.155.},
+    arxivId = {1707.01068}
+}
+
+@article{West2015,
+    title = {{Major evolutionary transitions in individuality}},
+    year = {2015},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {West, Stuart A. and Fisher, Roberta M. and Gardner, Andy and Kiers, E. Toby},
+    number = {33},
+    pages = {10112--10119},
+    volume = {112},
+    url = {http://www.pnas.org/lookup/doi/10.1073/pnas.1421402112},
+    isbn = {1091-6490 (Electronic){\textbackslash}r0027-8424 (Linking)},
+    doi = {10.1073/pnas.1421402112},
+    issn = {0027-8424},
+    pmid = {25964342}
+}
+
+@article{Low2008,
+    title = {{Mammalian-like features of sleep structure in zebra finches}},
+    year = {2008},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Low, Philip Steven and Shank, Sylvan S. and Sejnowski, Terrence J. and Margoliash, Daniel},
+    number = {26},
+    pages = {9081--9086},
+    volume = {105},
+    url = {http://www.pnas.org/lookup/doi/10.1073/pnas.0703452105},
+    isbn = {1091-6490 (Electronic){\textbackslash}n0027-8424 (Linking)},
+    doi = {10.1073/pnas.0703452105},
+    issn = {0027-8424},
+    pmid = {18579776}
+}
+
+@incollection{Parker1983,
+    title = {{Mate quality and mating decisions}},
+    year = {1983},
+    booktitle = {Mate Choice},
+    author = {Parker, Geoff A},
+    pages = {141--166},
+    publisher = {Cambridge University Press},
+    isbn = {9780521272070}
+}
+
+@article{Zahavi1975,
+    title = {{Mate selection-A selection for a handicap}},
+    year = {1975},
+    journal = {Journal of Theoretical Biology},
+    author = {Zahavi, Amotz},
+    number = {1},
+    month = {9},
+    pages = {205--214},
+    volume = {53},
+    url = {https://linkinghub.elsevier.com/retrieve/pii/0022519375901113},
+    doi = {10.1016/0022-5193(75)90111-3},
+    issn = {10958541},
+    pmid = {1195756}
+}
+
+@article{Dempster1991,
+    title = {{Maximum Likelihood from Incomplete Data via the EM Algorithm}},
+    year = {1991},
+    journal = {Toxicologic Pathology},
+    author = {Dempster, A. P. and Rubin, D. B.},
+    number = {3},
+    pages = {293--297},
+    volume = {19},
+    isbn = {9781450319522},
+    doi = {10.1177/019262339101900314},
+    issn = {01926233},
+    pmid = {9501024},
+    arxivId = {0710.5696v2}
+}
+
+@article{Andre2014,
+    title = {{Mechanistic constraints and the unlikely evolution of reciprocal cooperation}},
+    year = {2014},
+    journal = {Journal of Evolutionary Biology},
+    author = {Andr{\'{e}}, J. B.},
+    number = {4},
+    pages = {784--795},
+    volume = {27},
+    isbn = {1010-061x},
+    doi = {10.1111/jeb.12351},
+    issn = {14209101},
+    pmid = {24618005},
+    keywords = {Bootstrapping, Evolution of cooperation, Genetic constraints, Reciprocity}
+}
+
+@article{Krasnow2013,
+    title = {{Meeting now suggests we will meet again: Implications for debates on the evolution of cooperation}},
+    year = {2013},
+    journal = {Scientific Reports},
+    author = {Krasnow, Max M. and Delton, Andrew W. and Tooby, John and Cosmides, Leda},
+    number = {1},
+    month = {12},
+    pages = {1747},
+    volume = {3},
+    publisher = {Nature Publishing Group},
+    url = {http://www.nature.com/articles/srep01747},
+    doi = {10.1038/srep01747},
+    issn = {20452322},
+    keywords = {Evolution, Evolutionary theory, Psychology, Social evolution}
+}
+
+@article{Deregnaucourt2012,
+    title = {{Melatonin affects the temporal pattern of vocal signatures in birds}},
+    year = {2012},
+    journal = {Journal of Pineal Research},
+    author = {Der{\'{e}}gnaucourt, Sébastien and Saar, Sigal and Gahr, Manfred},
+    number = {3},
+    pages = {245--258},
+    volume = {53},
+    isbn = {1600-079X (Electronic) 0742-3098 (Linking)},
+    doi = {10.1111/j.1600-079X.2012.00993.x},
+    issn = {07423098},
+    pmid = {22506964},
+    keywords = {Japanese quail, Zebra finch, birdsong, learning, melatonin, motor control, pinealectomy, timing, vocalization}
+}
+
+@article{Fiete2007,
+    title = {{Model of Birdsong Learning Based on Gradient Estimation by Dynamic Perturbation of Neural Conductances}},
+    year = {2007},
+    journal = {Journal of Neurophysiology},
+    author = {Fiete, I. R. and Fee, M. S. and Seung, H. S.},
+    number = {4},
+    pages = {2038--2057},
+    volume = {98},
+    url = {http://jn.physiology.org/cgi/doi/10.1152/jn.01311.2006},
+    isbn = {0022-3077},
+    doi = {10.1152/jn.01311.2006},
+    issn = {0022-3077},
+    pmid = {17652414},
+    arxivId = {1507.07580}
+}
+
+@article{Daw2011,
+    title = {{Model-based influences on humans' choices and striatal prediction errors}},
+    year = {2011},
+    journal = {Neuron},
+    author = {Daw, Nathaniel D. and Gershman, Samuel J. and Seymour, Ben and Dayan, Peter and Dolan, Raymond J.},
+    number = {6},
+    pages = {1204--1215},
+    volume = {69},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/j.neuron.2011.02.027},
+    isbn = {1097-4199 (Electronic) 0896-6273 (Linking)},
+    doi = {10.1016/j.neuron.2011.02.027},
+    issn = {08966273},
+    pmid = {21435563},
+    arxivId = {NIHMS150003}
+}
+
+@article{Nick2015,
+    title = {{Models of vocal learning in the songbird: Historical frameworks and the stabilizing critic}},
+    year = {2015},
+    journal = {Developmental Neurobiology},
+    author = {Nick, Teresa A.},
+    number = {10},
+    pages = {1091--1113},
+    volume = {75},
+    isbn = {1932-846X (Electronic){\textbackslash}r1932-8451 (Linking)},
+    doi = {10.1002/dneu.22189},
+    issn = {1932846X},
+    pmid = {24841478},
+    keywords = {Basal ganglia, Disinhibition, Mirror, Motor control, Oscillation}
+}
+
+@article{Leibo2017,
+    title = {{Multi-agent Reinforcement Learning in Sequential Social Dilemmas}},
+    year = {2017},
+    author = {Leibo, Joel Z. and Zambaldi, Vinicius and Lanctot, Marc and Marecki, Janusz and Graepel, Thore},
+    url = {http://arxiv.org/abs/1702.03037},
+    isbn = {0000000348935},
+    doi = {10.1098/rstb.2015.0272},
+    issn = {0962-8436},
+    pmid = {27114574},
+    arxivId = {1702.03037},
+    keywords = {agent-based, cooperation, markov games, non-cooperative games, social dilemmas, social simulation}
+}
+
+@article{Johnstone1996,
+    title = {{Mutual Mate Choice and Sex Differences in Choosiness}},
+    year = {1996},
+    journal = {Evolution},
+    author = {Johnstone, Rufus A. and Reynolds, John D. and Deutsch, James C.},
+    number = {4},
+    pages = {1382},
+    volume = {50},
+    doi = {10.2307/2410876},
+    issn = {00143820},
+    keywords = {-mate choice, 1995, accepted august 7, and ornamentation in many, parental care, parental investment, received january 23, sex differences in weaponry, sex role reversal, sexual selection}
+}
+
+@article{Johnstone2008,
+    title = {{Mutualism, market effects and partner control}},
+    year = {2008},
+    journal = {Journal of Evolutionary Biology},
+    author = {Johnstone, R. A. and Bshary, R.},
+    number = {3},
+    pages = {879--888},
+    volume = {21},
+    isbn = {1420-9101 (Electronic){\textbackslash}r1010-061X (Linking)},
+    doi = {10.1111/j.1420-9101.2008.01505.x},
+    issn = {1010061X},
+    pmid = {18312320},
+    keywords = {Biological markets, Cleaner fish, Cooperation, Mutualism, Partner control, Punishment, Reciprocal altruism}
+}
+
+@article{Schwartz2001,
+    title = {{Natural signal statistics and sensory}},
+    year = {2001},
+    journal = {Nature neuroscience},
+    author = {Schwartz, Odelia and Simoncelli, Eero P},
+    number = {8},
+    pages = {819--825},
+    volume = {1},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/11477428},
+    isbn = {1097-6256 (Print)},
+    doi = {10.1038/90526},
+    issn = {1097-6256},
+    pmid = {11477428},
+    keywords = {Acoustic Stimulation, Action Potentials, Action Potentials: physiology, Animals, Auditory Perception, Auditory Perception: physiology, Central Nervous System, Central Nervous System: physiology, Cochlear Nerve, Cochlear Nerve: physiology, Data Interpretation, Macaca, Macaca: anatomy {\&} histology, Macaca: physiology, Models, Neurological, Neurons, Neurons: physiology, Nonlinear Dynamics, Photic Stimulation, Reaction Time, Reaction Time: physiology, Saimiri, Saimiri: anatomy {\&} histology, Saimiri: physiology, Sensation, Sensation: physiology, Signal Transduction, Signal Transduction: physiology, Statistical, Synaptic Transmission, Synaptic Transmission: physiology, Visual Cortex, Visual Cortex: physiology, Visual Perception, Visual Perception: physiology}
+}
+
+@article{Liljeholm2011,
+    title = {{Neural Correlates of Instrumental Contingency Learning: Differential Effects of Action-Reward Conjunction and Disjunction}},
+    year = {2011},
+    journal = {Journal of Neuroscience},
+    author = {Liljeholm, M. and Tricomi, E. and O'Doherty, J. P. and Balleine, B. W.},
+    number = {7},
+    pages = {2474--2480},
+    volume = {31},
+    url = {http://www.jneurosci.org/cgi/doi/10.1523/JNEUROSCI.3354-10.2011},
+    isbn = {1529-2401 (Electronic){\textbackslash}r0270-6474 (Linking)},
+    doi = {10.1523/JNEUROSCI.3354-10.2011},
+    issn = {0270-6474},
+    pmid = {21325514}
+}
+
+@article{Hayden2011,
+    title = {{Neuronal basis of sequential foraging decisions in a patchy environment}},
+    year = {2011},
+    journal = {Nature Neuroscience},
+    author = {Hayden, Benjamin Y. and Pearson, John M. and Platt, Michael L.},
+    number = {7},
+    pages = {933--939},
+    volume = {14},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nn.2856},
+    isbn = {1546-1726 (Electronic){\textbackslash}n1097-6256 (Linking)},
+    doi = {10.1038/nn.2856},
+    issn = {10976256},
+    pmid = {21642973}
+}
+
+@article{Hardy2006,
+    title = {{Nice guys finish first: The competitive altruism hypothesis}},
+    year = {2006},
+    journal = {Personality and Social Psychology Bulletin},
+    author = {Hardy, Charlie L. and Van Vugt, Mark},
+    number = {10},
+    pages = {1402--1413},
+    volume = {32},
+    doi = {10.1177/0146167206291006},
+    issn = {01461672},
+    keywords = {Altruism, Costly signals, Public goods, Reputation, Status}
+}
+
+@article{Aktipis2012,
+    title = {{NIH Public Access}},
+    year = {2012},
+    author = {Aktipis, C Athena},
+    number = {4},
+    pages = {263--276},
+    volume = {32},
+    doi = {10.1016/j.evolhumbehav.2011.01.002.Is},
+    keywords = {conditional movement, contingent movement, group selection, migration, multilevel selection, social dilemma, walk away}
+}
+
+@article{Nelson2012,
+    title = {{NIH Public Access}},
+    year = {2012},
+    author = {Nelson, Eric E and Guyer, Amanda E},
+    number = {3},
+    pages = {233--245},
+    volume = {1},
+    isbn = {6176321972},
+    doi = {10.1016/j.dcn.2011.01.002.The},
+    issn = {15378276},
+    arxivId = {NIHMS150003},
+    keywords = {adolescence, affiliative, childhood, emotion}
+}
+
+@article{Wilkinson2016,
+    title = {{Non-kin cooperation in bats.}},
+    year = {2016},
+    journal = {Philosophical transactions of the Royal Society of London. Series B, Biological sciences},
+    author = {Wilkinson, Gerald S and Carter, Gerald G and Bohn, Kirsten M and Adams, Danielle M},
+    number = {1687},
+    pages = {20150095},
+    volume = {371},
+    doi = {10.1098/rstb.2015.0095},
+    issn = {1471-2970},
+    pmid = {26729934},
+    keywords = {by-product mutualism, group augmentation, partner choice, reciprocity, spear-nosed bats, vampire bats}
+}
+
+@article{Barto2013,
+    title = {{Novelty or Surprise?}},
+    year = {2013},
+    journal = {Frontiers in Psychology},
+    author = {Barto, Andrew and Mirolli, Marco and Baldassarre, Gianluca},
+    number = {DEC},
+    pages = {1--15},
+    volume = {4},
+    isbn = {1664-1078 (Electronic){\textbackslash}r1664-1078 (Linking)},
+    doi = {10.3389/fpsyg.2013.00907},
+    issn = {16641078},
+    pmid = {24376428},
+    keywords = {Expectation, Intrinsic motivation, Novelty, Novelty detection, Surprise}
+}
+
+@article{Vengadesh2008,
+    title = {{Occurrence of the "peaking effect" corresponding to the "Highest range of effective intensities" exhibited by bacteriorhodopsin (bR) - Carboxymethylcellulose (CMC) biosensor upon illumination}},
+    year = {2008},
+    journal = {Malaysian Journal of Science},
+    author = {Vengadesh, P. and Majid, Wan Haliza Abdul and Shanmugam, S. Anandan and Low, K. S.},
+    number = {1},
+    pages = {121--127},
+    volume = {27},
+    doi = {10.1111/j.1420-9101.2006.01119.x},
+    issn = {13943065},
+    keywords = {Bacteriorhodopsin, Highest range of effective intensities, Peaking effect, bR-CMC photosensor}
+}
+
+@article{Margoliash2003,
+    title = {{Offline learning and the role of autogenous speech: New suggestions from birdsong research}},
+    year = {2003},
+    journal = {Speech Communication},
+    author = {Margoliash, Daniel},
+    number = {1},
+    pages = {165--178},
+    volume = {41},
+    isbn = {0167-6393},
+    doi = {10.1016/S0167-6393(02)00101-2},
+    issn = {01676393},
+    keywords = {Autogenous speech, Birdsong, Evolution of speech, Sleep and learning}
+}
+
+@article{Chou2008,
+    title = {{On the studies of syllable segmentation and improving MFCCs for automatic birdsong recognition}},
+    year = {2008},
+    journal = {Proceedings of the 3rd IEEE Asia-Pacific Services Computing Conference, APSCC 2008},
+    author = {Chou, Chih Hsun and Liu, Pang Hsin and Cai, Bingjing},
+    pages = {745--750},
+    isbn = {9780769534732},
+    doi = {10.1109/APSCC.2008.6}
+}
+
+@article{VanderWaal2009a,
+    title = {{Optimal group size, dispersal decisions and postdispersal relationships in female African lions}},
+    year = {2009},
+    journal = {Animal Behaviour},
+    author = {VanderWaal, Kimberly L. and Mosser, Anna and Packer, Craig},
+    number = {4},
+    pages = {949--954},
+    volume = {77},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.anbehav.2008.12.028},
+    doi = {10.1016/j.anbehav.2008.12.028},
+    issn = {00033472},
+    keywords = {African lion, Panthera leo, dispersal, group fission, optimal group size, postdispersal relationship}
+}
+
+@book{Osborne2004,
+    title = {{Osborne-a-Course-in-Game-Theory-Mit-1994}},
+    year = {2004},
+    author = {Osborne, Martin J},
+    pages = {1--373},
+    url = {papers2://publication/uuid/2874ABB7-F16E-42B6-AD33-9FF0A35DDBFC},
+    isbn = {0262150417}
+}
+
+@article{Bshary2008a,
+    title = {{Pairs of cooperating cleaner fish provide better service quality than singletons}},
+    year = {2008},
+    journal = {Nature},
+    author = {Bshary, Redouan and Grutter, Alexandra S. and Willener, Astrid S T and Leimar, Olof},
+    number = {7215},
+    pages = {964--966},
+    volume = {455},
+    isbn = {0028-0836},
+    doi = {10.1038/nature07184},
+    issn = {14764687},
+    pmid = {18923522}
+}
+
+@incollection{Trivers2017,
+    title = {{Parental investment and sexual selection}},
+    year = {1972},
+    booktitle = {Sexual Selection and the Descent of Man: The Darwinian Pivot},
+    author = {Trivers, Robert L.},
+    editor = {Trivers, Robert L.},
+    month = {7},
+    pages = {136--179},
+    publisher = {Routledge},
+    url = {https://www.taylorfrancis.com/books/9781351491112/chapters/10.4324/9781315129266-7},
+    isbn = {9781351491112},
+    doi = {10.4324/9781315129266-7}
+}
+
+@article{Trivers1972,
+    title = {{Parental investment and sexual selection. In Sexual selection and the descent of Man (Campbell B. ed)}},
+    year = {1972},
+    journal = {Sexual Selection and the Descent of Man: The Darwinian Pivot},
+    author = {Trivers, Robert L.},
+    editor = {Trivers, Robert L.},
+    number = {2},
+    month = {7},
+    pages = {136--179},
+    volume = {13},
+    publisher = {Routledge},
+    url = {https://www.taylorfrancis.com/books/9781351491112/chapters/10.4324/9781315129266-7},
+    isbn = {9781351491112},
+    doi = {10.4324/9781315129266-7}
+}
+
+@article{Barclay2007a,
+    title = {{Partner choice creates competitive altruism in humans}},
+    year = {2007},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {Barclay, Pat and Willer, Robb},
+    number = {1610},
+    pages = {749--753},
+    volume = {274},
+    isbn = {0962-8452},
+    doi = {10.1098/rspb.2006.0209},
+    issn = {14712970},
+    pmid = {17255001},
+    arxivId = {cs/9605103},
+    keywords = {Competitive altruism, Cooperation, Costly signalling, Reputation, Trust}
+}
+
+@article{Debove2015b,
+    title = {{Partner choice creates fairness in humans}},
+    year = {2015},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {Debove, Stephane and Andre, Jean-Baptiste and Baumard, Nicolas},
+    pages = {20150392},
+    volume = {282},
+    doi = {10.1098/rspb.2015.0392},
+    issn = {1471-2954},
+    pmid = {25972467},
+    keywords = {behaviour, cognition, evolution}
+}
+
+@article{Roberts2015,
+    title = {{Partner choice drives the evolution of cooperation via indirect reciprocity}},
+    year = {2015},
+    journal = {PLoS ONE},
+    author = {Roberts, Gilbert},
+    number = {6},
+    month = {6},
+    pages = {e0129442},
+    volume = {10},
+    publisher = {Public Library of Science},
+    url = {https://dx.plos.org/10.1371/journal.pone.0129442},
+    doi = {10.1371/journal.pone.0129442},
+    issn = {19326203}
+}
+
+@article{Simms2002,
+    title = {{Partner choice in nitrogen-fixation mutualisms of legumes and rhizobia}},
+    year = {2002},
+    journal = {Integrative and Comparative Biology},
+    author = {Simms, Ellen L. and Lee Taylor, D.},
+    number = {2},
+    pages = {369--380},
+    volume = {42},
+    doi = {10.1093/icb/42.2.369},
+    issn = {00031569}
+}
+
+@article{Campenni2014,
+    title = {{Partner choice promotes cooperation: The two faces of testing with agent-based models}},
+    year = {2014},
+    journal = {Journal of Theoretical Biology},
+    author = {Campenn{\`{i}}, Marco and Schino, Gabriele},
+    pages = {49--55},
+    volume = {344},
+    publisher = {Elsevier},
+    url = {http://dx.doi.org/10.1016/j.jtbi.2013.11.019},
+    doi = {10.1016/j.jtbi.2013.11.019},
+    issn = {10958541},
+    keywords = {Evolution, Proximate mechanisms, Social relationships}
+}
+
+@article{Hilbe2018a,
+    title = {{Partners and rivals in direct reciprocity}},
+    year = {2018},
+    journal = {Nature Human Behaviour},
+    author = {Hilbe, Christian and Chatterjee, Krishnendu and Nowak, Martin A.},
+    number = {7},
+    pages = {469--477},
+    volume = {2},
+    publisher = {Springer US},
+    url = {http://dx.doi.org/10.1038/s41562-018-0320-9},
+    doi = {10.1038/s41562-018-0320-9},
+    issn = {23973374}
+}
+
+@article{Tamosiunaite2008,
+    title = {{Path-finding in real and simulated rats: Assessing the influence of path characteristics on navigation learning}},
+    year = {2008},
+    journal = {Journal of Computational Neuroscience},
+    author = {Tamosiunaite, Minija and Ainge, James and Kulvicius, Tomas and Porr, Bernd and Dudchenko, Paul and W{\"{o}}rg{\"{o}}tter, Florentin},
+    number = {3},
+    pages = {562--582},
+    volume = {25},
+    isbn = {1082700800},
+    doi = {10.1007/s10827-008-0094-6},
+    issn = {09295313},
+    pmid = {18446432},
+    keywords = {Function approximation, Place field system, Reinforcement learning, SARSA, Weight decay}
+}
+
+@article{Andre2007,
+    title = {{Perfect reciprocity is the only evolutionarily stable strategy in the continuous iterated prisoner's dilemma}},
+    year = {2007},
+    journal = {Journal of Theoretical Biology},
+    author = {Andr{\'{e}}, Jean Baptiste and Day, Troy},
+    number = {1},
+    pages = {11--22},
+    volume = {247},
+    doi = {10.1016/j.jtbi.2007.02.007},
+    issn = {00225193},
+    pmid = {17397874},
+    keywords = {Continuous prisoner's dilemma, Game theory, Iterated game, Negotiation, Reciprocity}
+}
+
+@article{Picardo2016,
+    title = {{Population-Level Representation of a Temporal Sequence Underlying Song Production in the Zebra Finch}},
+    year = {2016},
+    journal = {Neuron},
+    author = {Picardo, Michel A. and Merel, Josh and Katlowitz, Kalman A. and Vallentin, Daniela and Okobi, Daniel E. and Benezra, Sam E. and Clary, Rachel C. and Pnevmatikakis, Eftychios A. and Paninski, Liam and Long, Michael A.},
+    number = {4},
+    pages = {866--876},
+    volume = {90},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/j.neuron.2016.02.016},
+    isbn = {0000100064},
+    doi = {10.1016/j.neuron.2016.02.016},
+    issn = {10974199},
+    pmid = {27196976},
+    arxivId = {15334406}
+}
+
+@article{Rubenstein2014,
+    title = {{Programmable self-assembly in a thousand-robot swarm}},
+    year = {2014},
+    journal = {Science},
+    author = {Rubenstein, Michael and Cornejo, Alejandro and Nagpal, Radhika},
+    number = {6198},
+    pages = {795--799},
+    volume = {345},
+    isbn = {1853467960},
+    doi = {10.1126/science.1254295},
+    issn = {10959203},
+    pmid = {25124435}
+}
+
+@article{Raihani2004,
+    title = {{Punishers Benefit From Third-Party}},
+    year = {2004},
+    journal = {Science},
+    author = {Raihani, Nichola J and Grutter, Alexandra S and Bshary, Redouan},
+    number = {8},
+    pages = {171},
+    volume = {327},
+    doi = {10.1126/science.1183068},
+    issn = {0036-8075}
+}
+
+@article{Brandt2005,
+    title = {{Punishing and abstaining for public goods}},
+    year = {2005},
+    journal = {Proceedings of the National Academy of Sciences},
+    author = {Brandt, H. and Hauert, C. and Sigmund, K.},
+    number = {2},
+    pages = {495--497},
+    volume = {103},
+    doi = {10.1073/pnas.0507229103},
+    issn = {0027-8424}
+}
+
+@article{Raihani2012,
+    title = {{Punishment and cooperation in nature}},
+    year = {2012},
+    journal = {Trends in Ecology and Evolution},
+    author = {Raihani, Nichola J. and Thornton, Alex and Bshary, Redouan},
+    number = {5},
+    pages = {288--295},
+    volume = {27},
+    publisher = {Elsevier Ltd},
+    url = {http://dx.doi.org/10.1016/j.tree.2011.12.004},
+    isbn = {0169-5347},
+    doi = {10.1016/j.tree.2011.12.004},
+    issn = {01695347},
+    pmid = {22284810}
+}
+
+@article{Bshary2005,
+    title = {{Punishment and partner switching cause cooperative behaviour in a cleaning mutualism}},
+    year = {2005},
+    journal = {Biology Letters},
+    author = {Bshary, Redouan and Grutter, Alexandra S},
+    number = {4},
+    month = {12},
+    pages = {396--399},
+    volume = {1},
+    url = {https://royalsocietypublishing.org/doi/10.1098/rsbl.2005.0344},
+    doi = {10.1098/rsbl.2005.0344},
+    issn = {1744-9561},
+    keywords = {biological market, cooperation, labroides dimidiatus, mutualism, partner choice, punishment}
+}
+
+@article{Roberts2010,
+    title = {{Rapid spine stabilization and synaptic enhancement at the onset of behavioural learning}},
+    year = {2010},
+    journal = {Nature},
+    author = {Roberts, Todd F. and Tschida, Katherine A. and Klein, Marguerita E. and Mooney, Richard},
+    number = {7283},
+    pages = {948--952},
+    volume = {463},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature08759},
+    isbn = {1476-4687 (Electronic) 0028-0836 (Linking)},
+    doi = {10.1038/nature08759},
+    issn = {00280836},
+    pmid = {20164928}
+}
+
+@article{Wilson1994,
+    title = {{Reactivation of Hippocampal Ensemble Memories During Sleep Matthew}},
+    year = {1994},
+    journal = {Science},
+    author = {Wilson, Matthew A and Mcnaughton, Bruce L},
+    number = {July},
+    pages = {5--8},
+    volume = {265},
+    isbn = {1111111111}
+}
+
+@article{Schino2009,
+    title = {{Reciprocal Altruism in Primates}},
+    year = {2009},
+    journal = {Advances in the Study of Behavior},
+    author = {Schino, Gabriele and Aureli, Filippo},
+    number = {1},
+    pages = {45--69},
+    volume = {39},
+    url = {https://linkinghub.elsevier.com/retrieve/pii/S0065345409390026},
+    doi = {10.1016/S0065-3454(09)39002-6}
+}
+
+@article{Leimar1997,
+    title = {{Reciprocity and communication of partner quality}},
+    year = {1997},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {Leimar, O.},
+    number = {1385},
+    pages = {1209--1215},
+    volume = {264},
+    isbn = {978-3-319-14644-7},
+    doi = {10.1098/rspb.1997.0167},
+    issn = {14712970}
+}
+
+@article{Schino2017,
+    title = {{Reciprocity in group-living animals: Partner control versus partner choice}},
+    year = {2017},
+    journal = {Biological Reviews},
+    author = {Schino, Gabriele and Aureli, Filippo},
+    number = {2},
+    pages = {665--672},
+    volume = {92},
+    doi = {10.1111/brv.12248},
+    issn = {1469185X},
+    keywords = {Cooperation, Partner choice, Partner control, Proximate mechanisms, Reciprocity}
+}
+
+@article{Perl2011,
+    title = {{Reconstruction of physiological instructions from Zebra finch song}},
+    year = {2011},
+    journal = {Physical Review E - Statistical, Nonlinear, and Soft Matter Physics},
+    author = {Perl, Yonatan Sanz and Arneodo, Ezequiel M. and Amador, Ana and Goller, Franz and Mindlin, Gabriel B.},
+    number = {5},
+    pages = {1--8},
+    volume = {84},
+    isbn = {0031182012000},
+    doi = {10.1103/PhysRevE.84.051909},
+    issn = {15393755},
+    pmid = {22181446},
+    arxivId = {NIHMS150003}
+}
+
+@article{Burkov2014,
+    title = {{Repeated games for multiagent systems: a survey}},
+    year = {2014},
+    journal = {The Knowledge Engineering Review},
+    author = {Burkov, Andriy and Chaib-Draa, Brahim},
+    number = {1},
+    month = {1},
+    pages = {1--30},
+    volume = {29},
+    url = {https://www.cambridge.org/core/product/identifier/S026988891300009X/type/journal_article},
+    doi = {10.1017/S026988891300009X},
+    issn = {0269-8889},
+    keywords = {Autoimmune neurology, Beh{\c{c}}et, IgG-4-related disease, Neuro-rheumatology, Neurologic complications, Neurosarcoidosis, Rheumatoid arthritis, Rheumatology, Sj{\"{o}}gren syndrome, Systemic lupus erythematosus}
+}
+
+@article{Leimar1997a,
+    title = {{Repeated games: A state space approach}},
+    year = {1997},
+    journal = {Journal of Theoretical Biology},
+    author = {Leimar, Olof},
+    number = {4},
+    pages = {471--498},
+    volume = {184},
+    doi = {10.1006/jtbi.1996.0286},
+    issn = {00225193}
+}
+
+@article{Skaggs2016,
+    title = {{Replay of Neuronal Firing Sequences in Rat Hippocampus During Sleep Following Spatial Experience Author ( s ): William E . Skaggs and Bruce L . McNaughton Published by : American Association for the Advancement of Science Stable URL : http://www.jstor.org}},
+    year = {2016},
+    author = {Skaggs, William E and Mcnaughton, Bruce L},
+    number = {5257},
+    pages = {1870--1873},
+    volume = {271}
+}
+
+@article{Peyrache2009,
+    title = {{Replay of rule-learning related neural patterns in the prefrontal cortex during sleep}},
+    year = {2009},
+    journal = {Nature Neuroscience},
+    author = {Peyrache, Adrien and Khamassi, Mehdi and Benchenane, Karim and Wiener, Sidney I. and Battaglia, Francesco P.},
+    number = {7},
+    pages = {919--926},
+    volume = {12},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nn.2337},
+    isbn = {1546-1726 (Electronic)},
+    doi = {10.1038/nn.2337},
+    issn = {10976256},
+    pmid = {19483687}
+}
+
+@article{Doncieux2015,
+    title = {{Representational redescription: the next challenge?}},
+    year = {2015},
+    journal = {AMD Newsletter},
+    author = {Doncieux, S},
+    number = {1},
+    pages = {16--17},
+    volume = {12}
+}
+
+@article{Parker2015,
+    title = {{Reprinted from Bateson ( ed .) Mate Choice Printed in Great Britain}},
+    year = {2015},
+    author = {Parker, Geoff A},
+    number = {January 1983}
+}
+
+@article{Diekmann2014,
+    title = {{Reputation Formation and the Evolution of Cooperation in Anonymous Online Markets}},
+    year = {2014},
+    journal = {American Sociological Review},
+    author = {Diekmann, Andreas and Jann, Ben and Przepiorka, Wojtek and Wehrli, Stefan},
+    number = {1},
+    month = {2},
+    pages = {65--85},
+    volume = {79},
+    publisher = {SAGE PublicationsSage CA: Los Angeles, CA},
+    url = {http://journals.sagepub.com/doi/10.1177/0003122413512316},
+    doi = {10.1177/0003122413512316},
+    issn = {0003-1224},
+    keywords = {cooperation, online markets, reciprocity, reputation, trust}
+}
+
+@article{Swakman2016,
+    title = {{Reputation-based cooperation: Empirical evidence for behavioral strategies}},
+    year = {2016},
+    journal = {Evolution and Human Behavior},
+    author = {Swakman, Violet and Molleman, Lucas and Ule, Aljaž and Egas, Martijn},
+    number = {3},
+    month = {5},
+    pages = {230--235},
+    volume = {37},
+    publisher = {Elsevier},
+    url = {https://www.sciencedirect.com/science/article/pii/S109051381500121X?via%3Dihub},
+    doi = {10.1016/j.evolhumbehav.2015.12.001},
+    issn = {10905138},
+    keywords = {Experiment, Human cooperation, Indirect reciprocity, Individual differences}
+}
+
+@article{Raihani2011,
+    title = {{Resolving the iterated prisoner's dilemma: Theory and reality}},
+    year = {2011},
+    journal = {Journal of Evolutionary Biology},
+    author = {Raihani, Nichola J. and Bshary, R.},
+    number = {8},
+    pages = {1628--1639},
+    volume = {24},
+    isbn = {1420-9101 (Electronic){\textbackslash}r1010-061X (Linking)},
+    doi = {10.1111/j.1420-9101.2011.02307.x},
+    issn = {1010061X},
+    pmid = {21599777},
+    keywords = {Cooperation, Prisoner's dilemma, Pseudo-reciprocity, Punishment, Reciprocity, Tit-for-tat}
+}
+
+@article{Guthrie1934,
+    title = {{Reward and punishment}},
+    year = {1934},
+    journal = {Psychological Review},
+    author = {Guthrie, E. R.},
+    number = {5},
+    month = {1},
+    pages = {450--460},
+    volume = {41},
+    publisher = {National Academy of Sciences},
+    url = {https://www.pnas.org/content/103/2/495},
+    doi = {10.1037/h0074245},
+    issn = {0033295X},
+    pmid = {11553811},
+    keywords = {ATTENTION, CONDITIONING, IN LEARNING, LEARNING, LEARNING AND, MEMORY AND THOUGHT, MOTOR PHENOMENA AND ACTION, PUNISHMENT, PUNISHMENT IN, REWARD, REWARD IN}
+}
+
+@article{Li2018,
+    title = {{Reward depending on public funds stimulates cooperation in spatial prisoner's dilemma games}},
+    year = {2018},
+    journal = {Chaos, Solitons and Fractals},
+    author = {Li, Ya and Chen, Shanxiong and Niu, Ben},
+    pages = {38--45},
+    volume = {114},
+    doi = {10.1016/j.chaos.2018.07.002},
+    issn = {09600779},
+    keywords = {Evolutionary game, Prisoner's dilemma, Reward mechanism, Square lattice}
+}
+
+@article{Vukov2013,
+    title = {{Reward from Punishment Does Not Emerge at All Costs}},
+    year = {2013},
+    journal = {PLoS Computational Biology},
+    author = {Vukov, Jeromos and Pinheiro, Flávio L. and Santos, Francisco C. and Pacheco, Jorge M.},
+    number = {1},
+    volume = {9},
+    doi = {10.1371/journal.pcbi.1002868},
+    issn = {1553734X}
+}
+
+@article{Samuni2018,
+    title = {{Reward of labor coordination and hunting success in wild chimpanzees}},
+    year = {2018},
+    journal = {Communications Biology},
+    author = {Samuni, Liran and Preis, Anna and Deschner, Tobias and Crockford, Catherine and Wittig, Roman M.},
+    number = {1},
+    pages = {1--9},
+    volume = {1},
+    publisher = {Springer US},
+    url = {http://dx.doi.org/10.1038/s42003-018-0142-3},
+    doi = {10.1038/s42003-018-0142-3},
+    issn = {23993642}
+}
+
+@article{Jocham2016,
+    title = {{Reward-Guided Learning with and without Causal Attribution}},
+    year = {2016},
+    journal = {Neuron},
+    author = {Jocham, Gerhard and Brodersen, Kay H H. and Constantinescu, Alexandra O O. and Kahn, Martin C C. and Ianni, Angela M. and Walton, Mark E E. and Rushworth, Matthew F F.S. and Behrens, Timothy E E.J.},
+    number = {1},
+    pages = {177--190},
+    volume = {90},
+    publisher = {The Authors},
+    url = {http://dx.doi.org/10.1016/j.neuron.2016.02.018},
+    isbn = {1097-4199 (Electronic){\textbackslash}r0896-6273 (Linking)},
+    doi = {10.1016/j.neuron.2016.02.018},
+    issn = {10974199},
+    pmid = {26971947}
+}
+
+@article{Lynch2016,
+    title = {{Rhythmic Continuous-Time Coding in the Songbird Analog of Vocal Motor Cortex}},
+    year = {2016},
+    journal = {Neuron},
+    author = {Lynch, Galen F. and Okubo, Tatsuo S. and Hanuschkin, Alexander and Hahnloser, Richard H.R. and Fee, Michale S.},
+    number = {4},
+    pages = {877--892},
+    volume = {90},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/j.neuron.2016.04.021},
+    isbn = {1097-4199 (Electronic) 0896-6273 (Linking)},
+    doi = {10.1016/j.neuron.2016.04.021},
+    issn = {10974199},
+    pmid = {27196977}
+}
+
+@article{Bredeche2013,
+    title = {{Roborobo! a Fast Robot Simulator for Swarm and Collective Robotics}},
+    year = {2013},
+    author = {Bredeche, Nicolas and Montanier, Jean-Marc and Weel, Berend and Haasdijk, Evert},
+    number = {Ppsn},
+    pages = {1--2},
+    url = {http://arxiv.org/abs/1304.2888},
+    arxivId = {1304.2888}
+}
+
+@article{Stulp2013,
+    title = {{Robot Skill Learning: From Reinforcement Learning to Evolution Strategies}},
+    year = {2013},
+    journal = {Paladyn, Journal of Behavioral Robotics},
+    author = {Stulp, Freek and Sigaud, Olivier},
+    number = {1},
+    pages = {49--61},
+    volume = {4},
+    url = {http://www.degruyter.com/view/j/pjbr.2013.4.issue-1/pjbr-2013-0003/pjbr-2013-0003.xml},
+    doi = {10.2478/pjbr-2013-0003},
+    issn = {2081-4836},
+    keywords = {black-box optimization, dynamic movement primitives, evolution strategies, reinforcement learning}
+}
+
+@inproceedings{Otsuka2017,
+    title = {{Robust spread of cooperation by expectation-of-cooperation strategy with simple labeling method}},
+    year = {2017},
+    booktitle = {Proceedings of the International Conference on Web Intelligence - WI '17},
+    author = {Otsuka, Tomoaki and Sugawara, Toshiharu},
+    pages = {483--490},
+    publisher = {ACM Press},
+    url = {http://dl.acm.org/citation.cfm?doid=3106426.3106458},
+    address = {New York, New York, USA},
+    isbn = {9781450349512},
+    doi = {10.1145/3106426.3106458},
+    keywords = {agent network, cooperation, prisoner's dilemma, reinforcement learning}
+}
+
+@article{Mehta2002,
+    title = {{Role of experience and oscillations in transforming a rate code into a temporal code}},
+    year = {2002},
+    author = {Mehta, M R and Lee, A K and Wilson, M A},
+    pages = {8--11},
+    doi = {10.1038/nature00808.1.}
+}
+
+@article{Seely2011,
+    title = {{Role of mutual inhibition in binocular rivalry}},
+    year = {2011},
+    journal = {Journal of Neurophysiology},
+    author = {Seely, J. and Chow, C. C.},
+    number = {5},
+    pages = {2136--2150},
+    volume = {106},
+    url = {http://jn.physiology.org/cgi/doi/10.1152/jn.00228.2011},
+    isbn = {1522-1598 (Electronic){\textbackslash}r0022-3077 (Linking)},
+    doi = {10.1152/jn.00228.2011},
+    issn = {0022-3077},
+    pmid = {21775721}
+}
+
+@article{Alvard2002,
+    title = {{Rousseau’s Whale Hunt?}},
+    year = {2002},
+    journal = {Current Anthropology},
+    author = {Alvard, Michael S. and Nolin, David A.},
+    number = {4},
+    month = {8},
+    pages = {533--559},
+    volume = {43},
+    url = {http://www.journals.uchicago.edu/doi/10.1086/341653},
+    doi = {10.1086/341653},
+    issn = {0011-3204}
+}
+
+@article{Chiang2010,
+    title = {{Self-interested partner selection can lead to the emergence of fairness}},
+    year = {2010},
+    journal = {Evolution and Human Behavior},
+    author = {Chiang, Yen Sheng},
+    number = {4},
+    pages = {265--270},
+    volume = {31},
+    publisher = {Elsevier B.V.},
+    url = {http://dx.doi.org/10.1016/j.evolhumbehav.2010.03.003},
+    doi = {10.1016/j.evolhumbehav.2010.03.003},
+    issn = {10905138},
+    keywords = {Fairness, Partner selection, Ultimatum game, Assortativity,, Self-interest}
+}
+
+@article{Smart1995,
+    title = {{Sensations and brain processes.}},
+    year = {1995},
+    journal = {Behavioural brain research},
+    author = {Smart, J.C.C.},
+    number = {1-2},
+    pages = {157--61},
+    volume = {71},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/8747183},
+    doi = {10.2307/2182164},
+    issn = {0166-4328},
+    pmid = {8747183},
+    keywords = {Animals, Brain, Brain: physiology, Consciousness, Consciousness: physiology, Humans, Mental Processes, Mental Processes: physiology, Sensation, Sensation: physiology}
+}
+
+@article{Paul2002,
+    title = {{Sexual Selection and Mate Choice}},
+    year = {2002},
+    journal = {International Journal of Primatology},
+    author = {Andersson, Malte and Simmons, Leigh W. and Paul, Andreas},
+    number = {4},
+    pages = {877--904},
+    volume = {23},
+    url = {https://doi.org/10.1023/A:1015533100275},
+    doi = {10.1023/A:1015533100275},
+    issn = {1573-8604},
+    keywords = {Mate choice, Nonhuman primates, Polyandrous mating, Sex roles, Sexual selection, mate choice, nonhuman, polyandrous mating, sex roles, sexual selection}
+}
+
+@article{Andersson2006a,
+    title = {{Sexual selection and mate choice}},
+    year = {2006},
+    journal = {Trends in Ecology and Evolution},
+    author = {Andersson, Malte and Simmons, Leigh W.},
+    number = {6},
+    pages = {296--302},
+    volume = {21},
+    doi = {10.1016/j.tree.2006.03.015},
+    issn = {01695347},
+    pmid = {16769428}
+}
+
+@article{west1979sexual,
+    title = {{Sexual selection, social competition, and evolution}},
+    year = {1979},
+    journal = {Proceedings of the American Philosophical Society},
+    author = {West-Eberhard, Mary Jane},
+    number = {4},
+    pages = {222--234},
+    volume = {123},
+    publisher = {JSTOR}
+}
+
+@article{Bessiere2012,
+    title = {{Simplexit{\'{e}} et probabilit{\'{e}}s subjectives}},
+    year = {2012},
+    journal = {Colloque'Complexit{\'{e}}-Simplexit{\'{e}}'},
+    author = {Bessiere, P},
+    pages = {125--130},
+    url = {http://hal.archives-ouvertes.fr/hal-00724405/},
+    isbn = {9782722603301},
+    keywords = {()}
+}
+
+@article{Shank2009,
+    title = {{Sleep and sensorimotor integration during early vocal learning in a songbird}},
+    year = {2009},
+    journal = {Nature},
+    author = {Shank, Sylvan S. and Margoliash, Daniel},
+    number = {7234},
+    pages = {73--77},
+    volume = {458},
+    isbn = {0028-0836},
+    doi = {10.1038/nature07615},
+    issn = {00280836},
+    pmid = {19079238}
+}
+
+@article{Margoliash2010,
+    title = {{Sleep, off-line processing, and vocal learning}},
+    year = {2010},
+    journal = {Brain and Language},
+    author = {Margoliash, Daniel and Schmidt, Marc F.},
+    number = {1},
+    pages = {45--58},
+    volume = {115},
+    isbn = {0093-934X},
+    doi = {10.1016/j.bandl.2009.09.005},
+    issn = {0093934X},
+    pmid = {19906416},
+    arxivId = {NIHMS150003},
+    keywords = {Birdsong, Language, Neuromodulator, Sensory memory, Speech, Template, Vocal learning}
+}
+
+@article{West2007a,
+    title = {{Social semantics: Altruism, cooperation, mutualism, strong reciprocity and group selection}},
+    year = {2007},
+    journal = {Journal of Evolutionary Biology},
+    author = {West, S. A. and Griffin, A. S. and Gardner, A.},
+    number = {2},
+    pages = {415--432},
+    volume = {20},
+    isbn = {1010-061X},
+    doi = {10.1111/j.1420-9101.2006.01258.x},
+    issn = {1010061X},
+    pmid = {17305808},
+    keywords = {Direct fitness, Hamilton's rule, Inclusive fitness, Kin selection, Reciprocal altruism, Social evolution, Social selection}
+}
+
+@article{Verhaltensphysiologie1983,
+    title = {{Song Learning in the Zebra Finch ( Taeniopygia Guttata ):}},
+    year = {1983},
+    author = {Verhaltensphysiologie, Lehrstuhl and Biologie, Fakultiit and Bielefeld, Universitiit and Germany, West},
+    pages = {369--374}
+}
+
+@article{Dave2000,
+    title = {{Song replay during sleep and computational rules for sensorimotor vocal learning}},
+    year = {2000},
+    journal = {Science},
+    author = {Dave, A. S. and Margoliash, D.},
+    number = {5492},
+    pages = {812--816},
+    volume = {290},
+    isbn = {0036-8075, 1095-9203},
+    doi = {10.1126/science.290.5492.812},
+    issn = {00368075},
+    pmid = {11052946}
+}
+
+@article{Chade2017,
+    title = {{Sorting through Search and matching models in economics}},
+    year = {2017},
+    journal = {Journal of Economic Literature},
+    author = {Chade, Hector and Eeckhout, Jan and Smith, Lones},
+    number = {2},
+    pages = {493--544},
+    volume = {55},
+    doi = {10.1257/jel.20150777},
+    issn = {00220515}
+}
+
+@article{Repp1997,
+    title = {{Spectral envelope and context effects in the tritone paradox}},
+    year = {1997},
+    journal = {Perception},
+    author = {Repp, Bruno H.},
+    number = {5},
+    pages = {645--665},
+    volume = {26},
+    isbn = {0301-0066 (Print)0301-0066 (Linking)},
+    doi = {10.1068/p260645},
+    issn = {03010066},
+    pmid = {9488887}
+}
+
+@misc{Perc2017,
+    title = {{Statistical physics of human cooperation}},
+    year = {2017},
+    booktitle = {Physics Reports},
+    author = {Perc, Matjaž and Jordan, Jillian J. and Rand, David G. and Wang, Zhen and Boccaletti, Stefano and Szolnoki, Attila},
+    month = {5},
+    pages = {1--51},
+    volume = {687},
+    publisher = {North-Holland},
+    url = {https://www.sciencedirect.com/science/article/pii/S0370157317301424},
+    doi = {10.1016/j.physrep.2017.05.004},
+    issn = {03701573}
+}
+
+@article{Lipkind2013,
+    title = {{Stepwise acquisition of vocal combinatorial capacity in songbirds and human infants}},
+    year = {2013},
+    journal = {Nature},
+    author = {Lipkind, Dina and Marcus, Gary F. and Bemis, Douglas K. and Sasahara, Kazutoshi and Jacoby, Nori and Takahasi, Miki and Suzuki, Kenta and Feher, Olga and Ravbar, Primoz and Okanoya, Kazuo and Tchernichovski, Ofer},
+    number = {7452},
+    pages = {104--108},
+    volume = {498},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature12173},
+    isbn = {0028-0836},
+    doi = {10.1038/nature12173},
+    issn = {00280836},
+    pmid = {23719373},
+    arxivId = {NIHMS150003}
+}
+
+@article{Barclay2013,
+    title = {{Strategies for cooperation in biological markets, especially for humans}},
+    year = {2013},
+    journal = {Evolution and Human Behavior},
+    author = {Barclay, Pat},
+    number = {3},
+    month = {5},
+    pages = {164--175},
+    volume = {34},
+    publisher = {Elsevier Inc.},
+    url = {http://dx.doi.org/10.1016/j.evolhumbehav.2013.02.002 https://linkinghub.elsevier.com/retrieve/pii/S1090513813000214},
+    doi = {10.1016/j.evolhumbehav.2013.02.002},
+    issn = {10905138},
+    keywords = {Arms race, Biological markets, Competitive altruism, Cooperation, Friendship, Generosity, Helping, Partner choice, Reciprocity, Reputation}
+}
+
+@article{Shakeshaft2013,
+    title = {{Strong genetic influence on a UK nationwide test of educational achievement at the end of compulsory education at age 16}},
+    year = {2013},
+    journal = {PLoS ONE},
+    author = {Shakeshaft, Nicholas G. and Trzaskowski, Maciej and McMillan, Andrew and Rimfeld, Kaili and Krapohl, Eva and Haworth, Claire M.A. and Dale, Philip S. and Plomin, Robert},
+    number = {12},
+    volume = {8},
+    isbn = {0020739X},
+    doi = {10.1371/journal.pone.0080341},
+    issn = {19326203},
+    pmid = {24349000}
+}
+
+@article{Fruteau2009,
+    title = {{Supply and demand determine the market value of food providers in wild vervet monkeys}},
+    year = {2009},
+    journal = {Proceedings of the National Academy of Sciences of the United States of America},
+    author = {Fruteau, Cécile and Voelkl, Bernhard and Van Damme, Eric and No{\"{e}}, Ronald},
+    number = {29},
+    pages = {12007--12012},
+    volume = {106},
+    doi = {10.1073/pnas.0812280106},
+    issn = {00278424},
+    keywords = {Biological markets, Cooperation, Economic behavior, Primates, Reciprocity}
+}
+
+@article{Kennett2001,
+    title = {{Tactile±Visual Links in Exogenous Spatial Attention under Different Postures: Convergent Evidence from Psychophysics and ERPs}},
+    year = {2001},
+    author = {Kennett, Steffan and Eimer, Martin and Spence, Charles and Driver, Jon},
+    pages = {1--29}
+}
+
+@article{Bshary2008,
+    title = {{Tapping out a message}},
+    year = {2008},
+    journal = {Nature},
+    author = {Bshary, R},
+    number = {November},
+    volume = {456},
+    url = {http://www.nature.com/nature/journal/v456/n7218/full/456037a.html}
+}
+
+@article{McNamara2008,
+    title = {{The coevolution of choosiness and cooperation}},
+    year = {2008},
+    journal = {Nature},
+    author = {McNamara, John M. and Barta, Zoltan and Fromhage, Lutz and Houston, Alasdair I.},
+    number = {7175},
+    pages = {189--192},
+    volume = {451},
+    isbn = {0028-0836},
+    doi = {10.1038/nature06455},
+    issn = {14764687},
+    pmid = {18185587}
+}
+
+@article{DosSantos2018,
+    title = {{The coevolution of cooperation and cognition in humans}},
+    year = {2018},
+    journal = {Proceedings of the Royal Society B: Biological Sciences},
+    author = {dos Santos, Miguel and West, Stuart A.},
+    number = {1879},
+    volume = {285},
+    isbn = {0000000221},
+    doi = {10.1098/rspb.2018.0723},
+    issn = {14712954},
+    keywords = {Intelligence, Kin selection, Social dilemmas}
+}
+
+@article{Bernstein2002,
+    title = {{The Complexity of Decentralized Control of Markov Decision Processes}},
+    year = {2002},
+    journal = {Mathematics of Operations Research},
+    author = {Bernstein, Daniel S. and Givan, Robert and Immerman, Neil and Zilberstein, Shlomo},
+    number = {4},
+    pages = {819--840},
+    volume = {27},
+    url = {http://pubsonline.informs.org/doi/abs/10.1287/moor.27.4.819.297},
+    isbn = {1558607099},
+    doi = {10.1287/moor.27.4.819.297},
+    issn = {0364-765X},
+    arxivId = {1301.3836}
+}
+
+@misc{Stigler,
+    title = {{The Economics of Information}},
+    author = {Stigler, George},
+    isbn = {0022-3808},
+    doi = {10.2307/1829263},
+    issn = {0022-3808},
+    pmid = {17891731},
+    arxivId = {1829263}
+}
+
+@article{Kurzban2015,
+    title = {{The Evolution of Altruism in Humans}},
+    year = {2015},
+    journal = {Annual Review of Psychology},
+    author = {Kurzban, Robert and Burton-Chellew, Maxwell N. and West, Stuart A.},
+    number = {1},
+    pages = {575--599},
+    volume = {66},
+    url = {http://www.annualreviews.org/doi/10.1146/annurev-psych-010814-015355},
+    isbn = {978-0-8243-0266-5},
+    doi = {10.1146/annurev-psych-010814-015355},
+    issn = {0066-4308},
+    pmid = {25061670},
+    arxivId = {10.28},
+    keywords = {adaptationism, conflict, cooperation, kinship, prosociality, reciprocity}
+}
+
+@article{Axelrod1981,
+    title = {{The evolution of cooperation}},
+    year = {1981},
+    journal = {Science},
+    author = {Axelrod, R and Hamilton, W.},
+    number = {4489},
+    month = {3},
+    pages = {1390--1396},
+    volume = {211},
+    url = {http://www.sciencemag.org/cgi/doi/10.1126/science.7466396},
+    doi = {10.1126/science.7466396},
+    issn = {0036-8075}
+}
+
+@article{JoelLSachsUlrichGMuellerThomasPWilcox2004,
+    title = {{The Evolution of Cooperation}},
+    year = {2004},
+    journal = {The Quarterly Review of Biology},
+    author = {Sachs, Joel L and Mueller, Ulrich G and Wilcox, Thomas P and Bull, James J},
+    number = {2},
+    month = {6},
+    pages = {135--160},
+    volume = {79},
+    publisher = {The University of Chicago Press},
+    url = {https://www.journals.uchicago.edu/doi/10.1086/383541 http://www.jstor.org/stable/10.1086/383541 .},
+    doi = {10.1086/383541},
+    issn = {0033-5770},
+    pmid = {785523},
+    keywords = {byproducts, directed reciprocation, mutualism, partner choice, partner fidelity feedback, shared genes, symbiosis}
+}
+
+@article{Lehmann2006,
+    title = {{The evolution of cooperation and altruism - A general framework and a classification of models}},
+    year = {2006},
+    journal = {Journal of Evolutionary Biology},
+    author = {Lehmann, L. and Keller, L.},
+    number = {5},
+    pages = {1365--1376},
+    volume = {19},
+    doi = {10.1111/j.1420-9101.2006.01119.x},
+    issn = {1010061X},
+    keywords = {Altruism, Cooperation, Group selection, Kin selection, Punishment, Strong reciprocity}
+}
+
+@article{Enquist1993,
+    title = {{The evolution of cooperation in mobile organisms}},
+    year = {1993},
+    journal = {Animal Behaviour},
+    author = {Enquist, Magnus and Leimar, Olof},
+    number = {4},
+    month = {4},
+    pages = {747--757},
+    volume = {45},
+    publisher = {Academic Press},
+    url = {https://www.sciencedirect.com/science/article/pii/S0003347283710894},
+    doi = {10.1006/anbe.1993.1089},
+    issn = {00033472}
+}
+
+@article{Packer1988a,
+    title = {{The evolution of cooperative hunting}},
+    year = {1988},
+    journal = {American Naturalist},
+    author = {Packer, C. and Ruttan, L.},
+    number = {2},
+    pages = {159--198},
+    volume = {132},
+    doi = {10.1086/284844},
+    issn = {00030147}
+}
+
+@article{Andre2011,
+    title = {{The evolution of fairness in a biological market}},
+    year = {2011},
+    journal = {Evolution},
+    author = {Andr{\'{e}}, Jean Baptiste and Baumard, Nicolas},
+    number = {5},
+    pages = {1447--1456},
+    volume = {65},
+    doi = {10.1111/j.1558-5646.2011.01232.x},
+    issn = {00143820},
+    keywords = {Evolution of cooperation, Game theory, Moral psychology, Mutualism, Partner-choice, Ultimatum game}
+}
+
+@article{Raihani2011a,
+    title = {{The evolution of punishment in n-player public goods games: A volunteer's dilemma}},
+    year = {2011},
+    journal = {Evolution},
+    author = {Raihani, Nichola J. and Bshary, Redouan},
+    number = {10},
+    pages = {2725--2728},
+    volume = {65},
+    isbn = {0014-3820},
+    doi = {10.1111/j.1558-5646.2011.01383.x},
+    issn = {00143820},
+    pmid = {21967415}
+}
+
+@article{Trivers1971,
+    title = {{The Evolution of Reciprocal Altruism}},
+    year = {1971},
+    journal = {The Quarterly Review of Biology},
+    author = {Trivers, Robert L.},
+    number = {1},
+    month = {3},
+    pages = {35--57},
+    volume = {46},
+    url = {https://www.journals.uchicago.edu/doi/10.1086/406755},
+    isbn = {00335770},
+    doi = {10.1086/406755},
+    issn = {0033-5770},
+    pmid = {724},
+    arxivId = {arXiv:gr-qc/9809069v1},
+    archivePrefix = {arXiv},
+    eprint = {9809069v1},
+    primaryClass = {gr-qc}
+}
+
+@article{Fogarty2011,
+    title = {{The evolution of teaching}},
+    year = {2011},
+    journal = {Evolution},
+    author = {Fogarty, L. and Strimling, P. and Laland, K. N.},
+    number = {10},
+    pages = {2760--2770},
+    volume = {65},
+    isbn = {1558-5646 (Electronic) 0014-3820 (Linking)},
+    doi = {10.1111/j.1558-5646.2011.01370.x},
+    issn = {00143820},
+    pmid = {21967419},
+    keywords = {Asocial learning, Cooperation, Cumulative culture, Evolution, Social learning, Teaching}
+}
+
+@article{Barclay2015,
+    title = {{The Evolutionary Psychology of Human Pro-sociality: Adaptations, Byproducts, and Mistakes}},
+    year = {2015},
+    journal = {Handbook of Prosocial Behavior},
+    author = {Barclay, Pat and van Vugt, Mark},
+    pages = {37--60},
+    isbn = {9780195399813},
+    doi = {10.1093/oxfordhb/9780195399813.013.029},
+    issn = {0195399811},
+    keywords = {altruism, cooperation, costly signaling, evolutionary psychology, levels of analysis, reciprocity}
+}
+
+@article{Hamilton1964,
+    title = {{The genetical evolution of social behaviour. I}},
+    year = {1964},
+    journal = {Journal of Theoretical Biology},
+    author = {Hamilton, W.D.},
+    number = {1},
+    month = {7},
+    pages = {1--16},
+    volume = {7},
+    url = {https://linkinghub.elsevier.com/retrieve/pii/0022519364900384},
+    isbn = {0022-5193},
+    doi = {10.1016/0022-5193(64)90038-4},
+    issn = {00225193}
+}
+
+@article{Barlow2015,
+    title = {{The impact of agent size and number of rounds on cooperation in the iterated Prisoner's Dilemma}},
+    year = {2015},
+    journal = {IEEE SSCI 2014 - 2014 IEEE Symposium Series on Computational Intelligence - FOCI 2014: 2014 IEEE Symposium on Foundations of Computational Intelligence, Proceedings},
+    author = {Barlow, Lee Ann},
+    number = {1},
+    pages = {120--127},
+    publisher = {IEEE},
+    isbn = {9781479944927},
+    doi = {10.1109/FOCI.2014.7007816},
+    issn = {23254270}
+}
+
+@article{Rigotti2013,
+    title = {{The importance of mixed selectivity in complex cognitive tasks}},
+    year = {2013},
+    journal = {Nature},
+    author = {Rigotti, Mattia and Barak, Omri and Warden, Melissa R. and Wang, Xiao Jing and Daw, Nathaniel D. and Miller, Earl K. and Fusi, Stefano},
+    number = {7451},
+    pages = {585--590},
+    volume = {497},
+    publisher = {Nature Publishing Group},
+    url = {http://dx.doi.org/10.1038/nature12160},
+    isbn = {doi:10.1038/nature12160},
+    doi = {10.1038/nature12160},
+    issn = {00280836},
+    pmid = {23685452},
+    arxivId = {arXiv:1011.1669v3}
+}
+
+@article{Cartwright2017,
+    title = {{The importance of selection in the evolution of blindness in cavefish}},
+    year = {2017},
+    journal = {BMC Evolutionary Biology},
+    author = {Cartwright, Reed A. and Schwartz, Rachel S. and Merry, Alexandra L. and Howell, Megan M.},
+    number = {1},
+    pages = {1--14},
+    volume = {17},
+    isbn = {1951-6401},
+    doi = {10.1186/s12862-017-0876-4},
+    issn = {14712148},
+    pmid = {28173751},
+    arxivId = {q-bio/0512045},
+    keywords = {Migration-selection balance, Models/simulations, Mutations, Population genetics}
+}
+
+@article{Brosnan2010,
+    title = {{The interplay of cognition and cooperation}},
+    year = {2010},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Brosnan, Sarah F. and Salwiczek, Lucie and Bshary, Redouan},
+    number = {1553},
+    pages = {2699--2710},
+    volume = {365},
+    doi = {10.1098/rstb.2010.0154},
+    issn = {14712970},
+    keywords = {Cognition, Comparative approach, Cooperation, Mutualism, Reciprocity}
+}
+
+@article{Smith2005,
+    title = {{The Marriage Model with Search Frictions}},
+    year = {2005},
+    journal = {SSRN Electronic Journal},
+    author = {Smith, Lones},
+    month = {2},
+    url = {http://www.ssrn.com/abstract=34242},
+    doi = {10.2139/ssrn.34242},
+    issn = {1556-5068}
+}
+
+@article{Smith2006a,
+    title = {{The marriage model with search frictions}},
+    year = {2006},
+    journal = {Journal of Political Economy},
+    author = {Smith, Lones},
+    number = {6},
+    pages = {1124--1144},
+    volume = {114},
+    doi = {10.1086/510440},
+    issn = {00223808}
+}
+
+@article{Bernard2016a,
+    title = {{The Mechanics of Coordination and the Evolution of Cooperation}},
+    year = {2016},
+    author = {Bernard, Arthur},
+    url = {https://drive.google.com/open?id=0B3F8ElZ2Cjn7TzFRVFJSaUo0UkU}
+}
+
+@article{Harvey,
+    title = {{The Microbial Genetic Algorithm 2 GAs Stripped to the Minimum}},
+    author = {Harvey, Inman},
+    keywords = {genetic}
+}
+
+@article{Presti2012,
+    title = {{The Mind-Body Problem}},
+    year = {2012},
+    journal = {Encyclopedia of Human Behavior: Second Edition},
+    author = {Presti, D. E.},
+    pages = {615--621},
+    isbn = {9780123750006},
+    doi = {10.1016/B978-0-12-375000-6.00392-X},
+    issn = {00368733},
+    pmid = {7209483},
+    keywords = {Awareness, Brain, Consciousness, Information, Mental, Metaphysics, Mind, Phenomenology, Physical, Reality, Unconscious}
+}
+
+@inproceedings{Kaplan2001,
+    title = {{The natural history of human food sharing and cooperation: a review and a new multi-individual approach to the negotiation of norms}},
+    year = {2001},
+    booktitle = {Conference on the Structure and Evolution of Strong Reciprocity, Santa Fe},
+    author = {Kaplan, Hillard and Gurven, Michael},
+    month = {2},
+    url = {http://www.anth.ucsb.edu/faculty/gurven/papers/kaplangurven.pdf},
+    isbn = {0262072521 (alk. paper)},
+    issn = {0016-6464},
+    pmid = {13640227},
+    keywords = {TRANQUILIZING AGENTS}
+}
+
+@article{Nottebohm2005,
+    title = {{The Neural Basis of Birdsong}},
+    year = {2005},
+    journal = {PLoS Biology},
+    author = {Nottebohm, Fernando},
+    number = {5},
+    pages = {e164},
+    volume = {3},
+    url = {https://dx.plos.org/10.1371/journal.pbio.0030164},
+    isbn = {1544-9173 U6 - ctx{\_}ver=Z39.88-2004{\&}ctx{\_}enc=info{\%}3Aofi{\%}2Fenc{\%}3AUTF-8{\&}rfr{\_}id=info:sid/summon.serialssolutions.com{\&}rft{\_}val{\_}fmt=info:ofi/fmt:kev:mtx:journal{\&}rft.genre=article{\&}rft.atitle=The+neural+basis+of+birdsong{\&}rft.jtitle=PLoS+biology{\&}rft.au=Nottebohm{\%}2C+Fernando{\&}rft.date=2005-01-01{\&}rft.pub=PUBLIC+LIBRARY+SCIENCE{\&}rft.issn=1544-9173{\&}rft.volume=3{\&}rft.issue=5{\&}rft.spage=e164{\&}rft.epage=761{\&}rft{\_}id=info:doi/10.1371{\%}2Fjournal.pbio.0030164{\&}rft.externalDBID=5PM{\&}rft.externalDocID=15884976 U7 - Journal Arti},
+    doi = {10.1371/journal.pbio.0030164},
+    issn = {15457885},
+    pmid = {15884976}
+}
+
+@article{Domenech2018,
+    title = {{The Neuro-Computational Architecture of Value-Based Selection in the Human Brain}},
+    year = {2018},
+    journal = {Cerebral Cortex},
+    author = {Domenech, Philippe and Redout{\'{e}}, Jérôme and Koechlin, Etienne and Dreher, Jean Claude},
+    number = {2},
+    pages = {585--601},
+    volume = {28},
+    doi = {10.1093/cercor/bhw396},
+    issn = {14602199},
+    pmid = {28057725},
+    keywords = {MVPA, drift-diffusion model, fMRI, neuroeconomics, value-based decision}
+}
+
+@article{Roper2006,
+    title = {{The onset of song learning and song tutor selection in fledgling zebra finches}},
+    year = {2006},
+    journal = {Ethology},
+    author = {Roper, Annabelle and Zann, Richard},
+    number = {5},
+    pages = {458--470},
+    volume = {112},
+    isbn = {0179-1613},
+    doi = {10.1111/j.1439-0310.2005.01169.x},
+    issn = {01791613}
+}
+
+@article{Carter2014,
+    title = {{The Reciprocity Controversy}},
+    year = {2014},
+    journal = {Animal Behavior and Cognition},
+    author = {Carter, Gerald},
+    number = {3},
+    pages = {368},
+    volume = {1},
+    url = {http://www.animalbehaviorandcognition.org/uploads/journals/3/11.Carter_FINAL.pdf},
+    doi = {10.12966/abc.08.11.2014},
+    issn = {2372-4323},
+    keywords = {and behavioral ecologists often, and methods, asking questions at different, behavior, co operation, comparative psychologists, cooperation using different theories, cues trigger the cooperative, evolutionary psychologists, how does it develop, levels of analysis, prisoner, pseudoreciprocity, reciprocal altruism, reciprocity, s dilemma, study, what, when did it evolve, why is it adaptive}
+}
+
+@book{Dawkins1976,
+    title = {{The Selfish Gene}},
+    year = {1976},
+    author = {Dawkins, Richard},
+    publisher = {Oxford University Press}
+}
+
+@article{Johnstone1997,
+    title = {{The tactics of mutual mate choice and competitive search}},
+    year = {1997},
+    journal = {Behavioral Ecology and Sociobiology},
+    author = {Johnstone, Rufus A.},
+    number = {1},
+    pages = {51--59},
+    volume = {40},
+    doi = {10.1007/s002650050315},
+    issn = {03405443},
+    keywords = {Dynamic game, Mate choice, Search behaviour, Sexual selection}
+}
+
+@article{MaynardSmith1974,
+    title = {{The theory of games and the evolution of animal conflicts}},
+    year = {1974},
+    journal = {Journal of Theoretical Biology},
+    author = {Maynard Smith, J.},
+    number = {1},
+    month = {9},
+    pages = {209--221},
+    volume = {47},
+    url = {https://linkinghub.elsevier.com/retrieve/pii/0022519374901106},
+    doi = {10.1016/0022-5193(74)90110-6},
+    issn = {00225193},
+    pmid = {4459582}
+}
+
+@article{Schwitzgebel2008,
+    title = {{The Unreliability of Naive Introspection}},
+    year = {2008},
+    journal = {Philosophical Review},
+    author = {Schwitzgebel, E.},
+    number = {2},
+    pages = {245--273},
+    volume = {117},
+    url = {https://read.dukeupress.edu/the-philosophical-review/article/117/2/245-273/2787},
+    isbn = {0031-8108},
+    doi = {10.1215/00318108-2007-037},
+    issn = {0031-8108},
+    pmid = {8130951}
+}
+
+@article{Devaine2014,
+    title = {{Theory of mind: Did evolution fool us?}},
+    year = {2014},
+    journal = {PLoS ONE},
+    author = {Devaine, Marie and Hollard, Guillaume and Daunizeau, Jean},
+    number = {2},
+    volume = {9},
+    isbn = {1932-6203 (Electronic){\textbackslash}r1932-6203 (Linking)},
+    doi = {10.1371/journal.pone.0087619},
+    issn = {19326203},
+    pmid = {24505296}
+}
+
+@article{Marler1997,
+    title = {{Three Models of Song Learning: Evidence from Behavior widespread ocurrence of species-specific song univer}},
+    year = {1997},
+    journal = {Journal of Neurobiology},
+    author = {Marler, Peter},
+    number = {5},
+    pages = {501--516},
+    volume = {33},
+    url = {http://citeseerx.ist.psu.edu/viewdoc/download?doi=10.1.1.132.8740&rep=rep1&type=pdf},
+    doi = {10.1016/0141-0229(81)90084-3},
+    issn = {01410229},
+    pmid = {20598878},
+    arxivId = {1311.4725},
+    keywords = {birdsong learning, ing, instruction-based learn-, learning preferences, selection-based learning}
+}
+
+@article{Nowak1992,
+    title = {{Tit for tat in heterogeneous populations}},
+    year = {1992},
+    journal = {Nature},
+    author = {Nowak, Martin A and Sigmund, Karl},
+    number = {6357},
+    month = {1},
+    pages = {250--253},
+    volume = {355},
+    url = {http://www.nature.com/articles/355250a0},
+    doi = {10.1038/355250a0},
+    issn = {0028-0836}
+}
+
+@article{Bernard2016,
+    title = {{To Cooperate or Not to Cooperate: Why Behavioural Mechanisms Matter}},
+    year = {2016},
+    journal = {PLoS Computational Biology},
+    author = {Bernard, Arthur and Andr{\'{e}}, Jean Baptiste and Bredeche, Nicolas},
+    number = {5},
+    pages = {1--14},
+    volume = {12},
+    doi = {10.1371/journal.pcbi.1004886},
+    issn = {15537358},
+    pmid = {27148874}
+}
+
+@article{Rockenbach2011,
+    title = {{To qualify as a social partner, humans hide severe punishment, although their observed cooperativeness is decisive}},
+    year = {2011},
+    journal = {Proceedings of the National Academy of Sciences of the United States of America},
+    author = {Rockenbach, Bettina and Milinski, Manfred},
+    number = {45},
+    pages = {18307--18312},
+    volume = {108},
+    isbn = {1108996108},
+    doi = {10.1073/pnas.1108996108},
+    issn = {00278424},
+    keywords = {Cooperation, Economic experiment, Signaling}
+}
+
+@article{Holmes2007,
+    title = {{Tool-Use: Capturing Multisensory Spatial Attention or Extending Multisensory Peripersonal Space?}},
+    year = {2007},
+    journal = {Cortex},
+    author = {Holmes, Nicholas P. and Sanabria, Daniel and Calvert, Gemma A. and Spence, Charles},
+    number = {3},
+    month = {1},
+    pages = {469--489},
+    volume = {43},
+    url = {http://koreascience.or.kr/journal/view.jsp?kj=JOSRB5&py=2016&vnc=v5n3&sp=551 https://linkinghub.elsevier.com/retrieve/pii/S0010945208704714},
+    isbn = {0256-9574},
+    doi = {10.1016/S0010-9452(08)70471-4},
+    issn = {00109452},
+    keywords = {crossmodal, multisensory, peripersonal space, tool-use, touch, vision}
+}
+
+@article{Bergstrom2000,
+    title = {{Towards a theory of mutual mate choice: Lessons from two-sided matching}},
+    year = {2000},
+    journal = {Evolutionary Ecology Research},
+    author = {Bergstrom, Carl T. and Real, Leslie A.},
+    number = {4},
+    pages = {493--508},
+    volume = {2},
+    issn = {15220613},
+    keywords = {Assortative mating, Coalitions, Game theory, Group formation, Mating systems, Sexual selection}
+}
+
+@article{Taylor2007,
+    title = {{Transforming the dilemma}},
+    year = {2007},
+    journal = {Evolution},
+    author = {Taylor, Christine and Nowak, Martin A.},
+    number = {10},
+    pages = {2281--2292},
+    volume = {61},
+    doi = {10.1111/j.1558-5646.2007.00196.x},
+    issn = {00143820},
+    keywords = {Direct and indirect reciprocity, Evolution of cooperation, Group selection, Kin selection, Network reciprocity (graph selection), Prisoner's Dilemma}
+}
+
+@article{VanSeijen2014,
+    title = {{True Online TD({$\lambda$})}},
+    year = {2014},
+    journal = {Icml},
+    author = {van Seijen, Harm and Sutton, Richard},
+    pages = {692--700},
+    volume = {32},
+    url = {http://www.jmlr.org/proceedings/papers/v32/seijen14.pdf%0Ahttp://jmlr.org/proceedings/papers/v32/seijen14.html},
+    isbn = {9781634393973},
+    doi = {10.13140/2.1.1456.2568},
+    issn = {1938-7228}
+}
+
+@article{Tomasello2012b,
+    title = {{Two key steps in the evolution of human cooperation: The interdependence Hypothesis}},
+    year = {2012},
+    journal = {Current Anthropology},
+    author = {Tomasello, Michael and Melis, Alicia P. and Tennie, Claudio and Wyman, Emily and Herrmann, Esther},
+    number = {6},
+    month = {12},
+    pages = {673--692},
+    volume = {53},
+    url = {https://www.journals.uchicago.edu/doi/10.1086/668207},
+    doi = {10.1086/668207},
+    issn = {0011-3204}
+}
+
+@article{Bertram2014,
+    title = {{Two neural streams, one voice: Pathways for theme and variation in the songbird brain}},
+    year = {2014},
+    journal = {Neuroscience},
+    author = {Bertram, R. and Daou, A. and Hyson, R. L. and Johnson, F. and Wu, W.},
+    pages = {806--817},
+    volume = {277},
+    isbn = {0306-4522},
+    doi = {10.1016/j.neuroscience.2014.07.061},
+    issn = {18737544},
+    pmid = {25106128},
+    keywords = {Basal ganglia, Motor memory, Premotor cortex, Sensory-motor integration, Vocal learning}
+}
+
+@article{Jeon2013,
+    title = {{Two principles of organization in the prefrontal cortex are cognitive hierarchy and degree of automaticity}},
+    year = {2013},
+    journal = {Nature Communications},
+    author = {Jeon, Hyeon-Ae and Friederici, Angela D.},
+    number = {May},
+    pages = {1--8},
+    volume = {4},
+    publisher = {Nature Publishing Group},
+    url = {http://www.nature.com/doifinder/10.1038/ncomms3041},
+    isbn = {2041-1723 (Electronic){\textbackslash}r2041-1723 (Linking)},
+    doi = {10.1038/ncomms3041},
+    issn = {2041-1723},
+    pmid = {23787807}
+}
+
+@article{Baumard2008,
+    title = {{Une th{\'{e}}orie naturaliste et mutualiste de la morale}},
+    year = {2008},
+    author = {Baumard, Nicolas},
+    pages = {1--308}
+}
+
+@article{Long2008,
+    title = {{Using temperature to analyse temporal dynamics in the songbird motor pathway}},
+    year = {2008},
+    journal = {Nature},
+    author = {Long, Michael A. and Fee, Michale S.},
+    number = {7219},
+    pages = {189--194},
+    volume = {456},
+    isbn = {6173240173},
+    doi = {10.1038/nature07448},
+    issn = {14764687},
+    pmid = {19005546}
+}
+
+@article{McNamara2010c,
+    title = {{Variation and the response to variation as a basis for successful cooperation}},
+    year = {2010},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {McNamara, John M. and Leimar, Olof},
+    number = {1553},
+    pages = {2627--2633},
+    volume = {365},
+    isbn = {1471-2970 (Electronic){\textbackslash}n0962-8436 (Linking)},
+    doi = {10.1098/rstb.2010.0159},
+    issn = {14712970},
+    pmid = {20679107},
+    keywords = {Assessment, Negotiation, Reputation, Social sensitivity}
+}
+
+@article{Hobaiter2017,
+    title = {{Variation in hunting behaviour in neighbouring chimpanzee communities in the Budongo forest, Uganda}},
+    year = {2017},
+    journal = {PLoS ONE},
+    author = {Hobaiter, Catherine and Samuni, Liran and Mullins, Caroline and Akankwasa, Walter John and Zuberb{\"{u}}hler, Klaus},
+    number = {6},
+    pages = {1--17},
+    volume = {12},
+    isbn = {1111111111},
+    doi = {10.1371/journal.pone.0178065},
+    issn = {19326203}
+}
+
+@article{Bhalla1997,
+    title = {{Visual-motor recalibration in geographical slant perception. Apr 1997}},
+    year = {1997},
+    journal = {Dissertation Abstracts International: Section B: The Sciences and Engineering},
+    author = {Bhalla, Mukul},
+    number = {10-B},
+    pages = {pp},
+    volume = {.57},
+    isbn = {Print 0419-4217 Dissertation Abstracts International ProQuest Information {\&} Learning; US AAM9708597 Electronic, Print Electronic},
+    keywords = {Cognitive Processes [2340], General Psychology [2100], Visual-motor recalibration in geographical slant p, elderly)}
+}
+
+@article{Admin2010,
+    title = {{Voice Recognition Algorithms using Mel Frequency Cepstral Coefficient (MFCC) and Dynamic Time Warping (DTW) Techniques}},
+    year = {2010},
+    author = {{Admin}},
+    number = {3},
+    pages = {138--143},
+    volume = {2},
+    arxivId = {1003.4083}
+}
+
+@article{Hauert2002,
+    title = {{Volunteering as Red Queen mechanism for cooperation in public goods games}},
+    year = {2002},
+    journal = {Science},
+    author = {Hauert, Christoph and De Monte, Silvia and Hofbauer, Josef and Sigmund, Karl},
+    number = {5570},
+    month = {5},
+    pages = {1129--1132},
+    volume = {296},
+    publisher = {American Association for the Advancement of Science},
+    url = {http://www.ncbi.nlm.nih.gov/pubmed/12004134},
+    doi = {10.1126/science.1070582},
+    issn = {00368075},
+    pmid = {12004134}
+}
+
+@article{Geoffroy2019,
+    title = {{Why cooperation is not running away}},
+    year = {2019},
+    journal = {bioRxiv},
+    author = {Geoffroy, Felix and Baumard, Nicolas and Andre, Jean-Baptiste},
+    month = {1},
+    pages = {316117},
+    url = {http://biorxiv.org/content/early/2019/01/11/316117.abstract},
+    doi = {10.1101/316117}
+}
+
+@article{Krosnick2009,
+    title = {{Why do People Vote? A Psychological Analysis of the Causes of Voter Turnout}},
+    year = {2009},
+    journal = {Democracy and Disenfranchisement},
+    author = {Krosnick, Jon A. and Harder, Joshua},
+    number = {3},
+    pages = {525--549},
+    volume = {64},
+    isbn = {9781405191265},
+    doi = {10.1002/9781444307337.ch6},
+    issn = {00224537},
+    keywords = {Characteristics of a Particular Election, Demographic Factors, Registration, Social and Psychological Factors, The Effects of Canvassing, Polling, and Election O}
+}
+
+@article{Opp2001,
+    title = {{Why Do People Vote? The Cognitive-Illusion Proposition and Its Test}},
+    year = {2001},
+    journal = {Kyklos},
+    author = {Opp, Karl-Dieter},
+    number = {2-3},
+    pages = {355--378},
+    volume = {54},
+    url = {http://doi.wiley.com/10.1111/j.0023-5962.2001.00158.x},
+    doi = {10.1111/j.0023-5962.2001.00158.x},
+    issn = {0023-5962}
+}
+
+@article{Blondel2007,
+    title = {{Why Do Rational People Vote in Large Elections with Costs to Vote?}},
+    year = {2007},
+    journal = {Sciences-New York},
+    author = {Blondel, Serge and L{\'{e}}vy-Garboua, L.},
+    url = {http://ead.univ-angers.fr/~granem08/IMG/pdf/DT_GRANEM_005.pdf}
+}
+
+@article{Packer1990,
+    title = {{Why lions form groups: food is not enough}},
+    year = {1990},
+    journal = {American Naturalist},
+    author = {Packer, C. and Scheel, D. and Pusey, A. E.},
+    number = {1},
+    pages = {1--19},
+    volume = {136},
+    doi = {10.1086/285079},
+    issn = {00030147}
+}
+
+@article{Schram1996,
+    title = {{Why people vote: Experimental evidence}},
+    year = {1996},
+    journal = {Journal of Economic Psychology},
+    author = {Schram, Arthur and Sonnemans, Joep},
+    number = {4},
+    pages = {417--442},
+    volume = {17},
+    isbn = {0167-4870},
+    doi = {10.1016/0167-4870(96)00022-0},
+    issn = {01674870},
+    keywords = {Experiment, Voting}
+}
+
+@article{DeCheveigne2002,
+    title = {{YIN, a fundamental frequency estimator for speech and music}},
+    year = {2002},
+    journal = {The Journal of the Acoustical Society of America},
+    author = {de Cheveign{\'{e}}, Alain and Kawahara, Hideki},
+    number = {4},
+    pages = {1917--1930},
+    volume = {111},
+    url = {http://asa.scitation.org/doi/10.1121/1.1458024},
+    isbn = {0001-4966 (Print)},
+    doi = {10.1121/1.1458024},
+    issn = {0001-4966},
+    pmid = {12002874}
+}
+
+@article{Swaddle2010,
+    title = {{Zebra Finches}},
+    year = {2010},
+    journal = {Encyclopedia of Animal Behavior},
+    author = {Swaddle, J.P.},
+    number = {March},
+    pages = {629--632},
+    url = {http://linkinghub.elsevier.com/retrieve/pii/B9780080453378000486},
+    isbn = {9780080453378},
+    doi = {10.1016/B978-0-08-045337-8.00048-6}
+}
+
+@article{Gilby2015,
+    title = {{‘Impact hunters’ catalyse cooperative hunting in two wild chimpanzee communities}},
+    year = {2015},
+    journal = {Philosophical Transactions of the Royal Society B: Biological Sciences},
+    author = {Gilby, Ian C. and Machanda, Zarin P. and Mjungu, Deus C. and Rosen, Jeremiah and Muller, Martin N. and Pusey, Anne E. and Wrangham, Richard W.},
+    number = {1683},
+    volume = {370},
+    doi = {10.1098/rstb.2015.0005},
+    issn = {14712970},
+    keywords = {By-product mutualism, Chimpanzee, Collective action, Cooperation, Hunting, Predation}
+}
+
+@article{Baumard2013,
+    title = {{“Fair” outcomes without morality in cleaner wrasse mutualism (response to BBS paper "A mutualistic approach to morality: The evolution of fairness by partner choice")}},
+    year = {2013},
+    journal = {Behavioral and brain sciences},
+    author = {Baumard, Nicolas and Andr{\'{e}}, Jean Baptiste and Sperber, Dan},
+    pages = {59--78},
+    volume = {6}
+}
+
+@article{Gurven2000,
+    title = {{“It’s a Wonderful Life”: signaling generosity among the Ache of Paraguay}},
+    year = {2000},
+    author = {Gurven, Michael and Allen-Arave, Wesley and Hill, Kim and Hurtado, Magdalena},
+    pages = {263--282},
+    volume = {21},
+    keywords = {all over town collecting, altruism, bert, ble and they scattered, food sharing, generosity, george, god, he never thinks about, himself, hunter-gatherers, mary did it, money, people you were in, reputation, s in trouble, s why he, she told a few, status quest, t ask any questions, that, they didn, trou-, uncle billy}
+}
+
+@article{2017a,
+    title = {{病院・介護施設におけるノロウイルス感染症の拡大防止対策を 目的とした吐物の飛散状況に関する研究No Title}},
+    year = {2017},
+    journal = {感染症誌},
+    author = {{林伸行}},
+    pages = {399--404},
+    volume = {91}
+}
+
+@article{Noe1995BiologicalMarkets,
+    title = {{Biological markets}},
+    year = {1995},
+    journal = {Trends in Ecology {\&} Evolution},
+    author = {No{\"{e}}, Ronald and Hammerstein, Peter},
+    doi = {10.1016/S0169-5347(00)89123-5},
+    issn = {01695347},
+    pmid = {21237061}
+}
+
+@article{WilliamsBirdBehavior,
+    title = {{Bird song and singing behavior}},
+    author = {Williams, Heather}
+}
+
+@article{Mccall1965ECONOMICSInformat,
+    title = {{ECONOMICS OF INFORMATION AND JOB SEARCH * In the recent literature A . A . Alchian and W . R . Allen ,' G . J . Stigler , 2 and probably others have suggested that unemployed re- sources may be productive in a world where uncertainty prevails and informat}},
+    year = {1965},
+    journal = {Office},
+    author = {Mccall, J J},
+    number = {3},
+    pages = {113--126},
+    volume = {38}
+}
+
+@article{Coen2007LearningBirdsong,
+    title = {{Learning to Sing Like a Bird : Self-Supervised Acquisition of Birdsong}},
+    year = {2007},
+    author = {Coen, Michael H}
+}
+
+@article{Song2015MasterTo,
+    title = {{Master Thesis A Bio-inspired Cognitive Architecture of the Zebra Finch ’ s Birdsong Learning System to}},
+    year = {2015},
+    author = {Song, Tutor},
+    number = {November}
+}
+
+@book{Binmore2007NaturalJustice,
+    title = {{Natural Justice}},
+    year = {2007},
+    booktitle = {Natural Justice},
+    author = {Binmore, Ken},
+    pages = {1--208},
+    isbn = {9780199783670},
+    doi = {10.1093/acprof:oso/9780195178111.001.0001},
+    keywords = {Cooperation, Egalitarianism, Evolutionary ethics, Fairness norms, Game theory, Hunter-gatherers, Nash equilibrium, Reciprocal altruism, Social contract, Utilitarianism}
+}
+
+@book{AssociationforComputingMachinerySpecialInterestGrouponArtificialIntelligence2014ProceedingsSystems.,
+    title = {{Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems.}},
+    year = {2014},
+    booktitle = {Proceedings of the 2014 international conference on Autonomous agents and multi-agent systems},
+    author = {Association for Computing Machinery Special Interest Group on Artificial Intelligence, Bijan and Bou Ammar, Haitham and Bloembergen, Daan and Tuyls, Karl and Weiss, Gerhard},
+    pages = {677--684},
+    publisher = {International Foundation for Autonomous Agents and Multiagent Systems},
+    url = {https://dl.acm.org/citation.cfm?id=2615841},
+    isbn = {9781450327381},
+    keywords = {evolution of cooperation, repeated games on graphs}
+}
+
+@book{Alexander2017TheSystems,
+    title = {{The biology of moral systems}},
+    year = {2017},
+    booktitle = {The Biology of Moral Systems},
+    author = {Alexander, Richard D.},
+    isbn = {9781351329309},
+    doi = {10.4324/9780203700976}
+}
+
+@article{Johnson2011TheOverconfidence,
+    title = {{The evolution of overconfidence}},
+    year = {2011},
+    journal = {Nature},
+    author = {Johnson, Dominic D. P. and Fowler, James H.},
+    number = {7364},
+    month = {9},
+    pages = {317--320},
+    volume = {477},
+    publisher = {Nature Publishing Group},
+    url = {http://www.nature.com/articles/nature10384},
+    doi = {10.1038/nature10384},
+    issn = {0028-0836},
+    keywords = {Anthropology, Psychology}
+}
+
+@article{Raihani2015Third-partySo,
+    title = {{Third-party punishers are rewarded, but third-party helpers even more so}},
+    year = {2015},
+    journal = {Evolution},
+    author = {Raihani, Nichola J. and Bshary, Redouan},
+    number = {4},
+    volume = {69},
+    doi = {10.1111/evo.12637},
+    issn = {15585646}
+}
+
+@article{EisenbruchWhyPreprint,
+    title = {{Why warmth matters more than competence - preprint}},
+    author = {Eisenbruch, Adar and Krasnow, Max},
+    number = {917},
+    doi = {10.31219/osf.io/562ke}
+}
\ No newline at end of file