Introduction

Preserving the memory of an important life event requires successful integration of several modalities that constitute a multisensory trace, often referred to as the memory engram [1]. In particular, spatial memory is defined as the brain’s ability to encode key features of the external environment and to navigate within the boundaries of this mental representation, also known as cognitive map [2, 3]. On the physiological level, it manifests as populations of neurons exhibiting activity tuned to specific aspects of the external spatial context, in particular, firing correlated with an animal’s presence in a certain, unique location within the environment [4, 5]. Such neuron populations, called place cells [6, 7], are mainly located in the hippocampus, yet their specificity is influenced by several other groups of spatially tuned neurons, most notably grid cells, border cells and head direction cells [8,9,10,11]. These cells are in turn distributed among several cortical and sub-cortical structures. Said network is further connected by a plethora of direct and indirect projections, serving both top-down and bottom-up processing of spatial information and associating it with representations stored by other circuits.

In the case of episodic memories, their spatial context can be, among others, linked to the emotional state during learning. This phenomenon is often investigated with aversive states, like those triggered by physical pain or alarming stimuli. However, it occurs regardless of the valence and can also involve appetitive states; in particular reward-related ones, on which we focus in this work. These are processed by yet another crucial functional element of the brain: the reward system, responsible for cognitive fundamentals of motivation and reinforcement, as well as a crucial agent in many psychological disorders. The physiological connections between these systems have also been established; for instance, it was demonstrated that a projection from the hippocampus to the ventral tegmental area (VTA) mediated relations between context and reward [12, 13].

Appetitive states can be naturally evoked by likes of attractive foods or positive social interactions [14,15,16], but also induced with substances that directly activate the reward system, that is pharmacological rewards [17,18,19]. One of them is amphetamine, which is known to reliably activate the reward system [20,21,22] and to induce the emission of 50 kHz band ultrasonic vocalisations, which are a marker of affective appetitive states in rats [18, 23, 24]. Given these effects, it is often used to induce a consistent appetitive emotional state to be linked with spatial cues during the experiment.

The immense complexity of brain mechanics makes their quantification a highly non-trivial endeavour. Henceforth, contemporary approaches have to focus on narrow aspects of activity, especially those accessible for measurements and relatively straightforward to interpret. On the transcriptomic level, one of such convenient manifestations is the transient expression of immediate early genes (IEGs), in particular, cFos, Arc, Homer and zif268 [25,26,27]. They can be used to visualise and quantify the cellular ensembles involved in the memory trace, as demonstrated in [28, 29].

The memory has a very rich dynamic [30]. In a global view, it consists of memory formation, consolidation, recognition, recall, re-consolidation and extinction; yet all of these processes have their own local dynamics of variable time scales and involve complex phenomena, for instance, engram shifts between brain structures. Consequently, the investigation of temporal aspects is crucial for the analyses of memory mechanics. Thanks to the previously identified distinctive patterns of IEG expression in time, they are an important tool for this task.

In this work, we aim to investigate the neural mechanisms involved in the interplay of spatial memory and reward processing systems, underlying the emotional perception of context, which is in turn crucial to understanding goal-directed behaviour associated with reward-seeking, as well as spatial memory storage, maturation and consolidation. Rodent models offer a way to explore these relationships experimentally on the behavioural and molecular levels. One of the simplest and most commonly used models is the conditioned place preference (CPP) paradigm, where the animal learns to associate a particular area of experimental enclosure (facilitated by spatial cues) with an emotional experience. We have devised an approach extending CPP with an open-field paradigm, orchestrating a reward-seeking task based on spatial cues in a 5-region cage (with 4 corners and interconnecting centre space). For conditioning, we have used amphetamine administration in one of the corners.

To investigate the mechanism underlying such evoked memory, we have used the CatFISH method (Cellular compartment analysis of temporal activity by fluorescent in situ hybridisation) to measure the co-expression of two IEGs, Arc and Homer-1A, in nine brain areas covering both the cortex and sub-cortical structures [31]. In order to elucidate larger-scale temporal dynamics, we have analysed two groups of rats, with a recent and remote memory recall. By using machine learning, we were able to pinpoint the circuits responsible for behavioural responses related to reward-seeking based on spatial cues.

Results

Experiment overview

Rats were trained to associate a particular corner of a rectangular enclosure with the effects of amphetamine injection. In training, doors in cage partitions were closed, constraining rats to stay in the corner they were placed in. On the other hand, for testing, said doors were opened and rats were allowed to freely roam around the cage; yet, they were not given amphetamine.

The testing session included two 5-min entries, separated by a 20-min break; afterwards, rats were sacrificed and their brains analysed (Fig. 1). Due to such a course, we were able to exploit the temporal expression patterns of two IEGs to untangle the brain activity in either entry. Precisely, we were able to measure which fraction of neuronal nuclei within a given brain structure was active during any entry, either entry, exclusively either entry and finally during both entries (Fig. 1E). Out of these descriptors, we have also calculated the co-localisation coefficient, expressing the degree to which the neuron activity is consistent across entries.

Fig. 1: Overview of the experimental procedure.
figure 1

A Each rat is habituated to an experimental cage, trained to associate corner c3 with amphetamine effects and undergoes testing which consists of two entries into the context. Rats are divided into recent and remote recall groups, which differ only in an additional 2-week delay between training and testing sessions in the remote group. B Details of the 6-day training routine; on even days rat is placed in the conditioned corner c3 and given amphetamine, while on odd days placed in each other corner and given sham, saline injection. C Scheme of the experimental cage; a 1 m \(\times\) 1 m box is partitioned into 5 regions by transparent walls equipped with doors that can be opened (during habituation and testing) or closed (during training). An over-hanged illumination assembly provides animals with spatial cues. D Scheme of the CatFISH method used to elucidate changes in brain activity between test entries. mRNAs of two IEGs, Arc and Homer, are detected; the temporal distribution of entries is set up so that at the observation point, we can independently detect expression of IEGs triggered by the activity at either test entry. E Image analysis pipeline; brain structures are identified and marked as ROIs using shapes from a reference atlas [54]. Within each ROI, we identify nuclei and, according to the IEG expression detected within them, classify into inactive, active during either entry and active during both entries.

This experiment was repeated in two variants, investigating recent and remote (consolidated) recall; the first one was carried out the day after training, while the second one was 2 weeks after the last training session. Both sessions were video-recorded, which allowed us to quantify animals’ behavioural propensity to the conditioned corner.

We assumed that placing the animal in the cage context evokes the recall and possibly re-consolidation of the trained spatial-emotional association, consequently that this overall set-up will allow us to elucidate short- and long-term dynamics on the structural level.

Remote recall involves smaller, more consolidated neuronal sub-populations in certain structures

First, we investigated whether there are qualitative signs of strongly consolidated memory in the remote recall group, focusing on the most direct descriptors of inter-entry variability; the outcome of this analysis is summarised in Fig. 2A.

Fig. 2: The differences in overall activity and activity co-localisation in investigated structures between rats with recent and remote recall.
figure 2

A Comparison of the fraction of active nuclei (top) and co-localisation coefficient (bottom); both conditioned on the brain structure. In the remote recall, the activity has generally dropped in comparison to the recent recall, while the co-localisation has increased, which suggests the development of specialised neuron sub-populations. Box-plots adhere to a standard Tukey’s definition, original data is superimposed as jittered points. B Heatmap showing the values of brain activity descriptors found to significantly differ between groups in a machine learning-based multivariate analysis. Most straightforward effects were identified in the VTA and hippocampus, but also in cortical regions and amygdala. Values were shown as ranks for clarity, with rank one given to the lowest value (blue) and rank sixteen to the highest (red).

When comparing the overall brain activity in the remote group compared to the recent group, expressed as a percentage of any active nuclei, we see that there is a significant (p = 0.04) drop of activity, conditioned over structures. Moreover, this drop in overall activity was accompanied by an increase in co-localisation (p < 0.001), suggesting an emergence of specialised neuronal sub-populations. These differences were too subtle to be attributed to particular structures with standard statistical methods, however.

The activity was also fairly consistent between individual rats; the notable exception from the rule is nucleus accumbens, which noted striking individual variability in activity, which was not explained by the group. It can be explained by the animals’ behaviour, though.

Recall of the remote memory has a distinct activity pattern from the recent recall

The Boruta [32] machine learning-based analysis of the differences between recent and remote recall has uncovered additional, more nuanced interactions, presented on Fig. 2B.

For VTA, it has detected the aforementioned pattern of higher overall activation in the recent group, yet more specialised in the remote group. For hippocampus, the dentate gyrus area (DG) activity is higher in the recent recall group and this effect is especially pronounced at the first entry. On the other hand, in CA3 we observed disjoint populations, active only in the first entry and indicating the recent group, as well as active only in the second entry and indicating the remote group. We have found that two cortical regions contain specific neuronal populations more active in the remote group; in the prelimbic cortex (PrL), it was active at the first entry, while in the agranular retrosplenial cortex (RSA), only at the second.

Two parameters of the amygdala activity at the first entry have been also found to be significant; overall activity of the lateral part and exclusive activity of the central part. Generally, they were higher in the recent group, but also in two rats from the remote group with slightly elevated hippocampal activity and high VTA co-localisation.

High, memory remoteness-independent variation in NAcc is explained by behaviour

As mentioned in Section “Remote recall involves smaller, more consolidated neuronal sub-populations in certain structures”, there is a substantial variation of activity in certain structures which is not explained by the recent/remote group; in particular in the nucleus accumbens. By using the trajectory data extracted from the video recordings of the behaviour, we have quantified the inclination of a rat to stay in the corner that was previously associated with the amphetamine injection. To rule out the impact of variable exploration tendencies, we have expressed it as a correctness score, defined as a fraction of the total time spent in any corner which was spent in a conditioned one.

While this score was not significantly correlated with the recent/remote recall group, we have identified significant molecular associations with machine learning; they are collected in Fig. 3.

Fig. 3: The factors associated with the correctness score.
figure 3

Panels correspond to the in the first test entry (A) and second test entry (B). Heatmaps show the values of brain activity descriptors found to be significantly associated with correctness in a machine learning-based multivariate analysis. The high first-entry correctness score was related to nucleus accumbens and CA3 activity, while the central amygdala was a key driver in the second entry. Values were shown as ranks for clarity, with rank one given to a lowest value (blue) and rank sixteen to the highest (red); grey encodes missing values.

High correctness was identified to be explained by the activity of hippocampal CA3 as well as both the shell and core of the nucleus accumbens. Moreover, these activity levels were fairly consistent across two entries, henceforth almost all pattern parameters were selected. A similar pattern applies to caudate putamen (CPu), yet to a lesser extent, because CPu activity at the second entry is less discriminating.

Finally, there are also significant correlations between first entry correctness and overall activity of the lateral amygdala (LA) as well as activity co-localisation in the granular retrosplenial cortex (RSG).

Behaviour at each entry has distinct neuronal activity correlates

While the overall activity in NAcc and CA3, which are strong correlates of correctness at the first entry, remains fairly consistent between entries, they cease to strongly explain the correctness at the second entry. Only co-localisations at CA3 and NAcc core and overall activity of CA3 remain to be important.

On the other hand, the activity of the central amygdala (CeA) becomes a key driver, we also see significant interactions involving the infralimbic cortex (IL) and VTA. They are rather complex and involve a structure state at the first entry.

Strongest inter-structural co-activation occurs at second entry

We have investigated activity correlations across all investigated neuronal sub-populations in all structures. The graphs constructed from these identified to be statistically significant ones are presented in Fig. 4—there were none such in the Any and Second classes.

Fig. 4: Significant monotonic correlations between values of activity descriptors between investigated brain structures.
figure 4

Each panel presents a graph for a certain activity descriptor; panels with no links (for an activity at any entry, any and at the second entry, 2nd) were omitted. The densest, non-trivial networks can be observed between sizes of neuron sub-populations active exclusively at certain entries. The weight of a link corresponds to an absolute value of Spearman correlation coefficient; black links represent positive, while red ones negative correlations.

This analysis has mostly uncovered trivial, inter-structural correlations, in particular within amygdala (BLA-LA-CeA), NAcc (shell & core), retrosplenial cortex (RSG-RSA) and hippocampus (CA1-CA3, though not DG).

When analysing populations active exclusively during the first entry, we found a significant negative correlation between CeA and PrL. In comparison to the general first entry activity graphs, we see that both the internal coupling in NAcc and between CA1-CA3 have disappeared, which indicates that this synchronisation has persisted over both entries. On the other hand, RSG-RSA link is present in both exclusive views but not in any more general one, suggesting it is a result of a more temporally localised phenomenon.

Most of the significant correlations identified are between the sizes of neuronal populations active exclusively during the second entry. Here, we see intra-structural coherence in the amygdala, hippocampus, retrosplenial cortex and between prelimbic and infralimbic cortices, yet not in NAcc. There are also inter-structural correlations, positive between the hippocampus, retrosplenial cortex and, to a lesser extent, amygdala; as well as negative correlations between the prelimbic cortex and both basolateral amygdala (BLA) and DG.

One should note, however, that the sample sizes are quite limited for a correlation network study and this analysis is likely to have low sensitivity.

Suppression of central amygdala during training affects the recall

We have performed an additional experiment, following the same protocol as for remote recall, but using rats in which CeA was selectively suppressed during training with a chemogenetic modification. A group of 17 rats were bilaterally injected with a viral construct into CeA; it causes the neural activity to be suppressed when an exogenous, neutral ligand (JHU 37160) is administered. Eight animals were receiving JHU for training, while the rest were left as controls and received saline to keep the injection regime the same.

Our aim was to verify the hypothesis that CeA is crucial to integrate the emotional state with the spatial context; with its activity suppressed, we expected rats to lose their propensity to the conditioned corner in testing, even though the activity of CeA was unaltered at that time. Figure 5 illustrates our findings; as expected, rats with modification noted significantly lower correctness scores than controls. Amphetamine is a classical substance used to trigger positive, 50 kHz band ultrasonic vocalisations (USVs) [17, 18]; to this end, we also analysed the count of USVs emitted during training. This parameter followed the same patterns as correctness, namely it was significantly lowered in the modified group; both parameters were also significantly positively correlated.

Fig. 5: Behavioural effects of suppressing CeA during training.
figure 5

A Comparison of the correctness score at the first test entry between rats with chemogenetic modification and controls. B Same for the USV count during training. C Relation between USV count and the correctness score. Vocalisation counts shown in log-scale.

Discussion

Contemporary models of memory generally involve multiple functional tiers. Memories are first formed in short-term storage, which is transient as it must withstand a constant influx of new information. To this end, encoded data is then selectively moved to long-term memory, based on a certain assessment of its possible usefulness—this process may take substantial time, and it is generally faster when similar memories (often referred to as schemes) are already present in the long-term memory.

Long-term memory, on the other hand, is the first to react during recall, triggered by a stimulus consistent with a certain record. The activation of the short-term system follows, as recall almost always happens in circumstances where novel, relevant information can be acquired and such would likely be used to update the recalled memory in a process of re-consolidation.

Physiological changes consistent with memory consolidation

It is reasonable to assume that certain memory utilises a progressively smaller and more consolidated population of neurons as it moves from a fresh, most verbose representation of current stimuli and emotional state to a consolidated, processed record. This is indeed what we see when comparing rats with a remote recall from those with a recent recall; the observed activity is generally lower but more specialised.

Machine learning-based mining, capable of uncovering non-monotonic and multivariate interactions, has revealed a more detailed picture, showing significant differences on the level of a particular structure. They were especially pronounced in the VTA and DG, but also in the amygdala, prelimbic and retrosplenial cortices, as well as in CA3.

The dynamics in the VTA were closest to the general trend of diminishing but specialising activity. This structure is attributed to be an element of the reward system [33, 34] as well to be involved in memory processes [35, 36], especially when reward-related memories are considered. Henceforth, the strong rewarding effect of amphetamine might have led to a formation of a specific neuronal population in this structure.

The higher activity of DG in the recent group is quite interesting, especially because it was also the least active part of the hippocampus; this may be connected with its speculated role in refining memories during encoding [37]. This process is expected to be more pronounced in the recent recall group, where animals experience a sudden break of the training routine.

Cortical activity patterns are particularly interesting, since they both consider an increase of activity in the remote recall group, suggesting that rats have in fact developed a specified neuron population during the hold off period. They could not be detected by simple co-localisation analysis, however, because relevant cortical structures were not uniformly responding in either test entry—in particular, said population in the prelimbic cortex was only active in the first entry, while the one in RSA only in the second one.

A similar argument can be made about CA3; the literature suggests it should be involved in the investigated phenomena, and indeed this is the case, yet its activity pattern goes beyond a simple consolidation model. Precisely, our results suggest that the recent recall group has a specific CA3 population active exclusively in the first entry, while the remote group has a similar population, yet active exclusively in the second entry. A possible explanation of this pattern considers the fact that test sessions give an experience conflicting with training—amphetamine is not administered at all, not even in the conditioned corner. This fact undoubtedly supports the dissolution of the trained association, which may be aided by CA3. One can hypothesise that fresh memories are more volatile and a single conflicting observation is sufficient to trigger revision, while consolidated ones require stronger support and start to activate at the second entry.

The recent recall group was also characterised by an elevated activation of the lateral and central amygdala during the first entry; this is consistent with the aforementioned activity patterns of CA3 and VTA, leading to a conclusion that this trio was jointly involved in the encoding of freshly formed spatial-emotional association, utilising a network of projections well established in the literature. Our results suggest that this action is transient and can be easily diminished by conflicting experiences, though.

Interestingly, clustering based on ML-selected parameters has uncovered two clear outliers (a05 and a02) which, despite being in a recent recall group, exhibited activity patterns of the remote group. We interpret this as a sign of early consolidation, possibly triggered by an earlier spatial memory or individual differences in emotional state or a reaction to amphetamine.

Utilisation of a memory depends on different factors than its consolidation

We have used a correctness score to quantify the animal’s propensity to the corner associated with the amphetamine injection, which we believe corresponds to goal-directed drug seeking. There is no group effect in this parameter, which we interpret as an indication that all animals have correctly recalled the spatial organisation of the train/test cage and identified the conditioned corner, yet adjusted the seeking and anticipatory behaviours based on other factors. In particular, this involves both areas of NAcc, which express higher activity in rats with a larger initial correctness score; this is also consistent with a very high individual variability of NAcc activity patterns.

NAcc is a key hub in the cortico-limbic circuitry directing action selection and decision-making in reward-related tasks [38], and its activity is bound to the expression of emotion, in particular, reflected by ultrasonic vocalisation [18, 39,40,41] and with processing current values and reward prediction [42,43,44,45]. To this end, we conclude that the experiment likely captured activation of NAcc neurons positively associated with reward anticipation, which was a reaction to the recognition of substance administration context, possibly induced directly by cortex through well-established connections [46,47,48,49].

The strength of this action was shaped by the individual variability among rats, however, which caused different behavioural outcomes. The very origin of this variability is unclear, yet it likely arises at a network level because of the aforementioned hub nature of NAcc. In particular, the first-entry correctness is also positively correlated with the activity of the caudate putamen and hippocampus CA3, as well as, to a lower extent, with the overall activity of amygdala LA and activity co-localisation in RSG.

It is crucial to note that the NAcc activity persisted between test entries, despite the fact that it was not indicative of the correctness in the second entry. In general, we have not assumed both test sessions to be equal, as the first one establishes a novel and conflicting context for the conditioned corner, as there is no amphetamine injection consistent with previous experience. This is directly visible on Fig. 6, which summarises all the machine-learning results.

Fig. 6: Overview of the overlap between machine learning-selected significantly important factors connected with the memory recall group and correctness score in either test entry.
figure 6

Although hippocampus, amygdala and nucleus accumbens re-appear in all contexts, their involvement differs at the level of finer-grained neuronal populations.

While the overall NAcc activity has not diminished due to this fact yet the behaviour has changed, we believe this is a consequence of a tertiary mechanism that has modulated the NAcc influence. The machine learning analysis of the correlates of the second entry correctness scores suggests that this can be attributed to an aforementioned CA3-amygdala-VTA trio. In particular, a higher score is predicted by activation of the CeA, but only during the first entry; this can be attributed to a hypothesis that the sustenance of the anticipatory actions is promoted by the original emotional arousal. CA3 and VTA activity effects are more complex and require further insights.

Remembered emotional context drives the behaviour

The role of CeA was further analysed in a follow-up experiment. We suppressed its activity during training using chemogenetics. In the outcome, we observed both the reduction of emotional processing expressed as 50 kHz band USV emission intensity in training, followed by a reduction of rats’ inclination to the conditioned corner in test, in spite of lack of CaA suppression.

This result validates the circuit inferred from ML and correlation analyses, with CeA influencing memory encoding via hippocampus, in particular DG and CA3. On the general level, this result strengthens the concept that although the memory is based on particular neuronal populations representing certain physical entities, the actual stored and ultimately retrieved information lies predominantly in the relations between them.

Conclusions

Inspired by theoretical models of memory dynamics, we formed a hypothesis that memory consolidation involves the emergence of specialised neuronal populations. To this end, we have scanned the brain for traces of such, i.e., regions which exhibit lower but more co-localised activity when comparing rats with maturated memories to freshly trained ones, within structures connected with memory functions and emotional processing. We have confirmed that pattern on the whole-brain level, and we showed that it is most pronounced in the VTA.

The dynamics of cortical regions, generally regarded to be the prime destination for consolidating memories, proved to be more complex. This may be a sign of higher-order phenomena triggered by an asymmetry between training and testing conditions, namely a need to re-encode the spatial memory with a novel, conflicting experience acquired in testing. This process appears to involve hippocampus CA3 and causes increasing intra-structural coordination, but further insight is required to verify these observations.

Finally, we have shown that the behavioural outcome induced by memory recall can be heavily modulated by independent factors, including individual variability, especially when emotional processing is substantially involved—in particular, we have found nucleus accumbens to be a key structure enforcing action, while the amygdala to likely be either an integrator or a modulator. Anyhow, molecular and physiological observations provide a more direct and reliable quantification of the properties of memories. On the other hand, considering individual variability is crucial for the development of effective interventions, both experimental and therapeutic.

Material and methods

Animals

Adult (9–10 weeks) male Long-Evans rats (\(n=18+17\), 180\(\pm\)20 g) were used in experiments. Because we used non-parameteric statistical methods and machine learning, sample sizes were established based on simulation and our previous experience with similar experiments; no animals were excluded, and experimental groups were assigned at random. Animals were purchased from a licensed breeder (the Polish Academy of Science Medical Research Center, Warsaw, Poland), and housed in standard laboratory conditions under 12 h light:dark cycles (lights on at 7 a.m.), at a constant temperature (21\(\pm\)2 °C) with \(70 \%\) humidity. Rats had free access to both food and water. All experiments were performed in accordance with the European Communities Council Directive of 24 November 1986 (86/609 EEC) & the Polish Act on Animal Welfare, under permission no. 539/201 from the First Warsaw Ethical Committee on Animal Research.

Behavioural experiments

All experiments utilised the same 1 m \(\times\) 1 m cage, partitioned into 5 regions (four corners and a central, interconnecting space) with translucent walls equipped with doors. In the training configuration doors are closed, henceforth a rat can be confined in a selected corner but retains the capacity to observe the whole cage through the wall. On the other hand, in the testing configuration, doors are opened and a rat can freely roam around the cage. The whole setup is softly illuminated by an overhung asymmetric array of static colour lights, which serves as a visual-spatial cue. Otherwise, corners were identical in terms of surface texture, colour or size; the cage was cleaned with 70/100 ethanol solution before each rat placement to remove dirt and scent marks.

Animals were first habituated to the experimenter and cage; in this phase, doors were opened and the visual cue array was inactive. After habituation, the cage was switched to the training configuration and animals were trained in the following manner. In each of the six training days, the animal was placed confined in a corner c1, c3, c2, c3, c4 and c3, respectively, for 15 min. During this time, the animal was injected with either amphetamine at a dose of 1.5 mg/kg, when in corner c3, or saline (1 ml/kg; i.p.), in other corners; this way, c3 was the conditioned corner.

After training, animals were, according to the pre-assigned group, either immediately transferred to testing (recent group), or first underwent a hold-off in their home cage for 14 days (remote group), and then tested. The testing procedure was identical for all animals and was performed as follows. The animal was placed in the centre of the cage in the testing configuration, with doors opened. Then, it was allowed to freely move and explore for 5 min; in particular, the rat could visit the conditioned corner. After this first entry, animals were put on hold in a separate cage for 20 min. Finally, animals were placed again in the centre of the cage for another 5 min, which constituted the second entry.

Immediately after testing, each animal was decapitated under isoflurane anaesthesia; its brain was isolated and flash-frozen in a dry-ice-cooled isopentane bath. Afterwards, brains were awaiting catFISH reaction stored aluminium foil-wrapped in a deep freezer set to −80 °C.

All sessions (habituation, six training entries and two test entries) took place under video surveillance; recordings of the test sessions were later used to reconstruct rat trajectories using the DeepLabCut toolbox [50]. No blinding was applied at this stage of experiments.

Fluorescent staining of IEGs

To optimise the slicing process, we arranged brains side by side in blocks of four, embedded in a medium with an optimal slicing temperature (OCT; Sakura). The blocks were cut in a cryostat (Leica CM 1850, Germany) into 20\(\mu m\) sections, which were mounted on gelatin-coated SuperFrost slides (Thermo Fisher).

Next, slices undergo fluorescent staining according to an established protocol [51,52,53]. Fluorescein- and digoxigenin-labelled antisense riboprobes for 3’UTR of, respectively, Homer-1a and Arc mRNA were applied to slices and allowed to complete hybridisation overnight, in a single step.

Then, both kinds of probes were sequentially detected, first with anti-fluorescein (#NEL701A001KT, Perkin Elmer)and then with anti-digoxigenin (#NEL704A001KT, Perkin Elmer). Homer 1a probes were stained with a tyramide-fluorescein signal amplification system (TSA-Fluorescein) and Arc probes with TSA-Cy3 (Perkin-Elmer). Furthermore, the slides were also incubated with DAPI nuclear counter-stain (Invitrogen, Thermo Fisher), to visualise the nuclei. Finally, prepared slides were cover-slipped with an anti-fade media (Vectashield, Vector labs) and sealed with nail polish.

Image acquisition and analysis

Stained slices containing representative samples of interesting structures were scanned using Inverted Leica DMI 6000 microscope with Andor DSD2 Confocal Module and HC PL APO CS2 20x/0.70 Immersive objective. Fluorescence was excited with a 200W halogen lamp. To detect nuclei, Homer-1a and Arc, three fluorescent filtersets were used, respectively: DAPI (exc. 390\(\pm\)40 nm, 405 DM, em. 452\(\pm\)45 nm), FITC (exc. 483\(\pm\)28 nm, 488 DM, em. 525\(\pm\)45 nm) and Cy3 (exc. 556\(\pm\)20 nm, 561 DM, em. 609\(\pm\)54 nm).

We optimised the settings to obtain bright, intranuclear foci of ongoing IEG transcription. The laser power, gain and offset, as well as exposure times, were always set for the whole slide to ensure the best possible signal without substantial spatial variation. The scan was repeated to construct a z-stack; we rejected three bottom- and top-most layers so that only whole cells were considered later. Signals from individual filters were routed into separate channels of the final image.

For each structure, we performed computer-aided manual alignment of a brain atlas [54] to the sub-sampled, flattened images of appropriate slices (Fig. 1E); this way, we have defined numerically defined ROIs for further automatic analysis.

Next, for each ROI, we used a computer vision pipeline to interpret the geometry of the distribution of reporter proteins. We have pooled each structure from both cerebral hemispheres. In particular, for FITC and DIG/Cy3 emission channels, we have used the ICY dot-finder to identify localised aggregations of the reporter, marking the locations of the mRNA particle of the corresponding IEG. For the nuclei identification, we have fitted ellipses to a coherent circular bright patch in the DAPI emission channel, using a custom code based on the watershed algorithm [55]. Finally, for each nucleus region, we counted the number of dots for either IEG; this way, each ROI got quantified as fractions of nuclei of four classes: both Arc and Homer-1a positive, Arc-only positive, Homer-1a only positive and negative. This analysis was repeated for each z-layer, and we collected the median values of said fractions as a final result.

Additionally, we have calculated the co-localisation coefficient, given as a fraction of both Arc and Homer-1a positive nuclei divided by its expected value under the independent placement hypothesis, which is a product of fractions of Arc and Homer-1a positive nuclei. This coefficient is positive but unbounded and is expected to be one given the independence of neuron activation in both entries, under one when the activation is somehow exclusive to either entry and finally over one when neuron activations in both entries are correlated.

Chemogenetic intervention

Surgery

Rats received intracranial injections of the DREADD viral vector expressing pAAV-hSyn-hM4D(Gi)-mCherry (gift from Bryan Roth, Addgene viral prep #50475-AAV5; RRID:Addgene_50475) 3 weeks before the behavioural experiment. All surgical instruments were sterilised before the procedure. Rats were anaesthetised with isoflurane (5% for induction, 2% for maintenance) and administered a subcutaneous analgesic injection (Butorphanol, Butomidor, 3 mg/kg). An ocular lubricant was applied to prevent drying of the eyes, and the scalp was shaved. Following preparation, the rats were placed in a stereotaxic apparatus (RWD Life Science). The scalp was disinfected with 70% ethanol, incised and retracted. Two small burr holes were drilled to accommodate a 1 μL NanoFil syringe with a G33 needle (World Precision Instruments), which was then lowered into the target brain region. Stereotaxic coordinates for the central amygdala (CeA) were determined using the rat brain atlas, with anteroposterior (AP), mediolateral (ML) and dorsoventral (DV) positions referenced from Bregma: AP, −2.52 mm; ML, \(\pm\)4.8 mm; DV (from dura), −8.0 mm. The viral vector was bilaterally delivered into the CeA (7\(\times {10}^{12}\) vg/mL, 350 nL/site, 100 nL/min) using a UMP3 UltraMicroPump (World Precision Instruments). Following injection, the syringe remained in place for 7 min to ensure proper diffusion of the viral vector. Postoperative care included the administration of an analgesic (Tolfenamic acid, Tolfedine; 4 mg/kg, s.c.) and an antibiotic (Enrofloxacin, Baytril; 2.5 mg/kg, s.c.). To prevent dehydration, rats received 1 mL of 0.95% NaCl per 100 g of body weight via subcutaneous injection. Rats were kept on a heating pad until full recovery from anaesthesia and then placed in individual cages.

Behavioural experiment & post-hoc histology

The behavioural experiment was conducted in the same protocol as the remote group in the main test, as described in Section “Behavioural experiments”. JHU37160 dihydrochloride (Hello Bio Ltd cat. no: HB6261; DREADD ligand, water soluble) in dose 0.3 mg/kg (0.3 mg/mL) was administered 30 min before saline or amphetamine injection during days of training, due to its potency and lack of systemic effects [56].

After the conclusion of the behavioural experiment, brains were sliced and counterstained with a combination of anti-mCherry polyclonal antibody (#PA5-34974, Thermo Fisher, 1:1000) and anti-goat Alexa Fluor 568 (#A-11011, Thermo Fisher, 1:500). Samples were imaged using Olympus VS110 Fluorescence Microscope. Animals with lack of viral expression in the target region were excluded from behavioural analysis.

Recording and analysis of ultrasonic vocalisations

Rat calls were recorded using UltraSoundGate Condenser Microphones CM16 (Avisoft Bioacoustics, Berlin, Germany). The microphones were positioned in each of the four corners of the cage, 35 cm above the floor. These microphones were sensitive to frequencies ranging from 15 to 180 kHz, with a flat frequency response (\(\pm\)6 dB) between 25 and 140 kHz. The microphones were connected to an UltraSoundGate 416H (Avisoft Bioacoustics, Berlin, Germany), and then linked to a computer. Recordings were made using Avisoft-Recorder software, and the data were processed with custom-made Rat-Rec pro 7.5 software. The signals underwent discrete Fourier transformation (1024 samples, Hamming window) and were displayed as colour spectrograms. Each USV episode has been then manually identified and labelled, using blinded recording identifiers; we have verified that all calls were from the appetitive group, in the 50 kHz band.

Statistical analysis

For a basic comparison of the activity between the recall and modification groups, we have used a stratified, two-sided Mann-Whitney-Wilcoxon test. The two-sided Spearman correlation test was used for the reconstruction of inter-structural interaction graphs and to assess parameter correlations. We have applied 0.05 as a significance threshold for p-values, and use Holm’s correction [57] for multiple testing, except in the cross-structure correlation analysis which used Benjami-Hochberg FDR correction.

The Boruta method [32] was applied for the machine learning search for multivariate and non-linear interactions, using the standard Random Forest importance source with 50,000 trees wrapped into the impute transdapter; tentative selections were collected together with the confirmed ones. To stabilise the results, we repeated this procedure thirty times and finally reported features selected in at least half of them.