Priors / Finetuning
In the past two weeks, I’ve been focused on producing biologically grounded priors for Beads. These priors include constants (such as voltage thresholds, inductance, etc) and unitary measurements (no. of cells, vesicles, etc). I’ve also continued to fix bugs and bridge components in the audio and visual pipelines.
Priors / Finetuning
These priors come from a variety of sources - past neuroscience research, computational ease and physical constraints. I’ve put in my best effort to keep them biologically grounded along with a provision for improvement in future phases of Beads.
Photoreceptors
- We model the circular photoreceptors as hexagonal lattice sites. Each lattice site inside the central fovea is further broken down into six cone subunits; on the other hand, sites outside the fovea have an increased (stochastic) density of rods that peaks at intermediate eccentricity. This preserves mosaic irregularity in the retina
- The short / medium / long (wavelength) cycle of cone clusters produces short-range color sampling and gives every local patch access to all three cone types (at the hex scale). This design ensures that local color opponency is always possible
- Spectral sensitivity is represented by shifted Govardoskii nomograms (with wavelength peaking at 445 nm for short, 535 nm for medium, 565 nm for long; rods peak at 498 nm). RGB values from streams are converted into estimated luminance and finally into photoisomerizations per sec for spectral processing
- An estimated 9.9 million photoreceptors across a 1.248 mm radius patch are generated: approx. 6.39 M cones and approx. 3.52 M rods; the total area is approx. 4.893 mm²
Inner Retina / Plexiform Layers
- A horizontal cell is tied to each photoreceptor cell and has a radium of 10 microns, implying its inhibition neighbors are in the order of 100
- Bipolar cell time constants of 0.07 to 0.1 s correspond to low-pass corner frequencies around 1.6 to 2.3 Hz. This ensures that noise and jitter is filtered out. Additionally, bipolar cells have leaky integrators to help clear input drive over time
- AII amacrine cells specialize in scotopic routing (i.e., dim light vision) and apply piecewise functions to couple rod bipolar input with cone bipolar ON / OFF pathways
- 1000s of ON cone bipolar cells are clustered (with a radius of 50 microns) for direction selectivity by starburst amacrine cells
- Direction selective ganglion cells (DSGC) further cluster starburst amacrine cells to form the direction component of the optical spike trains. A tunable binary threshold regulates spikes in DSGC
- Midget, parasol and small-bistratified cells are instantiated by clustering cone bipolar clusters (greedy proximity clustering with group sizes 6 to 8) and computing an radially exponential weighted sum of bipolar outputs (radius is 30 microns for midget, 80 microns for parasol and 60 microns for bistratified)
Visual Cortex
- A multi-spectral gabor / bank is constructed which stores 8 orientations and 2 temporal affinities. Studies (here and here) have shown that primary sensory areas have linear dynamics that can be explained by STRFs and gabors
- In the primary visual cortex’s L4 units, multicompartment interneuron spikes are detected when somatic voltage crosses -30 mV
- The higher cortices contain esoteric constants that I’ve avoided here
Outer Ear
- Pinna notches are created at 6 kHz / 8 kHz, and peaks at 4 kHz / 12 kHz
- The ear canal length is 25 mm which gives a decibel boost of approx. 12 dB. The fundamental base frequency of the ear canal is approx. 3430 Hz
Cochlea
- 200 log-spaced basilar membrane segments are created with center frequencies between 20 Hz and 20000 Hz. The duct depth (for segments to vibrate) is 0.5 mm
- Outer hair cells are attached 1:1 to the above segments and produce a maximum length change of 10 nm on the basilar segments
- MET has a maximum current of 200 pico amps
- The refractory time constant of the membrane (i.e., how fast it can recoil to a useful state) is 0.8 ms and the maximum vesicle release rate is 5000 per sec
- Further esoteric constants have been avoided here
Auditory Cortex
- Auditory nerve fibers are simulated using a leaky integrate-and-fire model with threshold voltage as -50mV and resting voltage as -65 mV. The refractory period is 0.8 ms, capacitance time constant (time it takes to charge the membrane) and the electrical charge per vesicle is about 10^-13 coulombs
- Most auditory cortex integrate-and-fire models are approximated as C dV / dt = - (V - V_rest)/R + I_synaptic
- Binaural processing uses a discrete-delay coincidence detector for ITD (MSO) i.e., left and right ear spike trains are shifted across a bank of delays (−0.5 ms to +0.5 ms with 9 delays) and contributions are summed into time bins. If any time bins exceed a coincidence threshold, MSO events (for locating sounds) are produced
- An LSO comparator computes per bin excitatory minus inhibitory counts and thresholds the difference to produce ILD / LSO events
- Feedback loops from this component are fed into OHCs for efferent gain
- Similar to the visual cortex, a spectrotemporal receptive field helps match incoming spike trains to multicompartment interneurons. Studies (here and here) have shown that primary sensory areas have linear dynamics that can be explained by STRFs and gabors
Algorithms / Development
CMU Development
Minor tweaks across the CMU and cortex, to streamline cell methods into distinct cell creation, organization and functioning. Please find the changes in the commits (linked below for reference).
Development Activity - https://github.com/akhil-reddy/beads/graphs/commit-activity
Please note that some code (class templates, function comments, etc) is AI generated, so that I spend more of my productive time thinking and designing. However, I cross-verify each block of generated code with its corresponding design choice before moving ahead.
Next Steps
Deployment
- Code optimization for channel processing
- Post processing in the visual cortex
- Overlaying audio clips onto the cochlea, including optimization for wave segment processing
- Post processing in the auditory cortex
- Parallelization / streaming of cellular events via Flink or equivalent
Building the Environmental Response System (ERS)
- Building the ERUs
- Neurotransmitters - Fed by vision’s bipolar and amacrine cells, for example, to act on contrasting and/or temporal stimulus
- Focus - Building focus and its supporting mechanisms (of which acetylcholine is one)