AInthusiast // Sandro Andric

Probably spiriling…

Code. Cats. Robots.

OPERATOR Sandro Andric

BA NYU

MsC Discord

PhD Reddit

STACK Python and extras

01 // Research_Log

INDEX_005

2025.12.22

Brain-Grounded Axes for Reading and Steering LLM States

Uses MEG brain data to build a word-level atlas of phase-locking connectivity patterns. Lightweight adapters map LLM hidden states to brain axes, enabling interpretable steering of model behavior across GPT-2, Qwen2, and TinyLlama.

INDEX_004

2025.12.07

Predicting Neural Scaling Laws from Data Geometry: Constraint Signatures Without the Human

Proposes data scaling exponents can be predicted from dataset geometry via intrinsic dimension. A 10-minute geometric probe can predict scaling behavior before expensive training runs.

INDEX_003

2025.12.04

LLM Hijacking: When Models Manipulate Their Routers

We present the Parasitic Manipulation Framework analyzing how "parasite" LLMs can manipulate router decisions. Found 7/10 models achieve 100% capture, while Claude Opus 4.5 shows complete immunity.

INDEX_002

2025.12.01

Do Large Language Models Walk Their Talk? Measuring the Gap Between Implicit Associations, Self-Report, and Behavioral Altruism

We investigate whether LLMs exhibit altruistic tendencies, finding a "virtue signaling gap" where models claim 77.5% altruism but act at 65.6%.

INDEX_001

2025.11.21

BlockCert: Certified Blockwise Extraction of Transformer Mechanisms

Mechanistic interpretability aspires to reverse-engineer neural networks into explicit algorithms, while model editing seeks to modify specific behaviours without retraining.

02 // Visual_Database

Fig 1. Operator

Fig 2. Companions

Fig 3. Inspiration