What question did this study set out to answer?

To develop a pipeline that converts dense float embeddings into binary hypervectors while maintaining their similarity structure for symbolic reasoning.

April 10, 2026Open Access

Model Transplantation: Absorbing Pretrained Neural Knowledge into Binary Hypervectors

Read Full Paperexternally

Key Points

To develop a pipeline that converts dense float embeddings into binary hypervectors while maintaining their similarity structure for symbolic reasoning.
Conversion of dense float embeddings into 10,240-bit binary hypervectors using a Gaussian random matrix followed by sign binarization.
Implementation and benchmarking of the conversion pipeline in the NSCK codebase.
Evaluation on multiple embedding sources including Gaussian synthetic and BERT-based embeddings.
Assessment of correlation between source cosine similarities and projected hypervector Hamming similarities.
Achieved a Spearman rank correlation of ϱ > 0.98 across all embedding sources.
Contextual BERT embeddings demonstrated a correlation of ϱ = 0.9906 and Recall@10 = 0.889.
Transplanted hypervectors showed a significant increase in accuracy in a relation-retrieval task, with Top-1 accuracy of 0.650 compared to a random-HV baseline of 0.250.

Abstract

Large pretrained neural networks encode rich semantic knowledge in dense embeddingmatrices, but this knowledge is locked in a format incompatible with symbolic reasoning. Wepresent Model Transplantation, a pipeline that converts dense float embeddings into 10,240-bit binary hypervectors (HVs) while preserving similarity structure, enabling the full suite ofVector Symbolic Architecture (VSA) operations—binding, bundling, and analogical query—over transplanted knowledge. The core projection step applies a seeded random Gaussianmatrix P ∈ Rd×D followed by sign binarisation, an instance of the Johnson–Lindenstrauss(JL) random projection family.We implement and benchmark the pipeline in the NSCK (Neuro-Symbolic CognitiveKernel) codebase and evaluate on four embedding sources: (i) Gaussian synthetic embeddings(d = 32, n ≤ 1,000); (ii) bert-base-uncased contextual sentence embeddings (d = 768,n = 4,096); (iii) all-MiniLM-L6-v2 curated sentence embeddings (d = 384, n = 280); and(iv) all-MiniLM-L6-v2 scaled phrase embeddings (d = 384, n = 5,000). Across all sources weobserve Spearman rank correlation ϱ > 0.98 between source cosine similarities and projectedHV Hamming similarities. Contextual BERT achieves ϱ = 0.9906, Recall@10 = 0.889, andARI = 0.723; MiniLM-curated achieves Recall@10 = 0.898; and the scaled MiniLM vocabulary(n = 5,000) retains ϱ = 0.9839 with Recall@10 = 0.954. Round-trip encoding fidelity is 1.000(deterministic). We further demonstrate VSA analogy on a balanced 50-equation benchmark(5 categories × 10 analogies): king ⊕ man ⊕ woman ≈ queen. Aggregate accuracy is Top-1= 39/50 = 78% and Top-5 = 48/50 = 96%. Finally, on a downstream synonym-substitutedrelation-retrieval task, transplanted HVs achieve Top-1 accuracy 0.650 versus a random-HVbaseline of 0.250 (absolute gain +0.400), demonstrating functional reasoning gains beyondgeometric preservation. The JL analysis confirms D = 10,240 exceeds the lower bound by≥1.40× for the tested vocabulary sizes.Keywords: Vector Symbolic Architecture, model transplantation, random projection,Johnson–Lindenstrauss lemma, hypervector computing, neuro-symbolic integration, em-bedding compression

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Discussion

Cite this study

Shivam K. Prajapati (Thu,) studied this question.

synapsesocial.com/papers/69d896566c1944d70ce07b6c — DOI: https://doi.org/10.5281/zenodo.19476846

Also consider

Synapse has enriched 5 closely related papers on similar clinical questions. Consider them for comparative context:

Shift-Equivariant Similarity-Preserving Hypervector Representations of Sequences· 2024 · 14 citations
Vector-Symbolic Hybrid Systems Bypass Single-Vector Embedding Limitations on Structured Queries· 2025
HyperVQ: MLR-based Vector Quantization in Hyperbolic Space· 2024
HSViT: Horizontally Scalable Vision Transformer· 2024 · 2 citations
Hyperdimensional Probe: Decoding LLM Representations via Vector Symbolic Architectures· 2025

Authors

Shivam K. Prajapati

University of Prince Edward Island

Actions

Institutions

University of Prince Edward Island

References and Citations

Connected Papers

Building similarity graph...

Analyzing shared references across papers

Model Transplantation: Absorbing Pretrained Neural Knowledge into Binary Hypervectors

Key Points

Abstract

Citation Network

Connected Papers

Discussion

Cite this study

Also consider

Authors

Actions

Institutions

References and Citations

Citation Network

Connected Papers

Discussion

Also consider