Skip to content

A memory-efficient, dense, random projection of sparse vectors

License

Notifications You must be signed in to change notification settings

mhendrey/pocket_dimension

Repository files navigation

A small spiral galaxy inside a small glass sphere


Pocket Dimension provides a memory-efficient, dense, random projection of sparse vectors. This random projection is the used to be able to take records {"id": str, "features": List[bytes], "counts": List[int]}, convert them into sparse random vectors using scikit-learn's FeatureHasher, and then project them down to lower dimensional dense vectors.

When the very large sparse universe becomes too inhospitable, escape into a cozy pocket dimension.

Documentation

Documentation for the API and theoretical foundations of the algorithms can be found at https://mhendrey.github.io/pocket_dimension

Installation

Pocket Dimension may be install using pip:

pip install pocket_dimension

I'm working on a conda-forge version, but this uses pybloomfiltermmap3 which is currently only on PyPi.

About

A memory-efficient, dense, random projection of sparse vectors

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages