Skip to content
View ashvardanian's full-sized avatar
โ˜•
Less Slow
โ˜•
Less Slow

Block or report ashvardanian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
ashvardanian/README.md

Hey, I'm Ash, and I Love Building Infra

  • Building Unum Cloud since 2015.
  • Computer Science & AI researcher w/out publications.
  • Twice an Astrophysics dropout. Bioinformatics fan.
  • Investing in deep-tech, cloud, & semiconductors.
  • Fluent in English, Russian & Armenian.
  • Lived in ๐Ÿ‡บ๐Ÿ‡ธ๐Ÿ‡ฌ๐Ÿ‡ง๐Ÿ‡ท๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿ‡ฒ & ๐Ÿ‡ฒ๐Ÿ‡ฝ๐Ÿ‡ต๐Ÿ‡ฆ๐Ÿ‡ฆ๐Ÿ‡ท๐Ÿ‡ฉ๐Ÿ‡ช๐Ÿ‡ฆ๐Ÿ‡ช๐Ÿ‡น๐Ÿ‡ญ๐Ÿ‡ฒ๐Ÿ‡พ๐Ÿ‡ป๐Ÿ‡ณ๐Ÿ‡ฎ๐Ÿ‡ฉ.
  • Frequent host of "Systems" meetups in Armenia, and beyond.

For the past 20ish years, I have been coding mainly in C++, CUDA, Python, and optimizing Assembly on x86/ARM. Prefer spaces over tabs, east-const, and procedural programming over object-oriented or functional.

Want to chat? My nick is @ashvardanian on GitHub, LinkedIn, Twitter, Facebook, and YouTube. If you're looking for Venture Capital for something deeply technical or scientific โ€“ feel free to send links or materials to [email protected] ๐Ÿค—

GitHub Org's stars: unum-cloud GitHub User's stars: ashvardanian HackerNews User Karma
USearch Python installs SimSIMD Python installs StringZilla Python installs

Repositories

  • USearch - single-file search engine for C, C++, Rust, Swift, Go, Java, C#, Python, JS..., FAISS alternative
  • StringZilla - faster strings with SIMD/SWAR for C, C++, Rust, Python, and Swift, GlibC & STL alternative
  • UForm - small multimodal pre-trained AI models with SDKs for Python, JS, and Swift, CLIP alternative
  • SimSIMD - fast vector-vector math library for C, Python, Rust, and JavaScript, BLAS l1 alternative
  • UCall - networking server library for C and Python designed for Linux kernel bypass, FastAPI alternative

Some of those are used in ClickHouse, DuckDB, TiDB, ScyllaDB, YugaByte, LangChain, SemanticKernel, MemGraph, KuzuDB, and many other less "open" systems, such as competitive AI labs, Cloud companies, Fortune 500, iOS and Android apps with 100M-1B MAU, and government agencies. Other software projects & datasets include:

  • usearch-molecules - 28 billion embeddings - one of the largest datasets in Computational Chemistry
  • less_slow.cpp - teaches a performance oriented mindset for C++, CUDA, PTX, and ASM
    • less_slow.rs - Rust adaptation with a focus on higher-level abstractions
    • less_slow.py - Python adaptation with a focus on scripting & data-management
  • fork_union - OpenMP-like Rust & C++ scoped thread-pool w/out mutexes, allocs, exceptions, or CAS
  • UStore - multimodal embedded database for C, C++, and Python designed around key-value stores
  • TinySemVer - Semantic versioning GitHub CI tool that doesn't take 300K lines of JavaScript
  • affine-gaps - Less wrong local and global Gotoh sequence alignments in one NumBa Python file
  • SpaceV - 1 billion vectors from Microsoft SpaceV extended for usability

And more demos, benchmarks, and fun hackathon projects:

  • StringWa.rs - micro-benchmarking StringZilla against the best Rust tools
  • scaling-democracy - combinatorial voting method implemented in CUDA for H100 GPUs
  • SwiftSemanticSearch - example of on-device real-time AI using UForm and USearch on iOS
  • LibSee - non-intrusively profiling LibC calls with LD_PRELOAD tricks
  • ParallelReductionsBenchmark - GPGPU benchmarks for SyCL, CUDA, OpenCL, Vulkan, etc.
  • cuda-python-starter-kit - C++ and CUDA starter kit for Python developers avoiding CMake
  • ucsb - parallel benchmarks for ACID persistent key-value stores, like RocksDB
  • jaccard-index - optimizing CPU port utilization with Carry-Save Adders & Lookups
  • USearchBench.py - Billion-scale search benchmarks against FAISS, Weaviate, and Qdrant
  • USearchBench.java - Billion-scale search scaling benchmarks against Lucene, using Spark

Materials

Cherry picks:

Pinned Loading

  1. unum-cloud/usearch unum-cloud/usearch Public

    Fast Open-Source Search & Clustering engine ร— for Vectors & Arbitrary Objects ร— in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram ๐Ÿ”

    C++ 3.1k 218

  2. StringZilla StringZilla Public

    Up to 10x faster strings for C, C++, Python, Rust, Swift & Go, leveraging NEON, AVX2, AVX-512, SVE, & SWAR to accelerate search, hashing, sort, edit distances, and memory ops ๐Ÿฆ–

    C 2.7k 88

  3. unum-cloud/ucall unum-cloud/ucall Public

    Web Serving and Remote Procedure Calls at 50x lower latency and 70x higher bandwidth than FastAPI, implementing JSON-RPC & REST over io_uring โ˜Ž๏ธ

    C 1.2k 51

  4. SimSIMD SimSIMD Public

    Up to 200x Faster Dot Products & Similarity Metrics โ€” for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, โ€ฆ

    C 1.5k 84

  5. unum-cloud/uform unum-cloud/uform Public

    Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and ๐Ÿ”œ video, up to 5x faster than OpenAI CLIP and LLaVA ๐Ÿ–ผ๏ธ & ๐Ÿ–‹๏ธ

    Python 1.2k 75

  6. less_slow.cpp less_slow.cpp Public

    Playing around "Less Slow" coding practices in C++ 20, C, CUDA, PTX, & Assembly, from numerics & SIMD to coroutines, ranges, exception handling, networking and user-space IO

    C++ 1.8k 71