Skip to content
View im-anishraj's full-sized avatar
:octocat:
Open Source Contributor
:octocat:
Open Source Contributor

Block or report im-anishraj

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
im-anishraj/README.md

Anish Raj

Google Student Ambassador · CS undergrad · open-source maintainer building Python/C++ data tooling, retrieval systems, and local-first developer tools

Jaipur, India · Building in public since 2024 · Available for open-source, internships, and systems/AI tooling work

LinkedIn arnio on PyPI dooma on PyPI Email

current_focus/
  arnio       C++ accelerated data preparation for Python and pandas
  dooma       Offline-first DSA practice CLI with local progress tracking
  BhojRAG     Hybrid retrieval for low-resource Bhojpuri / Indic NLP
  nactograph  Local-first evidence recorder for AI coding-agent runs

What I Build

I like software that has a real engine under the surface: native extensions, search pipelines, CLIs, data tools, and systems where performance and correctness matter.

  • Python + C++ tooling: pybind11, CMake, wheels, typed Python APIs, pandas/NumPy bridges.
  • Retrieval and AI systems: BM25, FAISS, RRF fusion, evaluation metrics, RAG pipelines.
  • Developer tools: terminal-first workflows, local state, reproducible reports, CI-friendly packaging.
  • Open-source maintainership: issue triage, contribution guides, release automation, tests, docs, and community onboarding.

Featured Work

C++ accelerated data quality toolkit for Python.

Native CSV parsing, cleaning, schema validation, pandas integration, benchmarks, CI, wheel builds, and PyPI distribution.

arnio stars arnio forks arnio PyPI version

Offline-first DSA interview-prep CLI.

3,310 questions, 17,931 company mappings, fuzzy search, SQLite progress tracking, packaged runtime index, PyPI release, and multi-version CI.

dooma stars dooma PyPI version

Hybrid retrieval research framework for low-resource Bhojpuri NLP.

Character n-gram BM25, dense retrieval, reciprocal-rank fusion, IR metrics, ablations, and paper-ready evaluation assets.

Local-first flight recorder for AI coding agents.

PTY capture, git snapshots, JSONL event logs, redaction, risk detection, static HTML reports, and npm distribution.

nactograph npm version

Open Source Signal

  • Maintaining public projects with issue templates, contributor docs, release notes, CI, and packaging.
  • Using issues and pull requests to track release work, contributor tasks, roadmap items, and quality improvements across my own projects.
  • Active GSSoC 2026 contributor/maintainer workflow on selected projects.

Stack

C++ Python TypeScript pybind11 FAISS FastAPI React GitHub Actions

GitHub Snapshot

GitHub profile summary

Repos per language Most committed language GitHub stats

Pinned Loading

  1. arnio arnio Public

    C++ accelerated data quality toolkit for Python: CSV parsing, cleaning, schema validation, profiling, and pandas integration.

    Python 67 232

  2. bhojpuri-lang bhojpuri-lang Public

    Programming ab Bhojpuri mein! A fun, esoteric programming language built with Python. Perfect for beginners learning parsers. GSSoC '26!

    JavaScript 2

  3. dooma dooma Public

    Offline-first terminal DSA interview-prep CLI with 3,310 questions, company mappings, fuzzy search, mocks, and local SQLite progress.

    Python 5 10

  4. Hybrid-Search-RAG-Engine Hybrid-Search-RAG-Engine Public

    Reference hybrid-search RAG API combining FAISS dense retrieval, BM25 sparse retrieval, and reciprocal-rank fusion.

    Python