|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
Production Rag Pipelines With Re Ranking
|
Multi Agent Ai Orchestration Patterns
|
|
|
💬 Commented on [Enhancement] A hybrid filtering strategy for Weaviate HNSW in weaviate/weaviate (2026-07-02)
💬 Commented on Memory backend: Dakera — decay-weighted persistent memory fo in Significant-Gravitas/AutoGPT (2026-07-02)
💬 Commented on [BUG/Help] <title>UI Text Encoding Bug: Greek lowercase 'ω' in zai-org/ChatGLM-6B (2026-07-02)
💬 Commented on PipelineController daemon crashes with AttributeError: 'None in clearml/clearml (2026-07-02)
💬 Commented on Feature request : Advanced Ontology Management in neuml/txtai (2026-07-02)
💬 Commented on Just wanna know in predibase/lorax (2026-07-02)
💬 Commented on When tabby is running, unstaged git changes are randomly rev in TabbyML/tabby (2026-07-02)
💬 Commented on [Bug] DeepEP low_latency buffer lazy init fails during CUDA in sgl-project/sglang (2026-07-02)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 Multi-Agent AI Orchestration Patterns
🔬 Production RAG Pipelines with Re-ranking
🔬 Automated Machine Learning (AutoML) for Time Series Forecasting
🔬 AutoML for Complex, High-Dimensional Data
🔬 Real-Time Data Quality Monitoring for ML Systems
🔬 Fine-Tuning LLMs with Parameter-Efficient Methods (LoRA/QLoRA) at Scale
📌 RAG Relevance Scorer using Cross-Encoder — Production Pattern (Python) (2026-07-02)
📌 Real-Time Feature Store Client — Production Pattern (Python) (2026-07-01)
📌 Async Retry Pattern with Exponential Backoff — Production Pattern (Python) (2026-06-29)
🤖 Profile auto-updated on 2026-07-02 19:40 UTC


