Simanta Lahkar simantalahkar

👋 Hey there! I'm Simanta Lahkar

🔬 Computational Physicist turned Scientific Software Developer and Data Engineer
🧑‍💻 Currently designing databases for large-scale atomistic simulation data at TU Eindhoven × IBM Research
🚀 Passionate about bridging scientific computing with modern data engineering and AI technologies
💡 Love building tools that make complex scientific and engineering workflows accessible and scalable
🌍 Based in Den Bosch, Netherlands
⚡ Fun fact: I enjoy cooking, drone cinematography, and swimming when not debugging simulation data pipelines!

🌐 Connect with Me:

💻 Tech Stack:

Core Programming & Development

Data Engineering & Big Data

Data Analysis & Visualization

Scientific Computing & Machine Learning

Cloud & Infrastructure

🚀 What I'm Working On:

🔬 Scientific Cloud-Native Data Infrastructure & AI Integration

Building an open-source, cloud-native pipeline for large-scale molecular dynamics data. Using MinIO for scalable storage, Apache Spark and Delta Lake for transforming raw trajectories into structured formats, and Trino for fast SQL querying. Integrating MLflow for reproducible AI workflows and orchestrating everything with Apache Airflow. Focused on scalable, metadata-rich infrastructure for scientific computing.

🧬 LAMMPSKit - Production-Ready Scientific Package

Developed a modular Python toolkit for LAMMPS simulation analysis, backed by 270+ tests (94% coverage), Dockerized for portability, and powered by robust CI/CD. Achieved 60% memory savings and 40% faster performance compared to typical scientific scripting workflows.

⚛️ LAMMPS Extension for Electrochemical Simulations

Extended LAMMPS with C++ to integrate two open-source packages for novel electrochemical device simulations, navigating complex licensing and attribution challenges.

💼 Professional Focus:

🎯 Seeking opportunities in:

Scientific Software Development & Computational Materials Science
Data Engineering, Analytics & Data Stewardship
Modeling & Simulation Engineering
AI/ML Applications in Scientific Computing

🔧 Core Expertise:

Data Analysis & Insights: Statistical analysis of large scientific datasets with advanced visualization
Materials Science Modeling: Molecular dynamics simulations, DFT calculations, and multi-scale modeling
Performance Optimization: Algorithmic improvements achieving significant memory and speed gains
Data Pipeline Architecture: Real-time streaming and batch processing for scientific workflows
Data Governance: Metadata management, data quality assurance, and reproducible research practices
Full-Stack Scientific Computing: From Python APIs to C++ algorithms to cloud deployment
Production Software Development: CI/CD, automated testing, containerization, and package distribution

📊 Current Learning Journey:

🌱 Databricks Certified Data Engineer (in progress)
🌱 Cloud-native data lake architectures & data governance
🌱 Advanced statistical analysis and predictive modeling
🌱 Graph-based ML for scientific applications
🌱 Natural language interfaces for scientific databases

🎓 Background:

PhD in Materials Science & Engineering from Shanghai Jiao Tong University with expertise in computational modeling, machine learning, and numerical methods. Transitioned from pure research to building production-ready scientific software that solves real-world problems.

Key Achievement: Led IBM collaboration resulting in 10x device stability improvement through innovative simulation algorithms and data processing pipelines.

🗣️ Languages:

🇬🇧 English (Professional - C2)
🇳🇱 Dutch (Learning - Beginner)
🇨🇳 Chinese (Basic - A1)
🇮🇳 Hindi, Assamese, Bengali (Native)

💬 Let's connect! I'm always excited to discuss scientific computing, materials science research, data engineering challenges, or opportunities to make complex data more accessible through better analysis and visualization. Whether you're looking to optimize simulation workflows, design scalable data architectures, implement data governance, or bridge the gap between research and production - I'd love to hear from you!

📫 Reach out: [email protected] | LinkedIn | Portfolio

Provide feedback

Saved searches

Use saved searches to filter your results more quickly