About Me
Blog
Projects
Skills
Contact
I am a versatile Data Scientist and Machine Learning enthusiast with a rich background in data analysis, machine learning, artificial intelligence, and data analytics. My academic journey at UC Berkeley, combined with hands-on experiences from various internships and projects, has equipped me with the technical expertise and analytical mindset necessary to tackle complex data challenges and deliver innovative solutions.
Throughout my career, I have utilized a wide array of tools and technologies to analyze data, build predictive models, and develop software solutions. My skills range from cleaning and preprocessing data, conducting exploratory data analysis (EDA), and creating compelling visualizations to implementing advanced machine learning algorithms and optimizing large-scale data pipelines.
My experiences include enhancing model accuracy for large language models (LLMs) using state-of-the-art architectures, developing efficient data processing systems, and contributing to projects that drive significant improvements in data-driven decision-making.
Reach out to me here!

Data Analysis & Visualization
- Data Cleaning and Preprocessing
- Exploratory Data Analysis (EDA)
- Data Visualization
- Statistical Analysis and Hypothesis Testing
- Data Pipeline Development
- Large-Scale Data Processing
- Data Validation
- Regression Models
Machine Learning & AI
- Supervised and Unsupervised Learning
- Neural Networks and Deep Learning
- Natural Language Processing (NLP)
- Transformers
- Hyperparameter Tuning and Optimization
- Predictive Modeling
- Feature Engineering
- Sequence-to-Sequence Model Training
Software Development
- Object-Oriented Programming (OOP)
- Software Design and Architecture
- API Development and Integration
- Debugging and Testing
- Documentation and Reporting
- Decision Trees
- Random Forest
- Support Vector Machine
Tools & Technologies
- Python, Java, R, SQL
- Pandas, NumPy, Tableau, Matplotlib, Seaborn, Plotly, ggplot2
- Scikit-learn, TensorFlow, PyTorch, Hugging Face Transformers, LlamaIndex, LangChain
- Google Analytics
- Git/GitHub, Docker, Jupyter Notebooks, VS Code, Next.js
- PyPDF, FAISS, ChromaDB, Pinecone