Hello, I'm

Sid Valecha

Data Scientist & Machine Learning Engineer

Recent graduate passionate about transforming data into actionable insights. Specialized in machine learning, statistical analysis, and data visualization.

Headshot of Sid Valecha

Experiences

AI/ML Engineer Intern

73 Strings Inc.

June 2025 - August 2025 • New York, NY

  • Enhanced LLM-based parsers to streamline data extraction and consolidation from financial documents, reducing processing time by 30% and improving accuracy for AI-driven private equity portfolio and asset management
  • Developed and fine-tuned a YOLO-based chart detection model, boosting precision by 12% (to 93%) and recall by 11% (to 94.4%), significantly improving chart classification for financial analytics
  • Trained the model on 1,700+ annotated images, achieving a mean average precision of 92.2%, while implementing hyper parameter tuning to reduce false positives and enhance classification reliability
  • Explored and proposed innovative applications of Large Language Models for portfolio management, advising the integration of these technologies into the company's product suite, presented in a report.

Full Stack Developer Intern

United Nations (Office of Information and Communications Technology)

May 2025 - June 2025 • Remote

  • Developed and implemented a robust file-handling solution for a research repository website, using Node.js, Express, MongoDB, and Multer, streamlining document uploads, storage, and retrieval. Tested and validated API endpoints to ensure efficient backend performance
  • Built an LLM-based Retrieval-Augmented Generation (RAG) system for a research portal, utilizing LangChain, FAISS vector storage, and Ollama embeddings to enable fast and context-aware document retrieval. Implemented a document processing pipeline to handle large-scale unstructured data for responses
  • Developed a rule-based chatbot for a UNICEF learning management system to resolve user login issues, automating user login issue resolution and reducing support overhead

Skills & Technologies

Key technologies, programming, and analytics skills powering impactful data-driven solutions.

Programming Languages

Python Java R SQL JavaScript

Data Science & ML

Pandas NumPy SciPy Scikit-learn XGBoost TensorFlow PyTorch Matplotlib Seaborn Hugging Face OpenCV Ultralytics YOLO

AI/ML & NLP Technologies

Natural Language Processing LangChain Ollama Embeddings Google GenAI Embeddings Prompt Engineering MLOps LLMs

Web & App Development

HTML CSS JavaScript Flask RESTful APIs Streamlit

Data Engineering & Databases

MongoDB MySQL PostgreSQL Snowflake Elasticsearch FAISS Airbyte

DevOps & Infrastructure

Git Docker Kubernetes AWS EC2 GCP Jenkins

Analytics & Visualization

Tableau Microsoft Excel Kibana

Project Management & Collaboration

Jira Trello Postman

Featured Projects

A selection of data science projects showcasing my skills in machine learning, data analysis, and visualization.

Student Financial Outcomes Project

Student Financial Outcomes

Analyzed student loan debt and earnings outcomes by field of study, comparing STEM vs non-STEM fields using College Scorecard data.

R ggplot2 Statistical Analysis Data Visualization
Project 2

Project Title

Project description goes here. This showcases your skills and expertise in data science.

Python Machine Learning
Project 3

Project Title

Project description goes here. This showcases your skills and expertise in data science.

Python Data Analysis
Project 4

Project Title

Project description goes here. This showcases your skills and expertise in data science.

Python Machine Learning
Project 5

Project Title

Project description goes here. This showcases your skills and expertise in data science.

R Data Visualization
Project 6

Project Title

Project description goes here. This showcases your skills and expertise in data science.

Python Deep Learning

Education

Education

Bachelor of Science in Data Science

University of Wisconsin-Madison

Recent Graduate

  • Passionate about data analysis, machine learning, and software development
  • Relevant coursework: Machine Learning, Statistical Computing, Database Systems

Get In Touch

Whether you have a question or just want to say hi, feel free to reach out!

Email

sidvalecha4@gmail.com

Phone

+ 1 (848) 219-6614

Location

Madison, WI