Skip to content
View ShreyPatel4's full-sized avatar
🏠
Working from home
🏠
Working from home
  • 00:26 (UTC -12:00)

Organizations

@ganpat-university

Block or report ShreyPatel4

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
ShreyPatel4/README.md

Hi, I'm Shrey Patel πŸ‘‹

πŸš€ Data Scientist | Data Engineer | Full Stack Developer | AI Enthusiast

Welcome to my GitHub profile! I am a passionate Data Scientist and Data Engineer with over 4 years of professional experience, working in industries like healthcare, manufacturing, and fintech. My focus is on building scalable data pipelines, applying machine learning algorithms, and solving complex problems using data-driven insights.


πŸ”§ Technologies & Tools

  • Languages: Python, R, Java, C, C++, PySpark, Scala, Hive
  • Data Engineering: Docker, Jenkins, Airflow, Pentaho-ETL, Kafka, Databricks, Informatica
  • Machine Learning & AI: PyTorch, TensorFlow, CNN, RNN, LSTM, GNN, Random Forest, Decision Trees
  • Cloud Services: AWS (S3, Glue, SageMaker), Azure Databricks, Kubernetes, IBM Watson, Redshift
  • Databases: Snowflake, DynamoDB, Redshift, Hadoop, PostgreSQL, MySQL, BigQuery, HBase
  • Other Tools: Docker, Jenkins, Git, Linux, Elastic MapReduce, Lake House Architecture
  • Algorithms & Techniques: PCA, SVM, CNN, RNN, LSTM, DBN, NAS, Unsupervised NLP, DQN

πŸ† Featured Projects

Leveraged EfficientNet with ImageNet pre-training to predict the severity of pulmonary fibrosis using clinical and DICOM datasets.

  • Tech Stack: Python, TensorFlow, ImageNet, DICOM
  • Key Features: Ensemble learning, medical data prediction, early intervention strategies
  • Outcome: Achieved 68% accuracy in severity prediction, facilitating timely medical decisions

Developed a CNN-based model to detect defects in semiconductor wafers, improving manufacturing efficiency.

  • Tech Stack: Python, CNN, TensorFlow, AWS EC2
  • Key Features: Defect detection, semiconductor manufacturing, pattern recognition
  • Outcome: Achieved 94% accuracy in defect detection, reducing fabrication errors significantly

Created a synthetic data generation pipeline using Unity3D and GANs to enhance autonomous systems' training datasets.

  • Tech Stack: Unity3D, Blender, GANs, AWS EC2
  • Key Features: High-fidelity synthetic data, scaling data production for real-world scenarios
  • Outcome: Improved model accuracy by addressing edge cases, enhancing autonomous system performance

πŸ’Ό Professional Experience

Data Specialist @ CareWallet (Sep 2023 - Jan 2024)

  • Enhanced fraud detection by 35% using AWS Rekognition and restructured mobile app architecture.
  • Boosted analytical insights by 30% with a HIPAA-compliant Snowflake DB architecture.
  • Led cross-functional teams, improving application security by 12%.

Data Engineer @ Ridgeant Technologies (Jul 2021 - Aug 2023)

  • Drove 37% YoY growth by reengineering ETL pipelines and implementing dynamic pricing models.
  • Transitioned ETL processes to Informatica, boosting SQL efficiency by 95% and reducing B2B costs by 20%.
  • Increased pharmaceutical sales by 35% through planogram optimization and predictive modeling.

Software Developer @ ZF Friedrichshafen AG (Apr 2021 - Sep 2021)

  • Improved real-time data retrieval by 30% for GCP Nearby-Search API, serving over 1,000 daily searches.
  • Expanded application reach by 50% to 95 hospitals, improving healthcare data access and interaction.

🧠 What I’m currently learning / working on:

  • Exploring MLOps and improving proficiency in Kubernetes for large-scale machine learning pipelines.
  • Working on an AI-powered fitness tracker with integrated community engagement and meal recognition.
  • Enhancing skills in Generative AI for autonomous system performance and NLP tasks.

πŸ“« How to reach me:


🌱 Fun Fact:

I love applying data science techniques to everyday problems like optimizing travel routes or analyzing personal fitness data. Also, I’m a huge fan of photography and often combine it with data visualization!


Pinned Loading

  1. facebook/react facebook/react Public

    The library for web and native user interfaces.

    JavaScript 228k 46.5k

  2. facebook/infer facebook/infer Public

    A static analyzer for Java, C, C++, and Objective-C

    OCaml 14.9k 2k

  3. Advanced-Data-Predictive-Analytics Advanced-Data-Predictive-Analytics Public

    Advanced analytics which is used to make predictions about unknown Test-Cases From Test-Data. Predictive analytics uses many techniques from data mining, statistics, modeling, machine learning, and…

    Jupyter Notebook

  4. Advanced-MOOC-Result-Scraper- Advanced-MOOC-Result-Scraper- Public

    Advanced Automated Data-Mining Tool For MOOC Result to Scrap in one click.

    Python 1

  5. facebookarchive/react-360 facebookarchive/react-360 Public archive

    Create amazing 360 and VR content using React

    JavaScript 8.7k 1.2k