Viacheslav Zhenylenko,美国加州旧金山开发人员
Viacheslav is available for hire
Hire Viacheslav

Viacheslav Zhenylenko

Verified Expert  in Engineering

Machine Learning Developer

Location
旧金山,加州,美国
Toptal Member Since
June 13, 2019

Viacheslav在数据科学和软件工程方面有8年的经验, focusing on Python, 有Java和c++的生产经验. 他对从原始数据中获得的见解充满热情,并喜欢将它们转化为创造卓越的商业价值. Viacheslav精通应用高级机器学习技术, such as computer vision, NLP, product recommendation systems, networking data, 以及经典的数据科学来解决数据量大的项目.

Portfolio

创能有限公司
数据科学、机器学习、网络抓取、自然语言处理(NLP)...
Classic J. Vans INC.
Python,数据抓取,网页抓取,Selenium,亚马逊网络服务(AWS)...
Spectation Sports LLC
Python,数据科学,机器学习,数据工程,R语言,统计学...

Experience

Availability

Part-time

Preferred Environment

Eclipse, Visual Studio Code (VS Code), PyCharm, Jupyter, MacOS, Linux, Vim Text Editor, Sublime Text, Bash

The most amazing...

...我开发的项目是业界首创的, 用于RAN网络中拥塞检测的自重构AutoML系统.

Work Experience

Data Scientist

2023 - 2023
创能有限公司
  • 为数百个网站开发了并行数据收集流程.
  • 利用法学硕士进行总结, topic extraction, sentiment analysis, report generation, and other tasks.
  • 在AWS上部署了批处理、EventBridge、Lambda等工具.
Technologies: 数据科学、机器学习、网络抓取、自然语言处理(NLP), GPT, 生成预训练变压器(GPT), Data Engineering, Python, Sentiment Analysis, Data Visualization, Language Models, Large Language Models (LLMs), OpenAI, OpenAI GPT-3 API, OpenAI GPT-4 API, ChatGPT, Selenium

Data Scientist

2023 - 2023
Classic J. Vans INC.
  • 开发工具,以结构化的方式从多个web源自动收集信息.
  • 利用llm对原始文本进行命名实体提取和分类.
  • 使用带有代理的Selenium来模拟web客户机.
  • 在AWS上部署工具,包括EventBridge、ECS、Lambda、S3、QuickSight、SES等.
Technologies: Python,数据抓取,网页抓取,Selenium,亚马逊网络服务(AWS), Language Models, Machine Learning, Deep Learning, Neural Networks, Business Intelligence (BI), 生成预训练变压器(GPT), GPT, Large Language Models (LLMs), OpenAI, OpenAI GPT-3 API, OpenAI GPT-4 API, ChatGPT

Data Scientist | Engineer

2023 - 2023
Spectation Sports LLC
  • 开发了综合格斗赛前预测和赛中预测模型.
  • 利用多种机器学习技术, such as boosted trees, regression, ensembling, stacking, imputation, and others.
  • 为单个战斗机执行多步骤特征工程.
  • 从统计上确定这些数据的最有价值的特征和最佳外部来源.
  • 将模型部署并集成到现有的数据管道中, 包括MLOps方法的元素.
Technologies: Python,数据科学,机器学习,数据工程,R语言,统计学, Scikit-learn, PyTorch, Deep Learning, Amazon Web Services (AWS), Neural Networks

AI Engineer

2022 - 2023
Briefly
  • 将OpenAI模型用于提高生产力和客户成功领域的多个任务. 执行了微调、RAG、代理、函数调用等.
  • 使用Terraform开发分布式云基础设施, Amazon弹性容器服务(Amazon ECS), Amazon DynamoDB, Amazon EventBridge, Amazon Simple Queue Service (SQS), AWS Lambda, 和亚马逊简易电子邮件服务(SES).
  • 执行与外部服务和工具的API集成, including OpenAI, Google services, Slack, Notion, and Hubspot.
  • 设计并构建了一些带有业务逻辑的Django api.
Technologies: Django, Web Frameworks, PostgreSQL, SQL, 生成预训练变压器(GPT), 自然语言处理(NLP), GPT, OpenAI Gym, React, Jupyter Notebook, ChatGPT, Automation, OpenAI GPT-3 API, OpenAI GPT-4 API, APIs, HubSpot, Notion, HubSpot CRM, OpenAI, Large Language Models (LLMs), REST APIs, Language Models, Machine Learning, Deep Learning, Python, Amazon Web Services (AWS), Neural Networks, LangChain

Senior Data Engineer

2022 - 2022
Grata Inc.
  • 在Kubernetes上使用芹菜开发端到端分布式nlp地理编码管道.
  • 从公司网站和聚合器中实现抓取.
  • 在AWS上开发并部署了SageMaker网页分类模型.
  • 实现了一个带有查询查找的混合地理编码模型, query relaxation, result validation, prioritization, and fallback mechanisms.
  • 使用可用地理数据库和离线实体提取器(如libpostal和第三方地理编码服务)的组合,将其组合在一个视图中.
  • 增加了部分解析地址, 减少了95%的错误地址, 并将整体数据质量得分提高了20%.
Technologies: Python, Docker, Elasticsearch, Celery, Kubernetes, Jenkins, PostgreSQL, REST, Geocoding, Grafana, DataOps, Datadog, Pandas, NumPy, 自然语言工具包(NLTK), GIS, React, Cloud Infrastructure, SQL, Deep Learning, Amazon Web Services (AWS), Git, Agile Software Development, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Django, Jupyter, GPT, 生成预训练变压器(GPT), 自然语言处理(NLP), Parallel Programming, PyCharm, Data Science, Flask, React Redux, Algorithms, Data Analysis, Python 3, SciPy, DevOps, Amazon SageMaker, Data Scraping, Software Development, Programming, Transformers, Concurrent Programming, Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Dashboards, Databases, Software Engineering, Scikit-learn, Matplotlib, Unit Testing, ETL, GeoPandas, Quality Assurance (QA), APIs, Jupyter Notebook, Web Frameworks, REST APIs, Machine Learning

Lead Data Scientist

2020 - 2022
Botprise, Inc.
  • 在平台上为完整的ML周期、ModelOps和MLOps开发后端. 在AWS SageMaker之上添加了一个包装器.
  • 参与了几十个自动化工作流(用例), including MLOps, analytics, DataOps, networks, ITOps, etc.
  • 使用React创建后端和前端元素进行拖放, chatbot-building application.
  • 实现和部署了数十种算法(分类), clustering, time series, NLP, 和计算机视觉)用于不同的用例.
  • 领导一个小型机器学习团队,包括计划、管理、监控和领导.
技术:亚马逊网络服务(AWS), Python, Flask, MongoDB, Apache Kafka, React Redux, REST, Docker, Kubernetes, Amazon SageMaker, SciPy, NumPy, Pandas, PyTorch, TensorFlow, Transformers, Management, PostgreSQL, 机器学习操作(MLOps), React, Cloud Infrastructure, SQL, Computer Vision, Deep Learning, Keras, Git, Agile Software Development, Linux, Object Detection, Convolutional Neural Networks, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, AutoML, TCP/IP, Networks, Time Series Analysis, Concurrent Programming, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, Machine Learning Automation, GPT, 自然语言处理(NLP), 生成预训练变压器(GPT), PyCharm, Data Science, Algorithms, Python 3, AWS CloudFormation, DataOps, Software Development, Networking, AWS Lambda, Artificial Intelligence (AI), Programming, Datadog, 自然语言工具包(NLTK), Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Neural Networks, Dashboards, Databases, Software Engineering, Linear Regression, Microsoft Excel, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, APIs, Jupyter Notebook, Automation, Web Frameworks, REST APIs, Machine Learning

Data Analyst

2020 - 2020
Spin (Tier Mobility)
  • 使用汽车选择和汽车再培训模型,对一家全球电动滑板车租赁公司(数百个城市)的电动滑板车需求进行时间序列预测.
  • 执行临时数据分析并构建Looker仪表板.
  • 进行干预效果分析(促销及其他活动).
Technologies: Data Validation, Data Analysis, Data Analytics, Python, SQL, R, Data Science, Data Visualization, Google Cloud, Google Cloud Platform (GCP), Looker, BigQuery, Google BigQuery, Pandas, NumPy, Git, Agile Software Development, Linux, Cloud Infrastructure, Bash, Sublime Text, Vim Text Editor, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Apache Airflow, Algorithms, Docker, Python 3, SciPy, Software Development, Programming, Analytics, Business Intelligence (BI), Dashboards, Databases, Software Engineering, Linear Regression, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, ETL, Plotly, Jupyter Notebook, Internet of Things (IoT), REST APIs, Machine Learning

Senior MLOps Engineer

2020 - 2020
Pro Football Focus, LLC
  • 介绍并实现了MLOps技术、工具和方法.
  • 将一打单片R管道重建为分布式, modular, 以及函数式Python管道.
  • 在Dagster, Seldon, Feast等工具之上开发了MLOps层.
  • 为速度和性能对现有模型超参数进行微调.
Technologies: Data Science, Python, R, Machine Learning, React, Python 3, Seldon, Dagster, RabbitMQ, PostgreSQL, Feast, 机器学习操作(MLOps), Cloud Infrastructure, SQL, Deep Learning, Pandas, NumPy, Amazon Web Services (AWS), Git, Agile Software Development, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, 可解释人工智能(XAI), Predictive Analytics, Jupyter, Predictive Modeling, Parallel Programming, PyCharm, Visual Studio Code (VS Code), Data Analytics, Apache Airflow, REST, Algorithms, Kubernetes, Docker, SciPy, Prefect, AWS CloudFormation, DevOps, Software Development, AWS Lambda, Programming, Dask, Data Engineering, Databases, Software Engineering, Linear Regression, Sports, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, Quality Assurance (QA), Jupyter Notebook, Web Frameworks, APIs, REST APIs, DVC

Machine Learning Engineer

2020 - 2020
Plutoshift, Inc.
  • 将MLOps工具引入Seldon、Feast、Great Expectations等现有基础设施.
  • 迁移现有的硬编码模型以引入MLOps基础结构.
  • 使用Django为ml相关服务开发后端api.
  • 实现了制造传感器的分类和时间序列预测模型.
Technologies: Machine Learning, Python, Django, Convolutional Neural Networks, Object Detection, TensorFlow, PyTorch, Keras, Apache Airflow, Cloud Infrastructure, Google Cloud Platform (GCP), Azure, Cassandra, Apache Cassandra, Seldon, Feast, SQL, Deep Learning, Pandas, NumPy, Git, Agile Software Development, Linux, Google Cloud, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Data Science, Flask, REST, Algorithms, Kubernetes, Docker, Python 3, SciPy, Software Development, AWS Lambda, Programming, Datadog, Data Engineering, Data Visualization, Databases, Software Engineering, Linear Regression, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Quality Assurance (QA), Jupyter Notebook, Internet of Things (IoT), APIs, REST APIs

Senior AI Developer

2019 - 2020
Akcelita
  • 在AWS上为来自监控摄像头的照片开发了一个摄取和处理管道.
  • 实验了各种非深度学习和深度学习方法并进行了测试. 我还使用并训练了带有注意力机制的暹罗神经网络,达到了95%以上的准确率.
  • 创建并与分析人员共享演示文稿,并为项目开发和维护Wiki.
技术:亚马逊网络服务(AWS), TensorFlow, Python, Docker, PyTorch, AWS Lambda, Computer Vision, Deep Learning, Machine Learning, Classification, Artificial Intelligence (AI), Cloud Infrastructure, SQL, Pandas, NumPy, Keras, Git, Agile Software Development, Linux, Convolutional Neural Networks, Bash, Sublime Text, Vim Text Editor, Data Pipelines, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, PyCharm, Data Science, Algorithms, Python 3, SciPy, Software Development, Programming, Data Visualization, Neural Networks, Software Engineering, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Jupyter Notebook, REST APIs

Team Lead

2018 - 2019
乌克兰国家科学院
  • 领导和指导一个学生团队. 我用敏捷方法定义了目标并控制了过程.
  • 创建了农作物分类和地图创建工具.
  • 手动收集数据,并通过网络抓取使用Mapillary和监督数据标签.
  • 实现并测试了DeblurGAN和其他几个经典的去模糊方法.
  • 通过微调ResNet模型,监督田间定位(YOLO)和作物分类.
Technologies: OpenCV, PyTorch, TensorFlow, Keras, Python, Python 3, Computer Vision, 生成对抗网络(GANs), Cloud Infrastructure, Deep Learning, Pandas, NumPy, Amazon Web Services (AWS), Git, Agile Software Development, Linux, Object Detection, Convolutional Neural Networks, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Predictive Analytics, Jupyter, Predictive Modeling, PyCharm, Visual Studio Code (VS Code), Data Science, Algorithms, Docker, SciPy, Beautiful Soup, Web Scraping, Data Scraping, Software Development, AWS Lambda, Programming, GIS, API Integration, Neural Networks, Software Engineering, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, GeoPandas, Jupyter Notebook, REST APIs, Machine Learning

Senior Data Scientist

2017 - 2019
Openwave Mobility
  • 创建了一个多阶段的数据管道,从原始数据包数据(TCP/IP层)到机器学习模型的可消费输入,并在Python中实现多处理(CPython).
  • Trained, tuned, evaluated, 并比较了Python (scikit-learn)中的多个机器学习模型, Keras, XGBoost, CatBoost) and C++ (mlpack).
  • 监督数据分析和与利益相关者的沟通. 创建了一个可重用的Python工具,用于生成快速和外部可配置的数据分析报告.
  • 实现了基于专家知识聚合的自定义特征生成算法, derivatives, TCP/IP conversation delays, products, and fractions.
  • 实现了基于模型的自定义多阶段特征选择算法.
  • 在网络中部署和监控生产中的项目. 如果工具检测到拥塞,则应用优化策略. 客户报告说,视频内容的传输质量提高了20%.
Technologies: C++, Python, Python 3, Concurrent Programming, Agile Software Development, Machine Learning, Deep Learning, Time Series Analysis, Networks, Networking, TCP/IP, TensorFlow, SciPy, Scikit-learn, Pandas, NumPy, mlpack, AutoML, 可解释人工智能(XAI), Cloud Infrastructure, SQL, Amazon Web Services (AWS), Git, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Data Pipelines, Predictive Analytics, Jupyter, Predictive Modeling, Machine Learning Automation, Parallel Programming, PyCharm, Data Science, Data Analytics, Algorithms, Docker, Data Analysis, Software Development, AWS Lambda, Artificial Intelligence (AI), Programming, Data Engineering, Analytics, Data Validation, Data Visualization, Neural Networks, Databases, Software Engineering, Linear Regression, Matplotlib, Statistical Modeling, Unit Testing, ETL, Plotly, Jupyter Notebook, Automation

Data Scientist

2015 - 2017
Octetis
  • 为在线商店开发、部署并评估了一个Python混合推荐引擎.
  • 监督客户行为分析、可视化和利益相关者沟通.
  • 使用策略模式处理不同的用户粘性场景. 根据受欢迎程度(一般和基于类别)给出上下文建议。, item-to-item, and SVD. (Python, scikit-learn, SciPy).
  • 将推荐引擎集成到Django后端.
  • 随机抽样进行多次A/B测试,对系统进行评估. 与类别基线中最受欢迎的项目进行比较, 我们每回合的购买量提升了150%,收益也有所增加.
  • 用Keras为在线云站点构造器创建了一个图像超分辨率模块.
  • 利用中深度CNN,在多个模糊内核上进行训练,并通过REST将其作为服务部署.
  • 进行的调查显示,该平台用户的满意度提高了约5%.
Technologies: TensorFlow, Keras, Python, Pandas, NumPy, Python 3, PostgreSQL, Recommendation Systems, Django, Data Analysis, Data Analytics, Machine Learning, Artificial Intelligence (AI), Cloud Infrastructure, MySQL, SQL, Computer Vision, Deep Learning, Amazon Web Services (AWS), Git, Agile Software Development, Linux, Object Detection, Convolutional Neural Networks, Bash, Sublime Text, Vim Text Editor, Anomaly Detection, Classification, Data Pipelines, Time Series Analysis, Predictive Analytics, Jupyter, OpenCV, Predictive Modeling, PyCharm, Data Science, REST, Algorithms, Docker, SciPy, Jenkins, Microsoft Power BI, Software Development, AWS Lambda, Programming, Data Engineering, Analytics, API Integration, Data Validation, Data Visualization, Neural Networks, Business Intelligence (BI), Dashboards, Databases, Software Engineering, Linear Regression, Microsoft Excel, Spreadsheets, Scikit-learn, Matplotlib, Statistical Modeling, Unit Testing, ETL, Tableau, Plotly, Jupyter Notebook, REST APIs

Software Engineer Intern

2015 - 2015
Facebook
  • 使用FBLearner Flow对AdaBoost客户流失预测模型进行培训和评估.
  • 为优化执行超参数调优.
  • 使用Hive进行数据工程,使用Python处理数据.
Technologies: Apache Hive, Python, Python 3, Data Pipelines, FBLearner Flow, Machine Learning, Git, Agile Software Development, Linux, Bash, Sublime Text, Vim Text Editor, Classification, Predictive Analytics, Predictive Modeling, Data Science, Data Analytics, Algorithms, Software Development, Programming, Data Engineering, Analytics, Databases, Software Engineering, ETL

Research Intern

2014 - 2015
Samsung
  • 开发了智能键盘功能的算法(单词预测和拼写纠正).
  • 开发了n-gram的朴素贝叶斯和k -近邻(KNN)的拼写更正.
  • 使用拉普拉斯平滑和自定义键盘距离为KNN创建了更好的算法性能微调.
  • Developed algorithms with C++. 将它们与Java和Android键盘集成,并发布到App Store.
Technologies: C++, Android, Java, Machine Learning, Artificial Intelligence (AI), 生成预训练变压器(GPT), GPT, 自然语言处理(NLP), Android NDK, Git, Agile Software Development, Linux, Bash, Sublime Text, Vim Text Editor, Eclipse, Data Science, Algorithms, Software Development, Programming, Software Engineering

Software Engineer Intern

2013 - 2014
Engage Point
  • 在雅加达EE开发了一个内容管理互操作性系统. 我对应用程序使用了模型-视图-控制器框架.
  • 为应用程序的业务逻辑开发了Enterprise JavaBeans.
  • 为表示级别开发JavaServer页面.
Technologies: Java EE, Java, Linux, Git, Bash, Sublime Text, Vim Text Editor, Eclipse, Software Development, Programming, Software Engineering

Web Scraping Mapillary

这是一个使用Mapillary API在特定位置下载图像的代码示例. 它搜索地图上的特定位置, identifies the car's angle, 刮擦的只是道路近90度角的侧视图照片.

Languages

Python, SQL, Python 3, Bash, R, Snowflake, Fortran, Java, c++

Frameworks

Django, Flask, Web Frameworks, Selenium

Libraries/APIs

Scikit-learn, Keras, TensorFlow, PyTorch, Pandas, NumPy, SciPy, Beautiful Soup, Dask, 自然语言工具包(NLTK), Matplotlib, REST APIs, OpenCV, React Redux, PySpark, React

Tools

Jupyter, PyCharm, IPython Notebook, Amazon SageMaker, Geocoding, GIS, Vim Text Editor, Sublime Text, Plotly, Apache Airflow, Git, Celery, Jenkins, AWS CloudFormation, Grafana, Looker, Microsoft Power BI, AutoML, RabbitMQ, BigQuery, Microsoft Excel, Spreadsheets, Tableau, Notion, Android NDK, OpenAI Gym

Paradigms

Data Science, Agile Software Development, REST, Unit Testing, ETL, Parallel Programming, DevOps, Concurrent Programming, Anomaly Detection, Business Intelligence (BI), Automation, Management

Platforms

Docker, AWS Lambda, Jupyter Notebook, MacOS, Amazon Web Services (AWS), Linux, Apache Kafka, Kubernetes, Visual Studio Code (VS Code), Eclipse, Android, Java EE, Google Cloud Platform (GCP), Azure

Storage

MySQL, PostgreSQL, Data Pipelines, Data Validation, MongoDB, Elasticsearch, Datadog, Databases, Apache Hive, Cassandra, Google Cloud

Other

Predictive Analytics, Predictive Modeling, Machine Learning Automation, Computer Vision, Data Analytics, Deep Learning, Machine Learning, Mathematics, Applied Mathematics, Statistics, Algorithms, Data Analysis, Dagster, 机器学习操作(MLOps), Prefect, Computer Science, Web Scraping, Data Scraping, Software Development, Artificial Intelligence (AI), Programming, Computational Science, Time Series Analysis, Convolutional Neural Networks, Object Detection, Seldon, Feast, Data Visualization, Data Engineering, Analytics, Neural Networks, Software Engineering, Linear Regression, Statistical Modeling, GeoPandas, APIs, 生成预训练变压器(GPT), ChatGPT, OpenAI GPT-3 API, OpenAI GPT-4 API, OpenAI, Large Language Models (LLMs), Language Models, Recommendation Systems, 自然语言处理(NLP), Science, Scientific Computing, DataOps, Physical Science, Networking, Physics, Applied Physics, Transformers, TCP/IP, mlpack, 可解释人工智能(XAI), Cloud Infrastructure, Google BigQuery, API Integration, Dashboards, Quality Assurance (QA), GPT, Internet of Things (IoT), HubSpot, HubSpot CRM, LangChain, 生成对抗网络(GANs), Networks, FBLearner Flow, Classification, Apache Cassandra, Sports, Software Architecture, Sentiment Analysis, DVC

2019 - 2021

理论物理硕士(量子场论)

基辅国立大学-基辅,乌克兰

2018 - 2020

计算机数学与代数硕士学位

基辅国立大学-基辅,乌克兰

2010 - 2014

计算机科学与应用统计学学士学位

基辅国立大学-基辅,乌克兰

JULY 2023 - JULY 2026

AWS机器学习专业

Amazon Web Services

JANUARY 2023 - JANUARY 2026

AWS解决方案架构师助理

Amazon Web Services

OCTOBER 2022 - OCTOBER 2025

AWS认证开发者助理

Amazon Web Services

MAY 2017 - PRESENT

Data Science: Data to Insights

MITProfessionalX DSx | edX

JANUARY 2017 - PRESENT

Artificial Intelligence (AI)

ColumbiaX CSMM.101x | edX

MAY 2011 - PRESENT

Second Place

ACM-ICPC, Country level

JULY 2010 - PRESENT

Bronze Medal

国际数学奥林匹克(IMO)

MAY 2009 - PRESENT

Second Place

基辅国际物理节

Collaboration That Works

How to Work with Toptal

在数小时内,而不是数周或数月,我们的网络将为您直接匹配全球行业专家.

1

Share your needs

在与Toptal领域专家的电话中讨论您的需求并细化您的范围.
2

Choose your talent

在24小时内获得专业匹配人才的简短列表,以进行审查,面试和选择.
3

开始你的无风险人才试验

与你选择的人才一起工作,试用最多两周. 只有当你决定雇佣他们时才付钱.

Top talent is in high demand.

Start hiring