Skip to content
View SemyonSinchenko's full-sized avatar

Organizations

@apache

Block or report SemyonSinchenko

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
SemyonSinchenko/README.md

Semyon Sinchenko

Semyon's GitHub stats

Top Languages

Education

Master degree in solid state physics, Moscow State University, Moscow 2022

Bachelor degree in solid state physics, Moscow Engeneering Physical Institute, Moscow 2016

Skills

  • Python
  • Java
  • Scala
  • Data Engineering Stack (Apache Spark, Scala, Databricks, Airflow, Kafka, Ni-Fi, SQL)
  • Python ML Stack (NumPy, SciPy, Scikit-Learn, Tensorflow-2, XgBoost, Pandas)

Self-education

Expertise

  • Data Engineering
  • Data Science

Pinned Loading

  1. apache/incubator-graphar apache/incubator-graphar Public

    An open source, standard data file format for graph data storage and retrieval.

    C++ 228 45

  2. mrpowers-io/tsumugi-spark mrpowers-io/tsumugi-spark Public

    SparkConnect Server plugin and protobuf messages for the Amazon Deequ Data Quality Engine.

    Python 25 6

  3. zinggAI/zingg zinggAI/zingg Public

    Scalable identity resolution, entity resolution, data mastering and deduplication using ML

    Java 968 121

  4. flake8-pyspark-with-column flake8-pyspark-with-column Public

    A flake8 plugin that detects of usage withColumn in a loop or inside reduce

    Python 20

  5. apache/datafusion-comet apache/datafusion-comet Public

    Apache DataFusion Comet Spark Accelerator

    Rust 853 168

  6. feature-generation-benchmark feature-generation-benchmark Public

    A database-like benchmark of feature generation from time-series data

    Jupyter Notebook 14 1