Skip to content
Pro
Block or report user

Report or block yennanliu

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse
Block or report user

Report or block yennanliu

Hide content and notifications from this user.

Learn more about blocking users

Contact Support about this user’s behavior.

Learn more about reporting abuse

Report abuse

Pinned

  1. Develop models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle)

    Jupyter Notebook 3 2

  2. Repo for practical data science problems approaches, including notebook demo and working scripts

    Jupyter Notebook 5 3

  3. Resources for data science/engineering learning (tutorial / book / docker). 1) data dev env setup 2) essential data ref 3) practical demo

    Shell 5 1

  4. Collection of code for submitting Spark/Hadoop/Hive/Pig tasks to EMR (AWS Elastic MapReduce)

    Scala 1

  5. Run various ETL jobs via Airflow (Airflow as job dispense center) : 1) env set up 2) airflow DAG 3) Spark/ML/DL script

    Python 4

  6. Collections of data infrastructure development 1) Celery Job threads 2) DB maser- slave config 3) kafka-redis event fetch 4) Superset BI tool build

    Java 2

You can’t perform that action at this time.