Skip to content

This repository is mainly used to store R-programs of Data Science class of Johns Hopkins

Notifications You must be signed in to change notification settings

yelangya3826850/Projects-in-the-Data-Science-field

Repository files navigation

Projects in the field of Data Science

Introduction

This repository is mainly used to store projects of Data Science field which I have joined and worked for a period of time. You can find all the codes for the corresponding projects in the repository, and download freely. All the projects are developed by R language. The following is a list of all the projects. To see my main work of every projects, you just need to click the corresponding title. Welcome to reprint! But please let me know. When I set up to do these work, I was just a beginner of this field, so may be there are many questions remained that I have not noticed. Welcome to point them out and if you have any problems with my work welcome to contact me by e-mail: [email protected]

List of the projects that I have ever joined.

  • Machine Learning:

    Using the data from Human Activity Recognition to predict the manner in which they did the exercise. The data are collected from accelerometers on the belt, forearm, arm, and dumbell of 6 participants.

  • Cache Time-consuming Computations:

    Constructing an R function which is able to cache potentially time-consuming computations. The basic work for future big data analysis.

  • Suggestion for variables influencing motor cars' miles per gallon (MPG):

    This project is mainly to work for a certain magazine about the automobile industry. Looking at a data set of a collection of cars, they are interested in exploring the relationship between a set of variables and miles per gallon(MPG) (outcome), and mainly answer the following two questions:

    1. “Is an automatic or manual transmission better for MPG”
    2. "Quantify the MPG difference between automatic and manual transmissions"
  • Getting and Cleaning Data

    This is actually not a project but a previous work for the future analysis. You know that raw data nowadays is becoming more and more dirty and confusing, so it is important to preprocess these data to get a more clean ones. And this web page is mainly tracing the work of mine processing data.

  • Plotting

    This is actually not a project but a practice of plotting figures in R circumstance which is the basic work for the future work

  • Analysis for Influence of Air Pollutant PM2.5

    The overall goal of this project is to explore the National Emissions Inventory database and see what it say about fine particulate matter pollution in the United states over the 10-year period 1999–2008.

  • Personal Activity Analysis

    This project makes use of data from a personal activity monitoring device. This device collects data at 5 minute intervals through out the day. The data consists of two months of data from an anonymous individual collected during the months of October and November, 2012 and include the number of steps taken in 5 minute intervals each day.

About

This repository is mainly used to store R-programs of Data Science class of Johns Hopkins

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages