#

datamanipulation

Here are 54 public repositories matching this topic...

TheAlgorithms / R

Open

Add more algorithms

1

dynamitechetan commented Oct 1, 2018

Everyone is welcome to add more algorithms to this project. This repo is new so we need contributions from all.

Read more

enhancement help wanted good first issue hacktoberfest

greenbird / piri

Open

Add require_success keyword

2

thomasborgen commented Jan 26, 2021

Require success is our way of enabling the users to choose what is considered as errors and should hard fail.

When something is required to succeed, it must apply its function successfully.

ie: in casting, casting "123" to integer will succeed, but casting "test" to integer will fail.

When require_success is set to True in casting then this will trigger a hard fail and mapping wil

Read more

enhancement good first issue help wanted

Open

For all our default values add variable in constants.py

3

Open

Schema object.array[bool] should not have to be required and should be false by default

1

Find more good first issues

milesgranger / lumber-jack

High performance & light weight alternative to Pandas with ML focused tooling. (Work in progress x100!)

python rust data-mining numpy cython pandas python3 series datascience dataframe datamanipulation

Updated Aug 18, 2018
Python

yashajoshi / Cryptocurrency-Trading-Data-Analysis

Analyzing the historical cryptocurrency trading dataset, to portrait its dynamic landscape and dig into features of crypt currencies to figure out if any patterns in their price movement.

data-mining dimensionality-reduction data-analysis factor-analysis principal-component-analysis multidimensional-scaling k-means-clustering datamanipulation pandas-python

Updated Apr 21, 2020
Jupyter Notebook

RawatMeghna / KPMG-Data-Analytics-Virtual-Internship

In this online program, I completed similar tasks that KPMG Graduates do in the company. I learned what it is like working at one of the world’s best data analytics team, and built skills required to excel as a analytics consultant.

python3 tableau datavisualization dataanalysis datacleaning datamanipulation

Updated Aug 19, 2020

adzeo1047 / Data_Science

DataCamp Project Solutions

data-science machine-learning project data-visualization supervised-learning data-analysis unsupervised-learning datacamp datamanipulation pythonprojects datacamp-projects

Updated Jun 5, 2020
Jupyter Notebook

htwu1998 / DataCamp_Projects

Collections of supervised project completed using Python on DataCamp.

python dataanalysis datacleaning datamanipulation regressionanalysis

Updated Mar 21, 2021
Jupyter Notebook

timothypesi / Data-Science-Starter

This repository is for newcomers into the data science world. It Summarizes three key areas Data Exploration,Manipulation and Data Cleaning

datacleaning dataexploration datamanipulation

Updated Apr 14, 2021
Jupyter Notebook

sukeshpabba / datacamp-projects

Datacamp Projects

ggplot2 r dplyr project datacamp dataanalysis datamanipulation

Updated Mar 13, 2018
Jupyter Notebook

Algo

webaddicted / Algo

This repository contain all frequency ask interview questions in data structure and algo.

android kotlin java dart algorithms logic data-structures collections datamanipulation

Updated Apr 13, 2022
Java

iam-mhaseeb / Mastering-Data-Selection-with-Pandas

This repository is demonstration of Pandas library of Python's super powers.

python data pandas-dataframe python-library python-script pandas python3 datascience datawrangling pandas-dataframes dataengineering pandas-tutorial pandas-library dataexploration datamanipulation pandas-python

Updated Jun 1, 2020
Jupyter Notebook

R-vs-Pandas-Stack-Exchange-API

pkhiyara / R-vs-Pandas-Stack-Exchange-API

A Python data manipulation and analysis project that examines and visualizes the popularity of widely used data science tools R and Pandas across 3 Stack Exchange subcommunities (Stack Overflow, Cross Validated, Data Science) through the use of the Stack Exchange API and multiple Python libraries such as Pandas, JSON, Requests, and Matplotlib.

json data-science r analysis pandas-dataframe stackoverflow pandas data-visualization requests data-analysis matplotlib data-manipulation stackexchange stackoverflow-api stackexchange-api jsonobject datamanipulation matplotlib-pyplot jsonlibrary python-data-manipulation

Updated Jan 1, 2020
Jupyter Notebook

paperscissoroxie / nobelprizewinners

datavisualization datamanipulation importingandcleaningdata

Updated Jun 4, 2021
Jupyter Notebook

thomastrg / Predict-order-delivery-date

Prediction of the delay between the creation of an order and the beginning of the shipment. Dataset from the database of a company.

sklearn prediction pandas datamanipulation data-creation

Updated Oct 20, 2021
Jupyter Notebook

karthikbhandary2 / Analyzing_TV_Data

In this project, I used a combination of data manipulation and visualization to explore television data. I also looked at Super Bowl Data, generating insights into game outcomes, viewership, and even halftime shows.

data-science datavisualization datamanipulation

Updated Apr 13, 2022
Jupyter Notebook

madhura711 / American-Airlines--Data-Mining-and-Variance-Analysis

data-mining query excel exploratory-data-analysis sas pivot-tables data-visualization data-analysis tableau datapipeline t-sql waterfall-charts strategy-analysis variance-analysis datamanipulation

Updated Aug 20, 2018

Mrcwr2 / Power_Query_M-code_HVAC_PM_Autobuilder

Manipulating Data excel data in power query so that it works in our JLL Corrigo System and is built automatically. (Names and amounts have been changed for privacy)

data excel powerquery datacleaning datamanipulation mlanguage powerquerym

Updated Sep 27, 2021

mjoneil21 / Homework-2

Practical Computing for Data Analytics Homework 2: This project was based on census data regarding Michigan socio-economic data. Visualizations were made to bring out paterns in the data and draw conclusions. ggplot and dplyer were the primary packages used in this project for data manipulation and visualizations. A primary theme in this assignment was the differences between the Upper Peninsula and Lower Peninsula. There were vast differences in populations and the type of employment that one would typically find in each geographic region.

visualization census-data datamanipulation

Updated Feb 12, 2018
HTML

sadettindemirel / veRi_biRlestiRme

R ekosisteminde veri setleri nasıl birleştirilir?

rstats datamanipulation

Updated Jun 29, 2019
HTML

karthikbhandary2 / What_and_Where_are_the_Oldest_Businesses

An important part of business is planning for the future and ensuring that the business survives changing market conditions. Some businesses do this remarkably well and last for hundreds of years. In this project, I explored data from BusinessFinancing.co.uk on the world's oldest businesses: when were they founded, and which industries do they belong to? Like many business problems, the data we'll explore is contained in several different datasets. In order to understand the world's oldest businesses, we will first need to use joining techniques to merge our data. From there, we can use manipulation tools such as grouping and filtering to answer questions about these historic businesses.

data-science datamanipulation

Updated Apr 14, 2022
Jupyter Notebook

Surya-Murali / Data-Manipulations-in-R

Some R commands that might be handy for data manipulations and exploratory data analysis.

r exploratory-data-analysis datamanipulation

Updated May 3, 2018
R

paperscissoroxie / handwashing

statistics probability datavisualization datamanipulation importingandcleaningdata

Updated Jun 4, 2021
Jupyter Notebook

MudgalShashank / Segmenation---K-Means-Analysis

Had to develop a customer segmentation to define marketing strategy. The dataset summarized the usage behavior of about 9000 active credit card holders during the last 6 months. The file is at a customer level with 18 behavioral metrics..

analytics datascience statistical-analysis segmentation unsupervised-learning factor-analysis cluster-analysis kmeans-clustering datamanipulation

Updated Aug 27, 2018
SAS

Neviya / Analyze-International-Debt-Statistics

In this project, we are going to analyze international debt data collected by The World Bank. The dataset contains information about the amount of debt (in USD) owed by developing countries across several categories.

data sql analysis import datamanipulation

Updated Jul 26, 2021
Jupyter Notebook

YamanAlBochi / GoogleAppsAnalysis

Analyzing various apps found on the Google Play Store with the help of different python libraries. The dataset is chosen from Kaggle. It is the web scraped data of 10k Play Store apps for analyzing the Android market. It consists of in total of 10841 rows and 13 columns.

visualization python data-science data google analytics analysis store jupyter-notebook eda pandas applications datacleaning datamanipulation

Updated May 16, 2022
Jupyter Notebook

IqraImtiazz / Analyzing-TV-Data

Data cleaning and analysis is performed on data, to discover the performance of musicians in halftime show.

python pandas datacleaning datamanipulation

Updated May 31, 2022
Jupyter Notebook

rebeccasoren / SuperBowl

Load, clean, and explore Super Bowl data in the age of soaring ad costs and flashy halftime shows.

data-science datavisualization datamanipulation

Updated Oct 15, 2020
Jupyter Notebook

zkrzn / DataScienceProjects

Extraction Cleaning Manipulation Visualization Machine Learning & More

data sql big-data postgresql extraction datascience dataset business-intelligence machinelearning ab-testing dataanalysis business-analytics vizualisation datamanipulation cleaning-data-in-python

Updated Jun 11, 2022
Jupyter Notebook

sakshigupta06 / Predicting-Credit-Card-Approvals

This project builds an automatic credit card approval predictor using machine learning techniques, just like the real banks do.

python pandas-dataframe supervised-learning datamanipulation real-banks

Updated Sep 26, 2020

iamanshika / Dr.-Semmelweis-and-the-Discovery-of-Handwashing

In 1847, the Hungarian physician Ignaz Semmelweis made a breakthough discovery: he discovers handwashing. Contaminated hands was a major cause of childbed fever and by enforcing handwashing at his hospital he saved hundreds of lives.

data-visualization python3 probability-statistics intermediate-python datamanipulation importing-and-cleaning-data

Updated Oct 3, 2020
Jupyter Notebook

Improve this page

Add a description, image, and links to the datamanipulation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the datamanipulation topic, visit your repo's landing page and select "manage topics."