#
datamanipulation
Here are 54 public repositories matching this topic...
Open
1
enhancement
New feature or request
help wanted
Extra attention is needed
good first issue
Good for newcomers
hacktoberfest
2
thomasborgen
commented
Jan 26, 2021
Require success is our way of enabling the users to choose what is considered as errors and should hard fail.
When something is required to succeed, it must apply its function successfully.
ie: in casting, casting "123" to integer will succeed, but casting "test" to integer will fail.
When require_success is set to True in casting then this will trigger a hard fail and mapping wil
enhancement
New feature or request
good first issue
Good for newcomers
help wanted
Extra attention is needed
High performance & light weight alternative to Pandas with ML focused tooling. (Work in progress x100!)
-
Updated
Aug 18, 2018 - Python
Analyzing the historical cryptocurrency trading dataset, to portrait its dynamic landscape and dig into features of crypt currencies to figure out if any patterns in their price movement.
data-mining
dimensionality-reduction
data-analysis
factor-analysis
principal-component-analysis
multidimensional-scaling
k-means-clustering
datamanipulation
pandas-python
-
Updated
Apr 21, 2020 - Jupyter Notebook
In this online program, I completed similar tasks that KPMG Graduates do in the company. I learned what it is like working at one of the world’s best data analytics team, and built skills required to excel as a analytics consultant.
-
Updated
Aug 19, 2020
DataCamp Project Solutions
data-science
machine-learning
project
data-visualization
supervised-learning
data-analysis
unsupervised-learning
datacamp
datamanipulation
pythonprojects
datacamp-projects
-
Updated
Jun 5, 2020 - Jupyter Notebook
Collections of supervised project completed using Python on DataCamp.
-
Updated
Mar 21, 2021 - Jupyter Notebook
This repository is for newcomers into the data science world. It Summarizes three key areas Data Exploration,Manipulation and Data Cleaning
-
Updated
Apr 14, 2021 - Jupyter Notebook
Datacamp Projects
-
Updated
Mar 13, 2018 - Jupyter Notebook
This repository contain all frequency ask interview questions in data structure and algo.
-
Updated
Apr 13, 2022 - Java
This repository is demonstration of Pandas library of Python's super powers.
python
data
pandas-dataframe
python-library
python-script
pandas
python3
datascience
datawrangling
pandas-dataframes
dataengineering
pandas-tutorial
pandas-library
dataexploration
datamanipulation
pandas-python
-
Updated
Jun 1, 2020 - Jupyter Notebook
A Python data manipulation and analysis project that examines and visualizes the popularity of widely used data science tools R and Pandas across 3 Stack Exchange subcommunities (Stack Overflow, Cross Validated, Data Science) through the use of the Stack Exchange API and multiple Python libraries such as Pandas, JSON, Requests, and Matplotlib.
json
data-science
r
analysis
pandas-dataframe
stackoverflow
pandas
data-visualization
requests
data-analysis
matplotlib
data-manipulation
stackexchange
stackoverflow-api
stackexchange-api
jsonobject
datamanipulation
matplotlib-pyplot
jsonlibrary
python-data-manipulation
-
Updated
Jan 1, 2020 - Jupyter Notebook
-
Updated
Jun 4, 2021 - Jupyter Notebook
Prediction of the delay between the creation of an order and the beginning of the shipment. Dataset from the database of a company.
-
Updated
Oct 20, 2021 - Jupyter Notebook
In this project, I used a combination of data manipulation and visualization to explore television data. I also looked at Super Bowl Data, generating insights into game outcomes, viewership, and even halftime shows.
-
Updated
Apr 13, 2022 - Jupyter Notebook
Manipulating Data excel data in power query so that it works in our JLL Corrigo System and is built automatically. (Names and amounts have been changed for privacy)
-
Updated
Sep 27, 2021
Practical Computing for Data Analytics Homework 2: This project was based on census data regarding Michigan socio-economic data. Visualizations were made to bring out paterns in the data and draw conclusions. ggplot and dplyer were the primary packages used in this project for data manipulation and visualizations. A primary theme in this assignment was the differences between the Upper Peninsula and Lower Peninsula. There were vast differences in populations and the type of employment that one would typically find in each geographic region.
-
Updated
Feb 12, 2018 - HTML
An important part of business is planning for the future and ensuring that the business survives changing market conditions. Some businesses do this remarkably well and last for hundreds of years. In this project, I explored data from BusinessFinancing.co.uk on the world's oldest businesses: when were they founded, and which industries do they belong to? Like many business problems, the data we'll explore is contained in several different datasets. In order to understand the world's oldest businesses, we will first need to use joining techniques to merge our data. From there, we can use manipulation tools such as grouping and filtering to answer questions about these historic businesses.
-
Updated
Apr 14, 2022 - Jupyter Notebook
Some R commands that might be handy for data manipulations and exploratory data analysis.
-
Updated
May 3, 2018 - R
-
Updated
Jun 4, 2021 - Jupyter Notebook
Had to develop a customer segmentation to define marketing strategy. The dataset summarized the usage behavior of about 9000 active credit card holders during the last 6 months. The file is at a customer level with 18 behavioral metrics..
analytics
datascience
statistical-analysis
segmentation
unsupervised-learning
factor-analysis
cluster-analysis
kmeans-clustering
datamanipulation
-
Updated
Aug 27, 2018 - SAS
In this project, we are going to analyze international debt data collected by The World Bank. The dataset contains information about the amount of debt (in USD) owed by developing countries across several categories.
-
Updated
Jul 26, 2021 - Jupyter Notebook
Analyzing various apps found on the Google Play Store with the help of different python libraries. The dataset is chosen from Kaggle. It is the web scraped data of 10k Play Store apps for analyzing the Android market. It consists of in total of 10841 rows and 13 columns.
visualization
python
data-science
data
google
analytics
analysis
store
jupyter-notebook
eda
pandas
applications
datacleaning
datamanipulation
-
Updated
May 16, 2022 - Jupyter Notebook
Data cleaning and analysis is performed on data, to discover the performance of musicians in halftime show.
-
Updated
May 31, 2022 - Jupyter Notebook
Load, clean, and explore Super Bowl data in the age of soaring ad costs and flashy halftime shows.
-
Updated
Oct 15, 2020 - Jupyter Notebook
Extraction Cleaning Manipulation Visualization Machine Learning & More
data
sql
big-data
postgresql
extraction
datascience
dataset
business-intelligence
machinelearning
ab-testing
dataanalysis
business-analytics
vizualisation
datamanipulation
cleaning-data-in-python
-
Updated
Jun 11, 2022 - Jupyter Notebook
This project builds an automatic credit card approval predictor using machine learning techniques, just like the real banks do.
-
Updated
Sep 26, 2020
In 1847, the Hungarian physician Ignaz Semmelweis made a breakthough discovery: he discovers handwashing. Contaminated hands was a major cause of childbed fever and by enforcing handwashing at his hospital he saved hundreds of lives.
data-visualization
python3
probability-statistics
intermediate-python
datamanipulation
importing-and-cleaning-data
-
Updated
Oct 3, 2020 - Jupyter Notebook
Improve this page
Add a description, image, and links to the datamanipulation topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datamanipulation topic, visit your repo's landing page and select "manage topics."
Everyone is welcome to add more algorithms to this project. This repo is new so we need contributions from all.