pyspark-notebook

Various examples of notebooks for working with web archives with the Archives Unleashed Toolkit, and derivatives generated by the Archives Unleashed Toolkit.

spark python3 notebooks web-archives pyspark-notebook juypter-notebook

Updated Dec 5, 2022
Jupyter Notebook

arjones / bigdata-workshop-es

Star

Workshop Big Data en Español

docker postgres machine-learning scala kafka spark apache-spark postgresql jupyter-notebook superset pyspark pyspark-notebook

Updated Nov 9, 2023
HTML

aakinlalu / Crime-Classification-using-PySpark

Star

classify crime into different categories using PySpark

machine-learning pyspark-notebook pyspark-mllib pyspark-python crime-classification

Updated May 20, 2019
Jupyter Notebook

mohanakrishnavh / PySpark-Tutorial

Star

pyspark pyspark-notebook pyspark-tutorial pyspark-mllib

Updated May 8, 2018
Jupyter Notebook

microsoft / Fabric-RTA-FlightStream

Star

Microsoft Fabric Real-time Analytics flight streaming

streaming etl powerbi pyspark-notebook kql lakehouse

Updated Feb 8, 2024
Jupyter Notebook

jacobceles / intro-to-colab-pyspark-emr

Star

A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics like EMR sizing, Google Colaboratory, fine-tuning PySpark jobs, and much more.

pyspark dataframe pyspark-notebook pyspark-tutorial colaboratory colab-notebook colab-tutorial

Updated Nov 12, 2021
Jupyter Notebook

johntelforduk / betfair-data-analysis

Star

Explore, analyse and visualise Betfair Historical Data Feed using PySpark.

spark betfair pyspark matplotlib pyspark-notebook jupiter-notebook betfair-historical-data

Updated Feb 10, 2023
Jupyter Notebook

yennanliu / analysis

Star

Repo for practical data science problems approaches, including notebook demo and working scripts | #DS | #analysis

data-science machine-learning statistics deep-learning algorithms analysis tensorflow sklearn pytorch pyspark-notebook

Updated Oct 13, 2020
Jupyter Notebook

prabeesh / pyspark-notebook

Star

Pyspark Notebook With Docker

python docker spark apache-spark docker-compose notebook docker-image bigdata jupyter-notebook pyspark python-notebook pyspark-notebook

Updated Aug 18, 2015
Python

hyeonsangjeon / dataplatform

Star

Hadoop3.2 single/cluster mode with web terminal gotty, spark, jupyter pyspark, hive, eco etc.

hive hadoop hadoop-cluster hadoop-mapreduce hadoop-docker pyspark-notebook zeppelin-notebook hadoop-ecosystem

Updated Nov 7, 2019
Shell

imsanjoykb / PySpark-Bootcamp

Star

My Practice and project on PySpark

hadoop pyspark spark-streaming sparkjava transformation hadoop-mapreduce spark-sql pyspark-notebook pyspark-mllib pyspark-machine-learning pyspark-ml

Updated Sep 17, 2021
Jupyter Notebook

jitsejan / pyspark-101

Star

A PySpark course to get started with the basics for a Data Engineer

python spark pyspark pyspark-notebook pyspark-tutorial

Updated May 4, 2018
Jupyter Notebook

miquido / DataScience

Star

Useful scripts and notebooks for Data Science. The project was made by Miquido. https://www.miquido.com/

docker machine-learning spark pipeline aws-s3 pyspark pyspark-notebook pyspark-tutorial pyspark-mllib

Updated Jul 6, 2023
Jupyter Notebook

shsarv / Cardio-Monitor

Star

Cardio Monitor is a web app that helps you to find out whether you are at risk of developing heart disease. the model used for prediction has an accuracy of 92%. This is the course project of subject Big Data Analytics (BCSE0158).

pymongo ml spark-streaming pyspark-notebook end-to-end-machine-learning heart-disease-prediction mlxtend cardiovascular-diseases

Updated May 24, 2021
Jupyter Notebook

Improve this page

Add a description, image, and links to the pyspark-notebook topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pyspark-notebook topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pyspark-notebook

Here are 203 public repositories matching this topic...

josephmachado / efficient_data_processing_spark

hyunjoonbok / PySpark

rlilojr / Detecting-Malicious-URL-Machine-Learning

jplane / pyspark-devcontainer

josephmachado / docker_for_data_engineers

brennerh1 / databricks-demos

archivesunleashed / notebooks

arjones / bigdata-workshop-es

aakinlalu / Crime-Classification-using-PySpark

mohanakrishnavh / PySpark-Tutorial

microsoft / Fabric-RTA-FlightStream

jacobceles / intro-to-colab-pyspark-emr

johntelforduk / betfair-data-analysis

yennanliu / analysis

prabeesh / pyspark-notebook

hyeonsangjeon / dataplatform

imsanjoykb / PySpark-Bootcamp

jitsejan / pyspark-101

miquido / DataScience

shsarv / Cardio-Monitor

Improve this page

Add this topic to your repo