Stars
Tutorials for the Hopsworks Platform
Build resilient language agents as graphs.
Jupyter Notebooks to help you get hands-on with Pinecone vector databases
An intelligent financial bot built with LangChain. It integrates multiple financial APIs to provide users with stock analysis, real-time news, and portfolio management insights. By leveraging large…
Notebooks for Large Language Models (LLMs) Specialization
A streaming SQL engine, a fast and lightweight alternative to ksqlDB and Apache Flink, 🚀 powered by ClickHouse.
docker-compose.yml files for cp-all-in-one , cp-all-in-one-community, cp-all-in-one-cloud, Apache Kafka Confluent Platform
A simple and efficient tool to parallelize Pandas operations on all available CPUs
Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRun, Artifact Registry, Terraform, Docker
🧙 Build, run, and manage data pipelines for integrating and transforming data.
A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
Automatically Visualize any dataset, any size with a single line of code. Created by Ram Seshadri. Collaborators Welcome. Permission Granted upon Request.
Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
AtsPy: Automated Time Series Models in Python (by @firmai)
ZenML 🙏: The bridge between ML and Ops. https://zenml.io.
🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
A python library built to empower developers to build applications and systems with self-contained Computer Vision capabilities
Image Dataset Tool (idt) is a cli tool designed to make the otherwise repetitive and slow task of creating image datasets into a fast and intuitive process.