[go: up one dir, main page]

Skip to content
View boscoj2008's full-sized avatar
🎯
Focusing
🎯
Focusing
  • National Institute of Advanced Industrial Science & Technology
  • Tsukuba, Japan
  • 23:33 (UTC -12:00)

Block or report boscoj2008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

C++ 2,144 130 Updated Sep 28, 2024

https://huyenchip.com/ml-interviews-book/

HTML 3,388 515 Updated Jun 12, 2024

A repository for research on medium sized language models.

Python 472 71 Updated Sep 27, 2024

contains files and scripts for training InferSent algorithm

Jupyter Notebook 2 Updated Sep 14, 2021

A Graph-Based Blocking Approach for Entity Matching Using Contrastively Learned Embeddings

Python 1 Updated Jun 7, 2023

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Python 193 9 Updated Apr 17, 2023

•Scraped LinkedIn data using Selenium, cleaned and created schema in Excel. •Analyzed data using SQL, and presented insights via Power BI dashboard. •Used natural language processing to improve ski…

Jupyter Notebook 4 1 Updated Jul 24, 2023

Versatile Generative Language Model

Python 26 3 Updated Oct 29, 2022

Personal Data Engineering Projects

Jupyter Notebook 832 185 Updated Feb 8, 2023

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Python 378 33 Updated Apr 21, 2023

Identifying authorship of ancient hebrew texts via word embeddings (skip-gram, LSTM, BERT), unsupervised clustering and evaluation.

Jupyter Notebook 3 2 Updated Jun 4, 2021

Code, notebooks and examples with ECG: Ensemble Clustering for Graphs

Jupyter Notebook 31 Updated May 13, 2022

Hierarchical graph clustering

Jupyter Notebook 38 8 Updated Jun 22, 2018

Data Engineering with Python, published by Packt

Python 604 273 Updated Jan 30, 2023

The first type of clustering algorithm discussed in this course used the spatial distribution of points to determine cluster centers and membership. The most prominent implementation of this concep…

Jupyter Notebook 2 Updated Dec 16, 2020

A clustering algorithm that automatically determines the number of clusters and works without hyperparameter fine-tuning.

Python 213 21 Updated Dec 9, 2020
Jupyter Notebook 7 1 Updated Nov 26, 2019

Three modules of extractive text summarization, including implementation of Kmeans clustering using BERT sentence embedding

Jupyter Notebook 11 2 Updated Dec 9, 2019

Combining BERT with Static Word Embedding for Categorizing Social Media

Python 2 1 Updated May 31, 2021

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 611 99 Updated Oct 19, 2021

Uniform Manifold Approximation and Projection

Python 7,387 803 Updated Aug 18, 2024

Sentence Embeddings in NLI with Iterative Refinement Encoders

Python 78 16 Updated Nov 22, 2022

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 13,849 2,092 Updated Sep 28, 2024

Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org

Python 2,141 251 Updated Apr 25, 2024

Performed entity resolution/record linkage using different types of word embedding techniques on E-Commerce datasets.

Jupyter Notebook 3 2 Updated May 21, 2020

This project shows how to perform customers segmentation using Machine Learning algorithms. Three techniques will be presented and compared: KMeans, Agglomerative Clustering ,Affinity Propagation a…

Jupyter Notebook 8 7 Updated Oct 19, 2020
Jupyter Notebook 21 9 Updated Aug 8, 2018

Implementation of ExCut: Explainable Embedding-based Clustering over Knowledge Graphs

Python 10 2 Updated Jul 7, 2021

A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.

Python 141 27 Updated Sep 4, 2024

deep entity resolution lite version

Python 11 8 Updated Nov 11, 2019
Next