boscoj2008

🎯

Focusing

John Bosco boscoj2008

🎯

Focusing

PhD researcher working on Data integration, Entity matching, and Natural language processing.

1 follower · 8 following

National Institute of Advanced Industrial Science & Technology
Tsukuba, Japan
23:33 (UTC -12:00)
boscoj2008.github.io

Stars

unum-cloud / usearch

Fast Open-Source Search & Clustering engine × for Vectors & 🔜 Strings × in C++, C, Python, JavaScript, Rust, Java, Objective-C, Swift, C#, GoLang, and Wolfram 🔍

C++ 2,144 130 Updated Sep 28, 2024

chiphuyen / ml-interviews-book

https://huyenchip.com/ml-interviews-book/

HTML 3,388 515 Updated Jun 12, 2024

mlfoundations / open_lm

A repository for research on medium sized language models.

Python 472 71 Updated Sep 27, 2024

boscoj2008 / infersent-train-2021

contains files and scripts for training InferSent algorithm

Jupyter Notebook 2 Updated Sep 14, 2021

boscoj2008 / ContextualBlocker-for-EM

A Graph-Based Blocking Approach for Entity Matching Using Contrastively Learned Embeddings

Python 1 Updated Jun 7, 2023

WHU-ZQH / ChatGPT-vs.-BERT

🎁[ChatGPT4NLU] A Comparative Study on ChatGPT and Fine-tuned BERT

Python 193 9 Updated Apr 17, 2023

nitinsharma1706 / LinkedInJobAnalytics

•Scraped LinkedIn data using Selenium, cleaned and created schema in Excel. •Analyzed data using SQL, and presented insights via Power BI dashboard. •Used natural language processing to improve ski…

Jupyter Notebook 4 1 Updated Jul 24, 2023

zlinao / VGLM

Versatile Generative Language Model

Python 26 3 Updated Oct 29, 2022

alanchn31 / Data-Engineering-Projects

Personal Data Engineering Projects

Jupyter Notebook 832 185 Updated Feb 8, 2023

JohnGiorgi / DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Python 378 33 Updated Apr 21, 2023

JieSun1990 / NLP_Determining_Authorship_of_Hebrew_Bible

Identifying authorship of ancient hebrew texts via word embeddings (skip-gram, LSTM, BERT), unsupervised clustering and evaluation.

Jupyter Notebook 3 2 Updated Jun 4, 2021

ftheberge / Ensemble-Clustering-for-Graphs

Code, notebooks and examples with ECG: Ensemble Clustering for Graphs

Jupyter Notebook 31 Updated May 13, 2022

tbonald / paris

Hierarchical graph clustering

Jupyter Notebook 38 8 Updated Jun 22, 2018

PacktPublishing / Data-Engineering-with-Python

Data Engineering with Python, published by Packt

Python 604 273 Updated Jan 30, 2023

degr8noble / Density-Based-Clustering_method_with_python

The first type of clustering algorithm discussed in this course used the spatial distribution of points to determine cluster centers and membership. The most prominent implementation of this concep…

Jupyter Notebook 2 Updated Dec 16, 2020

josephius / star-clustering

A clustering algorithm that automatically determines the number of clusters and works without hyperparameter fine-tuning.

Python 213 21 Updated Dec 9, 2020

dozed / InferSent

Jupyter Notebook 7 1 Updated Nov 26, 2019

lingyu001 / nlp_text_summarization_implementation

Three modules of extractive text summarization, including implementation of Kmeans clustering using BERT sentence embedding

Jupyter Notebook 11 2 Updated Dec 9, 2019

israa-alghanmi / Combine_BERT_with_GloVe

Combining BERT with Static Word Embedding for Categorizing Social Media

Python 2 1 Updated May 31, 2021

xuyige / BERT4doc-Classification

Code and source for paper ``How to Fine-Tune BERT for Text Classification?``

Python 611 99 Updated Oct 19, 2021

lmcinnes / umap

Uniform Manifold Approximation and Projection

Python 7,387 803 Updated Aug 18, 2024

Helsinki-NLP / HBMP

Sentence Embeddings in NLI with Iterative Refinement Encoders

Python 78 16 Updated Nov 22, 2022

flairNLP / flair

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Python 13,849 2,092 Updated Sep 28, 2024

Accenture / AmpliGraph

Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org

Python 2,141 251 Updated Apr 25, 2024

Zelong-Chen / Empirical-Study-of-Entity-Resolution-Using-Word-Embedding

Performed entity resolution/record linkage using different types of word embedding techniques on E-Commerce datasets.

Jupyter Notebook 3 2 Updated May 21, 2020

avirichie / Customer-Segmentation-using-Unsupervised-Learning

This project shows how to perform customers segmentation using Machine Learning algorithms. Three techniques will be presented and compared: KMeans, Agglomerative Clustering ,Affinity Propagation a…

Jupyter Notebook 8 7 Updated Oct 19, 2020

xiaopeng-liao / DEC_pytorch

Jupyter Notebook 21 9 Updated Aug 8, 2018

mhmgad / ExCut

Implementation of ExCut: Explainable Embedding-based Clustering over Knowledge Graphs

Python 10 2 Updated Jul 7, 2021

brandonrobertz / SparseLSH

A Locality Sensitive Hashing (LSH) library with an emphasis on large, highly-dimensional datasets.

Python 141 27 Updated Sep 4, 2024

daqcri / deeper-lite

deep entity resolution lite version

Python 11 8 Updated Nov 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly