Welcome to the official documentation for noscrape! This wrapper simplifies the integration of noscrape into your Python projects. noscrape is a tool designed to prevent web scraping by obfuscating HTML elements using true-type fonts with shuffled unicodes.
The primary mechanism behind noscrape is the utilization of true-type fonts. noscrape generates a new version with shuffled unicodes, making it impossible to reverse-calculate them. Additionally, glyph-paths inside the font are obfuscated by randomly shifting them slightly.
noscrape is implemented using platform-specific binaries optimized for different operating systems and architectures. These binaries include:
noscrape_darwin_arm64
noscrape_darwin_x86_64
noscrape_linux_arm64
noscrape_linux_x86_64
noscrape_windows_x86_64.exe
These binaries serve as the core engine of noscrape, handling the generation of obfuscated text using true-type fonts with shuffled unicodes.
Wrapper implementations provided in languages such as PHP, Java, and Node.js facilitate communication with the noscrape binaries. They collect input data, call the appropriate noscrape binary based on the host platform, pass the input data for obfuscation, and return the obfuscated text or other outputs.
You can install the noscrape wrapper using pip:
pip install noscrape
from noscrape import Noscrape
n = Noscrape("example/example.ttf")
text = n.obfuscate("test")
b64_font = n.render()