
OCyara
OCyara performs OCR on images and PDF files to extract text content and scan it against Yara rules for malware detection.

OCyara
OCyara performs OCR on images and PDF files to extract text content and scan it against Yara rules for malware detection.
OCyara Description
OCyara is a Python module that performs Optical Character Recognition (OCR) on image files and scans the extracted text for matches against Yara rules. The tool can process various image formats and also handles images embedded within PDF files. The module requires Python 3.5+ and is designed to work on Debian-based Linux distributions, with testing performed on Kali Rolling and Ubuntu 16.10. Installation requires Tesseract OCR API and associated libraries including libtesseract-dev, libleptonica-dev, and various image format libraries. OCyara uses tesserocr for OCR functionality and requires manual installation of dependencies including python3-dev, tesseract-ocr, and image processing libraries. The tool supports multiple image formats including GIF and TIFF, though some Ubuntu LTS installations may require manual compilation of Tesseract and Leptonica for full format support. Installation is performed through pip after meeting system requirements, with Cython requiring separate installation due to tesserocr dependencies.
FEATURED
Password manager with end-to-end encryption and identity protection features
VPN service providing encrypted internet connections and privacy protection
Fractional CISO services for B2B companies to accelerate sales and compliance
Stay Updated with Mandos Brief
Get the latest cybersecurity updates in your inbox
TRENDING CATEGORIES
POPULAR
Security platform that provides protection, monitoring and governance for enterprise generative AI applications and LLMs against various threats including prompt injection and data poisoning.
A threat intelligence aggregation service that consolidates and summarizes security updates from multiple sources to provide comprehensive cybersecurity situational awareness.
Fabric Platform is a cybersecurity reporting solution that automates and standardizes report generation, offering a private-cloud platform, open-source tools, and community-supported templates.
A weekly newsletter providing cybersecurity leadership insights, industry updates, and strategic guidance for security professionals advancing to management positions.