k(r)ep - A high-performance string search utility

krep is an optimized string search utility designed for maximum throughput and efficiency when processing large files and directories. It is built with performance in mind, offering multiple search algorithms and SIMD acceleration when available.

Note:
Krep is not intended to be a full replacement or direct competitor to feature-rich tools like grep or ripgrep. Instead, it aims to be a minimal, efficient, and pragmatic tool focused on speed and simplicity.

Krep provides the essential features needed for fast searching, without the extensive options and complexity of more comprehensive search utilities. Its design philosophy is to deliver the fastest possible search for the most common use cases, with a clean and minimal interface.

The Story Behind the Name

The name "krep" has an interesting origin. It is inspired by the Icelandic word "kreppan," which means "to grasp quickly" or "to catch firmly." I came across this word while researching efficient techniques for pattern recognition.

Just as skilled fishers identify patterns in the water to locate fish quickly, I designed "krep" to find patterns in text with maximum efficiency. The name is also short and easy to remember—perfect for a command-line utility that users might type hundreds of times per day.

Key Features

Multiple search algorithms: Boyer-Moore-Horspool, KMP, Aho-Corasick for optimal performance across different pattern types
Algorithm selection: Automatic smart selection with optional --algo override for fine-tuning
SIMD acceleration: Uses SSE4.2, AVX2, or NEON instructions when available for blazing-fast searches
Memory-mapped I/O: Maximizes throughput when processing large files
Multi-threaded search: Automatically parallelizes searches across available CPU cores
Regex support: POSIX Extended Regular Expression searching
Multiple pattern search: Efficiently search for multiple patterns simultaneously using Aho-Corasick
Recursive directory search: Skip binary files and common non-code directories
Gitignore support: Respect .gitignore files during recursive search with --gitignore
Stdin pattern input: Read patterns from stdin with -f - for seamless pipeline integration
Colored output: Highlights matches for better readability
UI/UX refresh (v2.2): New terminal color palette, clearer -o line index styling, and a redesigned help screen
Specialized algorithms: Optimized handling for single-character and short patterns
Match Limiting: Stop searching a file after a specific number of matching lines are found.

What's New in v2.2.0

Refined terminal-first UI with a cleaner, more legible color theme
Better visual hierarchy in -o mode (filename, line index, match highlight)
Improved --help layout with grouped sections and clearer scanning

Installation

Using Homebrew (macOS)

If you are on macOS and have Homebrew installed, you can install krep easily:

brew install krep

Building from Source

# Clone the repository
git clone https://github.com/davidesantangelo/krep.git
cd krep

# Build and install
make
sudo make install

# uninstall
sudo make uninstall

The binary will be installed to /usr/local/bin/krep by default.

Requirements

GCC or compatible C compiler
POSIX-compliant system (Linux, macOS, BSD)
pthread support

Build Options

Override default optimization settings in the Makefile:

# Disable architecture-specific optimizations
make ENABLE_ARCH_DETECTION=0

Usage

krep [OPTIONS] PATTERN [FILE | DIRECTORY]
krep [OPTIONS] -e PATTERN [FILE | DIRECTORY]
krep [OPTIONS] -f FILE [FILE | DIRECTORY]
krep [OPTIONS] -s PATTERN STRING_TO_SEARCH
krep [OPTIONS] PATTERN < FILE
cat FILE | krep [OPTIONS] PATTERN
echo 'pattern' | krep -f - [FILE | DIRECTORY]

Usage Examples

Search for a fixed string in a file:

krep -F "value: 100%" config.ini

Search recursively:

krep -r "function" ./project

Search recursively respecting .gitignore:

krep -r --gitignore "TODO" ./project

Read patterns from stdin (pipe-friendly):

echo 'pattern' | krep -f - target.txt

Whole word search (matches only complete words):

krep -w 'cat' samples/text.en

Use with piped input:

cat krep.c | krep 'c'

Command Line Options

-i, --ignore-case Case-insensitive search
-c, --count Count matching lines only
-o, --only-matching Print only the matched parts of lines
-e PATTERN, --pattern=PATTERN Specify pattern(s). Can be used multiple times.
-f FILE, --file=FILE Read patterns from FILE, one per line. Use - for stdin.
-m NUM, --max-count=NUM Stop searching each file after finding NUM matching lines.
-E, --extended-regexp Use POSIX Extended Regular Expressions
-F, --fixed-strings Interpret pattern as fixed string(s) (default unless -E is used)
-r, --recursive Recursively search directories
--gitignore Respect .gitignore files during recursive search
--algo=ALGO Force search algorithm: auto (default), bm (Boyer-Moore), kmp (KMP)
-t NUM, --threads=NUM Use NUM threads for file search (default: auto)
-s STRING, --string=STRING Search in the provided STRING instead of file(s)
-w, --word-regexp Match only whole words
--color[=WHEN] Control color output ('always', 'never', 'auto')
--no-simd Explicitly disable SIMD acceleration
-v, --version Show version information
-h, --help Show help message

Performance Benchmarks

Benchmarks are run with the official dataset:

curl -LO 'https://burntsushi.net/stuff/subtitles2016-sample.en.gz'
gzip -dk subtitles2016-sample.en.gz

You can reproduce the krep vs ripgrep comparison with:

make bench-rg
# optional: RUNS=7 bash test/benchmark_krep_vs_rg.sh Sherlock

krep v2.1.0 vs ripgrep (warm cache, 7 runs average baseline)

Pattern	krep avg real (s)	ripgrep avg real (s)	Speedup
`the`	0.175714	0.330000	1.88x
`Sherlock`	0.041429	0.080000	1.93x

Measured on macOS ARM64 with test/benchmark_krep_vs_rg.sh. Results vary by CPU, storage and cache state.

How Krep Works

Krep achieves its high performance through several key techniques:

1. Smart Algorithm Selection

Krep automatically selects the optimal search algorithm based on the pattern and available hardware:

Boyer-Moore-Horspool for most literal string searches
Knuth-Morris-Pratt (KMP) for very short patterns and repetitive patterns
memchr optimization for single-character patterns
SIMD Acceleration (SSE4.2, AVX2, or NEON) for compatible hardware
Regex Engine for regular expression patterns
Aho-Corasick for efficient multiple pattern matching (auto-selected with multiple -e patterns)

Use --algo=bm or --algo=kmp to override the automatic selection for single-pattern literal searches.

2. Multi-threading Architecture

Krep utilizes parallel processing to dramatically speed up searches:

Automatically detects available CPU cores
Divides large files into chunks for parallel processing
Implements thread pooling for maximum efficiency
Optimized thread count selection based on file size
Careful boundary handling to ensure no matches are missed

3. Memory-Mapped I/O

Instead of traditional read operations:

Memory maps files for direct access by the CPU
Significantly reduces I/O overhead
Enables CPU cache optimization
Progressive prefetching for larger files

4. Optimized Data Structures

Zero-copy architecture where possible
Efficient match position tracking
Lock-free aggregation of results

5. Skipping Non-Relevant Content

When using recursive search (-r), Krep automatically:

Skips common binary file types
Ignores version control directories (.git, .svn)
Bypasses dependency directories (node_modules, venv)
Detects binary content to avoid searching non-text files

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Author

Davide Santangelo - GitHub

License

This project is licensed under the BSD-2 License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
.github		.github
test		test
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
aho_corasick.c		aho_corasick.c
aho_corasick.h		aho_corasick.h
krep.c		krep.c
krep.h		krep.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

k(r)ep - A high-performance string search utility

The Story Behind the Name

Key Features

What's New in v2.2.0

Installation

Using Homebrew (macOS)

Building from Source

Requirements

Build Options

Usage

Usage Examples

Command Line Options

Performance Benchmarks

krep v2.1.0 vs ripgrep (warm cache, 7 runs average baseline)

How Krep Works

1. Smart Algorithm Selection

2. Multi-threading Architecture

3. Memory-Mapped I/O

4. Optimized Data Structures

5. Skipping Non-Relevant Content

Contributing

Author

License

About

Uh oh!

Releases 43

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

Uh oh!

License

davidesantangelo/krep

Folders and files

Latest commit

History

Repository files navigation

k(r)ep - A high-performance string search utility

The Story Behind the Name

Key Features

What's New in v2.2.0

Installation

Using Homebrew (macOS)

Building from Source

Requirements

Build Options

Usage

Usage Examples

Command Line Options

Performance Benchmarks

krep v2.1.0 vs ripgrep (warm cache, 7 runs average baseline)

How Krep Works

1. Smart Algorithm Selection

2. Multi-threading Architecture

3. Memory-Mapped I/O

4. Optimized Data Structures

5. Skipping Non-Relevant Content

Contributing

Author

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 43

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages