regreplaceR

regreplaceR

This R package provides 4 tiny regex tools to extract, match, and replace named regex groups in strings. It contains in addition to 3 functions groups(), match_group() and replace_group() and an R6 class Replacer which is thought as the main interface of this package.

Installation

You can install this package by installing it from CRAN or from this GitHub repository.

# install directly from CRAN
install.packages("regreplaceR")

# or install directly from this github repository

# install the devtools package if not already installed
install.packages("devtools")

# install the package directly from this repository
devtools::install_github("gwangjinkim/regreplaceR")

Overview

The package offers three core functions to interact with strings using regular expressions with named groups:

Additionally, it includes an R6 class called Replacer that encapsulates these functionalities for a more object-oriented approach to regex-based operations.

I couldn’t use match() and replace() as generic functions, because there are already generic functions which require a specific set of arguments. By using an R6 class, I was free to choose the arguments for match() and replace().

1. groups()

This function extracts all named groups from a string based on a provided regex pattern.

library(regreplaceR)

pattern <- "(?P<name>\\w+) is (?P<age>\\d+)"
s <- "Jane is 25"
result <- groups(pattern, s)

# Output:
# $name
# [1] "Jane"
#
# $age
# [1] "25"

2. match_group()

Extract the value of a specific named group from the string.

library(regreplaceR)

pattern <- "(?P<name>\\w+) is (?P<age>\\d+)"
s <- "Jane is 25"
name <- match_group(pattern, s, "name")

# Output:
# [1] "Jane"

3. replace_group()

Replace the value of a specific named group with a new string.

library(regreplaceR)

pattern <- "(?P<name>\\w+) is (?P<age>\\d+)"
s <- "Jane is 25"
modified <- replace_group(pattern, s, "name", "John")

# Output:
# [1] "John is 25"

4. The Replacer Class

The Replacer class in the regreplace package provides utilities for regex operations, such as matching and replacing named groups.

Example Usage

library(regreplaceR)

# Create a new Replacer object
r <- Replacer$new(pattern = ".*?_(?P<date>\\d{8}-\\d{6})(?P<ext>\\..+$)")

# Match a group within a string
date <- r$match("file_20230905-123456.txt", "date")

# Replace the value of a matched group
modified_string <- r$replace("file_20230905-123456.txt", "date", "20240905-123456")

## simpler example:

# Create a Replacer object with a regex pattern
r <- Replacer$new(pattern="(?P<name>\\w+) is (?P<age>\\d+)")
   
# Match the "name" group in the string
name_match <- r$match("Jane is 25", "name")
print(name_match)  # Should print "Jane"
#> [1] "Jane"

# Match the "age" group in the string
age_match <- r$match("Jane is 25", "age")
print(age_match)  # Should print "25"
#> [1] "25"

# Replace the "name" group in the string
replaced_string <- r$replace("Jane is 25", "name", "John")
print(replaced_string)  # Should print "John is 25"
#> [1] "John is 25"

Class Methods

License

This package is licensed under the MIT License. See LICENSE for details.

Happy regexing! If you have any questions or find any bugs, please feel free to open an issue on the GitHub repository.