[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
09/28: gnu: Add r-textclean.
From: |
guix-commits |
Subject: |
09/28: gnu: Add r-textclean. |
Date: |
Mon, 15 Mar 2021 05:55:28 -0400 (EDT) |
lbraun pushed a commit to branch master
in repository guix.
commit 804fad34e8e0f74483e987cfe5f6a496c1debe74
Author: Lars-Dominik Braun <ldb@leibniz-psychology.org>
AuthorDate: Mon Mar 15 09:40:05 2021 +0100
gnu: Add r-textclean.
* gnu/packages/cran.scm (r-textclean): New variable.
---
gnu/packages/cran.scm | 35 +++++++++++++++++++++++++++++++++++
1 file changed, 35 insertions(+)
diff --git a/gnu/packages/cran.scm b/gnu/packages/cran.scm
index b8a57cd..7f6003a 100644
--- a/gnu/packages/cran.scm
+++ b/gnu/packages/cran.scm
@@ -27510,3 +27510,38 @@ and word lists.")
three, ... Ordinals are also available, first, second, third, ... and
indefinite article choice, \"a\" or \"an\".")
(license license:gpl2)))
+
+(define-public r-textclean
+ (package
+ (name "r-textclean")
+ (version "0.9.3")
+ (source
+ (origin
+ (method url-fetch)
+ (uri (cran-uri "textclean" version))
+ (sha256
+ (base32
+ "0kgjh6c4f14qkjc4fds7q7rpf4nkma3p0igm54fplmm3p853nvrz"))))
+ (properties `((upstream-name . "textclean")))
+ (build-system r-build-system)
+ (propagated-inputs
+ `(("r-data-table" ,r-data-table)
+ ("r-english" ,r-english)
+ ("r-glue" ,r-glue)
+ ("r-lexicon" ,r-lexicon)
+ ("r-mgsub" ,r-mgsub)
+ ("r-qdapregex" ,r-qdapregex)
+ ("r-stringi" ,r-stringi)
+ ("r-textshape" ,r-textshape)))
+ (home-page
+ "https://github.com/trinker/textclean")
+ (synopsis "Text Cleaning Tools")
+ (description
+ "Tools to clean and process text. Tools are geared at checking for
+substrings that are not optimal for analysis and replacing or removing them
+(normalizing) with more analysis friendly substrings (see Sproat, Black, Chen,
+Kumar, Ostendorf, & Richards (2001) @url{doi:10.1006/csla.2001.0169}) or
+extracting them into new variables. For example, emoticons are often used in
+text but not always easily handled by analysis algorithms. The
+@code{replace_emoticon()} function replaces emoticons with word equivalents.")
+ (license license:gpl2)))
- 02/28: gnu: Add r-qdapregex., (continued)
- 02/28: gnu: Add r-qdapregex., guix-commits, 2021/03/15
- 03/28: gnu: Add r-mgsub., guix-commits, 2021/03/15
- 04/28: gnu: Add r-dtt., guix-commits, 2021/03/15
- 06/28: gnu: Add r-syuzhet., guix-commits, 2021/03/15
- 08/28: gnu: Add r-english., guix-commits, 2021/03/15
- 11/28: gnu: gzstream: Add PIC flag., guix-commits, 2021/03/15
- 12/28: gnu: Add r-ndjson., guix-commits, 2021/03/15
- 14/28: gnu: Add r-readods., guix-commits, 2021/03/15
- 07/28: gnu: Add r-lexicon., guix-commits, 2021/03/15
- 01/28: gnu: Add r-esc., guix-commits, 2021/03/15
- 09/28: gnu: Add r-textclean.,
guix-commits <=
- 10/28: gnu: Add r-striprtf., guix-commits, 2021/03/15
- 13/28: gnu: Add r-streamr, guix-commits, 2021/03/15
- 18/28: gnu: Add r-readtext., guix-commits, 2021/03/15
- 16/28: gnu: Add r-pdftools., guix-commits, 2021/03/15
- 05/28: gnu: Add r-textshape., guix-commits, 2021/03/15
- 15/28: gnu: Add r-qpdf., guix-commits, 2021/03/15
- 19/28: gnu: Add r-packcircles., guix-commits, 2021/03/15
- 20/28: gnu: Add r-lwgeom., guix-commits, 2021/03/15
- 21/28: gnu: Add r-stars., guix-commits, 2021/03/15
- 22/28: gnu: Add r-tmaptools., guix-commits, 2021/03/15