Commonly Consumed Code Commodities
The reusables library is a cookbook of python functions and classes that programmers may find themselves often recreating.
It includes:
- Archiving and extraction for zip, tar, gz, bz2, and rar
- Path (file and folders) management
- Fast logging setup and tools
- Namespace (dict to class modules with child recursion)
- Friendly datetime formatting
- Config to dict parsing
- Common regular expressions and file extensions
- Helpful wrappers
- Bash analogues
- Easy downloading
- Multiprocessing helpers
Reusables is on PyPI, so can be easily installed with pip or easy_install.
pip install reusables
There are no required decencies. If this doesn't work, it's broken, raise a github issue.
Reusables is designed to not require any imports outside the standard library, but can be supplemented with those found in the requirements.txt file for additional functionality.
CI tests run on:
- Python 2.6+
- Python 3.3+
- Pypy
Examples are provided below, and the API documentation can always be found at readthedocs.org.
Please note this is currently in development. Any item in development prior to a major version (1.x, 2.x) may change. Once at a major version, no breaking changes are planned to occur within that version.
import reusables
reusables.find_files_list(".", ext=reusables.exts.pictures)
# ['/home/user/background.jpg', '/home/user/private.png']
reusables.archive("reusables", name="reuse", archive_type="zip")
# 'C:\\Users\\Me\\Reusables\\reuse.zip'
reusables.extract("test/test_structure.zip", "my_archive")
# All files in the zip will be extracted into directory "my_archive"
reusables.config_dict('my_config.cfg')
# {'Section 1': {'key 1': 'value 1', 'key2': 'Value2'}, 'Section 2': {}}
reusables.count_files(".")
# 405
reusables.file_hash("test_structure.zip", hash_type="sha256")
# 'bc11af55928ab89771b9a141fa526a5b8a3dbc7d6870fece9b91af5d345a75ea'
reusables.safe_path('/home/user/eViL User\0\\/newdir$^&*/new^%file.txt')
# '/home/user/eViL User__/newdir____/new__file.txt'
reusables.run("echo 'hello there!'", shell=True)
# CompletedProcess(args="echo 'hello there!'", returncode=0, stdout='hello there!\n')
reusables.cut("abcdefghi")
# ['ab', 'cd', 'ef', 'gh', 'i']
One of the most reusables pieces of code is the find_files. It is always appearing on stackoverflow and forums of how to implement os.walk or glob; here's both.
reusables.find_files_list(".", name="*reuse*", depth=2)
# ['.\\test\\test_reuse.py', '.\\test\\test_reuse_datetime.py',
# '.\\test\\test_reuse_logging.py', '.\\test\\test_reuse_namespace.py']
# match_case works for both ext and name
# depth of 1 means this working directory only, no further
reusables.find_files_list(ext=".PY", depth=1, match_case=True)
# []
reusables.find_files_list(ext=".py", depth=1, match_case=True)
# ['.\\setup.py']
reusables.find_files_list(name="setup", ext=".py", match_case=True)
# ['.\\setup.py']
reusables.find_files_list(name="Setup", ext=".py", match_case=True)
# []
Check out Box, a much improved version as its own library.
Dictionary management class, similar to Bunch, but designed so that sub-dictionaries are recursively made into namespaces.
my_breakfast = {"spam": {"eggs": {"sausage": {"bacon": "yummy"}}}}
namespace_breakfast = reusables.Namespace(**my_breakfast)
print(namespace_breakfast.spam.eggs.sausage.bacon)
# yummy
print(namespace_breakfast.spam.eggs['sausage'].bacon)
# yummy
str(namespace_breakfast['spam'].eggs)
# "{'sausage': {'bacon': 'yummy'}}"
namespace_breakfast.to_dict()
#{'spam': {'eggs': {'sausage': {'bacon': 'yummy'}}}}
dict(namespace_breakfast)
# {'spam': <Namespace: {'eggs': {'sausage': {'bacon': '...>}
# This is NOT the same as .to_dict() as it is not recursive
logger = reusables.setup_logger(__name__)
# By default it adds a stream logger to sys.stderr
logger.info("Test")
# 2016-04-25 19:32:45,542 __main__ INFO Test
There are multiple log formatters provided, as well as additional helper functions. All helper functions will accept either the logger object or the name of the logger.
reusables.remove_stream_handlers(__name__)
# remove_file_handlers() and remove_all_handlers() also available
reusables.add_stream_handler(__name__, log_format=reusables.log_formats.detailed)
r.add_rotating_file_handler(__name__, "my.log", level=logging.INFO)
logger.info("Example log entry")
# 2016-12-14 20:56:55,446 : 315147 MainThread : reusables.log INFO Example log entry
open("my.log").read()
# 2016-12-14 20:56:55,446 - __builtin__ INFO Example log entry
Provided log formats
Feel free to provide your own formats, aided by the docs. However this includes some commonly used ones you may find useful. they are all stored in the Namespace "reusables.log_formats" (feel free to use it as a dict as stated above).
- Because ReStructuredText tables don't preserve whitespace (even with literals),
- which is important to show distinction in these formatters, here's it in a code block instead.
reusables.log_formats.keys()
# ['common', 'level_first', 'threaded', 'easy_read', 'easy_thread', 'detailed']
logger = reusables.setup_logger(__name__, log_format=reusables.log_formats.threaded)
reusables.add_timed_rotating_file_handler(logger, "timed.log", level=logging.ERROR, log_format=reusables.log_formats.detailed)
+--------------+--------------------------------------------------------------------------------------+ | Formatter | Example Output | +==============+======================================================================================+ | easy_read | 2016-04-26 21:17:51,225 - example_logger INFO example log message | | | 2016-04-26 21:17:59,074 - example_logger ERROR Something broke | +--------------+--------------------------------------------------------------------------------------+ | detailed | 2016-04-26 21:17:51,225 : 7020 MainThread : example_logger INFO example log message | | | 2016-04-26 21:17:59,074 : 14868 MainThread : example_logger ERROR Something broke | +--------------+--------------------------------------------------------------------------------------+ | level_first | INFO - example_logger - 2016-04-26 21:17:51,225 - example log message | | | ERROR - example_logger - 2016-04-26 21:17:59,074 - Something broke | +--------------+--------------------------------------------------------------------------------------+ | threaded | 7020 MainThread : example log message | | | 14868 MainThread : Something broke | +--------------+--------------------------------------------------------------------------------------+ | easy_thread | 7020 MainThread : example_logger INFO example log message | | | 14868 MainThread : example_logger ERROR Something broke | +--------------+--------------------------------------------------------------------------------------+ | common | 2016-04-26 21:17:51,225 - example_logger - INFO - example log message | | | 2016-04-26 21:17:59,074 - example_logger - ERROR - Something broke | +--------------+--------------------------------------------------------------------------------------+
It's common to be looking for a specific type of file.
if file_path.endswith(reusables.exts.pictures):
print("{} is a picture file".format(file_path))
That's right, str.endswith (as well as str.startswith) accept a tuple to search.
File Type | Extensions |
---|---|
pictures | .jpeg .jpg .png .gif .bmp .tif .tiff .ico .mng .tga .psd .xcf .svg .icns |
video | .mkv .avi .mp4 .mov .flv .mpeg .mpg .3gp .m4v .ogv .asf .m1v .m2v .mpe .ogv .wmv .rm .qt .3g2 .asf .vob |
music | .mp3 .ogg .wav .flac .aif .aiff .au .m4a .wma .mp2 .m4a .m4p .aac .ra .mid .midi .mus .psf |
documents | .doc .docx .pdf .xls .xlsx .ppt .pptx .csv .epub .gdoc .odt .rtf .txt .info .xps .gslides .gsheet .pages .msg .tex .wpd .wps .csv |
archives | .zip .rar .7z .tar.gz .tgz .gz .bzip .bzip2 .bz2 .xz .lzma .bin .tar |
cd_images | .iso .nrg .img .mds .mdf .cue .daa |
scripts | .py .sh .bat |
binaries | .msi .exe |
markup | .html .htm .xml .yaml .json .raml .xhtml .kml |
unique
There are tons of wrappers for caching and saving inputs and outputs, this is a different take that requires the function returns a result not yet provided.
@reusables.unique(max_retries=100, error_text="All UIDs taken!")
def gen_small_uid():
return random.randint(0, 100)
time_it
Easily time the execution time of a function, using the high precision perf_conuter on Python 3.3+, otherwise clock.
@reusables.time_it()
def test_it():
return time.sleep(float(f"0.{random.randint(1, 9)}"))
Use the Python interpreter as much as a shell? Here's some handy helpers to fill the void. (Please don't do 'import *' in production code, this is used as an easy to use example using the interpreter interactively.)
> These are not imported by default with "import reusables", as they are designed to be imported only in an interactive shell
Some commands from other areas are also included where they are highly applicable in both instances, such as 'touch' and 'download'.
from reusables.cli import *
cd("~") # Automatic user expansion unlike os.chdir()
pwd()
# '/home/user'
pushd("Downloads")
# ['Downloads', '/home/user']
pwd()
# '/home/user/Downloads'
popd()
# ['/home/user']
ls("-lah") # Uses 'ls' on linux and 'dir' on windows
# total 1.5M
# drwxr-xr-x 49 james james 4.0K Nov 1 20:09 .
# drwxr-xr-x 3 root root 4.0K Aug 21 2015 ..
# -rw-rw-r-- 1 james james 22K Aug 22 13:21 picture.jpg
# -rw------- 1 james james 17K Nov 1 20:08 .bash_history
cmd("ifconfig") # Shells, decodes and prints 'reusables.run' output
# eth0 Link encap:Ethernet HWaddr de:ad:be:ef:00:00
# inet addr:10.0.2.5 Bcast:10.0.2.255 Mask:255.255.255.0
# ...
download('https://www.python.org/ftp/python/README.html', save_to_file=False)
# 2016-11-02 10:37:23,644 - reusables.web INFO Downloading https://www.python.org/ftp/python/README.html (2.3 KB) to memory
# b'<PRE>\nPython Distribution...
Easy formatting for datetime objects. Also parsing for ISO formatted time.
reusables.datetime_format("Wake up {son}, it's {hours}:{minutes} {periods}!"
"I don't care if it's a {day-fullname}, {command}!",
son="John",
command="Get out of bed!")
# "Wake up John, it's 09:51 AM! I don't care if it's a Saturday, Get out of bed!!"
reusables.datetime_from_iso('2019-03-10T12:56:55.031863')
# datetime.datetime(2019, 3, 10, 12, 56, 55, 31863)
Examples based on Mon Mar 28 13:27:11 2016
Format | Mapping | Example |
---|---|---|
{12-hour} | %I | 01 |
{24-hour} | %H | 13 |
{seconds} | %S | 14 |
{minutes} | %M | 20 |
{microseconds} | %f | 320944 |
{time-zone} | %Z | |
{years} | %y | 16 |
{years-full} | %Y | 2016 |
{months} | %m | 03 |
{months-name} | %b | Mar |
{months-full} | %B | March |
{days} | %d | 28 |
{week-days} | %w | 1 |
{year-days} | %j | 088 |
{days-name} | %a | Mon |
{days-full} | %A | Monday |
{mon-weeks} | %W | 13 |
{date} | %x | 03/28/16 |
{time} | %X | 13:27:11 |
{date-time} | %C | Mon Mar 28 13:27:11 2016 |
{utc-offset} | %Z | |
{periods} | %p | PM |
{iso-format} | %Y-%m-%dT%H:%M:%S | 2016-03-28T13:27:11 |
How can I help? / Why doesn't it do what I want it too?
Please feel free to make suggestions in the 'issues' section of github, or to be super duper helpful go ahead and submit a PR for the functionality you want to see! Only requirements are that it's well thought out and is more in place here rather than it's own project (to be merged will need documentation and basic unittests as well, but not a requirement for opening the PR). Please don't hesitate if you're new to python! Even the smallest PR contributions will earn a mention in a brand new Contributors section.
Unrar not installed?
A common error to see, especially on Windows based systems, is: "rarfile.RarCannotExec: Unrar not installed? (rarfile.UNRAR_TOOL='unrar')"
This is probably because unrar is not downloaded or linked properly. Download UnRAR from http://www.rarlab.com/rar_add.htm and follow these instructions before trying again: http://rarfile.readthedocs.org/en/latest/faq.html?highlight=windows#how-can-i-get-it-work-on-windows
The MIT License (MIT)
Copyright (c) 2014-2020 Chris Griffith
Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.
This does not claim to provide the most accurate, fastest or most 'pythonic' way to implement these useful snippets, this is simply designed for easy reference. Any contributions that would help add functionality or improve existing code is warmly welcomed!