Tap is a typed modernization of Python's argparse library.
Tap provides the following benefits:
- Static type checking
- Code completion
- Source code navigation (e.g. go to definition and go to implementation)
See this poster, which we presented at PyCon 2020, for a presentation of some of the relevant concepts we used to guide the development of Tap.
Tap requires Python 3.6+
To install Tap from PyPI run:
pip install typed-argument-parser
To install Tap from source, run the following commands:
git clone https://github.com/swansonk14/typed-argument-parser.git
cd typed-argument-parser
pip install -e .
To see this, let's look at an example:
"""main.py"""
from tap import Tap
class SimpleArgumentParser(Tap):
name: str # Your name
language: str = 'Python' # Programming language
package: str = 'Tap' # Package name
stars: int # Number of stars
max_stars: int = 5 # Maximum stars
args = SimpleArgumentParser().parse_args()
print(f'My name is {args.name} and I give the {args.language} package '
f'{args.package} {args.stars}/{args.max_stars} stars!')
You use Tap the same way you use standard argparse.
>>> python main.py --name Jesse --stars 5
My name is Jesse and I give the Python package Tap 5/5 stars!
The equivalent argparse code is:
"""main.py"""
from argparse import ArgumentParser
parser = ArgumentParser()
parser.add_argument('--name', type=str, required=True,
help='Your name')
parser.add_argument('--language', type=str, default='Python',
help='Programming language')
parser.add_argument('--package', type=str, default='Tap',
help='Package name')
parser.add_argument('--stars', type=int, required=True,
help='Number of stars')
parser.add_argument('--max_stars', type=int, default=5,
help='Maximum stars')
args = parser.parse_args()
print(f'My name is {args.name} and I give the {args.language} package '
f'{args.package} {args.stars}/{args.max_stars} stars!')
The advantages of being Python-native include being able to:
- Overwrite convenient built-in methods (e.g.
process_args
ensures consistency among arguments) - Add custom methods
- Inherit from your own template classes
Now we are going to highlight some of our favorite features and give examples of how they work in practice.
Arguments are specified as class variables defined in a subclass of Tap
. Variables defined as name: type
are required arguments while variables defined as name: type = value
are not required and default to the provided value.
class MyTap(Tap):
required_arg: str
default_arg: str = 'default value'
Single line comments are automatically parsed into the help string provided when running python main.py -h
. The type and default values of arguments are also provided in the help string.
"""main.py"""
from tap import Tap
class MyTap(Tap):
x: float # What am I?
pi: float = 3.14 # I'm pi!
args = MyTap().parse_args()
Running python main.py -h
results in the following:
>>> python main.py -h
usage: demo.py --x X [--pi PI] [-h]
optional arguments:
--x X (float, required) What am I?
--pi PI (float, default=3.14) I'm pi!
-h, --help show this help message and exit
Python's argparse provides a number of advanced argument parsing features with the add_argument
method. Since Tap is a wrapper around argparse, Tap provides all of the same functionality.
To make use of this functionality, first define arguments as class variables as usual, then override Tap's add_arguments
and use self.add_argument
just as you would use argparse's add_argument
.
from tap import Tap
class MyTap(Tap):
positional_argument: str
list_of_three_things: List[str]
argument_with_really_long_name: int
def add_arguments(self):
self.add_argument('positional_argument')
self.add_argument('--list_of_three_things', nargs=3)
self.add_argument('-arg', '--argument_with_really_long_name')
Tap automatically handles all of the following types:
str, int, float, bool
Optional, Optional[str], Optional[int], Optional[float], Optional[bool]
List, List[str], List[int], List[float], List[bool]
Set, Set[str], Set[int], Set[float], Set[bool]
Tuple, Tuple[Type1, Type2, etc.], Tuple[Type, ...]
Literal
Each is automatically parsed to their respective types, just like argparse.
If an argument arg
is specified as arg: bool
or arg: bool = False
, then adding the --arg
flag to the command line will set arg
to True
. If arg
is specified as arg: bool = True
, then adding --arg
sets arg
to False
.
Note that if the Tap
instance is created with explicit_bool=True
, then booleans can be specified on the command line as --arg True
or --arg False
rather than --arg
. Additionally, booleans can be specified by prefixes of True
and False
with any capitalization as well as 1
or 0
(e.g. for True, --arg tRu
, --arg T
, --arg 1
all suffice).
These arguments are parsed in exactly the same way as str
, int
, float
, and bool
. Note bools can be specified using the same rules as above and that Optional
is equivalent to Optional[str]
.
If an argument arg
is a List
, simply specify the values separated by spaces just as you would with regular argparse. For example, --arg 1 2 3
parses to arg = [1, 2, 3]
.
Identical to List
but parsed into a set rather than a list.
Tuples can be used to specify a fixed number of arguments with specified types using the syntax Tuple[Type1, Type2, etc.]
(e.g. Tuple[str, int, bool, str]
). Tuples with a variable number of arguments are specified by Tuple[Type, ...]
(e.g. Tuple[int, ...]
). Note Tuple
defaults to Tuple[str, ...]
.
Literal is analagous to argparse
's choices, which specifies the values that an argument can take. For example, if arg can only be one of 'H', 1, False, or 1.0078 then you would specify that arg: Literal['H', 1, False, 1.0078]
. For instance, --arg False
assigns arg to False and --arg True
throws error. The Literal
type was introduced in Python 3.8 (PEP 586) and can be imported with from typing_extensions import Literal
.
More complex types must be specified with the type
keyword argument in add_argument
, as in the example below.
def to_number(string: str):
return float(string) if '.' in string else int(string)
class MyTap(Tap):
number: Union[int, float]
def add_arguments(self):
self.add_argument('--number', type=to_number)
With complex argument parsing, arguments often end up having interdependencies. This means that it may be necessary to disallow certain combinations of arguments or to modify some arguments based on other arguments.
To handle such cases, simply override process_args
and add the required logic. process_args
is automatically called when parse_args
is called.
class MyTap(Tap):
package: str
is_cool: bool
stars: int
def process_args(self):
# Validate arguments
if self.is_cool and self.stars < 4:
raise ValueError('Cool packages cannot have fewer than 4 stars')
# Modify arguments
if self.package == 'Tap':
self.is_cool = True
self.stars = 5
Similar to argparse's parse_known_args
, Tap is capable of parsing only arguments that it is aware of without raising an error due to additional arguments. This can be done by calling parse_args
with known_only=True
. The remaining un-parsed arguments are then available by accessing the extra_args
field of the Tap object.
class MyTap(Tap):
package: str
args = MyTap().parse_args(['--package', 'Tap', '--other_arg', 'value'], known_only=True)
print(args.extra_args) # ['--other_arg', 'value']
It is sometimes useful to define a template Tap and then subclass it for different use cases. Since Tap is a native Python class, inheritance is built-in, making it easy to customize from a template Tap.
In the example below, StarsTap
and AwardsTap
inherit the arguments (package
and is_cool
) and the methods (process_args
) from BaseTap
.
class BaseTap(Tap):
package: str
is_cool: bool
def process_args(self):
if self.package == 'Tap':
self.is_cool = True
class StarsTap(BaseTap):
stars: int
class AwardsTap(BaseTap):
awards: List[str]
Tap uses Python's pretty printer to print out arguments in an easy-to-read format.
"""main.py"""
from tap import Tap
from typing import List
class MyTap(Tap):
package: str
is_cool: bool = True
awards: List[str] = ['amazing', 'wow', 'incredible', 'awesome']
args = MyTap().parse_args()
print(args)
Running python main.py --package Tap
results in:
>>> python main.py
{'awards': ['amazing', 'wow', 'incredible', 'awesome'],
'is_cool': True,
'package': 'Tap'}
Tap makes reproducibility easy, especially when running code in a git repo.
Specifically, Tap has a method called get_reproducibility_info
that returns a dictionary containing all the information necessary to replicate the settings under which the code was run. This dictionary includes:
- Python command
- The Python command that was used to run the program
- Ex.
python main.py --package Tap
- Time
- The time when the command was run
- Ex.
Thu Aug 15 00:09:13 2019
- Git root
- The root of the git repo containing the code
- Ex.
/Users/swansonk14/typed-argument-parser
- Git url
- The url to the git repo, specifically pointing to the current git hash (i.e. the hash of HEAD in the local repo)
- Ex. https://github.com/swansonk14/typed-argument-parser/tree/446cf046631d6bdf7cab6daec93bf7a02ac00998
- Uncommited changes
- Whether there are any uncommitted changes in the git repo (i.e. whether the code is different from the code at the above git hash)
- Ex.
True
orFalse
Tap has a method called save
which saves all arguments, along with the reproducibility info, to a JSON file.
"""main.py"""
from tap import Tap
class MyTap(Tap):
package: str
is_cool: bool = True
stars: int = 5
args = MyTap().parse_args()
args.save('args.json')
After running python main.py --package Tap
, the file args.json
will contain:
{
"is_cool": true,
"package": "Tap",
"reproducibility": {
"command_line": "python main.py --package Tap",
"git_has_uncommitted_changes": false,
"git_root": "/Users/swansonk14/typed-argument-parser",
"git_url": "https://github.com/swansonk14/typed-argument-parser/tree/446cf046631d6bdf7cab6daec93bf7a02ac00998",
"time": "Thu Aug 15 00:18:31 2019"
},
"stars": 5
}
Note: More complex types will be encoded in JSON as a pickle string.
Arguments can be loaded from a JSON file rather than parsed from the command line.
"""main.py"""
from tap import Tap
class MyTap(Tap):
package: str
is_cool: bool = True
stars: int = 5
args = MyTap()
args.load('args.json')
Note: All required arguments (in this case package
) must be present in the JSON file if not already set in the Tap object.
Arguments can be loaded from a Python dictionary rather than parsed from the command line.
"""main.py"""
from tap import Tap
class MyTap(Tap):
package: str
is_cool: bool = True
stars: int = 5
args = MyTap()
args.from_dict({
'package': 'Tap',
'stars': 20
})
Note: As with load
, all required arguments must be present in the dictionary if not already set in the Tap object. All values in the provided dictionary will overwrite values currently in the Tap object.