Python New Module Guide

January 4, 2025

Python new project guide

I’ve written this mostly as a guide to myself. I’m a little new to python (so this is the perfect time for me to write this), but not new to programming.

This is not about the python language or syntax, but about the right modules to use to setup a skeleton project.

This doc came out of painfully learning these things one by one.

Use `uv`

uv is an extremely fast Python package and project manager. It replaces pip, pipx, venv, and most of what you’d use hatch for. Install it following the official instructions.

Creating a new project with `uv init --package`

Use uv init --package my-package to create a new project:

$ uv init --package todo
$ cd todo
$ tree
.
├── pyproject.toml
├── README.md
└── src
    └── todo
        └── __init__.py

This sets up pyproject.toml with hatchling as the build backend and a script entry point:

[project.scripts]
todo = "todo:main"

[build-system]
requires = ["hatchling"]
build-backend = "hatchling.build"

Now you can run your package:

# Run a command in the project's virtual environment
$ uv run python -c 'import todo'

# Run the script entry point
$ uv run todo

uv automatically creates and manages the .venv virtual environment for you.

Adding dependencies

# Add a runtime dependency
$ uv add requests

# Add a development dependency
$ uv add --dev pytest ruff

Installing CLI tools globally

Instead of pipx, use uv tool install:

$ uv tool install ruff
$ uv tool install httpie

This installs tools in isolated environments and makes them available globally.

Test utilities

The presence of tests/__init__.py makes tests a module in its own right. So you can create a tests/config_reader.py support utility and then use it from tests/mytest.py as:

from . import config_reader
from todo import some_feature

# use it
config_reader.read(...)

This is handy for common functionality across multiple tests.

Use `xdg_base_dirs` for common directories

Use xdg_base_dirs and don’t hardcode e.g. ~/.todo/config.yaml.

xdg_config_home() for config files
xdg_cache_home() for for cache
xdg_data_home() for data
xdg_state_home() for history and logs

E.g.:

from xdg_base_dirs import xdg_config_home

config_file = xdg_config_home() / "polyopen/config.yaml"

Then read the config file with mashumaro.

`rich-argparse` over `argparse`

argparse is great.

Except that I’m wanting a couple of extra features from my --help text:

I want it to automatically adjust to my terminal width.
I want to be able to have separate paragraphs in the help text.
I want to show default values.

With vanilla argparse I can get 1. or 2. but not both, and with some hacks I can get 3. also.

But with rich-argparse I can get all three:

import argparse
from rich_argparse.contrib import ParagraphRichHelpFormatter
from rich.markdown import Markdown

class MyHelpFormatter(
  argparse.ArgumentDefaultsHelpFormatter,
  ParagraphRichHelpFormatter):
  pass

parser = argparse.ArgumentParser(
    description=Markdown(description, style="argparse.text"),
    formatter_class=MyHelpFormatter,
)

parser.add_argument('--foo', help=foo_help)

args = parser.parse_args()

As an added bonus, it looks much nicer than vanilla argparse.

Consider `mashumaro` for your JSON and YAML needs

Sure you can use json for JSON and pyyaml for YAML. Then you have to use untyped dicts for your data.

I’ve been using mashumaro and @dataclasses instead. Then you can use type-safe @dataclass objects.

# Basically, JSON, YAML (and TOML) all work the same way.
import mashumaro.codecs.json as json_codec
import mashumaro.codecs.yaml as yaml_codec
from dataclasses import dataclass

@dataclass
class NestedField:
    bool_field: bool

@dataclass
class SerializeMe:
    string_field: str
    int_field: int
    nested: NestedField

obj = SerializeMe(
    string_field="hello world",
    int_field=42,
    nested=NestedField(bool_field=True)
)

json = json_codec.encode(obj, SerializeMe)

print(json)
# Prints
# {"string_field": "hello world", "int_field": 42, "nested": {"bool_field": true}}

# Recall that JSON is a subset of YAML, so you can load JSON as YAML.
obj = yaml_codec.decode(json, SerializeMe)

print(obj)
# Prints
# SerializeMe(string_field='hello world', int_field=42, nested=NestedField(bool_field=True))

Python new project guide

Use uv

Creating a new project with uv init --package

Adding dependencies

Installing CLI tools globally

Test utilities

Use xdg_base_dirs for common directories

rich-argparse over argparse

Consider mashumaro for your JSON and YAML needs

Other topics

Use `uv`

Creating a new project with `uv init --package`

Use `xdg_base_dirs` for common directories

`rich-argparse` over `argparse`

Consider `mashumaro` for your JSON and YAML needs