DEV Community

Chi Cong, Nguyen
Chi Cong, Nguyen

Posted on

Python Best Practices for Data Engineer

Logging, Error Handling, Environment Variables, and Argument Parsing

In this guide, we'll explore best practices for using various Python functionalities to build robust and maintainable applications. We'll cover logging, exception handling, environment variable management, and argument parsing, with code samples and recommendations for each.

1. Logging
2. Try-Exception Handling
3. Using python-dotenv for Environment Variables
1. Argparse

Logging

Logging is a crucial aspect of Python development, as it allows you to track the execution of your program, identify issues, and aid in debugging. Here's how to effectively incorporate logging into your Python projects:

# example.py
import logging

# Configure the logging format and level
logging.basicConfig(
    format='%(asctime)s - %(levelname)s - %(message)s',
    level=logging.INFO
)

# Example usage
logging.info('This is an informational message.')
logging.warning('This is a warning message.')
logging.error('This is an error message.')
logging.debug('This is a debug message.')
Enter fullscreen mode Exit fullscreen mode

Best Practices:

  • Use meaningful log levels (debug, info, warning, error, critical) to provide valuable context. Avoid logging sensitive information, such as credentials or personal data.
  • Ensure log messages are concise and informative, helping you quickly identify and resolve issues.
  • Configure log file rotation and retention to manage storage and performance.
  • Use structured logging (e.g., with JSON) for better machine-readability and analysis.

Try Exception Handling

Proper exception handling is essential for creating robust and resilient applications. Here's an example of using try-except blocks in Python:

def divide_numbers(a, b):
    try:
        result = a / b
        return result
    except ZeroDivisionError:
        logging.error('Error: Division by zero')
        return None
    except TypeError:
        logging.error('Error: Invalid input types')
        return None
    except Exception as e:
        logging.error(f'Unexpected error: {e}')
        return None
Enter fullscreen mode Exit fullscreen mode

Best Practices:

  • Catch specific exceptions (e.g., ZeroDivisionError, TypeError) to handle known issues.
  • Use a broad Exception block to catch any unexpected errors.
  • Log the exception details for better debugging and error reporting.
  • Provide meaningful error messages that help users understand the problem.
  • Consider using custom exception classes for domain-specific errors.
  • Implement graceful error handling to ensure your application remains responsive and doesn't crash.

Using dotenv

Environment variables are a common way to store sensitive or configuration-specific data, such as API keys, database credentials, or feature flags. The python-dotenv library makes it easy to manage these variables:

#.env
API_KEY=xxxxxxx
DATABASE_URL=xxxxxxxxx
DB_PASSWORD=xxxxxxxx
DB_NAME=xxxxxx

#main.py
from dotenv import load_dotenv
import os

# Load environment variables from a .env file
load_dotenv()

# Access environment variables
api_key = os.getenv('API_KEY')
database_url = os.getenv('DATABASE_URL')
Enter fullscreen mode Exit fullscreen mode

Best Practices:

  • Store environment variables in a .env file, which should be excluded from version control.
  • Use a .env.example file to document the required environment variables.
  • Load environment variables at the start of your application, before accessing them.
  • Provide default values or raise clear errors if required environment variables are missing.
  • Use environment variables for sensitive data, not for general configuration.
  • Organize environment variables by context (e.g., DATABASE_URL, AWS_ACCESS_KEY).

Argparse

The argparse module in Python allows you to easily handle command-line arguments and options. Here's an example:

import argparse

parser = argparse.ArgumentParser(description='My Python Application')
parser.add_argument('-n', '--name', type=str, required=True, help='Name of the user')
parser.add_argument('-a', '--age', type=int, default=30, help='Age of the user')
parser.add_argument('-v', '--verbose', action='store_true', help='Enable verbose output')

args = parser.parse_args()

print(f'Name: {args.name}')
print(f'Age: {args.age}')
if args.verbose:
    logging.info('Verbose mode enabled.')
Enter fullscreen mode Exit fullscreen mode

Best Practices:

  • Use descriptive and concise argument names.
  • Provide helpful descriptions for each argument.
  • Specify the appropriate data types for arguments (e.g., type=str, type=int).
  • Set required=True for mandatory arguments.
  • Provide default values for optional arguments.
  • Use boolean flags (e.g., action='store_true') for toggles or switches.
  • Integrate argument parsing with your application's logging and error handling.

Putting It All Together

Here's a combined code snippet that demonstrates the usage of all the functionalities covered in this guide:

import logging
from dotenv import load_dotenv
import os
import argparse

# Configure logging
logging.basicConfig(
    format='%(asctime)s - %(levelname)s - %(message)s',
    level=logging.INFO
)

# Load environment variables
load_dotenv()
api_key = os.getenv('API_KEY')
database_url = os.getenv('DATABASE_URL')

# Define command-line arguments
parser = argparse.ArgumentParser(description='My Python Application')
parser.add_argument('-n', '--name', type=str, required=True, help='Name of the user')
parser.add_argument('-a', '--age', type=int, default=30, help='Age of the user')
parser.add_argument('-v', '--verbose', action='store_true', help='Enable verbose output')
args = parser.parse_args()

# Example function with exception handling
def divide_numbers(a, b):
    try:
        result = a / b
        return result
    except ZeroDivisionError:
        logging.error('Error: Division by zero')
        return None
    except TypeError:
        logging.error('Error: Invalid input types')
        return None
    except Exception as e:
        logging.error(f'Unexpected error: {e}')
        return None

# Example usage
if __:
    name = args.name
    age = args.age
    print(f'Name: {name}')
    print(f'Age: {age}')

    if args.verbose:
        logging.info('Verbose mode enabled.')
        logging.info(f'API Key: {api_key}')
        logging.info(f'Database URL: {database_url}')

    result = divide_numbers(10, 2)
    if result is not None:
        print(f'Division result: {result}')
Enter fullscreen mode Exit fullscreen mode

This code demonstrates the integration of logging, exception handling, environment variable management, and argument parsing. It includes best practices for each functionality, such as using appropriate log levels, catching specific exceptions, securely accessing environment variables, and providing helpful command-line argument descriptions.

Top comments (0)