SDK Style Guide

This document outlines the coding and documentation standards for our SDK. Our goal is to ensure a consistent, clear, and maintainable codebase. We adhere to established Python standards (PEP 8 and PEP 257) and specifically utilize spaCy’s docstring style.

General Coding Standards

PEP 8 Compliance:
Follow PEP 8 for overall code style, including naming conventions, line lengths, and whitespace usage.
Readable and Maintainable Code:
Write clear, self-explanatory code. Refactor and comment where necessary to improve clarity.
Consistent Formatting:
Use our style guide and formatting tools (e.g., linters, formatters) to maintain consistency across the codebase.

Docstring Conventions

We follow spaCy’s docstring style to ensure our documentation is clear and consistent.

Overview

Triple Double Quotes:
Use """ for all docstrings.
Placement:
Place the docstring immediately after the function, method, class, or module definition.
Imperative Mood:
Write summaries in the imperative mood (e.g., "Return", "Process", "Compute").

Structure

One-Line Summary:
- A concise description of what the function/class/module does.
Blank Line:
- Insert a blank line after the summary if the docstring contains additional detail.
Extended Description:
- Provide any extra information necessary to understand the code’s purpose or behavior.
Parameter Section (Args):
- List each parameter, its type, and a brief description.
Returns Section:
- Describe the return type and the meaning of the returned value.
Raises Section (if applicable):
- List any exceptions that might be raised by the function.

Example

Below is an example demonstrating the spaCy docstring style at the method level:

def process_text(text: str) -> Doc:
    """
    Process a text string and return a spaCy Doc object.

    This function tokenizes and annotates the input text using spaCy's NLP pipeline.

    Args:
        text (str): The input text to be processed.

    Returns:
        Doc: The processed spaCy Doc object.

    Raises:
        ValueError: If the input text is empty.
    """
    if not text:
        raise ValueError("Input text must not be empty.")
    # Processing code goes here

Below is an example demonstrating the spaCy docstring style at the module level:

"""
text_utils.py

This module provides utilities for processing text data. It leverages spaCy's NLP
pipeline to tokenize, annotate, and analyze text, offering helper classes and functions
to simplify common text processing tasks.
"""

import spacy

Below is an example demonstrating spaCy docstring style at the class level:

class TextProcessor:
    """
    A class for processing and analyzing text data using spaCy.

    This class provides methods to tokenize, lemmatize, and extract named entities from text.
    It utilizes spaCy's NLP pipeline to annotate the text and offers helper methods for
    common text processing tasks.

    Attributes:
        nlp (spacy.language.Language): The spaCy NLP model used for processing text.
    """

    def __init__(self, model: str = "en_core_web_sm"):
        """
        Initialize the TextProcessor with the specified spaCy model.

        Args:
            model (str): The name of the spaCy model to load (default is "en_core_web_sm").
        """
        self.nlp = spacy.load(model)

    def process(self, text: str):
        """
        Process the input text and return the annotated spaCy Doc object.

        Args:
            text (str): The input text to process.

        Returns:
            spacy.tokens.Doc: The processed document with tokens, entities, and annotations.

        Raises:
            ValueError: If the input text is empty.
        """
        if not text:
            raise ValueError("Input text must not be empty.")
        return self.nlp(text)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

STYLE_GUIDE.md

STYLE_GUIDE.md

SDK Style Guide

Table of Contents

General Coding Standards

Docstring Conventions

Overview

Structure

Example

Files

STYLE_GUIDE.md

Latest commit

History

STYLE_GUIDE.md

File metadata and controls

SDK Style Guide

Table of Contents

General Coding Standards

Docstring Conventions

Overview

Structure

Example