Splitting a Large Class and Multiple Inheritance in Python

When I started refactoring EFB Telegram Master Channel (ETM) for 2.0 updates, I was investigating ways to organize code into different files in a decent manner. In this article I’d like to talk about the strategy I used, comparing to another codebase I was reading back then, itchat.

In ETM version 1, most of the code is written the heavy and ugly 1675-line-long __init__.py. As more features planned to be added to ETM, it was really hard for me to navigate through the code, which have brought up my need of refactoring this huge thing.

Back then (which, surprisingly, was over 2 years ago), the main reference I had on a large enough project was itchat. Their code structure hasn’t been changing much since then. itchat did have a reasonably large code repository, but the way it splits its functions is rather unideal.

The way itchat did to have all functioned defined at root level of each file, and have a loader function that “loads” these methods to an object called core which contains some configuration data. To the Python interpreter, this method indeed works, thanks to its dynamic typing. But this looks really bad when you were trying to work with the code, as IDE usually can’t give any hint with objects defined in this way. That also happens when you try to work on the library itself, despite every function starts with a self in their arguments.

Then I went on looking for other common practices on breaking down a large class, some suggested importing functions inside a function, other using multiple inheritance. ^[Ref.] The former is not much different from what itchat was doing, and the latter looked promising at the beginning. I went on to do some experiment with multiple inheritance, and found that it does provide better autocomplete with IDE, but only in the main class. I can’t see one subclass from another one in the IDE. That is still reasonable as all those subclasses only comes together in the main class, they are not aware of each other.

core.pycomponents/__init__.pycomponents/component_1.pycomponents/component_2.py


from .components import load_components
class Core:
def method_1(self, param_1, param_2, param_3):
"""Doc string goes here."""
raise NotImplementedError()
def method_2(self):
    &quot;&quot;&quot;Doc string goes here.&quot;&quot;&quot;
    raise NotImplementedError()

def method_3(self, param_1):
    &quot;&quot;&quot;Doc string goes here.&quot;&quot;&quot;
    raise NotImplementedError()

load_components(Core)


from .component_1 import load_component_1
from .component_2 import load_component_2
def load_components(core):

load_component_1(core)

load_component_2(core)


def load_contact(core):
    core.method_1 = method_1
    core.method_2 = method_2
def method_1(self, param_1, param_2, param_3):
# Actual implementation
...
def method_2(self):

# Actual implementation

...


def load_contact(core):
    core.method_3 = method_3
def method_3(self, param_1):

# Actual implementation

...

I thought to myself, why can’t I just make some more classes and let them reference each other? Turns out that worked pretty well for me. I split my functions into several different “manager” classes, each of which is initialized with a reference to the main class. These classes are instantiated in topological order such that classes being referred to by others are created earlier. In ETM, the classes that are being referred to are usually those data providers utilities, namely ExperimentalFlagsManager, DatabaseManager, and TelegramBotManager.

__init__.pyflags.pydb.pychat_binding.py


from .flags import ExperimentalFlagsManager
from .db import DatabaseManager
from .chat_binding import ChatBindingManager
class TelegramChannel():

def init(self):

self.flags: ExperimentalFlagsManager = ExperimentalFlagsManager(self)

self.db: DatabaseManager = DatabaseManager(self)

self.chat_binding: ChatBindingManager = ChatBindingManager(self)


from typing import TYPE_CHECKING
if TYPE_CHECKING:
# Avoid cycle import for type checking
from . import TelegramChannel
class ExperimentalFlagsManager:

def init(channel: 'TelegramChannel'):

self.channel = channel

...


from typing import TYPE_CHECKING
from .flags import ExperimentalFlagsManager
if TYPE_CHECKING:
# Avoid cycle import for type checking
from . import TelegramChannel
class DatabaseManager:

def init(channel: 'TelegramChannel'):

self.channel: 'TelegramChannel' = channel

self.flags: ExperimentalFlagsManager = channel.flags

...


from typing import TYPE_CHECKING
from .chat_binding import ChatBindingManager
from .db import DatabaseManager
if TYPE_CHECKING:
# Avoid cycle import for type checking
from . import TelegramChannel
class ChatBindingManager:

def init(channel: 'TelegramChannel'):

self.channel: 'TelegramChannel' = channel

self.flags: ExperimentalFlagsManager = channel.flags

self.db: DatabaseManager = channel.db

...

While going on refactoring ETM, I learnt that multiple inheritance in Python is also used in another way – mixins. Mixins are classes that are useful when you want to add a set of features to many other classes. This has enlightened me when I was trying to deal with constantly adding references of the gettext translator in all manager classes.

I added a mixin called LocaleMixin that extracts the translator functions (gettext and ngettext) from the main class reference (assuming they are guaranteed to be there), and assign a local property that reflects these methods.


class LocaleMixin:
    channel: 'TelegramChannel'

    @property
    def _(self):
        return self.channel.gettext

    @property
    def ngettext(self):
        return self.channel.ngettext

When the mixin classes is added to the list of inherited classes, the IDE can properly recognise these helper properties, and their definitions are consolidated in the same place. I find it more organised that the previous style.

In the end, I find that simply creating classes for each component of my code turns out to be the most organised, and IDE-friendly way to breakdown a large class, and mixins are helpful to make references or helper functions available to multiple classes.

函数

函数就像「代码的魔法工具箱」，把常用的功能打包起来，随用随取。让我们用做奶茶的比喻来理解它~ ‍ 一、函数是什么？想象你开奶茶店：原料（水果、牛奶）→ 输入参数制作流程 → 函数内部的代码成品奶茶 → 返回值代码示例： # 定义「做奶茶」函数 def make_milk_tea(tea_base, toppi ..

控制流

控制流就像「程序的交通指挥官」，它决定代码该走哪条路、重复做什么事。让我们用最生活化的方式理解它~ ‍ 一、控制流是什么？想象你每天出门前：如果下雨 → 带伞（条件判断）重复刷牙 1 分钟 → 直到刷干净（循环）这就是生活中的控制流！编程中也一样 ‍ 二、条件判断：如果...就... 1️⃣ 最简单的 i ..

常用数据结构

数据结构就像「收纳数据的各种容器」️，不同的容器适合存放不同类型的数据。让我们用最生活化的方式认识它们吧~ ‍ 一、列表（List）→ 购物车特点：有顺序的容器可以随时增删改用方括号 [] 表示 # 创建购物车 cart = ['苹果', '笔记本', '️铅笔'] # 常用操作 cart.append('咖啡 ..

变量与数据类型

一、变量：就像贴标签的小盒子比喻：想象你有一个小盒子，上面贴着「零食盒」的标签，里面装了饼干。在编程中：变量名 = 盒子的标签（比如 my_snack）数据 = 盒子里的东西（比如 '饼干'）代码例子： # 把'饼干'放进叫my_snack的盒子里 my_snack = '饼干' # 查看盒子里有什么 prin ..

配置虚拟环境

虚拟环境管理（venv/pipenv/virtualenv/conda）为什么需要虚拟环境？隔离项目依赖：不同项目可能需要不同版本的 Python 或第三方库避免全局污染：防止系统 Python 环境被意外修改依赖可重现：方便团队协作和部署 1. venv（Python 内置，一般使用这个就够了，其他的知道有就 ..

Python 入门

1. 环境搭建 1.1 安装 Python 官网下载：https://www.python.org/downloads/ [图片] 下载完成以后打开 exe 文件，一定要勾选**Add Python to PATH**，点击“Install Now”开始安装就行。安装完成后，按下 Win + R 组合键，打开“运行” ..

欢迎来到这里！

我们正在构建一个小众社区，大家在这里相互信任，以平等 • 自由 • 奔放的价值观进行分享交流。最终，希望大家能够找到与自己志同道合的伙伴，共同成长。

关于

Splitting a Large Class and Multiple Inheritance in Python

相关帖子

函数

控制流

常用数据结构

变量与数据类型

配置虚拟环境

认识开发工具

Python 入门

欢迎来到这里！

近期热议

推荐标签标签

组织简介

用爱发电组织的核心驱动力：

最新标签

Splitting a Large Class and Multiple Inheritance in Python

相关帖子

函数

控制流

常用数据结构

变量与数据类型

配置虚拟环境

认识开发工具

Python 入门

欢迎来到这里！

近期热议

推荐标签 标签

组织简介

用爱发电组织的核心驱动力：

最新标签

推荐标签标签