Python过滤utf8mb4无效字符

    def replace_utf8mb4(self, v):
        """Replace 4-byte unicode characters by REPLACEMENT CHARACTER"""
        import re
        INVALID_UTF8_RE = re.compile(u'[^u0000-uD7FFuE000-uFFFF]', re.UNICODE)
        INVALID_UTF8_RE.sub(u'uFFFD', v)
原文地址:https://www.cnblogs.com/sullian/p/3492430.html