[libc-commits] [libc] [libc][wctype][codegen] Add generation script for conversion data (PR #170868)
Muhammad Bassiouni via libc-commits
libc-commits at lists.llvm.org
Fri Dec 5 12:17:10 PST 2025
================
@@ -0,0 +1,28 @@
+#!/usr/bin/env python3
+#
+# ===- Fetch files necessary for wctype generator ------------*- python -*--==#
+#
+# Part of the LLVM Project, under the Apache License v2.0 with LLVM Exceptions.
+# See https://llvm.org/LICENSE.txt for license information.
+# SPDX-License-Identifier: Apache-2.0 WITH LLVM-exception
+#
+# ==------------------------------------------------------------------------==#
+#
+# This file is meant to be run manually by maintainers to fetch the latest
+# unicode data files from unicode.org necessary for generating wctype data.
+# All rights to the data belong to unicode.org.
+
+from urllib.request import urlretrieve
+
+
+def fetch_unicode_data_files(
+ llvm_project_root_path: str,
+ files=["UnicodeData.txt"],
+ base_url="https://www.unicode.org/Public/UCD/latest/ucd/",
----------------
bassiounix wrote:
Would it really matter? I mean we would only fetch new files version every new release once, which could be 6 months for example. I think we then would need to get the latest version always, not go to the code, modify the version, then re-fetch. Just one fetch every new release for the latest version.
https://github.com/llvm/llvm-project/pull/170868
More information about the libc-commits
mailing list