diff --git a/.fmf/version b/.fmf/version new file mode 100644 index 0000000..d00491f --- /dev/null +++ b/.fmf/version @@ -0,0 +1 @@ +1 diff --git a/Makefile b/Makefile new file mode 100644 index 0000000..fc567de --- /dev/null +++ b/Makefile @@ -0,0 +1,4 @@ +update-sources: + spectool -g -S -f unicode-ucd.spec + dos2unix copyright.html + fedpkg new-sources UCD.zip Unihan.zip diff --git a/copyright.html b/copyright.html deleted file mode 100644 index ae3646f..0000000 --- a/copyright.html +++ /dev/null @@ -1,339 +0,0 @@ - - - - - - - - - -Unicode Terms of Use - - - - - - - - - - - - - - - - - - - - - - -
- - - - - - -
[Unicode]  Terms of - UseHome - | Site Map | - Search
-
 
- -
-

Unicode® Terms of Use

- -

For the general privacy policy governing access to this site, see - the  - - Unicode Privacy Policy. For trademark usage, see - the - Unicode® Consortium Name and Trademark Usage Policy.

- - - - - - - - - -
    -
  1. Unicode Copyright. -
      -
    1. Copyright © 1991-2018 Unicode, Inc. All rights reserved.
    2. -
    3. Certain documents and files on this website contain a legend - indicating that "Modification is permitted." Any person is - hereby authorized, without fee, to modify such documents and - files to create derivative works conforming to the Unicode® - Standard, subject to Terms and Conditions herein.
    4. -
    5. Any person is hereby authorized, without fee, to view, use, - reproduce, and distribute all documents and files, subject to the Terms and Conditions - herein.
    6. -
    7. Further specifications of rights and restrictions pertaining - to the use of the particular set of data files known as the - "Unicode Character Database" can be found in the - License.
    8. -
    9. Each version of the Unicode Standard has further - specifications of rights and restrictions of use. For the book - editions (Unicode 5.0 and earlier), these are found on the back - of the - title page. - - The online code charts carry specific restrictions. All -other files, including online documentation of the core specification -for Unicode 6.0 and later, are covered under these general Terms of -Use.
    10. -
    11. No license is granted to "mirror" the Unicode website where - a fee is charged for access to the "mirror" site.
    12. -
    13. Modification is not permitted with respect to this document. - All copies of this document must be verbatim.
    14. -
    -
  2. -
  3. Restricted Rights Legend. Any technical - data or software which is licensed to the United States of - America, its agencies and/or instrumentalities under this - Agreement is commercial technical data or commercial computer - software developed exclusively at private expense as defined in - FAR 2.101, or DFARS 252.227-7014 (June 1995), as applicable. For - technical data, use, duplication, or disclosure by the Government - is subject to restrictions as set forth in DFARS 202.227-7015 - Technical Data, Commercial and Items (Nov 1995) and this - Agreement. For Software, in accordance with FAR 12-212 or DFARS - 227-7202, as applicable, use, duplication or disclosure by the - Government is subject to the restrictions set forth in this - Agreement.
  4. -
  5. Warranties and Disclaimers. -
      -
    1. This publication and/or website may include technical or - typographical errors or other inaccuracies . Changes are - periodically added to the information herein; these changes will - be incorporated in new editions of the publication and/or - website. Unicode may make improvements and/or changes in the - product(s) and/or program(s) described in this publication - and/or website at any time.
    2. -
    3. If this file has been purchased on magnetic or optical media - from Unicode, Inc. the sole and exclusive remedy for any claim - will be exchange of the defective media within ninety (90) days - of original purchase.
    4. -
    5. EXCEPT AS PROVIDED IN SECTION C.2, THIS PUBLICATION AND/OR - SOFTWARE IS PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND EITHER - EXPRESS, IMPLIED, OR STATUTORY, INCLUDING, BUT NOT LIMITED TO, - ANY WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR - PURPOSE, OR NON-INFRINGEMENT. UNICODE AND ITS LICENSORS ASSUME - NO RESPONSIBILITY FOR ERRORS OR OMISSIONS IN THIS PUBLICATION - AND/OR SOFTWARE OR OTHER DOCUMENTS WHICH ARE REFERENCED BY OR - LINKED TO THIS PUBLICATION OR THE UNICODE WEBSITE.
    6. -
    -
  6. -
  7. Waiver of Damages. In no event shall - Unicode or its licensors be liable for any special, incidental, - indirect or consequential damages of any kind, or any damages - whatsoever, whether or not Unicode was advised of the possibility - of the damage, including, without limitation, those resulting from - the following: loss of use, data or profits, in connection with - the use, modification or distribution of this information or its - derivatives.
  8. -
  9. Trademarks & Logos. -
      -
    1. The Unicode Word Mark and the Unicode Logo are -trademarks of Unicode, Inc. “The Unicode Consortium” and “Unicode, -Inc.” are trade names of Unicode, Inc. Use of the information and -materials found on this website indicates your acknowledgement of -Unicode, Inc.’s exclusive worldwide rights in the Unicode Word Mark, the - Unicode Logo, and the Unicode trade names.
    2. -
    3. The Unicode Consortium Name and Trademark Usage Policy - (“Trademark Policy”) are incorporated herein by reference and you agree - to abide by the provisions of the Trademark Policy, which may be -changed from time to time in the sole discretion of Unicode, Inc.
    4. -
    5. All third party trademarks referenced herein are the property of their respective owners. -
    6. -
    -
  10. - -
  11. Miscellaneous. -
      -
    1. Jurisdiction and Venue. This server is operated from - a location in the State of California, United States of America. - Unicode makes no representation that the materials are - appropriate for use in other locations. If you access this - server from other locations, you are responsible for compliance - with local laws. This Agreement, all use of this site and any - claims and damages resulting from use of this site are governed - solely by the laws of the State of California without regard to - any principles which would apply the laws of a different - jurisdiction. The user agrees that any disputes regarding this - site shall be resolved solely in the courts located in Santa - Clara County, California. The user agrees said courts have - personal jurisdiction and agree to waive any right to transfer - the dispute to any other forum.
    2. -
    3. Modification by Unicode Unicode shall have the right - to modify this Agreement at any time by posting it to this site. - The user may not assign any part of this Agreement without - Unicode’s prior written consent.
    4. -
    5. Taxes. The user agrees to pay any taxes arising from - access to this website or use of the information herein, except - for those based on Unicode’s net income.
    6. -
    7. Severability.  If any provision of this - Agreement is declared invalid or unenforceable, the remaining - provisions of this Agreement shall remain in effect.
    8. -
    9. Entire Agreement. This Agreement constitutes the - entire agreement between the parties. 
    10. -
    -
  12. -
-
-
- -
-

EXHIBIT 1
-UNICODE, INC. LICENSE AGREEMENT - DATA FILES AND SOFTWARE

- -
Unicode Data Files include all data files under the directories
-http://www.unicode.org/Public/, http://www.unicode.org/reports/,
-http://www.unicode.org/cldr/data/, http://source.icu-project.org/repos/icu/, and
-http://www.unicode.org/utility/trac/browser/.
-
-Unicode Data Files do not include PDF online code charts under the
-directory http://www.unicode.org/Public/.
-
-Software includes any source code published in the Unicode Standard
-or under the directories
-http://www.unicode.org/Public/, http://www.unicode.org/reports/,
-http://www.unicode.org/cldr/data/, http://source.icu-project.org/repos/icu/, and
-http://www.unicode.org/utility/trac/browser/.
-
-NOTICE TO USER: Carefully read the following legal agreement.
-BY DOWNLOADING, INSTALLING, COPYING OR OTHERWISE USING UNICODE INC.'S
-DATA FILES ("DATA FILES"), AND/OR SOFTWARE ("SOFTWARE"),
-YOU UNEQUIVOCALLY ACCEPT, AND AGREE TO BE BOUND BY, ALL OF THE
-TERMS AND CONDITIONS OF THIS AGREEMENT.
-IF YOU DO NOT AGREE, DO NOT DOWNLOAD, INSTALL, COPY, DISTRIBUTE OR USE
-THE DATA FILES OR SOFTWARE.
-
-COPYRIGHT AND PERMISSION NOTICE
-
-Copyright © 1991-2018 Unicode, Inc. All rights reserved.
-Distributed under the Terms of Use in http://www.unicode.org/copyright.html.
-
-Permission is hereby granted, free of charge, to any person obtaining
-a copy of the Unicode data files and any associated documentation
-(the "Data Files") or Unicode software and any associated documentation
-(the "Software") to deal in the Data Files or Software
-without restriction, including without limitation the rights to use,
-copy, modify, merge, publish, distribute, and/or sell copies of
-the Data Files or Software, and to permit persons to whom the Data Files
-or Software are furnished to do so, provided that either
-(a) this copyright and permission notice appear with all copies
-of the Data Files or Software, or
-(b) this copyright and permission notice appear in associated
-Documentation.
-
-THE DATA FILES AND SOFTWARE ARE PROVIDED "AS IS", WITHOUT WARRANTY OF
-ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE
-WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND
-NONINFRINGEMENT OF THIRD PARTY RIGHTS.
-IN NO EVENT SHALL THE COPYRIGHT HOLDER OR HOLDERS INCLUDED IN THIS
-NOTICE BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL INDIRECT OR CONSEQUENTIAL
-DAMAGES, OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE,
-DATA OR PROFITS, WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER
-TORTIOUS ACTION, ARISING OUT OF OR IN CONNECTION WITH THE USE OR
-PERFORMANCE OF THE DATA FILES OR SOFTWARE.
-
-Except as contained in this notice, the name of a copyright holder
-shall not be used in advertising or otherwise to promote the sale,
-use or other dealings in these Data Files or Software without prior
-written authorization of the copyright holder.
-
- -
- - -
-
-
- - - - -
- Access to Copyright and terms of use
- - Last updated:  -- 2018/1/6 5:55:11 - - Contact Us - - - -
-
- -
- - - \ No newline at end of file diff --git a/license.txt b/license.txt new file mode 100644 index 0000000..ee8e69b --- /dev/null +++ b/license.txt @@ -0,0 +1,39 @@ +UNICODE LICENSE V3 + +COPYRIGHT AND PERMISSION NOTICE + +Copyright © 1991-2024 Unicode, Inc. + +NOTICE TO USER: Carefully read the following legal agreement. BY +DOWNLOADING, INSTALLING, COPYING OR OTHERWISE USING DATA FILES, AND/OR +SOFTWARE, YOU UNEQUIVOCALLY ACCEPT, AND AGREE TO BE BOUND BY, ALL OF THE +TERMS AND CONDITIONS OF THIS AGREEMENT. IF YOU DO NOT AGREE, DO NOT +DOWNLOAD, INSTALL, COPY, DISTRIBUTE OR USE THE DATA FILES OR SOFTWARE. + +Permission is hereby granted, free of charge, to any person obtaining a +copy of data files and any associated documentation (the "Data Files") or +software and any associated documentation (the "Software") to deal in the +Data Files or Software without restriction, including without limitation +the rights to use, copy, modify, merge, publish, distribute, and/or sell +copies of the Data Files or Software, and to permit persons to whom the +Data Files or Software are furnished to do so, provided that either (a) +this copyright and permission notice appear with all copies of the Data +Files or Software, or (b) this copyright and permission notice appear in +associated Documentation. + +THE DATA FILES AND SOFTWARE ARE PROVIDED "AS IS", WITHOUT WARRANTY OF ANY +KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF +MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT OF +THIRD PARTY RIGHTS. + +IN NO EVENT SHALL THE COPYRIGHT HOLDER OR HOLDERS INCLUDED IN THIS NOTICE +BE LIABLE FOR ANY CLAIM, OR ANY SPECIAL INDIRECT OR CONSEQUENTIAL DAMAGES, +OR ANY DAMAGES WHATSOEVER RESULTING FROM LOSS OF USE, DATA OR PROFITS, +WHETHER IN AN ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, +ARISING OUT OF OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THE DATA +FILES OR SOFTWARE. + +Except as contained in this notice, the name of a copyright holder shall +not be used in advertising or otherwise to promote the sale, use or other +dealings in these Data Files or Software without prior written +authorization of the copyright holder. diff --git a/plans/parse.fmf b/plans/parse.fmf new file mode 100644 index 0000000..c1627f9 --- /dev/null +++ b/plans/parse.fmf @@ -0,0 +1,5 @@ +summary: Basic smoke test +discover: + how: fmf +execute: + how: tmt diff --git a/sources b/sources index 86a2e98..99f4856 100644 --- a/sources +++ b/sources @@ -1,2 +1,2 @@ -SHA512 (UCD.zip) = 954a499ff072727c8778e711fb1753834adf09277c9e9a2592c02b59971860f9a407c7e9985ed1e6bdda7a3cf3b1cd6316599c4bffe8b0625413f874486830c1 -SHA512 (Unihan.zip) = a045e79545f1ec3529a8556d41c73b20809979e002f98314068c6f91daa79abb2ab3ab383090651b55fb55e343d8fcf67bea822d93d6e3221a206d58ef9e3a7b +SHA512 (UCD.zip) = 34bf4f3760b72f34d31b9135820458a92efc01be4e068446031849ba8e36b9e30adecb2b7721239441cc026e70a33ad780a753464a6a635cea222c34767755d6 +SHA512 (Unihan.zip) = 8b0d117fed39600c62864dd0c955b2fe0a52cfc975288fcf58cb770aaf476355f52b2a603e11330184a07d89c705c0fea55eeb9a044c8f1ba3ba79f5f32d0b33 diff --git a/tests/parse/main.fmf b/tests/parse/main.fmf new file mode 100644 index 0000000..f972409 --- /dev/null +++ b/tests/parse/main.fmf @@ -0,0 +1,6 @@ +summary: parse unicode data +require: +- unicode-ucd +- python3 +test: python3 test.py +framework: shell diff --git a/tests/parse/test.py b/tests/parse/test.py new file mode 100644 index 0000000..cfae90c --- /dev/null +++ b/tests/parse/test.py @@ -0,0 +1,16 @@ +#!/usr/bin/env python3 + +codepoints = 0 + +file = '/usr/share/unicode/ucd/UnicodeData.txt' + +with open(file, mode='rt', encoding='ascii') as unicode_data: + for line in unicode_data.readlines(): + codepoint_string, name, category = line.split(';')[:3] + codepoint = int(codepoint_string, 16) + char = chr(codepoint) + codepoints = codepoints + 1 + +print(codepoints) +# Unicode 17 +assert(codepoints == 40575) diff --git a/unicode-ucd.spec b/unicode-ucd.spec index 83d5e10..b9655ac 100644 --- a/unicode-ucd.spec +++ b/unicode-ucd.spec @@ -3,17 +3,20 @@ %global ucddir %{unicodedir}/ucd Name: unicode-ucd -Version: 11.0.0 +Version: 17.0.0 Release: 1%{?dist} Summary: Unicode Character Database -# https://fedoraproject.org/wiki/Licensing/MIT#Modern_Style_without_sublicense_.28Unicode.29 -License: MIT +# http://www.unicode.org/terms_of_use.html in ReadMe.txt redirects to: +# http://www.unicode.org/copyright.html +# which links to https://www.unicode.org/license.txt +# https://github.com/spdx/license-list-XML/issues/2105 +License: Unicode-3.0 URL: http://www.unicode.org/ucd/ -Source0: http://www.unicode.org/Public/zipped/%{version}/UCD.zip -# http://www.unicode.org/terms_of_use.html referenced in ReadMe.txt redirects to: -Source1: http://www.unicode.org/copyright.html -Source2: http://www.unicode.org/Public/zipped/%{version}/Unihan.zip +# update with "fbrnch update-sources -f" +Source0: https://www.unicode.org/Public/%{version}/ucd/UCD.zip +Source1: https://www.unicode.org/Public/%{version}/ucd/Unihan.zip +Source2: https://www.unicode.org/license.txt BuildArch: noarch %description @@ -45,13 +48,12 @@ grep -q "%{version}" ReadMe.txt || (echo "zip file seems not %{version}" ; exit %install mkdir -p %{buildroot}%{ucddir} cp -ar . %{buildroot}%{ucddir} -cp -p %{SOURCE2} %{buildroot}%{ucddir} - -cp -p %{SOURCE1} . +cp -p %{SOURCE1} %{buildroot}%{ucddir} +cp %{SOURCE2} . %files -%doc copyright.html +%license license.txt %dir %{unicodedir} %{ucddir} %exclude %{ucddir}/Unihan.zip @@ -61,6 +63,93 @@ cp -p %{SOURCE1} . %changelog +* Wed Sep 10 2025 Jens Petersen - 17.0.0-1 +- https://www.unicode.org/versions/Unicode17.0.0/ +- https://blog.unicode.org/2025/09/unicode-170-release-announcement.html + +* Fri Jul 25 2025 Fedora Release Engineering - 16.0.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_43_Mass_Rebuild + +* Sun Jan 19 2025 Fedora Release Engineering - 16.0.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_42_Mass_Rebuild + +* Thu Sep 19 2024 Jens Petersen - 16.0.0-1 +- update for https://unicode.org/versions/Unicode16.0.0/ + +* Sat Jul 20 2024 Fedora Release Engineering - 15.1.0-4 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_41_Mass_Rebuild + +* Mon Jul 15 2024 Parag Nemade - 15.1.0-3 +- Update the license tag to Unicode-3.0 + +* Sat Jan 27 2024 Fedora Release Engineering - 15.1.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_40_Mass_Rebuild + +* Sat Sep 16 2023 Jens Petersen - 15.1.0-1 +- Unicode 15.1 released + +* Thu Aug 31 2023 Jens Petersen - 15.1.0-0.2 +- add license.txt + +* Thu Aug 31 2023 Jens Petersen - 15.1.0-0.1 +- update to draft 15.1.0: https://unicode.org/versions/Unicode15.1.0/ +- do not add copyright.html file from website +- add missing Unicode-TOU license tag + +* Mon Aug 21 2023 Parag Nemade - 15.0.0-4 +- Migrate to SPDX license expression + +* Sat Jul 22 2023 Fedora Release Engineering - 15.0.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_39_Mass_Rebuild + +* Sat Jan 21 2023 Fedora Release Engineering - 15.0.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_38_Mass_Rebuild + +* Wed Sep 14 2022 Jens Petersen - 15.0.0-1 +- https://www.unicode.org/versions/Unicode15.0.0/ (#2126234) +- http://blog.unicode.org/2022/09/announcing-unicode-standard-version-150.html + +* Sat Jul 23 2022 Fedora Release Engineering - 14.0.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_37_Mass_Rebuild + +* Sat Jan 22 2022 Fedora Release Engineering - 14.0.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_36_Mass_Rebuild + +* Mon Sep 20 2021 Jens Petersen - 14.0.0-1 +- update to Unicode 14 +- https://www.unicode.org/versions/Unicode14.0.0/ + +* Fri Jul 23 2021 Fedora Release Engineering - 13.0.0-4 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_35_Mass_Rebuild + +* Wed Jan 27 2021 Fedora Release Engineering - 13.0.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_34_Mass_Rebuild + +* Wed Jul 29 2020 Fedora Release Engineering - 13.0.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_33_Mass_Rebuild + +* Wed Mar 11 2020 Jens Petersen - 13.0.0-1 +- update to Unicode 13 +- https://www.unicode.org/versions/Unicode13.0.0/ + +* Fri Jan 31 2020 Fedora Release Engineering - 12.1.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_32_Mass_Rebuild + +* Sat Jul 27 2019 Fedora Release Engineering - 12.1.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_31_Mass_Rebuild + +* Fri May 10 2019 fedora-toolbox - 12.1.0-1 +- update to Unicode 12.1 + +* Fri Mar 8 2019 Jens Petersen - 12.0.0-1 +- update to Unicode 12 + +* Sun Feb 03 2019 Fedora Release Engineering - 11.0.0-3 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_30_Mass_Rebuild + +* Sat Jul 14 2018 Fedora Release Engineering - 11.0.0-2 +- Rebuilt for https://fedoraproject.org/wiki/Fedora_29_Mass_Rebuild + * Thu Jun 21 2018 Jens Petersen - 11.0.0-1 - update to Unicode 11