├── LICENCE ├── README.md ├── composer.json ├── example.php ├── idna_convert.class.php ├── transcode_wrapper.php └── uctc.php /LICENCE: -------------------------------------------------------------------------------- 1 | GNU LESSER GENERAL PUBLIC LICENSE 2 | Version 2.1, February 1999 3 | 4 | Copyright (C) 1991, 1999 Free Software Foundation, Inc. 5 | 51 Franklin St, Boston, MA 02110, United States 6 | Everyone is permitted to copy and distribute verbatim copies 7 | of this license document, but changing it is not allowed. 8 | 9 | [This is the first released version of the Lesser GPL. It also counts 10 | as the successor of the GNU Library Public License, version 2, hence 11 | the version number 2.1.] 12 | 13 | Preamble 14 | 15 | The licenses for most software are designed to take away your 16 | freedom to share and change it. By contrast, the GNU General Public 17 | Licenses are intended to guarantee your freedom to share and change 18 | free software--to make sure the software is free for all its users. 19 | 20 | This license, the Lesser General Public License, applies to some 21 | specially designated software packages--typically libraries--of the 22 | Free Software Foundation and other authors who decide to use it. You 23 | can use it too, but we suggest you first think carefully about whether 24 | this license or the ordinary General Public License is the better 25 | strategy to use in any particular case, based on the explanations below. 26 | 27 | When we speak of free software, we are referring to freedom of use, 28 | not price. Our General Public Licenses are designed to make sure that 29 | you have the freedom to distribute copies of free software (and charge 30 | for this service if you wish); that you receive source code or can get 31 | it if you want it; that you can change the software and use pieces of 32 | it in new free programs; and that you are informed that you can do 33 | these things. 34 | 35 | To protect your rights, we need to make restrictions that forbid 36 | distributors to deny you these rights or to ask you to surrender these 37 | rights. These restrictions translate to certain responsibilities for 38 | you if you distribute copies of the library or if you modify it. 39 | 40 | For example, if you distribute copies of the library, whether gratis 41 | or for a fee, you must give the recipients all the rights that we gave 42 | you. You must make sure that they, too, receive or can get the source 43 | code. If you link other code with the library, you must provide 44 | complete object files to the recipients, so that they can relink them 45 | with the library after making changes to the library and recompiling 46 | it. And you must show them these terms so they know their rights. 47 | 48 | We protect your rights with a two-step method: (1) we copyright the 49 | library, and (2) we offer you this license, which gives you legal 50 | permission to copy, distribute and/or modify the library. 51 | 52 | To protect each distributor, we want to make it very clear that 53 | there is no warranty for the free library. Also, if the library is 54 | modified by someone else and passed on, the recipients should know 55 | that what they have is not the original version, so that the original 56 | author's reputation will not be affected by problems that might be 57 | introduced by others. 58 | 59 | Finally, software patents pose a constant threat to the existence of 60 | any free program. We wish to make sure that a company cannot 61 | effectively restrict the users of a free program by obtaining a 62 | restrictive license from a patent holder. Therefore, we insist that 63 | any patent license obtained for a version of the library must be 64 | consistent with the full freedom of use specified in this license. 65 | 66 | Most GNU software, including some libraries, is covered by the 67 | ordinary GNU General Public License. This license, the GNU Lesser 68 | General Public License, applies to certain designated libraries, and 69 | is quite different from the ordinary General Public License. We use 70 | this license for certain libraries in order to permit linking those 71 | libraries into non-free programs. 72 | 73 | When a program is linked with a library, whether statically or using 74 | a shared library, the combination of the two is legally speaking a 75 | combined work, a derivative of the original library. The ordinary 76 | General Public License therefore permits such linking only if the 77 | entire combination fits its criteria of freedom. The Lesser General 78 | Public License permits more lax criteria for linking other code with 79 | the library. 80 | 81 | We call this license the "Lesser" General Public License because it 82 | does Less to protect the user's freedom than the ordinary General 83 | Public License. It also provides other free software developers Less 84 | of an advantage over competing non-free programs. These disadvantages 85 | are the reason we use the ordinary General Public License for many 86 | libraries. However, the Lesser license provides advantages in certain 87 | special circumstances. 88 | 89 | For example, on rare occasions, there may be a special need to 90 | encourage the widest possible use of a certain library, so that it becomes 91 | a de-facto standard. To achieve this, non-free programs must be 92 | allowed to use the library. A more frequent case is that a free 93 | library does the same job as widely used non-free libraries. In this 94 | case, there is little to gain by limiting the free library to free 95 | software only, so we use the Lesser General Public License. 96 | 97 | In other cases, permission to use a particular library in non-free 98 | programs enables a greater number of people to use a large body of 99 | free software. For example, permission to use the GNU C Library in 100 | non-free programs enables many more people to use the whole GNU 101 | operating system, as well as its variant, the GNU/Linux operating 102 | system. 103 | 104 | Although the Lesser General Public License is Less protective of the 105 | users' freedom, it does ensure that the user of a program that is 106 | linked with the Library has the freedom and the wherewithal to run 107 | that program using a modified version of the Library. 108 | 109 | The precise terms and conditions for copying, distribution and 110 | modification follow. Pay close attention to the difference between a 111 | "work based on the library" and a "work that uses the library". The 112 | former contains code derived from the library, whereas the latter must 113 | be combined with the library in order to run. 114 | 115 | GNU LESSER GENERAL PUBLIC LICENSE 116 | TERMS AND CONDITIONS FOR COPYING, DISTRIBUTION AND MODIFICATION 117 | 118 | 0. This License Agreement applies to any software library or other 119 | program which contains a notice placed by the copyright holder or 120 | other authorized party saying it may be distributed under the terms of 121 | this Lesser General Public License (also called "this License"). 122 | Each licensee is addressed as "you". 123 | 124 | A "library" means a collection of software functions and/or data 125 | prepared so as to be conveniently linked with application programs 126 | (which use some of those functions and data) to form executables. 127 | 128 | The "Library", below, refers to any such software library or work 129 | which has been distributed under these terms. A "work based on the 130 | Library" means either the Library or any derivative work under 131 | copyright law: that is to say, a work containing the Library or a 132 | portion of it, either verbatim or with modifications and/or translated 133 | straightforwardly into another language. (Hereinafter, translation is 134 | included without limitation in the term "modification".) 135 | 136 | "Source code" for a work means the preferred form of the work for 137 | making modifications to it. For a library, complete source code means 138 | all the source code for all modules it contains, plus any associated 139 | interface definition files, plus the scripts used to control compilation 140 | and installation of the library. 141 | 142 | Activities other than copying, distribution and modification are not 143 | covered by this License; they are outside its scope. The act of 144 | running a program using the Library is not restricted, and output from 145 | such a program is covered only if its contents constitute a work based 146 | on the Library (independent of the use of the Library in a tool for 147 | writing it). Whether that is true depends on what the Library does 148 | and what the program that uses the Library does. 149 | 150 | 1. You may copy and distribute verbatim copies of the Library's 151 | complete source code as you receive it, in any medium, provided that 152 | you conspicuously and appropriately publish on each copy an 153 | appropriate copyright notice and disclaimer of warranty; keep intact 154 | all the notices that refer to this License and to the absence of any 155 | warranty; and distribute a copy of this License along with the 156 | Library. 157 | 158 | You may charge a fee for the physical act of transferring a copy, 159 | and you may at your option offer warranty protection in exchange for a 160 | fee. 161 | 162 | 2. You may modify your copy or copies of the Library or any portion 163 | of it, thus forming a work based on the Library, and copy and 164 | distribute such modifications or work under the terms of Section 1 165 | above, provided that you also meet all of these conditions: 166 | 167 | a) The modified work must itself be a software library. 168 | 169 | b) You must cause the files modified to carry prominent notices 170 | stating that you changed the files and the date of any change. 171 | 172 | c) You must cause the whole of the work to be licensed at no 173 | charge to all third parties under the terms of this License. 174 | 175 | d) If a facility in the modified Library refers to a function or a 176 | table of data to be supplied by an application program that uses 177 | the facility, other than as an argument passed when the facility 178 | is invoked, then you must make a good faith effort to ensure that, 179 | in the event an application does not supply such function or 180 | table, the facility still operates, and performs whatever part of 181 | its purpose remains meaningful. 182 | 183 | (For example, a function in a library to compute square roots has 184 | a purpose that is entirely well-defined independent of the 185 | application. Therefore, Subsection 2d requires that any 186 | application-supplied function or table used by this function must 187 | be optional: if the application does not supply it, the square 188 | root function must still compute square roots.) 189 | 190 | These requirements apply to the modified work as a whole. If 191 | identifiable sections of that work are not derived from the Library, 192 | and can be reasonably considered independent and separate works in 193 | themselves, then this License, and its terms, do not apply to those 194 | sections when you distribute them as separate works. But when you 195 | distribute the same sections as part of a whole which is a work based 196 | on the Library, the distribution of the whole must be on the terms of 197 | this License, whose permissions for other licensees extend to the 198 | entire whole, and thus to each and every part regardless of who wrote 199 | it. 200 | 201 | Thus, it is not the intent of this section to claim rights or contest 202 | your rights to work written entirely by you; rather, the intent is to 203 | exercise the right to control the distribution of derivative or 204 | collective works based on the Library. 205 | 206 | In addition, mere aggregation of another work not based on the Library 207 | with the Library (or with a work based on the Library) on a volume of 208 | a storage or distribution medium does not bring the other work under 209 | the scope of this License. 210 | 211 | 3. You may opt to apply the terms of the ordinary GNU General Public 212 | License instead of this License to a given copy of the Library. To do 213 | this, you must alter all the notices that refer to this License, so 214 | that they refer to the ordinary GNU General Public License, version 2, 215 | instead of to this License. (If a newer version than version 2 of the 216 | ordinary GNU General Public License has appeared, then you can specify 217 | that version instead if you wish.) Do not make any other change in 218 | these notices. 219 | 220 | Once this change is made in a given copy, it is irreversible for 221 | that copy, so the ordinary GNU General Public License applies to all 222 | subsequent copies and derivative works made from that copy. 223 | 224 | This option is useful when you wish to copy part of the code of 225 | the Library into a program that is not a library. 226 | 227 | 4. You may copy and distribute the Library (or a portion or 228 | derivative of it, under Section 2) in object code or executable form 229 | under the terms of Sections 1 and 2 above provided that you accompany 230 | it with the complete corresponding machine-readable source code, which 231 | must be distributed under the terms of Sections 1 and 2 above on a 232 | medium customarily used for software interchange. 233 | 234 | If distribution of object code is made by offering access to copy 235 | from a designated place, then offering equivalent access to copy the 236 | source code from the same place satisfies the requirement to 237 | distribute the source code, even though third parties are not 238 | compelled to copy the source along with the object code. 239 | 240 | 5. A program that contains no derivative of any portion of the 241 | Library, but is designed to work with the Library by being compiled or 242 | linked with it, is called a "work that uses the Library". Such a 243 | work, in isolation, is not a derivative work of the Library, and 244 | therefore falls outside the scope of this License. 245 | 246 | However, linking a "work that uses the Library" with the Library 247 | creates an executable that is a derivative of the Library (because it 248 | contains portions of the Library), rather than a "work that uses the 249 | library". The executable is therefore covered by this License. 250 | Section 6 states terms for distribution of such executables. 251 | 252 | When a "work that uses the Library" uses material from a header file 253 | that is part of the Library, the object code for the work may be a 254 | derivative work of the Library even though the source code is not. 255 | Whether this is true is especially significant if the work can be 256 | linked without the Library, or if the work is itself a library. The 257 | threshold for this to be true is not precisely defined by law. 258 | 259 | If such an object file uses only numerical parameters, data 260 | structure layouts and accessors, and small macros and small inline 261 | functions (ten lines or less in length), then the use of the object 262 | file is unrestricted, regardless of whether it is legally a derivative 263 | work. (Executables containing this object code plus portions of the 264 | Library will still fall under Section 6.) 265 | 266 | Otherwise, if the work is a derivative of the Library, you may 267 | distribute the object code for the work under the terms of Section 6. 268 | Any executables containing that work also fall under Section 6, 269 | whether or not they are linked directly with the Library itself. 270 | 271 | 6. As an exception to the Sections above, you may also combine or 272 | link a "work that uses the Library" with the Library to produce a 273 | work containing portions of the Library, and distribute that work 274 | under terms of your choice, provided that the terms permit 275 | modification of the work for the customer's own use and reverse 276 | engineering for debugging such modifications. 277 | 278 | You must give prominent notice with each copy of the work that the 279 | Library is used in it and that the Library and its use are covered by 280 | this License. You must supply a copy of this License. If the work 281 | during execution displays copyright notices, you must include the 282 | copyright notice for the Library among them, as well as a reference 283 | directing the user to the copy of this License. Also, you must do one 284 | of these things: 285 | 286 | a) Accompany the work with the complete corresponding 287 | machine-readable source code for the Library including whatever 288 | changes were used in the work (which must be distributed under 289 | Sections 1 and 2 above); and, if the work is an executable linked 290 | with the Library, with the complete machine-readable "work that 291 | uses the Library", as object code and/or source code, so that the 292 | user can modify the Library and then relink to produce a modified 293 | executable containing the modified Library. (It is understood 294 | that the user who changes the contents of definitions files in the 295 | Library will not necessarily be able to recompile the application 296 | to use the modified definitions.) 297 | 298 | b) Use a suitable shared library mechanism for linking with the 299 | Library. A suitable mechanism is one that (1) uses at run time a 300 | copy of the library already present on the user's computer system, 301 | rather than copying library functions into the executable, and (2) 302 | will operate properly with a modified version of the library, if 303 | the user installs one, as long as the modified version is 304 | interface-compatible with the version that the work was made with. 305 | 306 | c) Accompany the work with a written offer, valid for at 307 | least three years, to give the same user the materials 308 | specified in Subsection 6a, above, for a charge no more 309 | than the cost of performing this distribution. 310 | 311 | d) If distribution of the work is made by offering access to copy 312 | from a designated place, offer equivalent access to copy the above 313 | specified materials from the same place. 314 | 315 | e) Verify that the user has already received a copy of these 316 | materials or that you have already sent this user a copy. 317 | 318 | For an executable, the required form of the "work that uses the 319 | Library" must include any data and utility programs needed for 320 | reproducing the executable from it. However, as a special exception, 321 | the materials to be distributed need not include anything that is 322 | normally distributed (in either source or binary form) with the major 323 | components (compiler, kernel, and so on) of the operating system on 324 | which the executable runs, unless that component itself accompanies 325 | the executable. 326 | 327 | It may happen that this requirement contradicts the license 328 | restrictions of other proprietary libraries that do not normally 329 | accompany the operating system. Such a contradiction means you cannot 330 | use both them and the Library together in an executable that you 331 | distribute. 332 | 333 | 7. You may place library facilities that are a work based on the 334 | Library side-by-side in a single library together with other library 335 | facilities not covered by this License, and distribute such a combined 336 | library, provided that the separate distribution of the work based on 337 | the Library and of the other library facilities is otherwise 338 | permitted, and provided that you do these two things: 339 | 340 | a) Accompany the combined library with a copy of the same work 341 | based on the Library, uncombined with any other library 342 | facilities. This must be distributed under the terms of the 343 | Sections above. 344 | 345 | b) Give prominent notice with the combined library of the fact 346 | that part of it is a work based on the Library, and explaining 347 | where to find the accompanying uncombined form of the same work. 348 | 349 | 8. You may not copy, modify, sublicense, link with, or distribute 350 | the Library except as expressly provided under this License. Any 351 | attempt otherwise to copy, modify, sublicense, link with, or 352 | distribute the Library is void, and will automatically terminate your 353 | rights under this License. However, parties who have received copies, 354 | or rights, from you under this License will not have their licenses 355 | terminated so long as such parties remain in full compliance. 356 | 357 | 9. You are not required to accept this License, since you have not 358 | signed it. However, nothing else grants you permission to modify or 359 | distribute the Library or its derivative works. These actions are 360 | prohibited by law if you do not accept this License. Therefore, by 361 | modifying or distributing the Library (or any work based on the 362 | Library), you indicate your acceptance of this License to do so, and 363 | all its terms and conditions for copying, distributing or modifying 364 | the Library or works based on it. 365 | 366 | 10. Each time you redistribute the Library (or any work based on the 367 | Library), the recipient automatically receives a license from the 368 | original licensor to copy, distribute, link with or modify the Library 369 | subject to these terms and conditions. You may not impose any further 370 | restrictions on the recipients' exercise of the rights granted herein. 371 | You are not responsible for enforcing compliance by third parties with 372 | this License. 373 | 374 | 11. If, as a consequence of a court judgment or allegation of patent 375 | infringement or for any other reason (not limited to patent issues), 376 | conditions are imposed on you (whether by court order, agreement or 377 | otherwise) that contradict the conditions of this License, they do not 378 | excuse you from the conditions of this License. If you cannot 379 | distribute so as to satisfy simultaneously your obligations under this 380 | License and any other pertinent obligations, then as a consequence you 381 | may not distribute the Library at all. For example, if a patent 382 | license would not permit royalty-free redistribution of the Library by 383 | all those who receive copies directly or indirectly through you, then 384 | the only way you could satisfy both it and this License would be to 385 | refrain entirely from distribution of the Library. 386 | 387 | If any portion of this section is held invalid or unenforceable under any 388 | particular circumstance, the balance of the section is intended to apply, 389 | and the section as a whole is intended to apply in other circumstances. 390 | 391 | It is not the purpose of this section to induce you to infringe any 392 | patents or other property right claims or to contest validity of any 393 | such claims; this section has the sole purpose of protecting the 394 | integrity of the free software distribution system which is 395 | implemented by public license practices. Many people have made 396 | generous contributions to the wide range of software distributed 397 | through that system in reliance on consistent application of that 398 | system; it is up to the author/donor to decide if he or she is willing 399 | to distribute software through any other system and a licensee cannot 400 | impose that choice. 401 | 402 | This section is intended to make thoroughly clear what is believed to 403 | be a consequence of the rest of this License. 404 | 405 | 12. If the distribution and/or use of the Library is restricted in 406 | certain countries either by patents or by copyrighted interfaces, the 407 | original copyright holder who places the Library under this License may add 408 | an explicit geographical distribution limitation excluding those countries, 409 | so that distribution is permitted only in or among countries not thus 410 | excluded. In such case, this License incorporates the limitation as if 411 | written in the body of this License. 412 | 413 | 13. The Free Software Foundation may publish revised and/or new 414 | versions of the Lesser General Public License from time to time. 415 | Such new versions will be similar in spirit to the present version, 416 | but may differ in detail to address new problems or concerns. 417 | 418 | Each version is given a distinguishing version number. If the Library 419 | specifies a version number of this License which applies to it and 420 | "any later version", you have the option of following the terms and 421 | conditions either of that version or of any later version published by 422 | the Free Software Foundation. If the Library does not specify a 423 | license version number, you may choose any version ever published by 424 | the Free Software Foundation. 425 | 426 | 14. If you wish to incorporate parts of the Library into other free 427 | programs whose distribution conditions are incompatible with these, 428 | write to the author to ask for permission. For software which is 429 | copyrighted by the Free Software Foundation, write to the Free 430 | Software Foundation; we sometimes make exceptions for this. Our 431 | decision will be guided by the two goals of preserving the free status 432 | of all derivatives of our free software and of promoting the sharing 433 | and reuse of software generally. 434 | 435 | NO WARRANTY 436 | 437 | 15. BECAUSE THE LIBRARY IS LICENSED FREE OF CHARGE, THERE IS NO 438 | WARRANTY FOR THE LIBRARY, TO THE EXTENT PERMITTED BY APPLICABLE LAW. 439 | EXCEPT WHEN OTHERWISE STATED IN WRITING THE COPYRIGHT HOLDERS AND/OR 440 | OTHER PARTIES PROVIDE THE LIBRARY "AS IS" WITHOUT WARRANTY OF ANY 441 | KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING, BUT NOT LIMITED TO, THE 442 | IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR 443 | PURPOSE. THE ENTIRE RISK AS TO THE QUALITY AND PERFORMANCE OF THE 444 | LIBRARY IS WITH YOU. SHOULD THE LIBRARY PROVE DEFECTIVE, YOU ASSUME 445 | THE COST OF ALL NECESSARY SERVICING, REPAIR OR CORRECTION. 446 | 447 | 16. IN NO EVENT UNLESS REQUIRED BY APPLICABLE LAW OR AGREED TO IN 448 | WRITING WILL ANY COPYRIGHT HOLDER, OR ANY OTHER PARTY WHO MAY MODIFY 449 | AND/OR REDISTRIBUTE THE LIBRARY AS PERMITTED ABOVE, BE LIABLE TO YOU 450 | FOR DAMAGES, INCLUDING ANY GENERAL, SPECIAL, INCIDENTAL OR 451 | CONSEQUENTIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THE 452 | LIBRARY (INCLUDING BUT NOT LIMITED TO LOSS OF DATA OR DATA BEING 453 | RENDERED INACCURATE OR LOSSES SUSTAINED BY YOU OR THIRD PARTIES OR A 454 | FAILURE OF THE LIBRARY TO OPERATE WITH ANY OTHER SOFTWARE), EVEN IF 455 | SUCH HOLDER OR OTHER PARTY HAS BEEN ADVISED OF THE POSSIBILITY OF SUCH 456 | DAMAGES. 457 | 458 | END OF TERMS AND CONDITIONS 459 | 460 | How to Apply These Terms to Your New Libraries 461 | 462 | If you develop a new library, and you want it to be of the greatest 463 | possible use to the public, we recommend making it free software that 464 | everyone can redistribute and change. You can do so by permitting 465 | redistribution under these terms (or, alternatively, under the terms of the 466 | ordinary General Public License). 467 | 468 | To apply these terms, attach the following notices to the library. It is 469 | safest to attach them to the start of each source file to most effectively 470 | convey the exclusion of warranty; and each file should have at least the 471 | "copyright" line and a pointer to where the full notice is found. 472 | 473 | 474 | Copyright (C) 475 | 476 | This library is free software; you can redistribute it and/or 477 | modify it under the terms of the GNU Lesser General Public 478 | License as published by the Free Software Foundation; either 479 | version 2.1 of the License, or (at your option) any later version. 480 | 481 | This library is distributed in the hope that it will be useful, 482 | but WITHOUT ANY WARRANTY; without even the implied warranty of 483 | MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU 484 | Lesser General Public License for more details. 485 | 486 | You should have received a copy of the GNU Lesser General Public 487 | License along with this library; if not, write to the Free Software 488 | Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA 489 | 490 | Also add information on how to contact you by electronic and paper mail. 491 | 492 | You should also get your employer (if you work as a programmer) or your 493 | school, if any, to sign a "copyright disclaimer" for the library, if 494 | necessary. Here is a sample; alter the names: 495 | 496 | Yoyodyne, Inc., hereby disclaims all copyright interest in the 497 | library `Frob' (a library for tweaking knobs) written by James Random Hacker. 498 | 499 | , 1 April 1990 500 | Ty Coon, President of Vice 501 | 502 | That's all there is to it! 503 | -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- 1 | IDNA Convert (idna_convert.class.php) 2 | ===================================== 3 | http://idnaconv.phlymail.de mailto:phlymail@phlylabs.de 4 | 5 | (c) 2004-2014 phlyLabs, Berlin 6 | 7 | Introduction 8 | ------------ 9 | 10 | The class idna_convert allows to convert internationalized domain names 11 | (see RFC 3490, 3491, 3492 and 3454 for detials) as they can be used with various 12 | registries worldwide to be translated between their original (localized) form 13 | and their encoded form as it will be used in the DNS (Domain Name System). 14 | 15 | The class provides two public methods, encode() and decode(), which do exactly 16 | what you would expect them to do. You are allowed to use complete domain names, 17 | simple strings and complete email addresses as well. That means, that you might 18 | use any of the following notations: 19 | 20 | - www.nörgler.com 21 | - xn--nrgler-wxa 22 | - xn--brse-5qa.xn--knrz-1ra.info 23 | 24 | Errors, incorrectly encoded or invalid strings will lead to either a FALSE 25 | response (when in strict mode) or to only partially converted strings. 26 | You can query the occured error by calling the method get_last_error(). 27 | 28 | Unicode strings are expected to be either UTF-8 strings, UCS-4 strings or UCS-4 29 | arrays. The default format is UTF-8. For setting different encodings, you can 30 | call the method setParams() - please see the inline documentation for details. 31 | ACE strings (the Punycode form) are always 7bit ASCII strings. 32 | 33 | **ATTENTION:** As of version 0.6.0 this class is written in the OOP style of PHP5. 34 | Since PHP4 is no longer actively maintained, you should switch to PHP5 as fast as 35 | possible. 36 | We expect to see no compatibility issues with the upcoming PHP6, too. 37 | 38 | **ATTENTION:** BC break! As of version 0.6.4 the class per default allows the German 39 | ligature ß to be encoded as the DeNIC, the registry for .DE allows domains 40 | containing ß. 41 | In older builds "ß" was mapped to "ss". Should you still need this behaviour, 42 | see example 5 below. 43 | 44 | **ATTENTION:** As of version 0.8.0 the class fully supports IDNA 2008. Thus the 45 | aforementioned parameter is deprecated and replaced by a parameter to switch 46 | between the standards. See the updated example 5 below. 47 | 48 | Files 49 | ----- 50 | idna_convert.class.php - The actual class 51 | 52 | example.php - An example web page for converting 53 | 54 | transcode_wrapper.php - Convert various encodings, see below 55 | 56 | uctc.php - phlyLabs' Unicode Transcoder, see below 57 | 58 | ReadMe.txt - This file 59 | 60 | LICENCE - The LGPL licence file 61 | 62 | The class is contained in idna_convert.class.php. 63 | 64 | 65 | Examples 66 | -------- 67 | 1. Say we wish to encode the domain name nörgler.com: 68 | 69 | ```php 70 | // Include the class 71 | require_once('idna_convert.class.php'); 72 | // Instantiate it 73 | $IDN = new idna_convert(); 74 | // The input string, if input is not UTF-8 or UCS-4, it must be converted before 75 | $input = utf8_encode('nörgler.com'); 76 | // Encode it to its punycode presentation 77 | $output = $IDN->encode($input); 78 | // Output, what we got now 79 | echo $output; // This will read: xn--nrgler-wxa.com 80 | ``` 81 | 82 | 2. We received an email from a punycoded domain and are willing to learn, how 83 | the domain name reads originally 84 | 85 | ```php 86 | // Include the class 87 | require_once('idna_convert.class.php'); 88 | // Instantiate it 89 | $IDN = new idna_convert(); 90 | // The input string 91 | $input = 'andre@xn--brse-5qa.xn--knrz-1ra.info'; 92 | // Encode it to its punycode presentation 93 | $output = $IDN->decode($input); 94 | // Output, what we got now, if output should be in a format different to UTF-8 95 | // or UCS-4, you will have to convert it before outputting it 96 | echo utf8_decode($output); // This will read: andre@börse.knörz.info 97 | ``` 98 | 99 | 3. The input is read from a UCS-4 coded file and encoded line by line. By 100 | appending the optional second parameter we tell enode() about the input 101 | format to be used 102 | 103 | ```php 104 | // Include the class 105 | require_once('idna_convert.class.php'); 106 | // Instantiate it 107 | $IDN = new dinca_convert(); 108 | // Iterate through the input file line by line 109 | foreach (file('ucs4-domains.txt') as $line) { 110 | echo $IDN->encode(trim($line), 'ucs4_string'); 111 | echo "\n"; 112 | } 113 | ``` 114 | 115 | 4. We wish to convert a whole URI into the IDNA form, but leave the path or 116 | query string component of it alone. Just using encode() would lead to mangled 117 | paths or query strings. Here the public method encode_uri() comes into play: 118 | 119 | ```php 120 | // Include the class 121 | require_once('idna_convert.class.php'); 122 | // Instantiate it 123 | $IDN = new idna_convert(); 124 | // The input string, a whole URI in UTF-8 (!) 125 | $input = 'http://nörgler:secret@nörgler.com/my_päth_is_not_ÄSCII/'); 126 | // Encode it to its punycode presentation 127 | $output = $IDN->encode_uri($input); 128 | // Output, what we got now 129 | echo $output; // http://nörgler:secret@xn--nrgler-wxa.com/my_päth_is_not_ÄSCII/ 130 | ``` 131 | 132 | 5. To support IDNA 2008, the class needs to be invoked with an additional 133 | parameter. This can also be achieved on an instance. 134 | 135 | ```php 136 | // Include the class 137 | require_once('idna_convert.class.php'); 138 | // Instantiate it 139 | $IDN = new idna_convert(array('idn_version' => 2008)); 140 | // Sth. containing the German letter ß 141 | $input = 'meine-straße.de'); 142 | // Encode it to its punycode presentation 143 | $output = $IDN->encode_uri($input); 144 | // Output, what we got now 145 | echo $output; // xn--meine-strae-46a.de 146 | // Switch back to old IDNA 2003, the original standard 147 | $IDN->set_parameter('idn_version', 2003); 148 | // Sth. containing the German letter ß 149 | $input = 'meine-straße.de'); 150 | // Encode it to its punycode presentation 151 | $output = $IDN->encode_uri($input); 152 | // Output, what we got now 153 | echo $output; // meine-strasse.de 154 | ``` 155 | 156 | Transcode wrapper 157 | ----------------- 158 | In case you have strings in different encoding than ISO-8859-1 and UTF-8 you might need to 159 | translate these strings to UTF-8 before feeding the IDNA converter with it. 160 | PHP's built in functions utf8_encode() and utf8_decode() can only deal with ISO-8859-1. 161 | Use the file transcode_wrapper.php for the conversion. It requires either iconv, libiconv 162 | or mbstring installed together with one of the relevant PHP extensions. 163 | The functions you will find useful are 164 | encode_utf8() as a replacement for utf8_encode() and 165 | decode_utf8() as a replacement for utf8_decode(). 166 | 167 | Example usage: 168 | ```php 169 | encode($mystring); 175 | ?> 176 | ``` 177 | 178 | UCTC - Unicode Transcoder 179 | ------------------------- 180 | Another class you might find useful when dealing with one or more of the Unicode encoding 181 | flavours. The class is static, it requires PHP5. It can transcode into each other: 182 | - UCS-4 string / array 183 | - UTF-8 184 | - UTF-7 185 | - UTF-7 IMAP (modified UTF-7) 186 | All encodings expect / return a string in the given format, with one major exception: 187 | UCS-4 array is jsut an array, where each value represents one codepoint in the string, i.e. 188 | every value is a 32bit integer value. 189 | 190 | Example usage: 191 | ```php 192 | 197 | ``` 198 | 199 | Contact us 200 | ---------- 201 | In case of errors, bugs, questions, wishes, please don't hesitate to contact us 202 | under the email address above. 203 | 204 | The team of phlyLabs 205 | http://phlylabs.de 206 | mailto:phlymail@phlylabs.de 207 | -------------------------------------------------------------------------------- /composer.json: -------------------------------------------------------------------------------- 1 | { 2 | "name": "phpwhois/idna-convert", 3 | "description": "idna-convert library based on http://idnaconv.phlymail.de/", 4 | "keywords": ["PHP","IDN","IDNA"], 5 | "homepage": "http://idnaconv.phlymail.de/", 6 | "type": "library", 7 | "license": "LGPL-2.1", 8 | "authors": [ 9 | { 10 | "name": "Matthias Sommerfeld", 11 | "email": "phlymail@phlylabs.de", 12 | "role": "Developer" 13 | }, 14 | { 15 | "name": "Dmitry Lukashin", 16 | "email": "dmitry@lukashin.ru", 17 | "role": "Maintainer" 18 | } 19 | ], 20 | "autoload": { 21 | "files": [ 22 | "idna_convert.class.php" 23 | ] 24 | }, 25 | "extra": { 26 | "branch-alias": { 27 | "dev-master": "0.9.x-dev" 28 | } 29 | } 30 | } -------------------------------------------------------------------------------- /example.php: -------------------------------------------------------------------------------- 1 | $idn_version)); 8 | 9 | $version_select = ''."\n"; 28 | } 29 | } 30 | ?> 31 | 32 | 33 | 34 | phlyLabs Punycode Converter 35 | 36 | 37 | 51 | 52 | 53 |
54 |
phlyLabs' pure PHP IDNA Converter

55 | 56 | See the RFCs 3490, 57 | 3491, 58 | 3492 and 59 | 3454 as well as 60 | 5890, 61 | 5891, 62 | 5892, 63 | 5893 and 64 | RFC5894.
65 |
66 |
67 |
68 | Dieser Konverter erlaubt die Übersetzung von Domainnamen zwischen der Punycode- und der 69 | Unicode-Schreibweise.
70 | Geben Sie einfach den Domainnamen im entsprechend bezeichneten Feld ein und klicken Sie dann auf den darunter 71 | liegenden Button. Sie können einfache Domainnamen, komplette URLs (wie http://jürgen-müller.de) 72 | oder Emailadressen eingeben.
73 |
74 | Stellen Sie aber sicher, dass Ihr Browser den Zeichensatz UTF-8 unterstützt.
75 |
76 | Wenn Sie Interesse an der zugrundeliegenden PHP-Klasse haben, können Sie diese 77 | hier herunterladen.
78 |
79 | Diese Klasse wird ohne Garantie ihrer Funktionstüchtigkeit bereit gestellt. Nutzung auf eigene Gefahr.
80 | Um sicher zu stellen, dass eine Zeichenkette korrekt umgewandelt wurde, sollten Sie diese immer zurückwandeln 81 | und das Ergebnis mit Ihrer ursprünglichen Eingabe vergleichen.
82 |
83 | Fehler und Probleme können Sie gern an team@phlymail.de senden.
84 | 85 | This converter allows you to transfer domain names between the encoded (Punycode) notation 86 | and the decoded (UTF-8) notation.
87 | Just enter the domain name in the respective field and click on the button right below it to have 88 | it converted. Please note, that you might even enter complete domain names (like jürgen-müller.de) 89 | or a email addresses.
90 |
91 | Make sure, that your browser is capable of the UTF-8 character encoding.
92 |
93 | For those of you interested in the PHP source of the underlying class, you might 94 | download it here.
95 |
96 | Please be aware, that this class is provided as is and without any liability. Use at your own risk.
97 | To ensure, that a certain string has been converted correctly, you should convert it both ways and compare the 98 | results.
99 |
100 | Please feel free to report bugs and problems to: team@phlymail.com.
101 | 102 |
103 |
104 | 105 | 106 | 107 | 108 | 109 | 110 | 111 | 112 | 113 | 120 | 126 | 127 | 128 |
Original (Unicode)Punycode (ACE)
114 |
115 |
116 | 117 | 118 |
119 |
121 |
122 |
123 | 124 |
125 |
129 |
130 | Version used: 0.9.0; © 2004-2014 phlyLabs Berlin; part of phlyMail 131 |
132 | 133 | -------------------------------------------------------------------------------- /transcode_wrapper.php: -------------------------------------------------------------------------------- 1 | 7 | * @version 0.1.0 8 | */ 9 | 10 | /** 11 | * Convert a string from any of various encodings to UTF-8 12 | * 13 | * @param string String to encode 14 | *[@param string Encoding; Default: ISO-8859-1] 15 | *[@param bool Safe Mode: if set to TRUE, the original string is retunred on errors] 16 | * @return string The encoded string or false on failure 17 | * @since 0.0.1 18 | */ 19 | function encode_utf8($string = '', $encoding = 'iso-8859-1', $safe_mode = false) 20 | { 21 | $safe = ($safe_mode) ? $string : false; 22 | if (strtoupper($encoding) == 'UTF-8' || strtoupper($encoding) == 'UTF8') { 23 | return $string; 24 | } elseif (strtoupper($encoding) == 'ISO-8859-1') { 25 | return utf8_encode($string); 26 | } elseif (strtoupper($encoding) == 'WINDOWS-1252') { 27 | return utf8_encode(map_w1252_iso8859_1($string)); 28 | } elseif (strtoupper($encoding) == 'UNICODE-1-1-UTF-7') { 29 | $encoding = 'utf-7'; 30 | } 31 | if (function_exists('mb_convert_encoding')) { 32 | $conv = @mb_convert_encoding($string, 'UTF-8', strtoupper($encoding)); 33 | if ($conv) return $conv; 34 | } 35 | if (function_exists('iconv')) { 36 | $conv = @iconv(strtoupper($encoding), 'UTF-8', $string); 37 | if ($conv) return $conv; 38 | } 39 | if (function_exists('libiconv')) { 40 | $conv = @libiconv(strtoupper($encoding), 'UTF-8', $string); 41 | if ($conv) return $conv; 42 | } 43 | return $safe; 44 | } 45 | 46 | /** 47 | * Convert a string from UTF-8 to any of various encodings 48 | * 49 | * @param string String to decode 50 | *[@param string Encoding; Default: ISO-8859-1] 51 | *[@param bool Safe Mode: if set to TRUE, the original string is retunred on errors] 52 | * @return string The decoded string or false on failure 53 | * @since 0.0.1 54 | */ 55 | function decode_utf8($string = '', $encoding = 'iso-8859-1', $safe_mode = false) 56 | { 57 | $safe = ($safe_mode) ? $string : false; 58 | if (!$encoding) $encoding = 'ISO-8859-1'; 59 | if (strtoupper($encoding) == 'UTF-8' || strtoupper($encoding) == 'UTF8') { 60 | return $string; 61 | } elseif (strtoupper($encoding) == 'ISO-8859-1') { 62 | return utf8_decode($string); 63 | } elseif (strtoupper($encoding) == 'WINDOWS-1252') { 64 | return map_iso8859_1_w1252(utf8_decode($string)); 65 | } elseif (strtoupper($encoding) == 'UNICODE-1-1-UTF-7') { 66 | $encoding = 'utf-7'; 67 | } 68 | if (function_exists('mb_convert_encoding')) { 69 | $conv = @mb_convert_encoding($string, strtoupper($encoding), 'UTF-8'); 70 | if ($conv) return $conv; 71 | } 72 | if (function_exists('iconv')) { 73 | $conv = @iconv('UTF-8', strtoupper($encoding), $string); 74 | if ($conv) return $conv; 75 | } 76 | if (function_exists('libiconv')) { 77 | $conv = @libiconv('UTF-8', strtoupper($encoding), $string); 78 | if ($conv) return $conv; 79 | } 80 | return $safe; 81 | } 82 | 83 | /** 84 | * Special treatment for our guys in Redmond 85 | * Windows-1252 is basically ISO-8859-1 -- with some exceptions, which get accounted for here 86 | * @param string Your input in Win1252 87 | * @param string The resulting ISO-8859-1 string 88 | * @since 3.0.8 89 | */ 90 | function map_w1252_iso8859_1($string = '') 91 | { 92 | if ($string == '') return ''; 93 | $return = ''; 94 | for ($i = 0; $i < strlen($string); ++$i) { 95 | $c = ord($string{$i}); 96 | switch ($c) { 97 | case 129: $return .= chr(252); break; 98 | case 132: $return .= chr(228); break; 99 | case 142: $return .= chr(196); break; 100 | case 148: $return .= chr(246); break; 101 | case 153: $return .= chr(214); break; 102 | case 154: $return .= chr(220); break; 103 | case 225: $return .= chr(223); break; 104 | default: $return .= chr($c); break; 105 | } 106 | } 107 | return $return; 108 | } 109 | 110 | /** 111 | * Special treatment for our guys in Redmond 112 | * Windows-1252 is basically ISO-8859-1 -- with some exceptions, which get accounted for here 113 | * @param string Your input in ISO-8859-1 114 | * @param string The resulting Win1252 string 115 | * @since 3.0.8 116 | */ 117 | function map_iso8859_1_w1252($string = '') 118 | { 119 | if ($string == '') return ''; 120 | $return = ''; 121 | for ($i = 0; $i < strlen($string); ++$i) { 122 | $c = ord($string{$i}); 123 | switch ($c) { 124 | case 196: $return .= chr(142); break; 125 | case 214: $return .= chr(153); break; 126 | case 220: $return .= chr(154); break; 127 | case 223: $return .= chr(225); break; 128 | case 228: $return .= chr(132); break; 129 | case 246: $return .= chr(148); break; 130 | case 252: $return .= chr(129); break; 131 | default: $return .= chr($c); break; 132 | } 133 | } 134 | return $return; 135 | } 136 | 137 | ?> -------------------------------------------------------------------------------- /uctc.php: -------------------------------------------------------------------------------- 1 | 15 | * @copyright 2003-2009 phlyLabs Berlin, http://phlylabs.de 16 | * @version 0.0.6 2009-05-10 17 | */ 18 | class uctc { 19 | private static $mechs = array('ucs4', /*'ucs4le', 'ucs4be', */'ucs4array', /*'utf16', 'utf16le', 'utf16be', */'utf8', 'utf7', 'utf7imap'); 20 | private static $allow_overlong = false; 21 | private static $safe_mode; 22 | private static $safe_char; 23 | 24 | /** 25 | * The actual conversion routine 26 | * 27 | * @param mixed $data The data to convert, usually a string, array when converting from UCS-4 array 28 | * @param string $from Original encoding of the data 29 | * @param string $to Target encoding of the data 30 | * @param bool $safe_mode SafeMode tries to correct invalid codepoints 31 | * @return mixed False on failure, String or array on success, depending on target encoding 32 | * @access public 33 | * @since 0.0.1 34 | */ 35 | public static function convert($data, $from, $to, $safe_mode = false, $safe_char = 0xFFFC) 36 | { 37 | self::$safe_mode = ($safe_mode) ? true : false; 38 | self::$safe_char = ($safe_char) ? $safe_char : 0xFFFC; 39 | if (self::$safe_mode) self::$allow_overlong = true; 40 | if (!in_array($from, self::$mechs)) throw new Exception('Invalid input format specified'); 41 | if (!in_array($to, self::$mechs)) throw new Exception('Invalid output format specified'); 42 | if ($from != 'ucs4array') eval('$data = self::'.$from.'_ucs4array($data);'); 43 | if ($to != 'ucs4array') eval('$data = self::ucs4array_'.$to.'($data);'); 44 | return $data; 45 | } 46 | 47 | /** 48 | * This converts an UTF-8 encoded string to its UCS-4 representation 49 | * 50 | * @param string $input The UTF-8 string to convert 51 | * @return array Array of 32bit values representing each codepoint 52 | * @access private 53 | */ 54 | private static function utf8_ucs4array($input) 55 | { 56 | $output = array(); 57 | $out_len = 0; 58 | $inp_len = strlen($input); 59 | $mode = 'next'; 60 | $test = 'none'; 61 | for ($k = 0; $k < $inp_len; ++$k) { 62 | $v = ord($input{$k}); // Extract byte from input string 63 | 64 | if ($v < 128) { // We found an ASCII char - put into stirng as is 65 | $output[$out_len] = $v; 66 | ++$out_len; 67 | if ('add' == $mode) { 68 | if (self::$safe_mode) { 69 | $output[$out_len-2] = self::$safe_char; 70 | $mode = 'next'; 71 | } else { 72 | throw new Exception('Conversion from UTF-8 to UCS-4 failed: malformed input at byte '.$k); 73 | } 74 | } 75 | continue; 76 | } 77 | if ('next' == $mode) { // Try to find the next start byte; determine the width of the Unicode char 78 | $start_byte = $v; 79 | $mode = 'add'; 80 | $test = 'range'; 81 | if ($v >> 5 == 6) { // &110xxxxx 10xxxxx 82 | $next_byte = 0; // Tells, how many times subsequent bitmasks must rotate 6bits to the left 83 | $v = ($v - 192) << 6; 84 | } elseif ($v >> 4 == 14) { // &1110xxxx 10xxxxxx 10xxxxxx 85 | $next_byte = 1; 86 | $v = ($v - 224) << 12; 87 | } elseif ($v >> 3 == 30) { // &11110xxx 10xxxxxx 10xxxxxx 10xxxxxx 88 | $next_byte = 2; 89 | $v = ($v - 240) << 18; 90 | } elseif (self::$safe_mode) { 91 | $mode = 'next'; 92 | $output[$out_len] = self::$safe_char; 93 | ++$out_len; 94 | continue; 95 | } else { 96 | throw new Exception('This might be UTF-8, but I don\'t understand it at byte '.$k); 97 | } 98 | if ($inp_len-$k-$next_byte < 2) { 99 | $output[$out_len] = self::$safe_char; 100 | $mode = 'no'; 101 | continue; 102 | } 103 | 104 | if ('add' == $mode) { 105 | $output[$out_len] = (int) $v; 106 | ++$out_len; 107 | continue; 108 | } 109 | } 110 | if ('add' == $mode) { 111 | if (!self::$allow_overlong && $test == 'range') { 112 | $test = 'none'; 113 | if (($v < 0xA0 && $start_byte == 0xE0) || ($v < 0x90 && $start_byte == 0xF0) || ($v > 0x8F && $start_byte == 0xF4)) { 114 | throw new Exception('Bogus UTF-8 character detected (out of legal range) at byte '.$k); 115 | } 116 | } 117 | if ($v >> 6 == 2) { // Bit mask must be 10xxxxxx 118 | $v = ($v-128) << ($next_byte*6); 119 | $output[($out_len-1)] += $v; 120 | --$next_byte; 121 | } else { 122 | if (self::$safe_mode) { 123 | $output[$out_len-1] = ord(self::$safe_char); 124 | $k--; 125 | $mode = 'next'; 126 | continue; 127 | } else { 128 | throw new Exception('Conversion from UTF-8 to UCS-4 failed: malformed input at byte '.$k); 129 | } 130 | } 131 | if ($next_byte < 0) { 132 | $mode = 'next'; 133 | } 134 | } 135 | } // for 136 | return $output; 137 | } 138 | 139 | /** 140 | * Convert UCS-4 string into UTF-8 string 141 | * See utf8_ucs4array() for details 142 | * @access private 143 | */ 144 | private static function ucs4array_utf8($input) 145 | { 146 | $output = ''; 147 | foreach ($input as $v) { 148 | if ($v < 128) { // 7bit are transferred literally 149 | $output .= chr($v); 150 | } elseif ($v < (1 << 11)) { // 2 bytes 151 | $output .= chr(192+($v >> 6)).chr(128+($v & 63)); 152 | } elseif ($v < (1 << 16)) { // 3 bytes 153 | $output .= chr(224+($v >> 12)).chr(128+(($v >> 6) & 63)).chr(128+($v & 63)); 154 | } elseif ($v < (1 << 21)) { // 4 bytes 155 | $output .= chr(240+($v >> 18)).chr(128+(($v >> 12) & 63)).chr(128+(($v >> 6) & 63)).chr(128+($v & 63)); 156 | } elseif (self::$safe_mode) { 157 | $output .= self::$safe_char; 158 | } else { 159 | throw new Exception('Conversion from UCS-4 to UTF-8 failed: malformed input at byte '.$k); 160 | } 161 | } 162 | return $output; 163 | } 164 | 165 | private static function utf7imap_ucs4array($input) 166 | { 167 | return self::utf7_ucs4array(str_replace(',', '/', $input), '&'); 168 | } 169 | 170 | private static function utf7_ucs4array($input, $sc = '+') 171 | { 172 | $output = array(); 173 | $out_len = 0; 174 | $inp_len = strlen($input); 175 | $mode = 'd'; 176 | $b64 = ''; 177 | 178 | for ($k = 0; $k < $inp_len; ++$k) { 179 | $c = $input{$k}; 180 | if (0 == ord($c)) continue; // Ignore zero bytes 181 | if ('b' == $mode) { 182 | // Sequence got terminated 183 | if (!preg_match('![A-Za-z0-9/'.preg_quote($sc, '!').']!', $c)) { 184 | if ('-' == $c) { 185 | if ($b64 == '') { 186 | $output[$out_len] = ord($sc); 187 | $out_len++; 188 | $mode = 'd'; 189 | continue; 190 | } 191 | } 192 | $tmp = base64_decode($b64); 193 | $tmp = substr($tmp, -1 * (strlen($tmp) % 2)); 194 | for ($i = 0; $i < strlen($tmp); $i++) { 195 | if ($i % 2) { 196 | $output[$out_len] += ord($tmp{$i}); 197 | $out_len++; 198 | } else { 199 | $output[$out_len] = ord($tmp{$i}) << 8; 200 | } 201 | } 202 | $mode = 'd'; 203 | $b64 = ''; 204 | continue; 205 | } else { 206 | $b64 .= $c; 207 | } 208 | } 209 | if ('d' == $mode) { 210 | if ($sc == $c) { 211 | $mode = 'b'; 212 | continue; 213 | } 214 | $output[$out_len] = ord($c); 215 | $out_len++; 216 | } 217 | } 218 | return $output; 219 | } 220 | 221 | private static function ucs4array_utf7imap($input) 222 | { 223 | return str_replace('/', ',', self::ucs4array_utf7($input, '&')); 224 | } 225 | 226 | private static function ucs4array_utf7($input, $sc = '+') 227 | { 228 | $output = ''; 229 | $mode = 'd'; 230 | $b64 = ''; 231 | while (true) { 232 | $v = (!empty($input)) ? array_shift($input) : false; 233 | $is_direct = (false !== $v) ? (0x20 <= $v && $v <= 0x7e && $v != ord($sc)) : true; 234 | if ($mode == 'b') { 235 | if ($is_direct) { 236 | if ($b64 == chr(0).$sc) { 237 | $output .= $sc.'-'; 238 | $b64 = ''; 239 | } elseif ($b64) { 240 | $output .= $sc.str_replace('=', '', base64_encode($b64)).'-'; 241 | $b64 = ''; 242 | } 243 | $mode = 'd'; 244 | } elseif (false !== $v) { 245 | $b64 .= chr(($v >> 8) & 255). chr($v & 255); 246 | } 247 | } 248 | if ($mode == 'd' && false !== $v) { 249 | if ($is_direct) { 250 | $output .= chr($v); 251 | } else { 252 | $b64 = chr(($v >> 8) & 255). chr($v & 255); 253 | $mode = 'b'; 254 | } 255 | } 256 | if (false === $v && $b64 == '') break; 257 | } 258 | return $output; 259 | } 260 | 261 | /** 262 | * Convert UCS-4 array into UCS-4 string (Little Endian at the moment) 263 | * @access private 264 | */ 265 | private static function ucs4array_ucs4($input) 266 | { 267 | $output = ''; 268 | foreach ($input as $v) { 269 | $output .= chr(($v >> 24) & 255).chr(($v >> 16) & 255).chr(($v >> 8) & 255).chr($v & 255); 270 | } 271 | return $output; 272 | } 273 | 274 | /** 275 | * Convert UCS-4 string (LE in the moment) into UCS-4 garray 276 | * @access private 277 | */ 278 | private static function ucs4_ucs4array($input) 279 | { 280 | $output = array(); 281 | 282 | $inp_len = strlen($input); 283 | // Input length must be dividable by 4 284 | if ($inp_len % 4) { 285 | throw new Exception('Input UCS4 string is broken'); 286 | } 287 | // Empty input - return empty output 288 | if (!$inp_len) return $output; 289 | 290 | for ($i = 0, $out_len = -1; $i < $inp_len; ++$i) { 291 | if (!($i % 4)) { // Increment output position every 4 input bytes 292 | $out_len++; 293 | $output[$out_len] = 0; 294 | } 295 | $output[$out_len] += ord($input{$i}) << (8 * (3 - ($i % 4) ) ); 296 | } 297 | return $output; 298 | } 299 | } 300 | ?> --------------------------------------------------------------------------------