• Register
    • Help

    striker  0 Items
    Currently Supporting
    • Home
    • News
    • Forum
    • Wiki
    • Support
      • Manage Subscriptions
      • FAQ
      • Support For
        • VaultWiki 4.x Series
        • VaultWiki.org Site
    • What's New?
    • Buy Now
    • Manual
    • 
    • Support
    • VaultWiki 3.x Series
    • Task
    • Autolinks and Lower/Uppercase articles

    1. Welcome to VaultWiki.org, home of the wiki add-on for vBulletin and XenForo!

      VaultWiki allows your existing forum users to collaborate on creating and managing a site's content pages. VaultWiki is a fully-featured and fully-supported wiki solution for vBulletin and XenForo.

      The VaultWiki Team encourages you to join our community of forum administrators and check out VaultWiki for yourself.

    Issue: Autolinks and Lower/Uppercase articles

    • Issue Tools
      • View Changes
    1. issueid=2623 March 23, 2012 5:43 PM
      nasko nasko is offline
      New Member
      Autolinks and Lower/Uppercase articles

      So if we have an article with an english title such as Self Employed then autolinking in the forum works for:
      Self Employed
      self employed

      But if we have an article with title in bulgarian it only autolinks it in the forum when it is written exactly in the same way. You don't have this problem here. Here is an example:

      проба

      This will not autolink on our website, if the article title is Проба.

      I would guess that is because the forum uses Windows-1251
      LLinks for articles with bulgarian titles look like that: http://www.bghelp.co.uk/wiki/Eiieieae

      So is there a way to fix that, without converting the database to UTF-8, which will be done when we upgrade to VB 4.
    Issue Details
    Issue Number 2623
    Issue Type Task
    Project VaultWiki 3.x Series
    Category Non-Wiki Threads / Posts
    Status Rejected
    Priority 5 - Minor Bugs / Small Tweaks
    Target Version 3.0.16
    Resolved Version (none)
    Milestone (none)
    Software DependencyAny
    Votes to perform 0
    Votes not to perform 0
    Attachments 0
    Assigned Users (none)
    Tags (none)




    1. March 23, 2012 6:14 PM
      pegasus pegasus is offline
      VaultWiki Team
      There is no way to fix that without converting the database to UTF-8, since unicode comparison needs to be applied when performing the regular expression. If the character input is not also UTF-8, then the results won't be 1:1 (and typically characters are dropped), so we just don't do it.
      Reply Reply  
    2. March 23, 2012 6:26 PM
      nasko nasko is offline
      New Member
      I understood that it can't be done without converting to UTF-8, but can you explain the rest in simple english, cause it sounds like alien language to me Please

      Are you saying that in Windows-1251 there is no way to distinguish between upper and lower letter?

      Thank you.
      Reply Reply  
    3. March 24, 2012 12:29 AM
      pegasus pegasus is offline
      VaultWiki Team
      We pass the message text through what is called a regular expression (RegEx). I believe this is Perl code that is executed by PHP. In the regular expression, we specify for it to ignore differences in case. By default, it either uses the ASCII character set or the server default character set. Either way, we can't change the set that it uses with certainty except by telling it to use unicode (UTF-8), and this is only if PHP is compiled with multi-byte character support.

      The characters in character sets are numbered, and contains information about which characters are different cases of each other. The first 255 characters in a set are typically the same no matter which one is used, from what I remember (directly the ASCII set).

      In the case of Windows-1251 input into RegEx, the compiler can't distinguish upper-case or lower-case for characters that aren't within those first 255 characters - it doesn't know what id # pairs are upper or lower case because it's not familiar with the character set we're using. As a result, even though the characters may be upper/lower-case pairs, it reports them as different characters, and the auto-link is not recognized as valid.
      Reply Reply  
    + Reply

    Assigned Users
    Loading Please Wait
    Tags
    Loading Please Wait
    • Contact Us
    • License Agreement
    • Privacy
    • Terms
    • Top
    All times are GMT -4. The time now is 7:16 PM.
    This site uses cookies to help personalize content, to tailor your experience, and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Learn more… Accept Remind me later
  • striker
    Powered by vBulletin® Version 4.2.5 Beta 2
    Copyright © 2023 vBulletin Solutions Inc. All rights reserved.
    Search Engine Optimisation provided by DragonByte SEO (Pro) - vBulletin Mods & Addons Copyright © 2023 DragonByte Technologies Ltd.
    Copyright © 2008 - 2013 VaultWiki Team, Cracked Egg Studios, LLC.