• Register
    • Help

    striker  0 Items
    Currently Supporting
    • Home
    • News
    • Forum
    • Wiki
    • Support
      • Manage Subscriptions
      • FAQ
      • Support For
        • VaultWiki 4.x Series
        • VaultWiki.org Site
    • What's New?
    • Buy Now
    • Manual
    • 
    • Support
    • VaultWiki 4.x Series
    • Bug
    • Consecutive periods and apostrophs are imported as unknown symbol

    1. Welcome to VaultWiki.org, home of the wiki add-on for vBulletin and XenForo!

      VaultWiki allows your existing forum users to collaborate on creating and managing a site's content pages. VaultWiki is a fully-featured and fully-supported wiki solution for vBulletin and XenForo.

      The VaultWiki Team encourages you to join our community of forum administrators and check out VaultWiki for yourself.

    Issue: Consecutive periods and apostrophs are imported as unknown symbol

    • Issue Tools
      • View Changes
    1. issueid=5003 April 7, 2017 6:41 AM
      Alfa1 Alfa1 is offline
      Distinguished Member
      Consecutive periods and apostrophs are imported as unknown symbol

      Please see bug report thread 300113 on my live site. Consecutive periods and apostrophes are imported as unknown symbol which are not displayed except in the editor.
      This is most likely related to: Special Characters imported as (???????)
    Issue Details
    Issue Number 5003
    Issue Type Bug
    Project VaultWiki 4.x Series
    Category Importing
    Status Fixed
    Priority 1 - Security / Login / Data Loss
    Affected Version 4.0.17
    Fixed Version 4.0.18
    Milestone (none)
    Software DependencyXenForo 1.x
    License TypePaid
    Users able to reproduce bug 0
    Users unable to reproduce bug 0
    Attachments 0
    Assigned Users (none)
    Tags (none)




    1. April 7, 2017 11:10 AM
      pegasus pegasus is offline
      VaultWiki Team
      In the examples given, from how these appear in the screenshot, these are most likely not periods or apostrophes but:
      - MSWord curly quotes
      - MSWord ellipsis
      - and if your other report about "minus" sign is related, MSWord ndash and/or mdash.

      It is usually hard to get MSWord characters converted to UTF-8. vBulletin's editor had a button that was intended to remove MSWord formatting, but I suspect many users did not use it.

      I know some other users had issues with importing these characters in the past, but for at least one case that I can remember, it ended up being related to the user's forum's bad words filter, rather than a problem with the importer itself.
      Reply Reply  
    2. April 7, 2017 3:42 PM
      Alfa1 Alfa1 is offline
      Distinguished Member
      mhm. This really is a tricky issue which affects so many of my articles.
      MS Word may well be a cause, as vbulletin had no drafts feature and users frequently lost their content. So MSW was the logical solution.

      But what this all means is that unless I come up with some solution, I basically 1100 articles of which a significant share will be affected by this. Fixing all that up manually is not realistic. Not unless the issues can be easily identified/located and mass fixed.
      Reply Reply  
    3. April 12, 2017 2:44 PM
      Alfa1 Alfa1 is offline
      Distinguished Member
      Please see bug report 300113 on my live site.
      Reply Reply  
    4. May 6, 2017 6:13 PM
      pegasus pegasus is offline
      VaultWiki Team
      While we did the conversion almost exactly the same way as XenForo import, there were some differences which resulted in this issue:
      - To improve accuracy when the site contains multiple character sets, the character set detection algorithm is used (XenForo only uses the character set of the default language, which may be incorrect if content was posted using a different forum language).
      - Due to a bug in PHP, it is not possible to detect character sets like Windows-1252 or CP-1252; however, they are similar to ISO-8859-1, and are detected as such (something VaultWiki's importer was already aware of).
      - Due to a bug in PHP, converting from ISO-8859-1 does not include high-byte characters; these are only converted when converting from Windows-1252 (something XenForo's importer was aware of, but not VaultWiki's).

      This is fixed in the next release by treating a detected character set of ISO-8859-1 as Windows-1252, since Windows-1252 cannot be detected and it includes ISO-8859-1 anyway.

      Additionally, the automatic correction of the database character set from blank to latin1 had not been working for a few versions; the charset had to be manually entered in the import config file. The setting is obsolete in the next release anyway, since it introduces form-based import configs, with validation for settings like these.

      I suspect that this change will resolve all your other similar reports as Duplicates, but I will check each case individually.
      Reply Reply  
    + Reply

    Assigned Users
    Loading Please Wait
    Tags
    Loading Please Wait
    • Contact Us
    • License Agreement
    • Privacy
    • Terms
    • Top
    All times are GMT -4. The time now is 5:53 PM.
    This site uses cookies to help personalize content, to tailor your experience, and to keep you logged in if you register.
    By continuing to use this site, you are consenting to our use of cookies.
    Learn more… Accept Remind me later
  • striker
    Powered by vBulletin® Version 4.2.5 Beta 2
    Copyright © 2025 vBulletin Solutions Inc. All rights reserved.
    Search Engine Optimisation provided by DragonByte SEO (Pro) - vBulletin Mods & Addons Copyright © 2025 DragonByte Technologies Ltd.
    Copyright © 2008 - 2024 VaultWiki Team, Cracked Egg Studios, LLC.